On how often code is cloned across repositories
MetadataShow full item record
Detecting code duplication in large code bases, or even across project boundaries, is problematic due to the massive amount of data involved. Large-scale clone detection also opens new challenges beyond asking for the provenance of a single clone fragment, such as assessing the prevalence of code clones on the entire code base, and their evolution. We propose a set of lightweight techniques that may scale up to very large amounts of source code in the presence of multiple versions. The common idea behind these techniques is to use bad hashing to get a quick answer. We report on a case study, the Squeaksource ecosystem, which features thousands of software projects, with more than 40 million versions of methods, across more than seven years of evolution. We provide estimates for the prevalence of type-1, type-2, and type-3 clones in Squeaksource.
Showing items related by title, author, creator and subject.
Brixtel R; Fontaine M; Lesner B; Bazin C; Robbes R (IEEE Computer Society, 2010)Clone detection is usually applied in the context of detecting small-to medium scale fragments of duplicated code in large software systems. In this paper, we address the problem of clone detection applied to plagiarism ...
Cloning, expression and characterization of a beta-D-xylosidase from Lactobacillus rossiae DSM 15814(T) Pontonio E; Mahony J; Di Cagno E; O’Connell Motherway M; Lugli GA; O’Callaghan A; De Angelis M; Ventura M; Gobbetti M; van Sinderen D (2016)Background: Among the oligosaccharides that may positively affect the gut microbiota, xylo-oligosaccharides (XOS) and arabinoxylan oligosaccharides (AXOS) possess promising functional properties. Ingestion of XOS has been ...
Cloning, purification, crystallization and 1.57 angstrom resolution X-ray data analysis of AmsI, the tyrosine phosphatase controlling amylovoran biosynthesis in the plant pathogen Erwinia amylovora Benini S; Caputi L; Cianci M (WILEY-BLACKWELL, 2014)The Gram-negative bacterium Erwinia amylovora is a destructive pathogen of plants belonging to the Rosaceae family. Amongst its pathogenicity factors, E. amylovora produces the exopolysaccharide amylovoran, which contributes ...