CBCD: Cloned Buggy Code Detector

Download: PDF, slides (PowerPoint), TR UW-CSE-11-05-02.

“CBCD: Cloned Buggy Code Detector” by Jingyue Li and Michael D. Ernst. In ICSE 2012, Proceedings of the 34th International Conference on Software Engineering, (Zürich, Switzerland), June 2012, pp. 310-320.
A previous version appeared as University of Washington Department of Computer Science and Engineering technical report UW-CSE-11-05-02, (Seattle, WA, USA), May 2, 2011. Revised October 2011.

Abstract

Developers often copy, or clone, code in order to reuse or modify functionality. When they do so, they also clone any bugs in the original code. Or, different developers may independently make the same mistake. As one example of a bug, multiple products in a product line may use a component in a similar wrong way. This paper makes two contributions. First, it presents an empirical study of cloned buggy code. In a large industrial product line, about 4% of the bugs are duplicated across more than one product or file. In three open source projects (the Linux kernel, the Git version control system, and the PostgreSQL database) we found 282, 33, and 33 duplicated bugs, respectively. Second, this paper presents a tool, CBCD, that searches for code that is semantically identical to given buggy code. CBCD tests graph isomorphism over the Program Dependency Graph (PDG) representation and uses four optimizations. We evaluated CBCD by searching for known clones of buggy code segments in the three projects and compared the results with text-based, token-based, and AST-based code clone detectors, namely Simian, CCFinder, Deckard, and CloneDR. The evaluation shows that CBCD is fast when searching for possible clones of the buggy code in a large system, and it is more precise for this purpose than the other code clone detectors.

Download: PDF, slides (PowerPoint), TR UW-CSE-11-05-02.

BibTeX entry:

@inproceedings{LiE2012,
   author = {Jingyue Li and Michael D. Ernst},
   title = {{CBCD}: {Cloned} {Buggy} {Code} {Detector}},
   booktitle = {ICSE 2012, Proceedings of the 34th International
	Conference on Software Engineering},
   pages = {310--320},
   address = {Z{\"u}rich, Switzerland},
   month = jun,
   year = {2012}
}

(This webpage was created with bibtex2web.)

Back to Michael Ernst's publications.