English

Code Similarity on High Level Programs

Computer Vision and Pattern Recognition 2007-10-31 v1 Data Structures and Algorithms

Abstract

This paper presents a new approach for code similarity on High Level programs. Our technique is based on Fast Dynamic Time Warping, that builds a warp path or points relation with local restrictions. The source code is represented into Time Series using the operators inside programming languages that makes possible the comparison. This makes possible subsequence detection that represent similar code instructions. In contrast with other code similarity algorithms, we do not make features extraction. The experiments show that two source codes are similar when their respective Time Series are similar.

Keywords

Cite

@article{arxiv.0710.5547,
  title  = {Code Similarity on High Level Programs},
  author = {M. Miron Bernal and H. Coyote Estrada and J. Figueroa Nazuno},
  journal= {arXiv preprint arXiv:0710.5547},
  year   = {2007}
}

Comments

Proceedings of the 18th Autumn Meeting on Communications, Computers, Electronics and Industrial Exposition. (IEEE - ROCC07). Acapulco, Guerrero, Mexico. 2007

R2 v1 2026-06-21T09:37:45.828Z