Program logic based software plagiarism detection

Fangfang Zhang, Dinghao Wu, Peng Liu, Sencun Zhu

Research output: Contribution to journalConference article

20 Scopus citations

Abstract

Software plagiarism, an act of illegally copying others' code, has become a serious concern for honest software companies and the open source community. In this paper, we propose LoPD, a program logic based approach to software plagiarism detection. Instead of directly comparing the similarity between two programs, LoPD searches for any dissimilarity between two programs by finding an input that will cause these two programs to behave differently, either with different output states or with semantically different execution paths. As long as we can find one dissimilarity, the programs are semantically different, but if we cannot find any dissimilarity, it is likely a plagiarism case. We leverage symbolic execution and weakest precondition reasoning to capture the semantics of execution paths and to find path dissimilarities. LoPD is more resilient to current automatic obfuscation techniques, compared to the existing detection mechanisms. In addition, since LoPD is a formal program semantics-based method, it can provide a guarantee of resilience against many known obfuscation attacks. Our evaluation results indicate that LoPD is both effective and efficient in detecting software plagiarism.

Original languageEnglish (US)
Article number6982615
Pages (from-to)66-77
Number of pages12
JournalProceedings - International Symposium on Software Reliability Engineering, ISSRE
DOIs
StatePublished - Dec 11 2014
Event25th IEEE International Symposium on Software Reliability Engineering, ISSRE 2014 - Naples, Italy
Duration: Nov 3 2014Nov 6 2014

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Software
  • Safety, Risk, Reliability and Quality

Cite this