English

M3: Semantic API Migrations

Software Engineering 2020-08-28 v1

Abstract

Library migration is a challenging problem, where most existing approaches rely on prior knowledge. This can be, for example, information derived from changelogs or statistical models of API usage. This paper addresses a different API migration scenario where there is no prior knowledge of the target library. We have no historical changelogs and no access to its internal representation. To tackle this problem, this paper proposes a novel approach (M3^3), where probabilistic program synthesis is used to semantically model the behavior of library functions. Then, we use an SMT-based code search engine to discover similar code in user applications. These discovered instances provide potential locations for API migrations. We evaluate our approach against 7 well-known libraries from varied application domains, learning correct implementations for 94 functions. Our approach is integrated with standard compiler tooling, and we use this integration to evaluate migration opportunities in 9 existing C/C++ applications with over 1MLoC. We discover over 7,000 instances of these functions, of which more than 2,000 represent migration opportunities.

Keywords

Cite

@article{arxiv.2008.12118,
  title  = {M3: Semantic API Migrations},
  author = {Bruce Collie and Philip Ginsbach and Jackson Woodruff and Ajitha Rajan and Michael O'Boyle},
  journal= {arXiv preprint arXiv:2008.12118},
  year   = {2020}
}

Comments

Accepted to ASE 2020

R2 v1 2026-06-23T18:08:30.611Z