Write a Blog >>
Thu 12 Nov 2020 08:00 - 08:02 at Virtual room 1 - Analysis 3

Today, most developers bundle changes into commits that they submit to a shared code repository. Tangled commits intermix distinct concerns, such as a bug fix and a new feature. They cause issues for developers, reviewers, and researchers alike: they restrict the usability of tools such as git bisect, make patch comprehension more difficult, and force researchers who mine software repositories to contend with noise. We present a novel data structure, the 𝛿-NFG, a multiversion Program Dependency Graph augmented with name flows. A 𝛿-NFG directly and simultaneously encodes different program versions, thereby capturing commits, and annotates data flow edges with the names/lexemes that flow across them. Our technique, Flexeme, builds a 𝛿-NFG from commits, then applies Agglomerative Clustering using Graph Similarity to that 𝛿-NFG to untangle its commits. At the untangling task on a C# corpus, our implementation, Heddle, improves the state-of-the-art on accuracy by 0.14, achieving 0.81, in a fraction of the time: Heddle is 32 times faster than the previous state-of-the-art.

Full Presentation Slides (Flexeme_FSE_profirpp.pdf)1.3MiB

Conference Day
Thu 12 Nov

Displayed time zone: (UTC) Coordinated Universal Time change

08:00 - 08:30
08:00
2m
Talk
Flexeme: Untangling Commits Using Lexical Flows
Research Papers
Profir-Petru PârțachiUniversity College London, UK, Santanu Kumar DashUniversity of Surrey, UK, Miltiadis AllamanisMicrosoft Research, UK, Earl T. BarrUniversity College London, UK
DOI Pre-print Media Attached File Attached
08:03
1m
Talk
FREPA: An Automated and Formal Approach to Requirement Modeling and Analysis in Aircraft Control Domain
Industry Papers
Jincao FengEast China Normal University, Weikai MiaoEast China Normal University, China, Hanyue ZhengEast China Normal University, Yihao HuangEast China Normal University, Jianwen LiEast China Normal University, China, Zheng WangBeijing Sunwise Information Technology, China, Ting SuEast China Normal University, China, Bin GuBeijing Institute of Control Engineering, China, Geguang PuShanghai Trusted Industrial Control Platform, China, Mengfei YangChina Academy of Space Technology, China, Jifeng HeShanghai Key Lab of Trustworthy Computing, China
DOI
08:05
1m
Talk
Mining Assumptions for Software Components using Machine Learning
Research Papers
Khouloud GaaloulUniversity of Luxembourg, Luxembourg, Claudio MenghiUniversity of Luxembourg, Luxembourg, Shiva NejatiUniversity of Ottawa, Canada / University of Luxembourg, Luxembourg, Lionel C. BriandUniversity of Ottawa, Canada / University of Luxembourg, Luxembourg, David WolfeQRA, Canada
DOI
08:07
1m
Talk
Mining Input Grammars from Dynamic Control Flow
Research Papers
Rahul GopinathCISPA, Germany, Björn MathisCISPA, Germany, Andreas ZellerCISPA, Germany
DOI
08:09
1m
Talk
TypeWriter: Neural Type Prediction with Search-Based Validation
Research Papers
Michael PradelUniversity of Stuttgart, Germany, Georgios GousiosFacebook & Delft University of Technology, Jason LiuFacebook, USA, Satish ChandraFacebook, USA
DOI Pre-print Media Attached
08:11
19m
Talk
Conversations on Analysis 3
Paper Presentations
Khouloud GaaloulUniversity of Luxembourg, Luxembourg, Michael PradelUniversity of Stuttgart, Germany, Profir-Petru PârțachiUniversity College London, UK, Rahul GopinathCISPA, Germany, M: Dan HaoPeking University, China