BugsInPy: A Database of Existing Bugs in Python Programs to Enable Controlled Testing and Debugging Studies
The 2019 edition of Stack Overflow developer survey highlights that, for the first time, Python outperformed Java in terms of popularity. The gap between Python and Java further widened in the 2020 edition of the survey. Unfortunately, despite the rapid increase in Python's popularity, there are not many testing and debugging tools that are designed for Python. This is in stark contrast with the abundance of testing and debugging tools for Java. Thus, there is a need to push research on tools that can help Python developers.
One factor that contributed to the rapid growth of Java testing and debugging tools is the availability of benchmarks. A popular benchmark is the Defects4J benchmark; its initial version contained 357 real bugs from 5 real-world Java programs. Each bug comes with a test suite that can expose the bug. Defects4J has been used by hundreds of testing and debugging studies and has helped to push the frontier of research in these directions.
In this project, inspired by Defects4J, we create another benchmark database and tool that contain 493 real bugs from 17 real-world Python programs. We hope our benchmark can help catalyze future work on testing and debugging tools that work on Python programs.
Conference DayFri 13 NovDisplayed time zone: (UTC) Coordinated Universal Time change
01:00 - 01:30
|A Randomized Controlled Trial on the Effects of Embedded Computer Language Switching|
P. Merlin UesbeckUniversity of Nevada at Las Vegas, USA, Cole S. PetersonUniversity of Nebraska-Lincoln, USA, Bonita SharifUniversity of Nebraska-Lincoln, USA, Andreas StefikUniversity of Nevada at Las Vegas, USADOI
|BugsInPy: A Database of Existing Bugs in Python Programs to Enable Controlled Testing and Debugging Studies|
Ratnadira WidyasariSingapore Management University, Singapore, Sheng Qin SimSingapore Management University, Singapore, Camellia LokSingapore Management University, Singapore, Haodi QiSingapore Management University, Singapore, Jack PhanSingapore Management University, Singapore, Qijin TaySingapore Management University, Singapore, Constance TanSingapore Management University, Singapore, Fiona WeeSingapore Management University, Singapore, Jodie Ethelda TanSingapore Management University, Singapore, Yuheng YiehSingapore Management University, Singapore, Brian GohSingapore Management University, Singapore, Ferdian ThungSingapore Management University, Hong Jin KangSingapore Management University, Singapore, Thong HoangSingapore Management University, Singapore, David LoSingapore Management University, Ouh Eng LiehSingapore Management University, SingaporeDOI
|Do Code Review Measures Explain the Incidence of Post-Release Defects? Case Study Replications and Bayesian Networks|
|On the Naturalness of Hardware Descriptions|
Jaeseong LeeUniversity of Texas at Austin, USA, Pengyu NieUniversity of Texas at Austin, USA, Junyi Jessy LiUniversity of Texas at Austin, USA, Milos GligoricUniversity of Texas at AustinDOI
|Understanding Build Issue Resolution in Practice: Symptoms and Fix Patterns|
Yiling LouPeking University, China, Zhenpeng ChenPeking University, China, Yanbin CaoPeking University, China, Dan HaoPeking University, China, Lu ZhangPeking University, ChinaDOI
|Understanding Type Changes in Java|
Ameya KetkarOregon State University, USA, Nikolaos TsantalisConcordia University, Canada, Danny DigUniversity of Colorado Boulder, USADOI Pre-print Media Attached
|Conversations on Empirical 2|