Software Defect Prediction using Tree-Based Ensembles (ESEC/FSE 2020 - PROMISE 2020)

Write a Blog >>

Fri 6 - Mon 16 November 2020 Sacramento, California, United States

Who

Hamoud Aljamaan, Amal Alazba

Track

ESEC/FSE 2020 PROMISE

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 5 Nov 2020 16:00 - 16:20 at Virtual room 1 - Defect

Abstract

Software defect prediction is an active research area in software engineering. Accurate prediction of software defects assists software engineers in guiding software quality assurance activities to maximize utilization of testing resources, reduce maintenance cost and deliver quality software products. In the machine learning research, ensemble learning has been proven to improve the prediction performance over individual machine learning models. Recently, many boosting ensembles have been proposed in the literature, and their prediction capabilities were not investigated in defect prediction. In this paper, we will empirically investigate the prediction performance of Tree-based boosting ensembles in defect prediction, and they are: Ada boost, Random Forest, Extra Trees, Gradient Boosting, Hist Gradient Boosting, XGBoost and CatBoost. The study utilized 11 publicly available MDP NASA software defect datasets. Empirical results indicate the superiority of Random Forest and Extra Trees ensembles over other boosting ensembles. However, none of the boosting ensembles was significantly lower than individual decision trees in prediction performance. Finally, Ada boost ensemble was the worst performing ensemble among other ensembles.

Hamoud Aljamaan

King Fahd University of Petroleum and Minerals

Saudi Arabia

Amal Alazba

King Saud University