TY - JOUR
T1 - A novel composite model approach to improve software quality prediction
AU - Bouktif, Salah
AU - Ahmed, Faheem
AU - Khalil, Issa
AU - Antoniol, Giuliano
PY - 2010/12
Y1 - 2010/12
N2 - Context:: How can quality of software systems be predicted before deployment? In attempting to answer this question, prediction models are advocated in several studies. The performance of such models drops dramatically, with very low accuracy, when they are used in new software development environments or in new circumstances. Objective: The main objective of this work is to circumvent the model generalizability problem. We propose a new approach that substitutes traditional ways of building prediction models which use historical data and machine learning techniques. Method: In this paper, existing models are decision trees built to predict module fault-proneness within the NASA Critical Mission Software. A genetic algorithm is developed to combine and adapt expertise extracted from existing models in order to derive a "composite" model that performs accurately in a given context of software development. Experimental evaluation of the approach is carried out in three different software development circumstances. Results: The results show that derived prediction models work more accurately not only for a particular state of a software organization but also for evolving and modified ones. Conclusion: Our approach is considered suitable for software data nature and at the same time superior to model selection and data combination approaches. It is then concluded that learning from existing software models (i.e., software expertise) has two immediate advantages; circumventing model generalizability and alleviating the lack of data in software-engineering.
AB - Context:: How can quality of software systems be predicted before deployment? In attempting to answer this question, prediction models are advocated in several studies. The performance of such models drops dramatically, with very low accuracy, when they are used in new software development environments or in new circumstances. Objective: The main objective of this work is to circumvent the model generalizability problem. We propose a new approach that substitutes traditional ways of building prediction models which use historical data and machine learning techniques. Method: In this paper, existing models are decision trees built to predict module fault-proneness within the NASA Critical Mission Software. A genetic algorithm is developed to combine and adapt expertise extracted from existing models in order to derive a "composite" model that performs accurately in a given context of software development. Experimental evaluation of the approach is carried out in three different software development circumstances. Results: The results show that derived prediction models work more accurately not only for a particular state of a software organization but also for evolving and modified ones. Conclusion: Our approach is considered suitable for software data nature and at the same time superior to model selection and data combination approaches. It is then concluded that learning from existing software models (i.e., software expertise) has two immediate advantages; circumventing model generalizability and alleviating the lack of data in software-engineering.
KW - Decision trees
KW - Fault-proneness
KW - Genetic algorithm
KW - Software quality prediction
UR - https://www.scopus.com/pages/publications/77957365028
UR - https://www.sciencedirect.com/science/article/pii/S095058491000220X
U2 - 10.1016/j.infsof.2010.07.003
DO - 10.1016/j.infsof.2010.07.003
M3 - Article
AN - SCOPUS:77957365028
SN - 0950-5849
VL - 52
SP - 1298
EP - 1311
JO - Information and Software Technology
JF - Information and Software Technology
IS - 12
ER -