Predicting continuous integration build failures using evolutionary search
Created by W.Langdon from
gp-bibliography.bib Revision:1.8051
- @Article{SAIDANI:2020:IST,
-
author = "Islem Saidani and Ali Ouni and Moataz Chouchen and
Mohamed Wiem Mkaouer",
-
title = "Predicting continuous integration build failures using
evolutionary search",
-
journal = "Information and Software Technology",
-
volume = "128",
-
pages = "106392",
-
year = "2020",
-
ISSN = "0950-5849",
-
DOI = "doi:10.1016/j.infsof.2020.106392",
-
URL = "http://www.sciencedirect.com/science/article/pii/S0950584920301579",
-
keywords = "genetic algorithms, genetic programming, Continuous
integration, Build prediction, Multi-Objective
optimization, Search-Based software engineering,
Machine learning",
-
abstract = "Context: Continuous Integration (CI) is a common
practice in modern software development and it is
increasingly adopted in the open-source as well as the
software industry markets. CI aims at supporting
developers in integrating code changes constantly and
quickly through an automated build process. However, in
such context, the build process is typically time and
resource-consuming which requires a high maintenance
effort to avoid build failure. Objective: The goal of
this study is to introduce an automated approach to cut
the expenses of CI build time and provide support tools
to developers by predicting the CI build outcome.
Method: In this paper, we address problem of CI build
failure by introducing a novel search-based approach
based on Multi-Objective Genetic Programming (MOGP) to
build a CI build failure prediction model. Our approach
aims at finding the best combination of CI built
features and their appropriate threshold values, based
on two conflicting objective functions to deal with
both failed and passed builds. Results: We evaluated
our approach on a benchmark of 56,019 builds from 10
large-scale and long-lived software projects that use
the Travis CI build system. The statistical results
reveal that our approach outperforms the
state-of-the-art techniques based on machine learning
by providing a better balance between both failed and
passed builds. Furthermore, we use the generated
prediction rules to investigate which factors impact
the CI build results, and found that features related
to (1) specific statistics about the project such as
team size, (2) last build information in the current
build and (3) the types of changed files are the most
influential to indicate the potential failure of a
given build. Conclusion: This paper proposes a
multi-objective search-based approach for the problem
of CI build failure prediction. The performances of the
models developed using our MOGP approach were
statistically better than models developed using
machine learning techniques. The experimental results
show that our approach can effectively reduce both
false negative rate and false positive rate of CI build
failures in highly imbalanced datasets",
- }
Genetic Programming entries for
Islem Saidani
Ali Ouni
Moataz Chouchen
Mohamed Wiem Mkaouer
Citations