New Hybrid Approach for Developing Automated Machine Learning Workflows: A Real Case Application in Evaluation of Marcellus Shale Gas Production
Created by W.Langdon from
gp-bibliography.bib Revision:1.8051
- @Article{pham:2021:Fuels,
-
author = "Vuong {Van Pham} and Ebrahim Fathi and
Fatemeh Belyadi",
-
title = "New Hybrid Approach for Developing Automated Machine
Learning Workflows: A Real Case Application in
Evaluation of Marcellus Shale Gas Production",
-
journal = "Fuels",
-
year = "2021",
-
volume = "2",
-
number = "3",
-
keywords = "genetic algorithms, genetic programming, TPOT",
-
ISSN = "2673-3994",
-
URL = "https://www.mdpi.com/2673-3994/2/3/17",
-
DOI = "doi:10.3390/fuels2030017",
-
abstract = "The success of machine learning (ML) techniques
implemented in different industries heavily rely on
operator expertise and domain knowledge, which is used
in manually choosing an algorithm and setting up the
specific algorithm parameters for a problem. Due to the
manual nature of model selection and parameter tuning,
it is impossible to quantify or evaluate the quality of
this manual process, which in turn limits the ability
to perform comparison studies between different
algorithms. In this study, we propose a new hybrid
approach for developing machine learning workflows to
help automated algorithm selection and hyperparameter
optimisation. The proposed approach provides a robust,
reproducible, and unbiased workflow that can be
quantified and validated using different scoring
metrics. We have used the most common workflows
implemented in the application of artificial
intelligence (AI) and ML in engineering problems
including grid/random search, Bayesian search and
optimisation, genetic programming, and compared that
with our new hybrid approach that includes the
integration of Tree-based Pipeline Optimisation Tool
(TPOT) and Bayesian optimisation. The performance of
each workflow is quantified using different scoring
metrics such as Pearson correlation (i.e., R2
correlation) and Mean Square Error (i.e., MSE). For
this purpose, actual field data obtained from 1567 gas
wells in Marcellus Shale, with 121 features from
reservoir, drilling, completion, stimulation, and
operation is tested using different proposed workflows.
A proposed new hybrid workflow is then used to evaluate
the type well used for evaluation of Marcellus shale
gas production. In conclusion, our automated hybrid
approach showed significant improvement in comparison
to other proposed workflows using both scoring
matrices. The new hybrid approach provides a practical
tool that supports the automated model and
hyperparameter selection, which is tested using real
field data that can be implemented in solving different
engineering problems using artificial intelligence and
machine learning. The new hybrid model is tested in a
real field and compared with conventional type wells
developed by field engineers. It is found that the type
well of the field is very close to P50 predictions of
the field, which shows great success in the completion
design of the field performed by field engineers. It
also shows that the field average production could have
been improved by 8percent if shorter cluster spacing
and higher proppant loading per cluster were used
during the frac jobs.",
-
notes = "also known as \cite{fuels2030017}",
- }
Genetic Programming entries for
Vuong Van Pham
Ebrahim Fathi
Fatemeh Belyadi
Citations