Skip to main content
Log in

Exploring the use of multi-gene genetic programming in regional models for the simulation of monthly river runoff series

  • ORIGINAL PAPER
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

The use of new data-driven approaches based on the so-called expert systems to simulate runoff generation processes is a promising frontier that may allow for overcoming some modeling difficulties related to more complex traditional approaches. The present study highlights the potential of expert systems in creating regional hydrological models, for which they can benefit from the availability of large database. Different soft computing models for the reconstruction of the monthly natural runoff in river basins are explored, focusing on a new class of heuristic models, which is the Multi-Gene Genetic Programming (MGGP). The region under study is Sicily (Italy), where a regression based rainfall-runoff model, here used as benchmark model, was previously built starting from the analysis of a regional database relative to several gauged watersheds across the region. In the present study, different models are created using the same dataset, including: six MGGPs generated considering different modeling set-up; a Multi-Layer Perceptron Artificial Neural Network (ANN); two new hybrid models (ANN-MGGP), combining a Classifier ANN and two MGGPs that simulate separately low and high runoff. Results show how all the soft computing models perform similarly and outperform the benchmark model, demonstrating that MGGP can be considered as a valid alternative to the much more consolidated ANN technique. The new introduced hybrid ANN-MGGP is the only model showing at least satisfactory performance (i.e. Nash–Sutcliffe Efficiency above 0.5) over the full range of 38 watersheds explored, representing a useful regional tool for reconstructing monthly runoff series also at ungauged sites.

Graphical abstract

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Abbreviations

AdB:

Basin authority of Sicilian Region (Autorità di Bacino della Regione Sicilia)

ANN:

Artificial neural network

CN:

Curve number

DEM:

Digital elevation model

GA:

Genetic algorithm

GEP:

Gene expression programming

GIS:

Geographic information system

GP:

Genetic programming

HL:

Hidden layer

LGP:

Linear genetic programming

PDRMSE :

Percent difference in RMSE for soft computing models with respect to the benchmark model

NSE:

Nash–Sutcliffe efficiency

MGGP:

Multi-gene genetic programming

OL:

Output layer

QGIS:

Quantum GIS

SCS:

Soil conservation service

SM:

Supplementary material

SMA:

Simple moving average

sub-ET:

Sub-expression trees

RMSE:

Root mean squared error

RMSEBM :

Root mean squared error for the benchmark model

RMSESC :

Root mean squared error for soft computing model

Tri.Mo.Ti.S.:

Trinacria model for monthly time series

References

Download references

Acknowledgements

The authors thank anonymous reviewers for their helpful suggestions on the quality improvement of the present paper.

Funding

The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Author information

Authors and Affiliations

Authors

Contributions

DP is the first and corresponding author. Both authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by DP. The first draft of the manuscript was written by DP and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Dario Pumo.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Data availability

Data used in this article can be found, on request, at the website of the Basin Authority of Sicilian Region (Autorità di Bacino della Regione Sicilia) through the following link: https://www.regione.sicilia.it/istituzioni/regione/strutture-regionali/presidenza-regione/autorita-bacino-distretto-idrografico-sicilia. Data can also be freely visualized at the website of ISPRA (Istituto Superiore per la Protezione e la Ricerca Ambientale, https://www.isprambiente.gov.it).

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 4053 kb)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pumo, D., Noto, L.V. Exploring the use of multi-gene genetic programming in regional models for the simulation of monthly river runoff series. Stoch Environ Res Risk Assess 37, 1917–1941 (2023). https://doi.org/10.1007/s00477-022-02373-1

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-022-02373-1

Keywords

Navigation