Development of interpretable, data-driven plasticity models with symbolic regression

doi:10.1016/j.compstruc.2021.106557

Computers & Structures

Volume 252, August 2021, 106557

https://doi.org/10.1016/j.compstruc.2021.106557 Get rights and content

Highlights

•
Symbolic regression is used to produce data-driven plasticity models in the form of yield equations.
•
Tests are performed to verify a known plasticity model can be reproduced from response data with 95% robustness.
•
A new, interpretable porous plasticity model is produced from response data of a representative volume element.

Abstract

In many applications, such as those which drive new material discovery, constitutive models are sought that have three characteristics: (1) the ability to be derived in automatic fashion with (2) high accuracy and (3) an interpretable nature. Traditionally developed models are usually interpretable but sacrifice development time and accuracy. Purely data-driven approaches are usually fast and accurate but lack interpretability. In the current work, a framework for the rapid development of interpretable, data-driven constitutive models is pursued. The approach is characterized by the use of symbolic regression on data generated with micromechanical finite element models. Symbolic regression is the search for equations of arbitrary functional form which match a given dataset. Specifically, an implicit symbolic regression technique is developed to identify a plastic yield potential from homogenized finite element response data. Through three controlled test cases of varying complexity, the approach is shown to successfully produce interpretable plasticity models. The controlled test cases are used to investigate the robustness and scalability of the method and provide reasonable recommendations for more complex applications. Finally, the recommendations are used in the application of the method to produce a porous plasticity model from data corresponding to a representative volume element of voids within a metal matrix.

Introduction

Though occasionally being criticized as less axiomatic than balance laws (e.g., mass, momentum), the development of constitutive models has helped enable the use of new materials and alloys. Specifically, elastoplastic constitutive models based on Tresca [1], Mises [2], and Hill [3] yield functions have stood as the bedrock for engineering analyses over the years. These three models, specifically focusing on plastic deformation, have reached this status because they posses two qualities: (1) they are accurate and tunable enough to be predictive, and (2) they are interpretable, allowing for good judgment in application and extension.

In contrast to the above constitutive models, which are rooted in physics and extended to fit observations, there has been a trend toward data-driven constitutive model development. Generally speaking these models embrace the notion that accuracy and predictive capability are more important then having a sound understanding of the physics involved. Neural network based constitutive models [4], [5], [6] are one example of this paradigm, wherein a neural network model is trained to predict a response (e.g., stress) as a function of load (e.g., deformation). The resulting neural net is often referred to as a black-box owing to its lack of physical insightfullness and interpretability. Another extreme of the data-driven paradigm is the recent work in data-driven computational mechanics which removes the idea of constitutive models altogether and use data directly for the constitutive relationship [7], [8], [9]. This method, by definition, is a black-box and offers no more insight into the constitutive relation than the underlying data itself.

There have also been examples of data-driven constitutive models that fall somewhere in the middle of the interpretablity spectrum. For instance, the work of Soare et al. [10], Soare and Barlat [11] that uses relatively opaque, higher-order polynomials as a plasticity model; however, the derivation and fitting of the coefficients preserves a well known aspect of plastic yield functions: convexity. Another example of partially interpretable constitutive models are based on hybrid modeling [e.g. 12], wherein the modeling error of a classical physics-based model is corrected using a black-box model.

Physics-informed machine learning is another paradigm that combines interpretable aspects of physics with black-box modeling [13], [14], [15]. In this variant of machine learning, physics are incorporated into the objective function for training of a black-box model; either in the form of a residual of a partial differential equation [14] or the weak form of the equivalent boundary value problem [15]. Because the approach in the current work also uses physics in training, it can be viewed as a physics-informed machine learning method; however, there are two key differences between the current work and other physics-informed machine learning works. Firstly, in contrast to hidden physics models [13] where coefficients in a fixed equation are found, a model is sought with arbitrary functional form. And secondly, the models that are produced in the current work are interpretable (in the form of relatively simple equations).

In many – if not most – cases the use of black-box models causes no issues, so long as the model is predictive. There are, however, advantages to interpretable plasticity models. Most importantly, interpretable constitutive models (1) can help drive knowledge discovery (2) may be used more confidently outside the range of their training data and (3) are usually easier to implement within current computational mechanics frameworks.

A common tool for the construction of interpretable, data-driven models is symbolic regression (SR). SR is a procedure for rapidly finding equations that fit a given dataset. Owing to recent advances in scalability, robustness, and efficiency [16], [17], [18], [19], [20], SR has reached maturity such that it can be used in many complex, real-wold applications. For example, Schmidt and Lipson [21] developed a method for performing implicit SR to discover conservation laws from experimental data. The focus of the current paper is the application of SR to the development of interpretable plasticity models. The two primary contributions of the current paper are (1) the formulation of the plasticity modeling problem as an implicit symbolic regression problem and (2) a framework for obtaining the relevant training data from micromechanical simulation.

The topic of using SR in the development of constitutive models has been investigated before. One early example is the work of Schoenauer et al. [22], which developed a three-dimensional hyperelastic model in terms of a strain energy function. The model had limited success for two reasons. First, inserting physical constraints upon the possible strain energy function proved difficult. Second, the presence of real-valued constants within the generated strain energy expressions caused difficulty in the SR optimization. The presence of real-valued constants have been an obstacle for the general SR community [23], [24]. Since the time of Schoenauer et al. [22], techniques have been developed to address the issue, such as restriction of the constants to coefficients of linearly combined SR terms (which allows for regression to obtain best-fit constants) [25], [26] as well as embedded nonlinear optimization of constants [27]. The latter is more computationally intensive but does not restrict the functional form obtained with SR.

For the most part, SR has been used in plasticity model development as a tool for modifying sub-components of a traditionally developed model. Examples include parametrically homogenized constitutive models, which use SR as a means of tying tunable parameters in a yield function to microstructural features [28], [29], [30]. Another example is the work of Versino et al. [31] who used SR in the creation of a flow stress model as part of a Mises ( $J_{2}$ ) plasticity framework. They worked with relatively sparse experimentally-based datasets, because designing and performing experiments that could capture the needed data was costly and difficult. They were able to overcome the difficulty by supplementing their datasets with synthetic data points which represented their expert knowledge. Nonetheless, generation of significant quantities of relevant experimental data can be a challenge. For some applications there are high data-output experimental techniques [32], [33], [34] to feed data-hungry SR; however, in the current work an alternative approach is taken wherein data for SR is derived from micromechanical finite element (FE) simulations. The primary benefit is that simulation data is usually easier to generate than experimental data. Because the data is more plentiful, more complex relationships can be investigated. The underlying assumption of such an approach is that the micromechanical model accurately represents the material of interest.

In the current work, a framework is developed for using SR in the formulation of plastic yield functions based on the homogenized responses of micromechanical FE models. Details involved in the construction of the micromechanical FE models, along with the synthesis of their response data with SR, are covered in Section 2. The approach is then tested in three test cases (Sections 3 Test cases, 4 Test case results) illustrating its ability to generate interpretable and accurate plasticity models. Section 5 then discusses some of the key aspects that affect the accuracy and robustness of the method. Using the insights produced in Section 5, the method is applied to the problem of porous plasticity in Section 6. The major findings of this work are summarized in Section 7.

Section snippets

Approach

The approach taken in the current work is a type of automatic multi-scale computational homogenization. Multi-scale computational homogenization is characterized by the coarse-graining of the response of a representative volume element (RVE) to a larger scale [e.g., [35], [36], [37]]. It requires that the RVE be well-characterized and that the scale of the RVE is significantly separated from the scale where the homogenized constitutive model is used (see the work of Geers et al. [38] for more

Test cases

The above approach is illustrated in three verification test cases. In all test cases, a single-element FE model with an isotropic elastic–plastic material model is used as a material point evaluator (MPE, see Fig. 2). A known plasticity model is prescribed to the element and then the model is subjected to a mechanical load. The effectiveness of the above approach is gauged by its ability to recover the prescribed plasticity model from the load-response data of the MPE.

The elastic portion of

Test case results

All SR test cases were repeated 20 times to ensure robustness and repeatability. In each SR run the solution populations are evolved until convergence. Convergence is defined as coming within 5% of the fitness value of the target (true) equation. In the more general circumstance where the target equation is not known a priori, convergence would instead need to be defined in a relative manner. In such a case, the use of a train/test split of the data could be used to quantify the performance of

Robustness and efficiency of test cases

Application of the proposed approach for plasticity model development on higher fidelity material RVEs and more complex yield functions will involve increased computational costs. For this reason, it is important to investigate how the robustness and efficiency of the SR-based approach are affected by computational resources and expected model complexity. The following sections include studies of such relationships.

Application to porous plasticity

The above approach is applied to the development of a constitutive model for porous plasticity in this section. This application represents a realistic scenario characterized by (1) the generation of response data from an inhomogeneous RVE, (2) a much more complex system, and (3) a yield function without an expected true result. The results of the current method are compared to two well-known porous plasticity models, the Gurson model [53] and the Gurson-Tvergaard-Needleman (GTN) model [54].

Conclusions

A framework has been presented in this work for the use of symbolic regression (SR) in the development of interpretable, data-driven plasticity models. The framework is a type of automatic computational homogenization wherein load-response data from RVE simulations of a material system are used to develop a homogenized model with SR. The framework was successful in three test cases, delivering equations which are readily interpretable in an automatic fashion. The successful test cases

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work was conducted with support of NASA’s Transformative Aeronautics Concepts Program as part of the Convergent Aeronautics Solutions Project and the Transformative Tools and Technology Project.

J.M Emery and J.D. Hochhalter (partially) were supported for this work by Sandia National Laboratories. Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, L.L.C., a wholly owned subsidiary of Honeywell

References (55)

Sungmoon Jung et al.
Neural network constitutive model for rate-dependent materials
Comput Struct
(2006)
Trenton Kirchdoerfer et al.
Data-driven computational mechanics
Comput Methods Appl Mech Eng
(2016)
Francisco Chinesta et al.
Jose Vicente Aguado, Emmanuelle Abisset-Chavanne, and Elias Cueto. Data-driven computational plasticity
Proc Eng
(2017)
Robert Eggersmann et al.
Model-free data-driven inelasticity
Comput Methods Appl Mech Eng
(2019)
Stefan Soare et al.
On the use of homogeneous polynomials to develop anisotropic yield functions with applications to sheet forming
Int J Plast
(2008)
Stefan Soare et al.
Convex polynomial yield functions
J Mech Phys Solids
(2010)
Maziar Raissi et al.
Hidden physics models: Machine learning of nonlinear partial differential equations
J Comput Phys
(2018)
Maziar Raissi et al.
Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations
J Comput Phys
(2019)
Esteban Samaniego et al.
An energy approach to the solution of partial differential equations in computational mechanics via machine learning: Concepts, implementation and applications
Comput Methods Appl Mech Eng
(2020)
Shravan Kotha et al.
Parametrically homogenized constitutive models (phcms) from micromechanical crystal plasticity fe simulations, part i: Sensitivity analysis and parameter identification for titanium alloys
Int J Plast
(2019)

Deniz Ozturk et al.

Two-way multi-scaling for predicting fatigue crack nucleation in titanium alloys using parametrically homogenized constitutive models

J Mech Phys Solids

(2019)

Shravan Kotha et al.

Parametrically homogenized constitutive models (phcms) from micromechanical crystal plasticity fe simulations: Part ii: Thermo-elasto-plastic model with experimental validation for titanium alloys

Int J Plast

(2019)

Daniele Versino et al.

Data driven modeling of plastic deformation

Comput Methods Appl Mech Eng

(2017)

G.F. Bomarito et al.

Micromechanical investigation of ductile failure in Al 5083–H116 via 3D unit cell modeling

J Mech Phys Solids

(2015)

G.F. Bomarito et al.

Predicting the ductile failure of Al5083-H116 specimens with a mechanistic model and no free fitting parameters

Int J Solids Struct

(2017)

Marc G.D. Geers et al.

Multi-scale computational homogenization: Trends and challenges

J Comput Appl Math

(2010)

Viggo Tvergaard et al.

Analysis of the cup-cone fracture in a round tensile bar

Acta Metall

(1984)

Yuanli Bai et al.

A new model of metal plasticity and fracture with pressure and lode dependence

Int J Plasticity

(2008)

Henri É Tresca. Mémoire sur l’écoulement des corps solides soumis à de fortes pressions. Gauthier-Villars, 1864. URL...

Richard von Mises

Mechanik der festen körper im plastisch-deformablen zustand. Nachrichten von der Gesellschaft der Wissenschaften zu Göttingen

Mathematisch-Physikalische Klasse

(1913)

Rodney Hill

A theory of the yielding and plastic flow of anisotropic metals

Proc R Soc Lond A

(1948)

Rami Haj-Ali et al.

Simulated micromechanical models using artificial neural networks

J Eng Mech

(2001)

A.A. Javadi et al.

Neural network for constitutive modelling in finite element analysis

Comput Assisted Mech Eng Sci

(2003)

Rubén Ibáñez et al.

Hybrid constitutive modeling: data-driven learning of corrections to plasticity models

Int J Mater Form

(2019)

Jianjun Hu et al.

Continuous hierarchical fair competition model for sustainable innovation in genetic programming

Francisco Fernández et al.

An empirical study of multipopulation genetic programming

Genet Program Evolvable Mach

(2003)

Michael D. Schmidt et al.

Coevolution of fitness predictors

IEEE Trans Evol Comput

(2008)

Cited by (40)

Sparsity promoting algorithm for identification of nonlinear dynamic system based on Unscented Kalman Filter using novel selective thresholding and penalty-based model selection
2024, Mechanical Systems and Signal Processing
Identifying a nonlinear dynamic systems’ governing equation is crucial for many engineering applications, and yet a challenging task. In this study, the system’s dynamics are represented using a library of functions, and an Unscented Kalman Filter-based two-stage algorithm is proposed to identify true functions from the library. The algorithm promotes a sparse solution using the novel selective thresholding and penalty-based model selection performed within groups of correlated functions. In the first stage, selective thresholding is proposed which removes functions from the library which have no contribution to the dynamics of the system, but at the same time, it retains other functions without falsely eliminating them. In the second stage, groups of correlated functions are identified, and within each group, penalty-based model selection is carried out using the brute force method. The purpose of forming groups of correlated functions is to make the brute force method computationally feasible by decreasing the total number of models to compare. The number of models is decreased by a few orders when the groups are formed. From each group the best-performing model is selected and based on the functions those models contain, a sparse representation of the governing equation of motion is found. The algorithm was tested on a numerical simulation of a 5-story shear building with cubic nonlinearity in the column stiffness and on real data from a base-isolated University of Southern California (USC) hospital building subjected to the 1994 Northridge earthquake. The results from both cases affirm that the proposed algorithm can be used to obtain a sparse solution of the governing equation of motion of a nonlinear dynamic system which is described using a dictionary of functions.
Using machine learning to find exact analytic solutions to analytically posed physics problems
2024, Heliyon
We investigate the use of machine learning for solving analytic problems in theoretical physics. In particular, symbolic regression (SR) is making rapid progress in recent years as a tool to fit data using functions whose overall form is not known in advance. Assuming that we have a mathematical problem that is posed analytically, e.g. through equations, but allows easy numerical evaluation of the solution for any given set of input variable values, one can generate data numerically and then use SR to identify the closed-form function that describes the data, assuming that such a function exists. In addition to providing a concise way to represent the solution of the problem, such an obtained function can play a key role in providing insight and allow us to find an intuitive explanation for the studied phenomenon. We use a state-of-the-art SR package to demonstrate how an exact solution can be found and make an attempt at solving an unsolved physics problem. We use the Landau-Zener problem and a few of its generalizations as examples to motivate our approach and illustrate how the calculations become increasingly complicated with increasing problem difficulty. Our results highlight the capabilities and limitations of the presently available SR packages, and they point to possible modifications of these packages to make them better suited for the purpose of finding exact solutions as opposed to good approximations. Our results also demonstrate the potential for machine learning to tackle analytically posed problems in theoretical physics.
Discovering interpretable elastoplasticity models via the neural polynomial method enabled symbolic regressions
2024, Computer Methods in Applied Mechanics and Engineering
Conventional neural network elastoplasticity models are often perceived as lacking interpretability. This paper introduces a two-step machine learning approach that returns mathematical models interpretable by human experts. In particular, we introduce a surrogate model where yield surfaces are expressed in terms of a set of single-variable feature mappings obtained from supervised learning. A post-processing step is then used to re-interpret the set of single-variable neural network mapping functions into mathematical form through symbolic regression. This divide-and-conquer approach provides several important advantages. First, it enables us to overcome the scaling issue of symbolic regression algorithms. From a practical perspective, it enhances the portability of learned models for partial differential equation solvers written in different programming languages. Finally, it enables us to have a concrete understanding of the attributes of the materials, such as convexity and symmetries of models, through automated derivations and reasoning. Numerical examples have been provided, along with an open-source code to enable third-party validation at https://github.com/bbhm-90/SymPolyNN.
Interpretable machine learning for microstructure-dependent models of fatigue indicator parameters
2024, International Journal of Fatigue
Fatigue indicator parameters (FIPs) are typically computed using crystal plasticity finite element modeling (CPFEM) and used to predict microscale crack initiation. While informative, computing FIPs in this manner can limit their application in engineering use cases due to the computational demand of CPFEM. To address this limitation, an interpretable machine learning approach is developed and used to model FIPs in additive manufactured IN625 single-phase microstructures. Genetic programming based symbolic regression is used to evolve inherently interpretable expressions of FIPs from microstructure features. Once developed, these FIP models act as an efficient surrogate for CPFEM and, due to their symbolic representation, can be readily combined with engineering workflows.
Complementing a continuum thermodynamic approach to constitutive modeling with symbolic regression
2023, Journal of the Mechanics and Physics of Solids
An interpretable machine learning method, physics-informed genetic programming-based symbolic regression (P-GPSR), is integrated into a continuum thermodynamic approach to developing constitutive models. The proposed strategy for combining a thermodynamic analysis with P-GPSR is demonstrated by generating a yield function for an idealized material with voids, i.e., the Gurson yield function. First, a thermodynamic-based analysis is used to derive model requirements that are exploited in a custom P-GPSR implementation as fitness criteria or are strongly enforced in the solution. The P-GPSR implementation improved accuracy, generalizability, and training time compared to the same GPSR code without physics-informed fitness criteria. The yield function generated through the P-GPSR framework is in the form of a composite function that describes a class of materials and is characteristically more interpretable than GPSR-derived equations. The physical significance of the input functions learned by P-GPSR within the composite function is acquired from the thermodynamic analysis. Fundamental explanations of why the implemented P-GPSR capabilities improve results over a conventional GPSR algorithm are provided.
Data-driven modelling of the multiaxial yield behaviour of nanoporous metals
2023, International Journal of Mechanical Sciences
Nanoporous metals, built out of complex ligament networks, can be produced with an additional level of hierarchy. The resulting complexity of the structure makes modelling of the mechanical behaviour computationally expensive and time consuming. In addition, multiaxial stresses occur in the higher hierarchy ligaments. Therefore, knowledge of the multiaxial material behaviour, including the 6D yield surface, is required. Surrogate models, predicting the mechanical behaviour of the lower level of hierarchy, represented by finite element beam models, are a promising approach to overcome such challenges, when existing analytical models are not able to describe the material behaviour. Therefore, as a first step, we studied the elastic behaviour and the yield surfaces of representative volume elements with idealised diamond and Kelvin structure in finite element simulations. The yield surfaces showed pronounced anisotropy and could not be described by the Deshpande-Fleck model for isotropic solid foams. Instead, we used data-driven and hybrid artificial neural networks, as well as data-driven support vector machines and compared them regarding their potential for the prediction of yield surfaces. All considered methods were well suited and resulted in relative errors $< 4.5 %$ . Support vector machines showed the best generalisation and accuracy in 6D stress space and are suitable for extrapolation outside the range of training data.

View all citing articles on Scopus

View full text

Published by Elsevier Ltd.

Development of interpretable, data-driven plasticity models with symbolic regression

Highlights

Abstract

Introduction

Section snippets

Approach

Test cases

Test case results

Robustness and efficiency of test cases

Application to porous plasticity

Conclusions

Declaration of Competing Interest

Acknowledgments

Comput Struct

Comput Methods Appl Mech Eng

Proc Eng

Comput Methods Appl Mech Eng

Int J Plast

J Mech Phys Solids

J Comput Phys

J Comput Phys

Comput Methods Appl Mech Eng

Int J Plast

J Mech Phys Solids

Int J Plast

Comput Methods Appl Mech Eng

J Mech Phys Solids

Int J Solids Struct

J Comput Appl Math

Acta Metall

Int J Plasticity

Mechanik der festen körper im plastisch-deformablen zustand. Nachrichten von der Gesellschaft der Wissenschaften zu Göttingen

Mathematisch-Physikalische Klasse

A theory of the yielding and plastic flow of anisotropic metals

Proc R Soc Lond A

Simulated micromechanical models using artificial neural networks

J Eng Mech

Neural network for constitutive modelling in finite element analysis

Comput Assisted Mech Eng Sci

Hybrid constitutive modeling: data-driven learning of corrections to plasticity models

Int J Mater Form

Continuous hierarchical fair competition model for sustainable innovation in genetic programming

An empirical study of multipopulation genetic programming

Genet Program Evolvable Mach

Coevolution of fitness predictors

IEEE Trans Evol Comput