Stratified analysis of the age-related waist circumference cut-off model for the screening of dysglycemia at zero-cost

doi:10.1016/j.obmed.2022.100398

Obesity Medicine

Volume 31, May 2022, 100398

https://doi.org/10.1016/j.obmed.2022.100398 Get rights and content

Highlights

•
Novel artificial intelligence methods can simplify the screening of dysglycemia.
•
An age-related waist circumference cut-off model to detect dysglycemia.
•
Stratified analysis in sex, race and glycemia group maintaining good performance.
•
Good calibration of the model to detect dysglycemia.
•
Model is ideal to serve as zero-cost large-scale dysglycemia screening tool.

Abstract

Aims

We perform a stratified analysis of the recently published age-related waist circumference cut-off model to validate its performance in the screening of dysglycemia in the US population.

Methods

We use NHANES data as representative of the US population. Data were subdivided into sex, ethnic and glycemia groups. We evaluate the performance of the model separately in each group through the AUC. We also discuss the calibration of the model.

Results

For the sex-stratified analysis, we obtain AUC = 0.69–0.71 (95% C.I.) for male individuals and AUC = 0.75–0.78 (95% C.I.) for female individuals. The stratified analysis is performed in different ethnic groups, namely “Mexican American”, “Other Hispanic”, “Non-Hispanic White”, “Non-Hispanic Black” and “Other Race – Including Multi-Racial. We obtain, respectively, AUC = 0.74–0.75, AUC = 0.76–0.78, AUC = 0.73–0.75, AUC = 0.74–0.77 and AUC = 0.71–0.73 (95% C.I.). The model achieves AUC = 0.70–0.73 (95% C.I.) in the identification of individuals with prediabetes and AUC = 0.70–0.80 (95% C.I.) in the identification of individuals with diabetes.

Conclusions

The accuracy of the model turns out to be similar in each group considered in the stratified analysis, indicating that the model is suitable to be used as a screening tool for dysglycemia in the US population.

Introduction

The importance of developing screening models to identify subjects with early dysglycemia is an important task to prevent the development of manifest diabetes (Siu and U.S. Preventive Services Task Force, 2015; American Diabetes Association, 2020), to reduce the incidence of several fatal and nonfatal diabetes complication (Lindstrom and Tuomilehto, 2003) and to save the enormous costs of full-blown diabetic disease to the health system (Centers for Disease Control and Prevention, 2020).

Our recently published dysglycemia model (Buccheri et al., 2021) fits into this framework. The model, developed through the support of machine learning (ML) software Brain Project (BP) (Russo, 2016, 2020), predicts the dysglycemia status in a sample of the American adult population through the exclusive use of two zero-cost variables that do not require laboratory tests: waist circumference and age of an individual. As pointed out in Ref. (Buccheri et al., 2021), waist circumference is usually related to the visceral fat of an individual, thus testifying the existence of a strong correlation between obesity and dysglycemia. This result agrees with other important evidence previously published in the literature about genetic predisposition of developing diabetes in patients affected by obesity (Sheikhpour et al., 2020) and the major risk of diabetes and cardiovascular disease in the various obesity types (e.g. central obesity, visceral obesity, etc.) (Miklishanskaya et al., 2021).

Despite its simplicity, the model has similar performance than other complex models previously developed. Therefore, it is ideal to perform large scale screening of dysglycemia in the adult US population (Buccheri et al., 2021). The original model, which in turn consisted in an age-related waist circumference cut-off, was so far validated exclusively on a dataset representative of the general US population. However, its detailed analysis in different sex and ethnic groups, which can be considered intrinsic factors in the genesis of dysglycemia (American Diabetes Association, 2020), is an important step to fully validate the model. For example, it is known that risk to develop dysglycemia in female individuals increases more with age than for male individuals (Huebschmann et al., 2019) as well as certain ethnic groups have a greater genetic predisposition to have an altered blood sugar than others (Gurka et al., 2013).

The paper investigates the stratified performance of the model in different sex and ethnic groups of the US population, which is extremely important in the genesis of a zero-cost and easy-to-use model. We also evaluate the predictive performance individually on prediabetes and diabetes and discuss the calibration of the model. The present work, in turn, complements the findings of our previous work (Buccheri et al., 2021) through a stratified analysis of the model accuracy.

Section snippets

Study population (NHANES, 2007–2016)

The population used to test the model was derived from the data obtained in successive NHANES over 10 years (2007–2016). NHANES is a continuous collection of cross-sectional data conducted by the National Center for Health Statistics in the Center for Disease Control and Prevention (CDC). NHANES is a representation of US population through surveys with the typical and various ethnics US characteristics. NHANES reports the health and nutritional status of adults and children in the US, thus

Results

We have evaluated the performance of the model (Buccheri et al., 2021) separately for female and male individuals. Results of this sex-stratified analysis are described by the ROC curves shown in Fig. 1. We obtained AUC = 0.69–0.71 (95% C.I.) for male individuals and AUC = 0.75–0.78 (95% C.I.) for female individuals. The optimal trade-off between sensitivity and specificity, obtained with the procedure described in (Buccheri et al., 2021), turned out to be fairly similar for both sex groups,

Discussion

The results of the sub-analysis of the model highlighted various aspects that are certainly of scientific interest.

The accuracy of our model even in the lower-accuracy sex-group is still comparable to the overall accuracy achieved by state-of-the-art models (American Diabetes Association, 2020; Lindstrom and Tuomilehto, 2003). In turn, one could speculate that the reason of the slightly different accuracy in different sex-groups is possibly due to the weaker correlation of non-laboratory

Conclusions

In conclusion, the early identification of individuals with dysglycemia status is key to prevent type 2 diabetes as well as several related fatal and nonfatal complications. To help in the prevention of type 2 diabetes in the US population, we published a new model very simple and ideal to be used to perform large-scale zero-cost screening of dysglycemia. The model was previously validated exclusively on the general US population, thus not allowing to have a complete picture of its performance.

CRediT authorship contribution statement

Enrico Buccheri: Original idea, literature search, Study design, Data collection, Data interpretation, Writing – original draft. Daniele Dell’Aquila: Original idea, literature search, Study design, Data collection, Formal analysis, Data interpretation, Writing – original draft. Marco Russo: Original idea, Study design, Data collection, Formal analysis, Data interpretation, Writing - critical review.

Declarations of competing interest

None.

Acknowledgments

Data used in this study were collected by the National Health and Nutrition Examination Survey (NHANES) and they are free and publicly available on the National Center for Health Statistics of the Centers for Disease Control and Prevention (CDC) website. D.D. acknowledges funding support from the Italian Ministry of Education, University and Research (MIUR) through the “PON Ricerca e Innovazione 2014–2020, Azione I.2 A.I.M., D.D. 407/2018”.

References (23)

E. Buccheri et al.
Artiﬁcial intelligence in health data analysis: the Darwinian evolution theory suggests an extremely simple and zero-cost large-scale screening tool for prediabetes and type 2 diabetes
Diabetes Res. Clin. Pract.
(2021)
G. Campobello et al.
Neuro-genetic programming for multigenre classification of music content
Appl. Soft Comput.
(2020)
R.R. Kalyani et al.
Age-related and disease-related muscle loss: the effect of diabetes, obesity, and other diseases
Lancet Diabetes Endocrinol
(2014)
S.V. Miklishanskaya et al.
Types of obesity and their prognostic value
Obesity Med.
(2021)
M. Russo
A distributed neuro-genetic programming tool
Swarm. Evol. Comput.
(2016)
M. Russo et al.
Genetic programming for photovoltaic plant output forecasting
Sol. Energy
(2014)
M. Sheikhpour et al.
The Interaction between gene profile and obesity in type 2 diabetes: a review
Obesity Med.
(2020)
American Diabetes Association
2. Classification and diagnosis of diabetes: standards of medical care in diabetes–2020
Diabetes Care
(2020)
Centers for Disease Control and Prevention
National Diabetes Statistics Report
(2020)
Centers for Disease Control and Prevention. National Health and Nutrition Examination Survey....

K. De Silva et al.

A combined strategy of feature selection and machine learning to identify predictors of prediabetes

J. Am. Med. Inf. Assoc.

(2019)

Cited by (6)

Can artificial intelligence simplify the screening of muscle mass loss?
2023, Heliyon
Sarcopenia is a risk factor for morbidity and preventable mortality in old age, with consequent high costs for the national health system. Its diagnosis requires costly radiological examinations, such as the DEXA, which complicate screening in medical centers with a high prevalence of sarcopenia.
Developing a nearly zero-cost screening tool to emulate the performance of DEXA in identifying patients with muscle mass loss. This can crucially help the early diagnosis of sarcopenia at large-scale, contributing to reduce its prevalence and related complications with timely treatments.
We exploit cross-sectional data for about 14,500 patients and 38 non-laboratory variables from successive NHANES over 7 years (1999–2006). Data are analyzed through a state-of-the-art artificial intelligence approach based on decision trees.
A reduced number of anthropometric parameters allows to predict the outcome of DEXA with AUC between 0.92 and 0.94. The most complex model derived in this paper exploits 6 variables, related to the circumference of key corporal segments and to the evaluation of body fat. It achieves an optimal trade-off sensitivity of 0.89 and a specificity of 0.82. Restricting exclusively to variables related to lower limb, we obtain an even simpler tool with only slightly lower accuracy (AUC 0.88–0.90).
Anthropometric data seem to contain the entire informative content of a more complex set of non-laboratory variables, including anamnestic and/or morbidity factors. Compared to previously published screening tools for muscle mass loss, the newly developed models are less complex and achieve a better accuracy. The new results might suggest a possible inversion of the standard diagnostic algorithm of sarcopenia. We conjecture a new diagnostic scheme, which requires a dedicated clinical validation that goes beyond the scope of the present study.
A novel multi-layer modular approach for real-time fuzzy-identification of gravitational-wave signals
2023, Machine Learning: Science and Technology
Nuclear physics midterm plan at Legnaro National Laboratories (LNL)
2023, European Physical Journal Plus
Understanding Heavy-ion Fusion Cross Section Data Using Novel Artificial Intelligence Approaches
2023, Journal of Physics: Conference Series
Can Artificial Intelligence Simplify the Screening of Muscle Mass Loss?
2022, SSRN
A Novel Multi-Layer Modular Approach for Real-Time Fuzzy-Identification of Gravitational-Wave Signals
2022, arXiv

View full text

Stratified analysis of the age-related waist circumference cut-off model for the screening of dysglycemia at zero-cost

Highlights

Abstract

Aims

Methods

Results

Conclusions

Introduction

Section snippets

Study population (NHANES, 2007–2016)

Results

Discussion

Conclusions

CRediT authorship contribution statement

Declarations of competing interest

Acknowledgments

Diabetes Res. Clin. Pract.

Appl. Soft Comput.

Lancet Diabetes Endocrinol

Obesity Med.

Swarm. Evol. Comput.

Sol. Energy

Obesity Med.

2. Classification and diagnosis of diabetes: standards of medical care in diabetes–2020

Diabetes Care

National Diabetes Statistics Report

A combined strategy of feature selection and machine learning to identify predictors of prediabetes

J. Am. Med. Inf. Assoc.