Predicting 2-year neurodevelopmental outcomes in extremely preterm infants using graphical network and machine learning approaches

PENUT Consortium

doi:10.1016/j.eclinm.2022.101782

Predicting 2-year neurodevelopmental outcomes in extremely preterm infants using graphical network and machine learning approaches

PENUT Consortium

Research output: Contribution to journal › Article › peer-review

Abstract

Background: Infants born extremely preterm (<28 weeks’ gestation) are at high risk of neurodevelopmental impairment (NDI) with 50% of survivors showing moderate or severe NDI when at 2 years of age. We sought to develop novel models by which to predict neurodevelopmental outcomes, hypothesizing that combining baseline characteristics at birth with medical care and environmental exposures would produce the most accurate model. Methods: Using a prospective database of 692 infants from the Preterm Epo Neuroprotection (PENUT) Trial, which was carried out between December 2013 and September 2016, we developed three predictive algorithms of increasing complexity using a Bayesian Additive Regression Trees (BART) machine learning approach to predict both NDI and continuous Bayley Scales of Infant and Toddler Development 3rd ed subscales at 2 year follow-up using: 1) the 5 variables used in the National Institute of Child Health and Human Development (NICHD) Extremely Preterm Birth Outcomes Tool, 2) 21 variables associated with outcomes in extremely preterm (EP) infants, and 3) a hypothesis-free approach using 133 potential variables available for infants in the PENUT database. Findings: The NICHD 5-variable model predicted 3–4% of the variance in the Bayley subscale scores, and predicted NDI with an area under the receiver operator curve (AUROC, 95% CI) of 0.62 (0.56–0.69). Accuracy increased to 12–20% of variance explained and an AUROC of 0.77 (0.72–0.83) when using the 21 pre-selected clinical variables. Hypothesis-free variable selection using BART resulted in models that explained 20–31% of Bayley subscale scores and AUROC of 0.87 (0.83–0.91) for severe NDI, with good calibration across the range of outcome predictions. However, even with the most accurate models, the average prediction error for the Bayley subscale predictions was around 14–15 points, leading to wide prediction intervals. Higher total transfusion volume was the most important predictor of severe NDI and lower Bayley scores across all subscales. Interpretation: While the machine learning BART approach meaningfully improved predictive accuracy above a widely used prediction tool (NICHD) as well as a model utilizing NDI-associated clinical characteristics, the average error remained approximately 1 standard deviation on either side of the true value. Although dichotomous NDI prediction using BART was more accurate than has been previously reported, and certain clinical variables such as transfusion exposure were meaningfully predictive of outcomes, our results emphasize the fact that the field is still not able to accurately predict the results of complex long-term assessments such as Bayley subscales in infants born EP even when using rich datasets and advanced analytic methods. This highlights the ongoing need for long-term follow-up of all EP infants. Funding: Supported by the National Institute of Neurological Disorders and Stroke U01NS077953 and U01NS077955.

Original language	English (US)
Article number	101782
Journal	EClinicalMedicine
Volume	56
DOIs	https://doi.org/10.1016/j.eclinm.2022.101782
State	Published - Feb 2023

Keywords

Extreme prematurity
Outcomes
neurodevelopmental impairment
prediction

ASJC Scopus subject areas

General Medicine

Access to Document

10.1016/j.eclinm.2022.101782

Cite this

@article{0c2c09ab3642497dafb098499d263323,

title = "Predicting 2-year neurodevelopmental outcomes in extremely preterm infants using graphical network and machine learning approaches",

abstract = "Background: Infants born extremely preterm (<28 weeks{\textquoteright} gestation) are at high risk of neurodevelopmental impairment (NDI) with 50% of survivors showing moderate or severe NDI when at 2 years of age. We sought to develop novel models by which to predict neurodevelopmental outcomes, hypothesizing that combining baseline characteristics at birth with medical care and environmental exposures would produce the most accurate model. Methods: Using a prospective database of 692 infants from the Preterm Epo Neuroprotection (PENUT) Trial, which was carried out between December 2013 and September 2016, we developed three predictive algorithms of increasing complexity using a Bayesian Additive Regression Trees (BART) machine learning approach to predict both NDI and continuous Bayley Scales of Infant and Toddler Development 3rd ed subscales at 2 year follow-up using: 1) the 5 variables used in the National Institute of Child Health and Human Development (NICHD) Extremely Preterm Birth Outcomes Tool, 2) 21 variables associated with outcomes in extremely preterm (EP) infants, and 3) a hypothesis-free approach using 133 potential variables available for infants in the PENUT database. Findings: The NICHD 5-variable model predicted 3–4% of the variance in the Bayley subscale scores, and predicted NDI with an area under the receiver operator curve (AUROC, 95% CI) of 0.62 (0.56–0.69). Accuracy increased to 12–20% of variance explained and an AUROC of 0.77 (0.72–0.83) when using the 21 pre-selected clinical variables. Hypothesis-free variable selection using BART resulted in models that explained 20–31% of Bayley subscale scores and AUROC of 0.87 (0.83–0.91) for severe NDI, with good calibration across the range of outcome predictions. However, even with the most accurate models, the average prediction error for the Bayley subscale predictions was around 14–15 points, leading to wide prediction intervals. Higher total transfusion volume was the most important predictor of severe NDI and lower Bayley scores across all subscales. Interpretation: While the machine learning BART approach meaningfully improved predictive accuracy above a widely used prediction tool (NICHD) as well as a model utilizing NDI-associated clinical characteristics, the average error remained approximately 1 standard deviation on either side of the true value. Although dichotomous NDI prediction using BART was more accurate than has been previously reported, and certain clinical variables such as transfusion exposure were meaningfully predictive of outcomes, our results emphasize the fact that the field is still not able to accurately predict the results of complex long-term assessments such as Bayley subscales in infants born EP even when using rich datasets and advanced analytic methods. This highlights the ongoing need for long-term follow-up of all EP infants. Funding: Supported by the National Institute of Neurological Disorders and Stroke U01NS077953 and U01NS077955.",

keywords = "Extreme prematurity, Outcomes, neurodevelopmental impairment, prediction",

author = "{PENUT Consortium} and Juul, {Sandra E.} and Wood, {Thomas R.} and Kendell German and Law, {Janessa B.} and Kolnik, {Sarah E.} and Mihai Puia-Dumitrescu and Ulrike Mietzsch and Semsa Gogcu and Comstock, {Bryan A.} and Sijia Li and Mayock, {Dennis E.} and Heagerty, {Patrick J.} and Rajan Wadhawan and Courtney, {Sherry E.} and Tonya Robinson and Ahmad, {Kaashif A.} and Ellen Bendel-Stenzel and Mariana Baserga and LaGamma, {Edmund F.} and Downey, {L. Corbin} and Raghavendra Rao and Nancy Fahim and Andrea Lampland and Frantz, {Ivan D.} and Janine Khan and Michael Weiss and Gilmore, {Maureen M.} and Nishant Srinivasan and Perez, {Jorge E.} and Victor McKay",

note = "Publisher Copyright: {\textcopyright} 2022 The Author(s)",

year = "2023",

month = feb,

doi = "10.1016/j.eclinm.2022.101782",

language = "English (US)",

volume = "56",

journal = "EClinicalMedicine",

issn = "2589-5370",

publisher = "Lancet Publishing Group",

}

TY - JOUR

T1 - Predicting 2-year neurodevelopmental outcomes in extremely preterm infants using graphical network and machine learning approaches

AU - PENUT Consortium

AU - Juul, Sandra E.

AU - Wood, Thomas R.

AU - German, Kendell

AU - Law, Janessa B.

AU - Kolnik, Sarah E.

AU - Puia-Dumitrescu, Mihai

AU - Mietzsch, Ulrike

AU - Gogcu, Semsa

AU - Comstock, Bryan A.

AU - Li, Sijia

AU - Mayock, Dennis E.

AU - Heagerty, Patrick J.

AU - Wadhawan, Rajan

AU - Courtney, Sherry E.

AU - Robinson, Tonya

AU - Ahmad, Kaashif A.

AU - Bendel-Stenzel, Ellen

AU - Baserga, Mariana

AU - LaGamma, Edmund F.

AU - Downey, L. Corbin

AU - Rao, Raghavendra

AU - Fahim, Nancy

AU - Lampland, Andrea

AU - Frantz, Ivan D.

AU - Khan, Janine

AU - Weiss, Michael

AU - Gilmore, Maureen M.

AU - Srinivasan, Nishant

AU - Perez, Jorge E.

AU - McKay, Victor

PY - 2023/2

Y1 - 2023/2

N2 - Background: Infants born extremely preterm (<28 weeks’ gestation) are at high risk of neurodevelopmental impairment (NDI) with 50% of survivors showing moderate or severe NDI when at 2 years of age. We sought to develop novel models by which to predict neurodevelopmental outcomes, hypothesizing that combining baseline characteristics at birth with medical care and environmental exposures would produce the most accurate model. Methods: Using a prospective database of 692 infants from the Preterm Epo Neuroprotection (PENUT) Trial, which was carried out between December 2013 and September 2016, we developed three predictive algorithms of increasing complexity using a Bayesian Additive Regression Trees (BART) machine learning approach to predict both NDI and continuous Bayley Scales of Infant and Toddler Development 3rd ed subscales at 2 year follow-up using: 1) the 5 variables used in the National Institute of Child Health and Human Development (NICHD) Extremely Preterm Birth Outcomes Tool, 2) 21 variables associated with outcomes in extremely preterm (EP) infants, and 3) a hypothesis-free approach using 133 potential variables available for infants in the PENUT database. Findings: The NICHD 5-variable model predicted 3–4% of the variance in the Bayley subscale scores, and predicted NDI with an area under the receiver operator curve (AUROC, 95% CI) of 0.62 (0.56–0.69). Accuracy increased to 12–20% of variance explained and an AUROC of 0.77 (0.72–0.83) when using the 21 pre-selected clinical variables. Hypothesis-free variable selection using BART resulted in models that explained 20–31% of Bayley subscale scores and AUROC of 0.87 (0.83–0.91) for severe NDI, with good calibration across the range of outcome predictions. However, even with the most accurate models, the average prediction error for the Bayley subscale predictions was around 14–15 points, leading to wide prediction intervals. Higher total transfusion volume was the most important predictor of severe NDI and lower Bayley scores across all subscales. Interpretation: While the machine learning BART approach meaningfully improved predictive accuracy above a widely used prediction tool (NICHD) as well as a model utilizing NDI-associated clinical characteristics, the average error remained approximately 1 standard deviation on either side of the true value. Although dichotomous NDI prediction using BART was more accurate than has been previously reported, and certain clinical variables such as transfusion exposure were meaningfully predictive of outcomes, our results emphasize the fact that the field is still not able to accurately predict the results of complex long-term assessments such as Bayley subscales in infants born EP even when using rich datasets and advanced analytic methods. This highlights the ongoing need for long-term follow-up of all EP infants. Funding: Supported by the National Institute of Neurological Disorders and Stroke U01NS077953 and U01NS077955.

AB - Background: Infants born extremely preterm (<28 weeks’ gestation) are at high risk of neurodevelopmental impairment (NDI) with 50% of survivors showing moderate or severe NDI when at 2 years of age. We sought to develop novel models by which to predict neurodevelopmental outcomes, hypothesizing that combining baseline characteristics at birth with medical care and environmental exposures would produce the most accurate model. Methods: Using a prospective database of 692 infants from the Preterm Epo Neuroprotection (PENUT) Trial, which was carried out between December 2013 and September 2016, we developed three predictive algorithms of increasing complexity using a Bayesian Additive Regression Trees (BART) machine learning approach to predict both NDI and continuous Bayley Scales of Infant and Toddler Development 3rd ed subscales at 2 year follow-up using: 1) the 5 variables used in the National Institute of Child Health and Human Development (NICHD) Extremely Preterm Birth Outcomes Tool, 2) 21 variables associated with outcomes in extremely preterm (EP) infants, and 3) a hypothesis-free approach using 133 potential variables available for infants in the PENUT database. Findings: The NICHD 5-variable model predicted 3–4% of the variance in the Bayley subscale scores, and predicted NDI with an area under the receiver operator curve (AUROC, 95% CI) of 0.62 (0.56–0.69). Accuracy increased to 12–20% of variance explained and an AUROC of 0.77 (0.72–0.83) when using the 21 pre-selected clinical variables. Hypothesis-free variable selection using BART resulted in models that explained 20–31% of Bayley subscale scores and AUROC of 0.87 (0.83–0.91) for severe NDI, with good calibration across the range of outcome predictions. However, even with the most accurate models, the average prediction error for the Bayley subscale predictions was around 14–15 points, leading to wide prediction intervals. Higher total transfusion volume was the most important predictor of severe NDI and lower Bayley scores across all subscales. Interpretation: While the machine learning BART approach meaningfully improved predictive accuracy above a widely used prediction tool (NICHD) as well as a model utilizing NDI-associated clinical characteristics, the average error remained approximately 1 standard deviation on either side of the true value. Although dichotomous NDI prediction using BART was more accurate than has been previously reported, and certain clinical variables such as transfusion exposure were meaningfully predictive of outcomes, our results emphasize the fact that the field is still not able to accurately predict the results of complex long-term assessments such as Bayley subscales in infants born EP even when using rich datasets and advanced analytic methods. This highlights the ongoing need for long-term follow-up of all EP infants. Funding: Supported by the National Institute of Neurological Disorders and Stroke U01NS077953 and U01NS077955.

KW - Extreme prematurity

KW - Outcomes

KW - neurodevelopmental impairment

KW - prediction

UR - http://www.scopus.com/inward/record.url?scp=85144765631&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85144765631&partnerID=8YFLogxK

U2 - 10.1016/j.eclinm.2022.101782

DO - 10.1016/j.eclinm.2022.101782

M3 - Article

C2 - 36618896

AN - SCOPUS:85144765631

SN - 2589-5370

VL - 56

JO - EClinicalMedicine

JF - EClinicalMedicine

M1 - 101782

ER -

Predicting 2-year neurodevelopmental outcomes in extremely preterm infants using graphical network and machine learning approaches

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this