Skip to main content

Using machine-learning models to predict extubation failure in neonates with bronchopulmonary dysplasia

Abstract

Aim

To develop a decision-support tool for predicting extubation failure (EF) in neonates with bronchopulmonary dysplasia (BPD) using a set of machine-learning algorithms.

Methods

A dataset of 284 BPD neonates on mechanical ventilation was used to develop predictive models via machine-learning algorithms, including extreme gradient boosting (XGBoost), random forest, support vector machine, naïve Bayes, logistic regression, and k-nearest neighbor. The top three models were assessed by the area under the receiver operating characteristic curve (AUC), and their performance was tested by decision curve analysis (DCA). Confusion matrix was used to show the high performance of the best model. The importance matrix plot and SHapley Additive exPlanations values were calculated to evaluate the feature importance and visualize the results. The nomogram and clinical impact curves were used to validate the final model.

Results

According to the AUC values and DCA results, the XGboost model performed best (AUC = 0.873, sensitivity = 0.896, specificity = 0.838). The nomogram and clinical impact curve verified that the XGBoost model possessed a significant predictive value. The following were predictive factors for EF: pO2, hemoglobin, mechanical ventilation (MV) rate, pH, Apgar score at 5 min, FiO2, C-reactive protein, Apgar score at 1 min, red blood cell count, PIP, gestational age, highest FiO2 at the first 24 h, heart rate, birth weight, pCO2. Further, pO2, hemoglobin, and MV rate were the three most important factors for predicting EF.

Conclusions

The present study indicated that the XGBoost model was significant in predicting EF in BPD neonates with mechanical ventilation, which is helpful in determining the right extubation time among neonates with BPD to reduce the occurrence of complications.

Peer Review reports

Introduction

Bronchopulmonary dysplasia (BPD) is one of the most common and serious complications in preterm infants. The diagnosis is based on the recommended criterion, that is, any newborn with oxygen dependence (fraction of inspired oxygen [FiO2] > 0.21) for 28 days) [1]. Extubation failure (EF) is a difficult problem in newborns and neonates diagnosed with BPD, and it could result in poorer prognosis, such as pneumothorax, subglottic injury, nosocomial infection, and neurodevelopmental impairment [2]. EF is defined as the need for re-intubation within 72 h after planned extubation [3], and timely and effective extubation is an important goal [4]. However, premature extubation in unprepared neonates causes EF, prolonged mechanical ventilation (MV) duration, and increased morbidity and mortality in high-risk neonates [5,6,7]. A variety of prospective and retrospective studies have tried to identify predictors of EF, and different factors predicting EF have been studied, including birth weight, Apgar score [8], FiO2 prior to extubation [9], spontaneous breathing tests [10], cardiorespiratory variability [11], airway trauma, and feeding difficulties [12].

With the rapid development of precision medicine, novel machine-learning (ML) techniques have offered improved predictive performance relative to traditional prediction methods [13]. A previous study based on 486 premature neonates developed different ML algorithms to predict EF, but the algorithms performed worse than expert clinicians [14]. Kanbar et al. [15] mainly applied a single classifier to predict EF among 241 extremely preterm infants in a multicenter study. The authors obtained the receiver operating characteristic (ROC) curves of clinical, cardiorespiratory, and clinical and cardiorespiratory classifiers. The performance of their models was unsatisfactory (all area under the ROC curve [AUC] values < 0.75), and they failed to further explore visualization results. Onu et al. [16] intended to model the sequence of respiratory patterns for the 5-minute period of Endo Tracheal Tube Continuous Positive Airway Pressure (ETT-CPAP) and use a Markov chain model to convert state information into probabilities and predict the probability of EF in extremely preterm newborns. Robles-Rubio et al. [17] used an automated analysis of Respiratory Inductive Plethysmography (RIP) signals in a limited number of infants, which was a simple classification using two metrics and was a hypothesis generator. However, the pathophysiological relevance of the metrics derived from an Automated Unsupervised Respiratory Event Analysis (AUREA) remains unknown.

Extreme gradient boosting (XGBoost) is an interpretable model that has been used in different disease models, including in neonates [18,19,20,21]. However, there have been no reports on the use of XGBoost to study EF. In this study, we proposed an EF prediction and feature analysis model using XGBoost. The results indicated that the prediction model was helpful in increasing the current successful extubation rate in neonates under MV.

Materials and methods

Selection of participants

The study was approved by the Ethics Committee of the Children’s Hospital of Soochow University, with a waiver for informed consent because of the retrospective nature of the study. Eligible neonates were those who had been mechanically ventilated due to a diagnosis of BPD between January 1, 2015, and May 31, 2022, and survived to their first extubation from MV. The study excluded neonates (1) who were never mechanically ventilated or died prior to their first extubation, (2) who experienced unplanned extubation episodes and required immediate re-intubation without a trial of noninvasive respiratory support, (3) who were diagnosed with severe congenital malformations, complex congenital heart disease, chromosomal abnormalities, genetic metabolism sexual diseases, severe infectious diseases; and (4) who were missing too many medical records.

BPD was diagnosed based on the recommended diagnostic criterion, that is, any newborn with oxygen dependence (FiO2 > 0.21) for 28 days). If a newborn’s gestational age (GA) was less than 32 weeks, BPD was graded according to the corrected GA of 36 weeks or oxygen concentration at discharge. If the GA was more than 32 weeks, BPD was graded according to the oxygen demand concentration at 56 days after birth or at discharge as follows: (1) mild, no oxygen; (2) moderate, FiO2 < 0.30; and (3) severe, FiO2 < 0.30 or requiring mechanical ventilation [1].

EF was defined as the need for re-intubation within 72 h after planned extubation if blood gas values, vital signs, saturation, and overall clinical stability of the neonate were abnormal, according to the commonly used definition [3]. The criteria for extubation in the present study were as follows: (1) The neonates’ primary disease improved, infection was controlled, and they were in good condition. (2) When the blood gas analysis was normal, the ventilator parameters were gradually reduced, and spontaneous breathing was exercised and enhanced. We reduced FiO2 and PIP first; then, the MV rate was reduced, chest movements were observed, and SaO2 and arterial blood gas results were monitored. (3) When PIP was ≤ 18 cm H2O, PEEP was 2–4 cm H2O, rate was ≤ 10 times/min, FiO2 was ≤ 0.4, and the arterial blood gas results were normal, the MV was removed. The criteria for re-intubation in the present study were as follows: (1) symptoms of dyspnea persisted during treatment after extubation; (2) the arterial pCO2 was above 70 mm Hg; and (3) the arterial pO2 was below 50 mm Hg [3].

All of the neonates in this study received post-extubation noninvasive respiratory support, and only the first extubation episode was evaluated.

Data collection

Clinical and laboratory variables, vital signs, and MV parameters were obtained within 6 h prior to extubation. The recorded characteristics included sex, birth weight (BW), GA, delivery mode, Apgar scores at 1 min and 5 min, delivery room (DR) resuscitation (chest compression, epinephrine), multiple-pregnancy status, intrauterine distress, prenatal maternal use of dexamethasone and antibiotics, placental abnormality, premature rupture of membranes, diseases during pregnancy (including pregnancy-induced hypertension and gestational diabetes), highest FiO2 at the first 24 h, laboratory data (arterial blood gas, full blood count, and C-reactive protein [CRP]), vital signs (respiratory rate and heart rate), and MV parameters (PIP, FiO2, PEEP, and MV rate). For variables with multiple measurements, the average values were assessed.

Statistical analysis

The baseline characteristics were compared between the successful-extubation group and the EF group. Continuous variables without normal distribution were reported as medians (interquartile ranges) and analyzed using the Wilcoxon rank-sum test, while continuous variables with normal distribution were presented as means (standard deviations) and analyzed using the Student’s t test. Categorical variables were reported as numbers and analyzed using the chi-square test, continuity-corrected chi-square test, or Fisher’s exact-probability method for intergroup comparisons. All of the statistical analyses were conducted using Python (version 3.6) and R software. P values lower than 0.05 were defined as being statistically significant.

Prediction models

To address the issue of imbalanced data, the oversampling method with Adaptive Synthetic Sampling (ADASYN) algorithm was utilized. Unlike other oversampling techniques, ADASYN generates synthetic samples based on the density between different samples to achieve a balanced class distribution. We used a dynamic K value approach to more accurately capture the data characteristics of each class. Initially, each class sample was assigned a starting K value of 5. Local density was then calculated by assessing the proportion of majority class samples among the K nearest neighbors, estimating the sample’s regional density. K values were dynamically adjusted based on this density. Lower density samples received higher K values to better represent their data distribution, while higher density samples got smaller K values due to denser surrounding data. Synthetic samples were generated using these adjusted K values, particularly for minority class samples, selecting from their K nearest neighbors to enhance quality and diversity [22]. Six ML methods, namely, K-nearest neighbor (KNN), support vector machine (SVM), logistic regression (LR), naïve Bayes (NB), random forest (RF), and XGBoost, were preliminarily used to predict EF. Ten-fold cross validation was conducted in the complete dataset and the average metrics values were exported to access model performance in current study. During the process of hyperparameters tuning, the models were initialized separately. Afterwards, batch gradient descent algorithm was applied iteratively to update these parameters until convergence is achieved. In each iteration, the batch gradient descent algorithm adjusts the values of parameters based on the gradient of the loss function, and then gradually stabilizing the loss function values of these models. While the XGBoost algorithm can handle missing values automatically, most of the other models cannot analyze data with missing values. Therefore, features with missing degrees and other missing values were imputed using multiple imputation. In the model-comparison phase, the performances of the six predictive models were tested by the AUC values and compared, and decision curve analysis (DCA) was used to select the optimal model that achieved the highest overall diagnostic value for further verification. Confusion matrix was used to show the high performance of the best model. Feature importance analysis was performed to identify the top 15 features that were helpful for EF prediction. SHapley Additive exPlanations (SHAP)—an interpretable tool for predicting the output of ML models—was used to reflect the influence of the features in each data sample and also show positive and negative effects [23]. Finally, the nomogram and clinical impact curve (CIC) were plotted to evaluate the clinical usefulness and applicability of the model with the best diagnostic value.

Results

Characteristics affecting EF in BPD neonates

As shown in Fig. 1, a total of 284 neonates were included in this cohort. The dataset was then divided into a success set (n = 191) and a failure set (n = 93). Comparisons of the baseline characteristics, vital signs, laboratory parameters, and MV parameters before extubation between the successful-extubation and EF groups are shown in Table 1. There were significant differences in BW, GA, Apgar scores at 1 min and 5 min, DR chest compression, use of PS and respiratory stimulant, maternal disease (pregnancy-induced hypertension), and highest FiO2 in the first 24 h between the two groups (P < 0.05). The neonates in the EF group had a higher heart rate, MV rate, FiO2, PEEP, PIP, abnormal arterial blood gas (pH, pO2, and pCO2), and full blood count (red blood count [RBC], hemoglobin [Hb]) within 6 h prior to extubation (P < 0.05). Significantly prolonged MV duration, younger age at extubation, and higher CRP levels within 6 h before extubation were also found in the EF group. However, sex, premature rupture of membranes, intrauterine distress, abnormal placenta, delivery mode, multiple pregnancy, maternal disease (such as gestational diabetes), DR epinephrine, PMA at extubation, PLT, and WBC were not significantly different between the two groups.

Fig. 1
figure 1

Flowchart of the study

Table 1 Baseline characteristics between the successful extubation group and the EF group

Predictive models for EF in BPD neonates

In imbalanced data, oversampling method with ADASYN algorithm was applied to achieve a balanced class distribution. In the development of the ML model, 23 key features were finally selected, including BW, GA, Apgar scores at 1 and 5 min, DR chest compression, the use of PS and respiratory stimulants, maternal disease (pregnancy-induced hypertension), highest FiO2 at the first 24 h, heart rate, MV rate, FiO2, PEEP, PIP, pH, pO2, pCO2, RBC, Hb, and CRP within 6 h prior to extubation, MV duration, and age at extubation. First, the AUC values of the six models (shown in Fig. 2A) were obtained, and the sensitivity and specificity analyses of these predictive methods are summarized in Table 2. The XGBoost, LR, and RF models showed the largest AUC values (XGBoost, AUC = 0.873; RF, AUC = 0.836; LR, AUC = 0.787). According to the DCA of the above mentioned three prediction models, the net benefit of the XGBoost model was larger than that of the LR and RF models, suggesting that the XGBoost model was optimal and the LR model was inferior (Fig. 2B). Confusion matrix showed the high performance of the best model (Fig. 3) and had superior results (Table 3). Figure 4A shows the feature importance analysis of the XGBoost model to intuitively reflect the importance of the features. The top 15 features were selected, and it turned out that three features, namely, pO2, Hb, and MV rate, were the most important predictors of EF. To obtain more complex correlations with EF, the SHAP values were calculated to show positive and negative effects. Visualization results are shown in Fig. 4B, with blue indicating a negative contribution and red indicating a positive contribution. Specifically, MV rate, FiO2, CRP, PIP, highest FiO2 in the first 24 h, heart rate, and pCO2 were positive predictors of EF, as increasing the values of these features increased the prediction of EF. In contrast, pO2, Hb, pH, Apgar score at 1 and 5 min, RBC, GA and BW were negatively correlated with EF, as smaller values of these features indicated better prediction of the model.

Fig. 2
figure 2

(A) Receiver operating characteristic (ROC) curves of six machine-learning models. The three best performing models were the XGBoost, RF, and LR models. XGBoost AUC = 0.873; RF AUC = 0.836; LR AUC = 0.787; NB AUC = 0.771; SVM AUC = 0.758; KNN AUC = 0.643. (B) Decision curve analysis (DCA) of the three prediction models. The net benefit curves for the three prognostic models are shown. X-axis indicates the threshold probability for critical care outcome, and Y-axis indicates the net benefit. Solid red line = XGboost model; solid black line = RF model; solid blue line = LR model. The preferred model is the XGboost model, the net benefit of which was larger over the range of the RF and LR models

Table 2 Model performance in our study
Fig. 3
figure 3

Confusion matrix of the XGBoost

Table 3 The results of confusion matrix of XGBoost model
Fig. 4
figure 4

Top 15 features selected using XGBoost. (A) The corresponding variable importance score. X-axis indicates the importance score, which is the relative number of a variable that is used to distribute the data. Y-axis indicates the top 15 weighted variables. (B) The SHapley additive exPlanations (SHAP) framework for the features in the XGBoost model. Input variables are ranked in a descending order of feature importance. Red indicates positive contribution; blue indicates negative contribution

Validation of predictive model of XGBoost

To visualize the XGBoost model, the risk nomogram integrating 15 selected variables for the incidence of EF is shown in Fig. 5A. A CIC analysis was performed (Fig. 5B) to evaluate the clinical applicability of the risk prediction nomogram. The X-axis of the curve represents the number of patients or tests, and the Y-axis represents the number of true positives and false positives. The red curve (number of high-risk individuals) indicates the number of people who are classified as positive (high risk) by the model at each threshold probability. The blue curve (number of high-risk individuals with outcome) is the number of true positives at each threshold probability. In the CIC, when the threshold is greater than 0.65, the prediction model and the actual result are highly matched. The CIC visually showed that the nomogram had a superior overall net benefit within the wide and practical range of threshold probabilities and affected patient outcomes, indicating that the XGBoost model possessed significant predictive value.

Fig. 5
figure 5

Risk nomogram and clinical impact curve (CIC) analysis of the XGBoost model. (A) Nomogram to estimate the risk of re-intubation. To use the nomogram, we first draw a line from each parameter value to the score axis for the score; the points for all of the parameters are then added; finally, a line from the total score axis is drawn to determine the risk of mortality on the lower line of the nomogram. (B) CIC analysis of the XGBoost model. The red curve (number of high-risk individuals) indicates the number of people who are classified as positive (high risk) by the model at each threshold probability; the blue curve (number of high-risk individuals with outcome) is the number of true positives at each threshold probability. CIC visually indicates that nomogram confers high clinical net benefit and confirms the clinical value of the XGboost model

Discussion

In this study, the AUC values and DCA demonstrated that using the XGboost model could effectively predict EF in neonates. The pO2, Hb, MV rate, pH, Apgar score at 5 min, FiO2, C-reactive protein, Apgar score at 1 min, red blood cell count, PIP, GA, highest FiO2 at the first 24 h, heart rate, BW, pCO2 were good predictors of EF in neonates. To our knowledge, this is the first mathematical model to predict EF in neonates with BPD.

XGBoost, a decision tree–based algorithm, was found to be a better algorithm for ML and prediction [24]. There is growing evidence that XGBoost-based ML algorithm is a competitive alternative to traditional regression analysis, and it is used in predicting clinical adverse outcomes due to its excellent precision value and performance [25,26,27,28]. In the present study, we compared the predictive efficiency of different models and found that the XGBoost model demonstrated advantages over other models, including the LR and RF models.

The definition of EF has been quite variable among previous studies, ranging from 2 to 7 days after extubation [3]. A shorter period (prior to 48 h) may better distinguish between re-intubation for pulmonary insufficiency and re-intubation for other reasons such as sepsis or surgery [29]. Conversely, a longer period (7 days) accounts for a greater proportion of EF [30]. In this study, we used a time frame ≤ 72 h to define EF, and the results indicated that EF could be predicted effectively.

In the present study, 23 variables were evaluated to improve the usability of our model, and eventually 15 key features were selected, including pO2, Hb, MV rate, pH, Apgar score at 5 min, FiO2, C-reactive protein, Apgar score at 1 min, red blood cell count, PIP, GA, highest FiO2 at the first 24 h, heart rate, BW, pCO2. Previous studies have indicated that lower GA, BW, and Apgar score were important factors associated with an increased risk of EF [7, 8, 10, 31], which is consistent with the present study. Lower GA, BW, and Apgar score usually indicate underdeveloped lungs, low lung compliance, high airway resistance, and an immature central respiratory drive. A high FiO2 requirement on day 1 of life could also be a marker of the severity of respiratory distress syndrome and brain immaturity [6]. In our study, the highest FiO2 during the first 24 h of life was a significant predictor of EF.

We speculated that pH, pO2, and pCO2 within 6 h before extubation were significantly predictive of extubation readiness. Lower pH and pO2 and higher pCO2 indicate hypoventilation or severe pulmonary disease [2, 32], and are remarkable predictive factors for EF according to their SHAP values. In addition, abnormal vital signs such as heart rate were associated with a higher EF risk. The basic factor is commonly used in clinical practice, representing the vital status of a patient, and is included in many prediction models [33, 34].

As expected, pre-extubation Hb, RBC, and CRP levels were significantly different between the two groups. It remains controversial whether anemia causes apnea in premature neonates [25,26,27, 35], and this issue has not been well-studied in full-term newborns. Understandably, patients may be affected if the major oxygen transporter (erythrocyte and Hb) is reduced to a certain level. The association between CRP and EF could be explained by the fact that CRP is a measure of infection in the lungs, and, as shown in Fig. 3B, a higher CRP level was related to a higher EF risk.

The current study also showed that MV parameters before extubation, such as MV rate, FiO2, PIP, contribute to EF prediction in neonates. Previous studies have explored the benefits of MV parameters prior to extubation [25,26,27,28]. These studies indicated that there is a higher risk of EF when the effort of breathing results in a lower spontaneous tidal volume and there is an increased load on the respiratory muscles, as indicated by a high PIP, MV rate, and FiO2. In addition, when the patients were extubated from a high level of ventilator support, as shown by higher FiO2, PIP, and MV rates, their ability to sustain spontaneous breathing after extubation was overestimated. Therefore, the results of the present study provide support for serious consideration of EF in neonates prior to extubation.

This study has several limitations. First, there is disagreement regarding the definition of EF. The definition adopted in the present study was a time frame ≤ 72 h prior to re-intubation. However, a shorter period may miss a proportion of EF. Second, the data was from a single center, and there was no additional data to validate the robustness and universality of the model. Thus, further multicenter prospective study with larger samples should be conducted to validate the prediction model.

In conclusion, the current study screened 15 key features associated with EF and developed an XGBoost model that can better predict EF than other predictive models in neonates diagnosed with BPD.

Data availability

The authors confirm that data supporting the findings of this study are available within the article.

References

  1. Jobe AH, Bancalari E. Bronchopulmonary dysplasia. Am J Respir Crit Care Med. 2001;163:1723–9.

    Article  CAS  PubMed  Google Scholar 

  2. Shalish W, Kanbar L, Keszler M, et al. Patterns of reintubation in extremely preterm infants: a longitudinal cohort study. PEDIATR RES. 2018;83:969–75.

    Article  PubMed  Google Scholar 

  3. Al-Mandari H, Shalish W, Dempsey E, et al. International survey on periextubation practices in extremely preterm infants. Arch Dis Child Fetal Neonatal Ed. 2015;100:F428–31.

    Article  CAS  PubMed  Google Scholar 

  4. Boles JM, Bion J, Connors A, et al. Weaning from mechanical ventilation. Eur Respiratory J. 2007;29:1033–56.

    Article  Google Scholar 

  5. Jensen EA, DeMauro SB, Kornhauser M et al. Effects of multiple ventilation courses and duration of mechanical ventilation on respiratory outcomes in extremely low-birth-weight infants. 2015; 169:1011–7.

  6. Chawla S, Natarajan G, Shankaran S, et al. Markers of successful extubation in extremely Preterm infants, and Morbidity after failed extubation. J Pediatr. 2017;189:113–9.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Manley BJ, Doyle LW, Owen LS, et al. Extubating extremely Preterm infants: predictors of success and outcomes following failure. J Pediatr. 2016;173:45–9.

    Article  PubMed  Google Scholar 

  8. Hermeto F, Martins BM, Ramos JR, et al. Incidence and main risk factors associated with extubation failure in newborns with birth weight < 1,250 grams. J Pediatr (Rio J). 2009;85:397–402.

    Article  PubMed  Google Scholar 

  9. von Merkel J, Gebauer C, Bläser A, et al. [Prediction of extubation failure in ELBW preterm infants]. KLIN PADIATR. 2012;224:324–30.

    Google Scholar 

  10. Chawla S, Natarajan G, Gelmini M, et al. Role of spontaneous breathing trial in predicting successful extubation in premature infants. Pediatr Pulmonol. 2013;48:443–8.

    Article  PubMed  Google Scholar 

  11. Shalish W, Latremouille S, Papenburg J, Sant’Anna GM. Predictors of extubation readiness in preterm infants: a systematic review and meta-analysis. Arch Dis Child Fetal Neonatal Ed. 2019;104:F89–97.

    Article  PubMed  Google Scholar 

  12. Shalish W, Kanbar LJ, Rao S, et al. Prediction of Extubation readiness in extremely preterm infants by the automated analysis of cardiorespiratory behavior: study protocol. BMC PEDIATR. 2017;17:167.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Luo JC, Zhao QY, Tu GW. Clinical prediction models in the precision medicine era: old and new algorithms. Ann Transl Med. 2020;8:274.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Mueller M, Almeida JS, Stanislaus R, Wagner CL. Can Machine Learning methods Predict Extubation Outcome in premature infants as well as clinicians? J Neonatal Biol. 2013;2:1000118.

    PubMed  PubMed Central  Google Scholar 

  15. Kanbar LJ, Shalish W, Onu CC, et al. Automated prediction of extubation success in extremely preterm infants: the APEX multicenter study. Pediatr Res. Mar; 2023;93:1041–9.

    Article  CAS  PubMed  Google Scholar 

  16. Onu CC, Kanbar LJ, Shalish W et al. A Semi-markov chain approach to modeling respiratory patterns prior to extubation in preterm infants. Annu Int Conf IEEE Eng Med Biol Soc. 2017:2022–6.

  17. Robles-Rubio CA, Kaczmarek J, Chawla S, et al. Automated analysis of respiratory behavior in extremely preterm infants and extubation readiness. Pediatr Pulmonol. 2015;50(5):479–86.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Khera R, Haimovich J, Hurley NC, et al. Use of Machine Learning models to predict death after Acute myocardial infarction. JAMA CARDIOL. 2021;6:633–41.

    Article  PubMed  Google Scholar 

  19. Yu D, Liu Z, Su C, et al. Copy number variation in plasma as a tool for lung cancer prediction using Extreme Gradient Boosting (XGBoost) classifier. THORAC CANCER. 2020;11:95–102.

    Article  CAS  PubMed  Google Scholar 

  20. Zhang Z, Ho KM, Hong Y. Machine learning for the prediction of volume responsiveness in patients with oliguric acute kidney injury in critical care. CRIT CARE. 2019;23:112.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Batista A, Diniz C, Bonilha EA, et al. Neonatal mortality prediction with routinely collected data: a machine learning approach. BMC PEDIATR. 2021;21:322.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Priyadharshini M, Banu AF, Sharma B, et al. Hybrid Multi-label classification model for medical applications based on adaptive Synthetic Data and Ensemble Learning. Sens (Basel). 2023;23(15):6836.

    Article  Google Scholar 

  23. Lundberg S, Lee SI. A Unified Approach to interpreting model predictions. in Pro Adv Neural Inf Proce Syst. 2017; 4765–74.

  24. Chen T, Guestrin C, XGBoost:. A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining-KDD 2016, San Francisco, CA, USA. 2016; 785–94.

  25. Vidotto MC, Sogame LC, Gazzotti MR, et al. Analysis of risk factors for extubation failure in subjects submitted to non-emergency elective intracranial surgery. Respir Care. 2012;57:2059–66.

    Article  PubMed  Google Scholar 

  26. Vidotto MC, Sogame LC, Calciolari CC, et al. The prediction of extubation success of postoperative neurosurgical patients using frequency-tidal volume ratios. NEUROCRIT CARE. 2008;9:83–9.

    Article  PubMed  Google Scholar 

  27. Farhadi R, Lotfi HR, Alipour A, et al. Comparison of two levels of pressure support ventilation on success of Extubation in Preterm neonates: a Randomized Clinical Trial. Glob J Health Sci. 2015;8:240–7.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Cabello B, Thille AW, Roche-Campo F, et al. Physiological comparison of three spontaneous breathing trials in difficult-to-wean patients. Intensive Care Med. 2010;36:1171–9.

    Article  PubMed  Google Scholar 

  29. Xie J, Cheng G, Zheng Z, Luo H, Ooi OC. To extubate or not to extubate: risk factors for extubation failure and deterioration with further mechanical ventilation. J Card Surg. 2019;34:1004–11.

    Article  PubMed  Google Scholar 

  30. Mallat J, Baghdadi FA, Mohammad U, et al. Central venous-to-arterial PCO2 difference and central venous oxygen saturation in the detection of extubation failure in critically ill patients. Crit Care Med. 2020;48:1454–61.

    Article  CAS  PubMed  Google Scholar 

  31. Silva-Cruz AL, Velarde-Jacay K, Carreazo NY, et al. Risk factors for extubation failure in the intensive care unit. Rev Bras Ter Intensiva. 2018;30:294–300.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Manley BJ, Doyle LW, Owen LS, Davis PG. Extubating extremely Preterm infants: predictors of success and outcomes following failure. J Pediatr. 2016;173:45–9.

    Article  PubMed  Google Scholar 

  33. Costa AC, Schettino RC, Ferreira SC. [Predictors of extubation failure and reintubation in newborn infants subjected to mechanical ventilation]. Rev Bras Ter Intensiva. 2016;26:51–6.

    Article  Google Scholar 

  34. Ammari A, Suri M, Milisavljevic V, et al. Variables associated with the early failure of nasal CPAP in very low birth weight infants. J Pediatr. 2005;147:341–7.

    Article  PubMed  Google Scholar 

  35. Spasojevic S, Doronjski A. Risk factors associated with failure of extubation in very-low-birth-weight newborns. J Matern Fetal Neonatal Med. 2018;31:300–4.

    Article  PubMed  Google Scholar 

Download references

Funding

The authors declare that no funds, grants, or other support was received during the preparation of this manuscript.

Author information

Authors and Affiliations

Authors

Contributions

Wan-liang Guo and Xin Ding designed the study. Wan-liang Guo, Yue Tao, and Xin Ding collected data. Guo and Tao analyzed the data. Wan-liang Guo and Yue Tao wrote the manuscript. All of the authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Wan-liang Guo.

Ethics declarations

Ethical approval

The study was approved by the Ethics Committee of the Children’s Hospital of Soochow University, with a waiver for informed consent because of the retrospective nature of the study.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tao, Y., Ding, X. & Guo, Wl. Using machine-learning models to predict extubation failure in neonates with bronchopulmonary dysplasia. BMC Pulm Med 24, 308 (2024). https://doi.org/10.1186/s12890-024-03133-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12890-024-03133-3

Keywords