Incubation period, clinical and lung CT features for early prediction of COVID-19 deterioration: development and internal verification of a risk model
BMC Pulmonary Medicine volume 22, Article number: 188 (2022)
Most severe, critical, or mortal COVID-19 cases often had a relatively stable period before their status worsened. We developed a deterioration risk model of COVID-19 (DRM-COVID-19) to predict exacerbation risk and optimize disease management on admission.
We conducted a multicenter retrospective cohort study with 239 confirmed symptomatic COVID-19 patients. A combination of the least absolute shrinkage and selection operator (LASSO), change-in-estimate (CIE) screened out independent risk factors for the multivariate logistic regression model (DRM-COVID-19) from 44 variables, including epidemiological, demographic, clinical, and lung CT features. The compound study endpoint was progression to severe, critical, or mortal status. Additionally, the model's performance was evaluated for discrimination, accuracy, calibration, and clinical utility, through internal validation using bootstrap resampling (1000 times). We used a nomogram and a network platform for model visualization.
In the cohort study, 62 cases reached the compound endpoint, including 42 severe, 18 critical, and two mortal cases. DRM-COVID-19 included six factors: dyspnea [odds ratio (OR) 4.89;confidence interval (95% CI) 1.53–15.80], incubation period (OR 0.83; 95% CI 0.68–0.99), number of comorbidities (OR 1.76; 95% CI 1.03–3.05), D-dimer (OR 7.05; 95% CI, 1.35–45.7), C-reactive protein (OR 1.06; 95% CI 1.02–1.1), and semi-quantitative CT score (OR 1.50; 95% CI 1.27–1.82). The model showed good fitting (Hosmer–Lemeshow goodness, X2(8) = 7.0194, P = 0.53), high discrimination (the area under the receiver operating characteristic curve, AUROC, 0.971; 95% CI, 0.949–0.992), precision (Brier score = 0.051) as well as excellent calibration and clinical benefits. The precision-recall (PR) curve showed excellent classification performance of the model (AUCPR = 0.934). We prepared a nomogram and a freely available online prediction platform (https://deterioration-risk-model-of-covid-19.shinyapps.io/DRMapp/).
We developed a predictive model, which includes the including incubation period along with clinical and lung CT features. The model presented satisfactory prediction and discrimination performance for COVID-19 patients who might progress from mild or moderate to severe or critical on admission, improving the clinical prognosis and optimizing the medical resources.
The global pandemic of COVID-19 caused by the severe acute respiratory syndrome coronavirus (SARS-COV-2) has started in December 2019 and it has been around for 2 year now [1, 2]. As of May 9, 2021, the World Health Organization (WHO) reported that more than 1.5 billion infected people worldwide, and more than 3.29 million deaths occurred. The fatality rate in the early stage of the disease is more than 7.0% [3, 4]. This pandemic poses a significant threat to global health.
The reported clinical outcomes of different severity grades are heterogeneous, and the mild and moderate cases often rely on their immune ability to recover [5, 6]. However, most severe or critical COVID-19 patients are asymptomatic at the initial stage of onset, and the median time from onset to sepsis is 10.0 days [interquartile range (IQR) 7.0–14.0] . Early screening and active intervention in critical patients could reduce mortality . A deterioration model of early prediction of COVID-19 progression from mild or moderate to severe, critical or mortal might help front-line clinicians to optimize the patient triage and develop appropriate treatment strategies.
Many multivariate clinical prognostic models for predicting the deterioration of COVID-19 have been published [9,10,11,12,13]. The predictors mainly include demographic, clinical, and laboratory factors. However, the included factors are rarely involved in epidemiology and chest imaging features, such as the incubation period. Although the incubation period was the key feature and essential basis in the study of epidemic control and prediction , there were relatively few studies on the deterioration of COVID-19. Early studies found that the incubation period of travelers to Hubei was shorter than that of non-travelers . The incubation period was negatively correlated with the severity of COVID-19 . Furthermore, high CT scores characterized severe/critical COVID-19 pneumonia [16, 17]. Further research is still needed to determine whether epidemiology and lung CT features can improve the predictive ability of the deterioration model.
In this study, we present a prediction model of COVID-19 (DRM-COVID-19) with epidemiological, clinical, and pulmonary CT characteristics, which could predict the risk of COVID-19 deterioration on admission. The COVID-19 epidemic is still raging globally, and we hope our model can provide convenience for front-line clinicians to make individualized treatment decisions, reduce the deterioration and optimize the use of medical resources.
Participants, compound endpoint, and design
In this work, we established a retrospective cohort study on patients from five hospitals in Hunan Province (Loudi Central Hospital, Xiangtan Central Hospital, Yueyang First People's Hospital, Shaoyang Central Hospital, and Huaihua First People's Hospital) between January 11 and February 28, 2020. Considering the particularity of the COVID-19 disease, all the data were collected anonymously and retrospectively, protecting the patients' privacy. The ethics committee of the Loudi central hospital approved this study and waived the need for written informed consent for the new infectious diseases. Our study followed the World Medical Association’s Declaration of Helsinki.
According to the World Health Organization (WHO) living guidance of COVID-19 , 246 patients diagnosed with COVID-19 by pharyngeal or nasopharyngeal swabs were enrolled in this study. According to the WHO guidelines , the cases were classified as mild, moderate, severe, or critical [including acute respiratory distress syndrome (ARDS), sepsis, septic shock]. As the primary outcome, deterioration refers to the progression from mild or moderate to severe, critical or mortal [9, 10].
We excluded three COVID-19 patients who were asymptomatic since onset and four patients who were diagnosed as severe or critical on admission. Then, the remaining 239 patients were followed up for 30 days, among which 62 patients worsened and reached the primary outcome, and were included in the deterioration group. Hence, 177 patients represented the stable group (Fig. 1).
The collected data included the demographics, comorbidities, clinical symptoms, laboratory results, and imaging data, all of which were cross-verified by two experienced physicians from the electronic health records (ERH) in each COVID-19 treatment center. Clinical symptoms, laboratory results, and image data were collected at the day of admission.
The demographic and epidemiological data included the following: age, gender, Wuhan origin (living in Wuhan, traveling or taking public transport through Wuhan), and the incubation period (the time from the first exposure to onset ), and the length of hospital day. Comorbidities data included the following: number of comorbidities, coronary heart disease, hypertension, endocrine system disease (diabetes, obesity, hyperlipidemia, or hyperthyroidism), chronic lung disease (chronic obstructive pulmonary disease, asthma, chronic bronchitis, emphysema, or bronchiectasis), malignant tumor, and chronic digestive system diseases (viral hepatitis, liver cirrhosis, fatty liver, or drug-induced liver injury). The symptoms at admission included: fever (the highest body temperature before admission), cough, dyspnea, headache, dizziness, muscle pain, fatigue, and gastrointestinal symptoms. The laboratory data included: white blood cell count, neutrophil count, lymphocyte count, neutrophil/lymphocyte ratio, platelet count, hemoglobin, D-dimer, albumin, total bilirubin, direct bilirubin, creatine kinase, creatine kinase isoenzyme, lactate dehydrogenase, myoglobin, urea nitrogen, creatinine, blood glucose, C-reactive protein, and procalcitonin. As for the imaging data, a semi-quantitative scoring system was used to evaluate the score of each affected lobe as follows: 0, no involvement; 1, less than 5%; 2, 5–25%; 3, 26–49%; 4, 50–75%; 5, more than 75%; then, the scores were summed to obtain the total pulmonary involvement score [16, 17]. Two radiologists independently scored the image analysis according to the semi-quantitative score system, and then the average value was calculated and used.
Statistical methods and variable selection
The baseline data table presented continuous variables as median (IQR) and categorical variables as n (%). The Mann–Whitney U test, Chi-square test, or Fisher's exact test were used to compare the differences between the stable group and deterioration group when appropriate. Statistical analysis was performed using the R software (version 3.6.3, R Foundation). All the cases were enrolled in the variable selection and risk model development. All the required diseases information and variable values must be collected at admission. In case of outpatients, there must be no missing values. The L1-penalized LASSO regression was applied to reduce the data dimensionality to avoid potential collinearity and overfitting among variables. The best lambda value was selected in LASSO regression using tenfold cross-validation. Under the lambda compression (lambda.1se), the variables with small regression coefficients were directly compressed to 0 to eliminate the corresponding variables. Finally, only the most robust predictors were retained in the regression model. The LASSO regression was completed with the glmnet package .
Risk model development and internal validation
We started by analyzing the variables selected by LASSO regression through single-factor regression. According to the previously reported characteristic variables [4, 20], we followed the change-in-estimate (CIE) [21, 22] approach to simplify the complete model. CIE is a data-driven independent variable screening method. CIE removes variables from the multivariable regression model that contribute less than 0.1 (10%) to change in odds ratio (OR) of essential variables . A binary logistic regression model was established using the R package "caret"  and evaluated by the Hosmer–Lemeshow test. We performed internal validation with 1000 times of bootstrap resampling by counting R-squared (R2) and c-statistic for the evaluation and used nomogram and network calculator to visualize the predictive ability of DRM-COVID-19 using the DynNom  package.
Evaluating the risk model
The binary logistic regression prediction models were separately established for the incubation period, clinical (clinical symptoms and laboratory data) or CT score. The distinguishing ability between the above-mentioned prediction models and DRM-COVID-19 was compared by using the receiver operating characteristic (ROC) curve. The model sensitivity (true positive rate) and specificity (true negative rate) were evaluated in the ROC curve, and the area under the receiver operating characteristic curve (AUC) was calculated. Due to the imbalance in our data, the risk model was further evaluated by the metrics of accuracy, precision, recall, F1 score, and precision-recall curve (PRC). We wanted our risk model to avoid a missed diagnosis of deteriorating risk patients. Meanwhile, we wanted to improve the prediction precision while ensuring a good recall. Since the values of precision and recall could not be simultaneously high, the comprehensive evaluation index of F1 score was introduced. We used the R package "modEvA" to generate the curve of the four evaluation indicators and calculate the number of cases to construct the confusion matrix . We assessed the model accuracy by the logistic calibration curve, which visually demonstrated the consistency between the predicted and real results of DRM-COVID-19 (rms packets) . Meanwhile, the calibration curve included c-statistic (ROC), R2, and Brier score to evaluate the model performance. In order to assess the clinical usefulness of DRM-COVID-19, we used different decision thresholds (Pt 10–95%) to establish a decision curve evaluating the net benefit of DRM-COVID-19 based on the following equation :
The clinical impact curve shows the practical clinical value.
Secondary outcome analyses
The analysis of secondary outcomes included two aspects. First of all, the developed DRM-COVID-19 was used to calculate the predicted model of each case in the dataset, including stable and deterioration (severe, critical, and mortality) groups. The R package "ggstatsplot"  was used to visualize the data distribution, and the Kruskal–Wallis test was performed. Finally, the stable group was compared with other groups. Second, we applied univariate COX proportional hazard regression (Cox regression) to estimate successively the correlation between the predicted value of DRM-COVID-19 and exacerbation within 15 days, and the length of hospital stay. According to the optimal cutoff value of the predicted value, the patients were divided into the low-risk group (≤ 0.263) and high-risk group (> 0.263). The Kaplan–Meier method estimated the time-event curve and compared it with the bilateral log-rank test.
Patients' demographics and characteristics
The data of 239 COVID-19 patients were used to train the deterioration risk model. Among these, 62 patients (25.9%) reached the study compound endpoint, including 42/62 severe, 18/62 critical (admission to the intensive care unit (ICU), mechanical ventilation), and 2/62 eventually died; the remaining patients were discharged. The deterioration group had the following characteristics compared with the stable group: older age (median 54.8 vs. 41.5), slightly more men (32/62 vs. 30/62), fever (> 38.0, 66.1% vs. 37.3%), dyspnea (74.2% vs. 9.0%), cough (90.3% vs. 72.3%) and fatigue (54.1% vs. 27.1%), respectively.
Similarly, the patients in the deterioration group were more likely to have comorbidities compared with the stable group (61.3% vs. 22.6%), especially endocrine and metabolic diseases (29.0% vs. 6.78%), hypertension (30.6% vs. 8.47%), respiratory system diseases (12.9% vs. 4.52%) and coronary heart disease (11.3% vs. 2.82%) respectively. There was only one case of the malignant tumor. The study noted that the patients with a short incubation period also had a higher risk of deterioration than those in the stable group (5.0 days vs. 7.0 days, respectively). The hospitalization time of the deterioration group was longer than that of the stable group (median 17.5 days vs. 12.0 days). All the basic features are shown in Table 1.
The laboratory data showed significant differences in several blood tests between the deterioration and stable groups. For example, the deterioration group showed an increase in the following median compared with the stable group: neutrophilic-lymphocyte ratio (NLR, 5.30 vs. 3.02), myoglobin (91.2 vs. 55.0), creatine kinase (90.0 vs. 73.0), creatine kinase MB (14.0 vs. 12.0), blood urea nitrogen (4.77 vs. 3.90), lactate dehydrogenase (LDH, 255 vs. 203), D-dimer (0.42 vs. 0.25), C-reactive protein (CRP, 30.4 vs. 5.8), blood glucose (7.20 vs. 5.60), and CT score (10.5 vs. 3.0), as well as a decrease in the lymphocyte count (0.74 vs. 1.12), platelet count (151 vs. 202) and albumin (37.0 vs. 39.9), respectively (Table 2).
The 44 variables measured at admission (Tables 1, 2) were included in the LASSO regression (Additional file 1: Fig. S1). A total of 9 variables with non-zero coefficients were obtained, including the dyspnea, incubation period, number of comorbidities, age, lymphocyte count, D-dimer, CRP, blood glucose, and CT score. In the univariate regression analysis (Table 3), all the variables were independent risk factors for deterioration (P < 0.05). However, when all the variables were incorporated in the logistic regression model, dyspnea, incubation period, CRP, and CT score were correlated with the risk of deterioration (P < 0.05). Those might be related to the existence of confounding or intermediate variables. Then, according to CIE, to adjust the independent variable blood glucose, age, and lymphocyte count in the regression model, it was found that the OR value for dyspnea was less than 10%, while that for adjusting latency, complications, D-dimer, CRP and CT scores was more than 10% (Additional file 1: Table S1). Furthermore, combined with the clinical and published studies [6, 7], the factors of dyspnea, incubation period, number of comorbidities, D-dimer, CRP and CT score were selected as risk model variables.
Construction of deterioration risk score
In the multivariate analysis (Table 3), the logistic regression model identified the correlation between 6 variables and DRM-COVID-19. The factors of the incubation period (OR 0.83; 95% CI 0.68–0.99; P = 0.049) were negatively correlated, while the dyspnea (OR 4.89; 95% CI 1.53–15.80; P = 0.007), number of comorbidities (OR 1.76; 95% CI 1.03–3.05; P = 0.039), D-dimer (OR 7.05; 95% CI 1.35–45.7; P = 0.029), CRP (OR 1.06; 95% CI 1.02–1.1; P = 0.007) and CT score (OR 1.50; 95% CI 1.27–1.82; P < 0.001) were positively correlated. The model fits well (Hosmer–Lemeshow test, X2(8) = 7.0194, P = 0.53). A nomogram for DRM-COVID-19 containing the dyspnea, incubation period, number of comorbidities, D-dimer, CRP, and CT score was constructed (Fig. 2a). Furthermore, an online calculator based on the nomogram was developed, allowing the clinicians to automatically calculate the deterioration risk of COVID-19 patients (and 95% CI) (Fig. 2b), available online at (https://deterioration-risk-model-of-covid-19.shinyapps.io/DRMapp/).
Risk model evaluation and internal validation
We obtained the deterioration risk scores in four models based on DRM-COVID-19, incubation period, clinical, and CT scores. The ROC curves were plotted for the above-mentioned models (Fig. 3a), and the AUC values were 0.971 (95% CI: 0.949–0.992; specificity 0.938, sensitivity 0.935), 0.675 (95% CI: 0.593–0.757; specificity 0.751, sensitivity 0.548), 0.946 (95% CI: 0.917–0.975; specificity 0.876, sensitivity 0.887), and 0.896 (95% CI: 0.842–0.950; specificity 0.876, sensitivity 0.806), respectively. Therefore, the discriminative ability, sensitivity, and specificity of DRM-COVID-19 for patients with high-risk deterioration showed better values than other models. The optimal cut-off value of the ROC curve for DRM-COVID-19 was 0.263. The area under the precision-recall curve (AUCPR = 0.934) was provided by the PR curve, which showed the good classification performance of the model (Fig. 3b). The confusion matrix was constructed In the PR curve of DRM-COVID-19 (Fig. 3d). The accuracy, precision, recall, and F1 score values were 0.933, 0.829, 0.935, and 0.879, respectively (Fig. 3c). These results indicate that DRM-COVID-19 achieved positive performance in the deterioration and stable groups. Meanwhile, we also noted that the recall was higher than the precision, meeting the requirement of low misjudgment risk. As shown in Fig. 4, the model had good calibration for the COVID-19 deterioration prediction with no significant overestimation or underestimation (c-statistic = 0.971; R2 = 0.794). Measure precision by Brier score (0.051, 95% CI, 0.03–0.072) (Fig. 4a). The decision curve showed a good net benefit across the range of 0 to 100% (Fig. 4b). The clinical impact curve of DRM-COVID-19 among 1000 patients visually indicates the number of high-risk deteriorations (solid red line) versus the actual number of deterioration (dotted blue line, Fig. 4c). We performed internal validation of the logistic regression model and obtained the c-statistic 0.957 and R-squared 0.588 through bootstrap resampling for 1000 times. As a result, the prediction ability of the model is acceptable.
In the scatter chart (Fig. 4d), the Kruskal–Wallis test showed a statistical difference between the stable and deterioration groups (P < 0.001). Compared with the stable group, there was a statistical significance in the other three groups (P < 0.05). In contrast, the mortality group had a P value of 0.014, which may be related to the small number of cases. As the COVID-19 severity increased, the median predictive value of each clinical classification also showed an upward trend (stable, 0.02; severe, 0.87; critical, 0.99; mortality, 0.99), which suggests that the DRM-COVID-19 model has a predictive value for severity.
Univariate Cox regression analysis showed the optimal cut-off value of DRM-COVID-19 had an excellent predictive ability for deterioration within 15 days (HR 74.54 95% CI 26.81–207.2), and the Log-rank test showed significance P value < 0.0001 (Additional file 1: Fig. S2a). Schoenfeld Individual was calculated to test the proportional risk hypothesis test (P = 0.331) (Additional file 1: Fig. S2b). Similarly, the high-risk group spent more time in the hospital than the low-risk group (HR 20.68 95% CI 7.49–57.15, Log-rank test P < 0.0001, Additional file 1: Fig. S2c, d), and the case with a hospital stay of up to 50 days was a hemodialysis patient with a repeated nucleic acid test positive.
In this study, based on a COVID-19 multicenter retrospective cohort with 239 cases, we developed and internally validated a predictive model to help clinicians predict the deterioration risk of the patients upon admission, thus providing possible help for early triage and management of these patients. The internal verification indicated that the proposed DRM-COVID-19 model fits well. In the predictive model, the factors of dyspnea, incubation period, number of comorbidities, D-dimer, CRP and CT score were the most significant risk factors. These parameters are routinely measured for COVID-19 inpatients. At the same time, we prepared a network platform, which is convenient for the clinicians to operate.
Since the emergence of the COVID-19 pandemic, many predictive models concerning the diagnosis and prognosis have emerged [5, 9, 16, 29,30,31]. In the development of COVID-19 prediction models, artificial intelligence, including machine learning and deep learning, has been widely used to improve the accuracy and expansibility of the prediction models . Lasso regression and Logistic regression used in our research belong to the machine learning algorithms. The most important thing of a development regression model is to strike a balance between the influencing factors and the control bias to avoid over-fitting and under-fitting. Specifically, when datasets have few events, penalty regression is superior to standard regression and provides better prediction . In this study, we adopted Lasso regression (lambda.1se) to obtain the nine optimal variables due to the data with few events. Usually, the events per variables ratio should be 10 or more [33, 34]. Sixty-two cases in the cohort met the primary outcome of deterioration, so the variables of the DRM-COVID-19 should not exceed six. The conventional stepwise regression is passive to eliminate some variables through covariable coefficients, quickly leading to invalid estimation and predictive effects [21, 22]. The change-in-estimate (CIE) represents a standard method in epidemic disease studies. Therefore, to positively control confounders, we adopted the measure of combining CIE with the background knowledge of COVID-19  to select and adjust nine variables. Finally, we constructed predictive models including the dyspnea, incubation period, number of comorbidities, D-dimer, CRP, and CT score. This compound parameter screening method is an exciting attempt in our study, which achieved an excellent predictive performance (AUC 0.971, 95% CI 0.949–0.992). Moreover, the calibration curve showed high coherence between the predicted and actual deterioration probability. The clinical decision-making curve and clinical impact curve show that when DRM-COVID-19 is used to determine whether the patient has the risk of being hospitalized or not, better clinical benefits than "full deterioration" or "non-deterioration" will be obtained.
For unbalanced datasets, the ROC curve is considered to be deceptive to the interpretability and reliability of the model classification performance to a certain extent . Therefore, other evaluation methods are often introduced into machine learning . The obtained results of the PR curve (AUCPR 0.934), accuracy (0.933), precision (0.829), recall (0.935) and F1 score (0.879) showed a good classification performance of RDS-COVID-19. In our predicted model, recall is higher than precision to avoid missing the cases that may aggravate.
The variables commonly found in published literature, such as dyspnea, number of comorbidities, D-dimer, and CRP, were associated with clinical endpoints requiring mechanical ventilation, ICU admission, and mortality [5,6,7]. These variables were also included in our risk model. Since the COVID-19 pandemic, numerous diagnostic and prognostic models have emerged . Different prediction models have different prediction factors [5, 9]. Even with similar research purposes [9,10,11,12,13], the predictors might not be identical. Reasons might include different endpoints, different populations, or different study methods. In our study, the variables screened by LASSO regression included age and lymphocyte count, which were common risk factors in the COVID-19 prediction model , but were not included in our model, which may also be related to the reasons mentioned earlier.
We found that the longer the incubation period, the lower the risk of developing into severe or critical illness, following the previously published results . The main reason for the short incubation period is the more significant virus load in the body. Respiratory viruses induce the immune response through inflammatory mediators and cytokines, leading to clinical symptoms, thus determining the incubation period . Pneumonia caused by SARS-CoV-2 is also related to the immune response. The shorter incubation period was associated with more pulmonary exudate lesions , and a greater risk for aggravation or hospitalization. Therefore, the incubation period is an independent risk factor for the DRM-COVID-19 model.
Another prominent variable was the CT score. The CT score of the deterioration group (mean value ± SD, 10.8 ± 5.0) was significantly higher than the stable group (3.5 ± 2.5, P < 0.0001). The study of Francone and colleagues  showed that the CT score of the critical group (20.3 ± 3) and the severe group (17.4 ± 3.1) were significantly higher than those of the mild group (8.7 ± 4, P < 0.001). The CT score of the deterioration group in our study was lower than that of the above-mentioned severe group and slightly higher than that of the mild group. These differences may be related to the different detection times of lung CT. In our study, lung CT was detected at admission or before aggravation. Another study showed that the cut-off value of the CT score was 7, with good sensitivity (80.0%) and excellent specificity (82.8%) . Interestingly, the CT values of our deterioration group ranged from 7.25 to 14.0 in the quartile. The CT score can act as an independent risk factor for the deterioration of COVID-19.
In addition, the scatter diagram was prepared based on the DRM-COVID-19 predictive models of the stable (mild/moderate), severe, critical and mortal groups. The predictive model could distinguish the stable group well from the other groups (P < 0.05). With 0.25 as the distinguishing line on the violin chart, 93.5% of the deterioration cases could be distinguished, 92.9% of the severe cases could be accurately predicted, and 95% of the critical cases (including the mortality cases) could be identified. Interestingly, the cutoff value of the ROC curve was 0.263, with a sensitivity of 93.5% and a specificity of 93.8%. Therefore, when the DRM-COVID-19 network calculator is used for triage of hospitalized patients with a score greater than 0.263, we need aggressive treatment to prevent further deterioration. The high-risk and low-risk groups based on the optimal cutoff value also achieved excellent results in predicting the risk of deterioration within 15 days and the length of hospitalization.
This study has some limitations. First, the dataset included partial data collected from one province. Due to the relatively small number of cases, LASSO regression and binary logistics did not follow a training set and testing set to evaluate the generalization ability of the model. Therefore, we adopted a tenfold cross-validation method in LASSO regression and 1000 times bootstrap resampling internal verification in binary logistic regression. There was also not enough data as a validation set for external validation. Those would increase the risk of overfitting the model. Second, our prediction model is only based on Chinese data from the first wave of the epidemic, and the treatment and care of patients at that time were not homogeneous and standardized. It should be verified whether it applies to other countries or regions or currently the main variants cases (delta-variant or omicron-variant). Those are essential steps in verifying the generalization ability of the model. Third, due to the rapid control of the epidemic, we could not collect more data, especially positive data. Although good results were obtained in the analysis of unbalanced data, we were still cautious in the triage of the cases. Finally, the control of confounding factors and the elimination of intermediate variables still pose the risk of misjudgment.
In this study, we used CIE to screen variables based on the Lasso regression to avoid the risk of over-fitting for the prediction model due to the small sample size. We first developed a COVID-19 aggravation risk prediction model based on the incubation period, clinical, and chest images. The predicted value of DRM-COVID-19 can effectively predict the risk of deterioration within 15 days. The prediction model can triage each symptomatic COVID-19 patient and ensure the appropriate level of care according to the risk of deterioration, thus reducing deterioration rate, optimizing the medical resources and alleviating medical stress.
Availability of data and materials
All data relevant to this study are included in the article or uploaded as supplementary information. The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Cucinotta D, Vanelli M. WHO declares COVID-19 a pandemic. Acta Biomed. 2020;91(1):157–60. https://doi.org/10.23750/abm.v91i1.9397.
Viruses CSGO. The species severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. Nat Microbiol. 2020;5(4):536–44. https://doi.org/10.1038/s41564-020-0695-z.
Onder G, Rezza G, Brusaferro S. Case-fatality rate and characteristics of patients dying in relation to COVID-19 in Italy. JAMA. 2020;323(18):1775–6. https://doi.org/10.1001/jama.2020.4683.
Deng X, Yang J, Wang W, et al. Case fatality risk of the first pandemic wave of novel coronavirus disease 2019 (COVID-19) in China. Clin Infect Dis. 2021;73(1):e79–85. https://doi.org/10.1093/cid/ciaa578.
Liang W, Liang H, Ou L, et al. Development and validation of a clinical risk score to predict the occurrence of critical illness in hospitalized patients with COVID-19. JAMA Intern Med. 2020;180(8):1081–9. https://doi.org/10.1001/jamainternmed.2020.2033.
Petrilli CM, Jones SA, Yang J, et al. Factors associated with hospital admission and critical illness among 5279 people with coronavirus disease 2019 in New York City: prospective cohort study. BMJ. 2020;369:m1966. https://doi.org/10.1136/bmj.m1966.
Zhou F, Yu T, Du R, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395(10229):1054–62. https://doi.org/10.1016/S0140-6736(20)30566-3.
Sun Q, Qiu H, Huang M, Yang Y. Lower mortality of COVID-19 by early recognition and intervention: experience from Jiangsu Province. Ann Intensive Care. 2020;10(1):33. https://doi.org/10.1186/s13613-020-00650-2.
Gupta RK, Harrison EM, Ho A, et al. Development and validation of the ISARIC 4C deterioration model for adults hospitalised with COVID-19: a prospective cohort study. Lancet Respir Med. 2021;9(4):349–59. https://doi.org/10.1016/S2213-2600(20)30559-2.
Zeng Z, Wu C, Lin Z, et al. Development and validation of a simple-to-use nomogram to predict the deterioration and survival of patients with COVID-19. BMC Infect Dis. 2021;21(1):356. https://doi.org/10.1186/s12879-021-06065-z.
Vultaggio A, Vivarelli E, Virgili G, et al. Prompt predicting of early clinical deterioration of moderate-to-severe COVID-19 patients: usefulness of a combined score using IL-6 in a preliminary study. J Allergy Clin Immunol Pract. 2020;8(8):2575–81. https://doi.org/10.1016/j.jaip.2020.06.013.
Mauer E, Lee J, Choi J, et al. A predictive model of clinical deterioration among hospitalized COVID-19 patients by harnessing hospital course trajectories. J Biomed Inform. 2021;118:103794. https://doi.org/10.1016/j.jbi.2021.103794.
Francis NA, Stuart B, Knight M, Vancheeswaran R, Oliver C, Willcox M, Barlow A, Moore M. Predictors of clinical deterioration in patients with suspected COVID-19 managed in a “virtual hospital” setting: a cohort study. BMJ Open. 2021;11(3):e45356. https://doi.org/10.1136/bmjopen-2020-045356.
Huang S, Li J, Dai C, Tie Z, Xu J, Xiong X, Hao X, Wang Z, Lu C. Incubation Period of Coronavirus Disease 2019: new implications for intervention and control. Int J Environ Health Res. 2021. https://doi.org/10.1080/09603123.2021.1905781.
Leung C. The difference in the incubation period of 2019 novel coronavirus (SARS-CoV-2) infection between travelers to hubei and nontravelers: the need for a longer quarantine period. Infect Control Hosp Epidemiol. 2020;41(5):594–6. https://doi.org/10.1017/ice.2020.81.
Francone M, Iafrate F, Masci GM, et al. Chest CT score in COVID-19 patients: correlation with disease severity and short-term prognosis. Eur Radiol. 2020;30(12):6808–17. https://doi.org/10.1007/s00330-020-07033-y.
Li K, Wu J, Wu F, Guo D, Chen L, Fang Z, Li C. The clinical and chest CT features associated with severe and critical COVID-19 pneumonia. Investig Radiol. 2020;55(6):327–31. https://doi.org/10.1097/RLI.0000000000000672.
World Health Organization. COVID-19 clinical management: living guidance. 2021; 25 January. https://apps.who.int/iris/bitstream/handle/10665/338882/WHO-2019-nCoV-clinical-2021.1-chi.pdf.
Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010;33(1):1–22.
Xu PP, Tian RH, Luo S, et al. Risk factors for adverse clinical outcomes with COVID-19 in China: a multicenter, retrospective, observational study. Theranostics. 2020;10(14):6372–83. https://doi.org/10.7150/thno.46833.
Greenland S, Pearce N. Statistical foundations for model-based adjustments. Annu Rev Public Health. 2015;36:89–108. https://doi.org/10.1146/annurev-publhealth-031914-122559.
Greenland S. Modeling and variable selection in epidemiologic analysis. Am J Public Health. 1989;79(3):340–9. https://doi.org/10.2105/ajph.79.3.340.
Kuhn M. Caret: Classification and regression training. R package version 6.0-86. 2020. https://CRAN.R-project.org/package=caret.
Jalali A, Roshan D, Alvarez-Iglesias A, Newell J. Visualising statistical models using dynamic nomograms. R package version 5.0.1. 2019.
Márcia Barbosa A, Real R, Muñoz AR, Brown JA. New measures for assessing model equilibrium and prediction mismatch in species distribution models. Divers Distrib. 2013;19(10):1333–8. https://doi.org/10.1111/ddi.12100.
Harrell F. Rms: regression modeling strategies. R package version 6.0-1. 2020. https://CRAN.R-project.org/package=rms.
Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. 2006;26(6):565–74. https://doi.org/10.1177/0272989X06295361.
Patil I. Visualizations with statistical details: the “Ggstatsplot” approach. J Open Source Softw. 2021;61(6):3167. https://doi.org/10.21105/joss.03167.
Abdulaal A, Patel A, Charani E, Denny S, Mughal N, Moore L. Prognostic modeling of COVID-19 using artificial intelligence in the United Kingdom: model development and validation. J Med Internet Res. 2020;22(8):e20259. https://doi.org/10.2196/20259.
Ko H, Chung H, Kang WS, et al. An artificial intelligence model to predict the mortality of COVID-19 patients at hospital admission time using routine blood samples: development and validation of an ensemble model. J Med Internet Res. 2020;22(12):e25442. https://doi.org/10.2196/25442.
Krysko O, Kondakova E, Vershinina O, et al. Artificial intelligence predicts severity of COVID-19 based on correlation of exaggerated monocyte activation, excessive organ damage and hyperinflammatory syndrome: a prospective clinical study. Front Immunol. 2021;12:715072. https://doi.org/10.3389/fimmu.2021.715072.
Abdulaal A, Patel A, Al-Hindawi A, Charani E, Alqahtani SA, Davies GW, Mughal N, Moore L. Clinical utility and functionality of an artificial intelligence-based app to predict mortality in COVID-19: mixed methods analysis. JMIR Form Res. 2021;5(7):e27992. https://doi.org/10.2196/27992.
Pavlou M, Ambler G, Seaman SR, Guttmann O, Elliott P, King M, Omar RZ. How to develop a more accurate risk prediction model when there are few events. BMJ. 2015;351:h3868. https://doi.org/10.1136/bmj.h3868.
Peduzzi P, Concato J, Kemper E, Holford TR, Feinstein AR. A simulation study of the number of events per variable in logistic regression analysis. J Clin Epidemiol. 1996;49(12):1373–9. https://doi.org/10.1016/s0895-4356(96)00236-3.
Saito T, Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE. 2015;10(3):e118432. https://doi.org/10.1371/journal.pone.0118432.
Wynants L, Van Calster B, Collins GS, et al. Prediction models for diagnosis and prognosis of Covid-19 infection: systematic review and critical appraisal. BMJ. 2020;369:m1328. https://doi.org/10.1136/bmj.m1328.
Lai C, Yu R, Wang M, et al. Shorter incubation period is associated with severe disease progression in patients with COVID-19. VIRULENCE. 2020;11(1):1443–52. https://doi.org/10.1080/21505594.2020.1836894.
Hermesh T, Moltedo B, López CB, Moran TM. Buying time-the immune system determinants of the incubation period to respiratory viruses. Viruses. 2010;2(11):2541–58. https://doi.org/10.3390/v2112541.
The authors would like to express their gratitude to EditSprings (https://www.editsprings.com/) for the expert linguistic services provided. The authors thank Prof. Yuechun, Hu and Prof. Wen, Mao from the Radiology Department of Loudi Central Hospital for their help in lung CT score and all the front line medical workers.
HP is supported by the Special Topic Project for Pneumonia Epidemic Infected by New Coronavirus in Loudi City (Grant Lou Caijiao  No. 43), and CH is supported by the COVID-19 emergency in Xiangtan City (Grant SFYB20201006).
Ethics approval and consent to participate
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standard. The ethics committee of the Loudi central hospital approved this study. Meanwhile, The ethics committee of the Loudi central hospital waived the need for informed consent because of the observational nature of the study, the use of anonymized data and new infectious diseases.
Consent for publication
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Peng, H., Hu, C., Deng, W. et al. Incubation period, clinical and lung CT features for early prediction of COVID-19 deterioration: development and internal verification of a risk model. BMC Pulm Med 22, 188 (2022). https://doi.org/10.1186/s12890-022-01986-0
- Prediction model
- Incubation period
- Semi-quantitative CT score
- Change-in-estimate (CIE)