Subjects selection
From April 2019 to August 2020, COPD patients who met the inclusion and exclusion criteria in respiratory department of 13 general hospitals in Ningxia, China were investigated. Patients were followed up from May 2019 to September 2020. Ethics approval for the data collection and the use of clinical data in the study were obtained from the Ethics Committee of the General Hospital of Ningxia Medical University (2020-643), and all subjects signed informed consent. 1200 patients were enrolled. Inclusion criteria: (1) it meets the GOLD (2021) diagnostic criteria for COPD: that is, it has the following characteristics: ① Previous diagnosis of COPD; ② Smoking more than 20 cigarettes a day for more than 15 years; ③Have a long-term history of exposure to a large number of biofuels or occupational dust in an enclosed space; ④ Symptoms, chronic course and progressive aggravation above 40 years old; ⑤ Chronic cough or expectoration, and dyspnea after gradual activity; ⑥ Symptoms persist, with less daytime variation. After excluding the possibility of pulmonary tuberculosis, the peak expiratory flow rate (PEFR) was measured. The PEFR of 2 receptor agonist salbutamol 2 was improved by less than 20% after repeated determination 15 min after spray; FEV1/FVC0.7; (2) COPD atients with stable condition: the patient has stable or mild cough, expectoration, shortness of breath and other symptoms; (3) Age ≥ 60 years old. According to the following exclusion criteria, a total of 142 patients were excluded: (1) errors or missing items exceeding 20% and completely identical questionnaires were 83 cases; (2) 16 cases with audio-visual impairment and unable to communicate; (3) 21 cases requested to withdraw from the study; (4) During the study, 22 cases could not be contacted due to change of contact information or other reasons. A total of 1058 patients were included (Fig. 1).
Research tools
General data questionnaire: designed by researchers according to the purpose and content of the research, through literature review and preliminary investigation, including age, sex, marital status, education level, smoking status, comorbidity, hospitalization times of acute exacerbation of COPD in the past 1 year, etc.
Modified Medical Research Council Dyspnea Scale (mMRC) are used to assess the severity of dyspnea in COPD patients, and verified to have good testing characteristics in Chinese population, it is divided into five levels. Level 0: difficulty breathing only during strenuous activity; Level 1: shortness of breath when walking briskly on flat ground or walking on a hill; Level 2: walking on flat ground slower than peers or needing to stop to rest due to shortness of breath; Level 3: walk on flat ground for about 100 m or need to stop for breath after a few minutes; Level 4: unable to leave the house because of severe breathing difficulties, or having difficulty breathing when putting on or undressing [13].
Activities of Daily Living (ADL) are used to evaluate patients’ ability of Daily Living Activities. It was first published in 1965 by Dorothy Barthel and Florence Mahone, and it was verified to have good test characteristics among Chinese people [14] includes 10 items. The total score is 100 points, and the evaluation standard is: 81~100 points, life completely self-care. 61~80 points, mild dysfunction, able to complete daily activities independently; 41~60 points, moderate dysfunction, need help in life ≤ 40 points, severely dysfunctional or totally dependent, most daily activities cannot be completed or require human care [15].
The Geriatric Depression Scale (GDS) is a measure of Depression in the elderly over the last 1 week. Prepared by Brink Equal in 1982, proved to have good test characteristics in The Chinese population [16] includes 30 entries. The total score is 30 and the assessment standard is: 0~10 is normal, that is no depression; 11 to 20 are classified as likely to have depressive symptoms; Between 21 and 30 is classified as depression [17].
The Mini Nutritional Assessment-Short Form (MNA-SF) is designed to evaluate Nutritional status. In 2001, it was proposed by Rubenstein et al. On the basis of Mini Nutritional Assessment and verified to have good testing characteristics in Chinese population [18] includes 8 questions. With a total score of 14, 12 to 14 is classified as normal nutritional status, 8 to 11 as at risk of malnutrition and 0 to 7 as undernourished [19].
COPD assessment test (CAT) scale used to assess the severity of COPD. Proposed by Jones on the basis of the St George Respiratory Questionnaire (SGRQ) in 2009, it has proven to have good test characteristics in a Chinese population [20] Includes 8 questions. The total score is 40 points, with the total score < 10 indicating mild illness, 10 < total score ≤ 20 indicating moderate illness, 20 < total score ≤ 30 indicating serious illness, and > 30 indicating very serious illness [21].
Data collection methods
The researchers conducted a preliminary survey of 48 elderly patients with COPD and improved the questionnaire. The investigator of each hospital shall be trained uniformly (Investigators having worked in respiratory medicine department for 5 y or more, having bachelor’s degree and above with the qualification certificate of supervisor nurse, having questionnaire investigation experience), the training contents shall include the research purpose, research contents, questionnaire filling requirements, etc., trained investigator ask the patient about each item of the questionnaire and fill out each item according to the patient’s answers. Check if there is any missing item, take it back after checking. Patients were followed up to 30 days after discharge, including whether they were readmitted for acute exacerbation within 30 days after discharge, whether they took medication regularly and whether they took rehabilitation exercise, etc., and the readmission season of patients was recorded.
Construction and verification of risk prediction model
A total of 28 variables were included in the screening of indicators for the construction of risk prediction model. After single factor analysis, a total of 15 variables such as age, education level and smoking status entered the regression analysis. Finally, 8 meaningful factors of logistic regression analysis were included in the construction of the model. The research objects are divided into training set (n = 741) and test set (n = 317) according to the ratio of 7:3. Based on the influencing factors, four kernel function models of SVM, linear SVM, polynomial SVM, sigmoid SVM and RBF SVM, are constructed. The prediction efficiency of the four kernel functions is analyzed and compared through the precision, recall, accuracy, F1 index and the area under the ROC curve (AUC) respectively.
Data analysis
Epidata 3.1 was used for data entry, SPSS 25.0 and MATLAB R2020b were used for statistical analysis. Measurement data were described by median and quartile, while counting data were described by frequency and percentage. The measurement data of the two groups were compared by Mann–Whitney U test, the grading data were compared by Wilcoxon W test, and the counting data were compared by χ2 test. precision, recall, accuracy, F1 index and AUC evaluation model prediction ability. In all analyses, statistical significance was set at P < 0.05.