Skip to main content

Learning and depicting lobe-based radiomics feature for COPD Severity staging in low-dose CT images

Abstract

Background

Chronic obstructive pulmonary disease (COPD) is a prevalent and debilitating respiratory condition that imposes a significant healthcare burden worldwide. Accurate staging of COPD severity is crucial for patient management and treatment planning.

Methods

The retrospective study included 530 hospital patients. A lobe-based radiomics method was proposed to classify COPD severity using computed tomography (CT) images. First, we segmented the lung lobes with a convolutional neural network model. Secondly, the radiomic features of each lung lobe are extracted from CT images, the features of the five lung lobes are merged, and the selection of features is accomplished through the utilization of a variance threshold, t-Test, least absolute shrinkage and selection operator (LASSO). Finally, the COPD severity was classified by a support vector machine (SVM) classifier.

Results

104 features were selected for staging COPD according to the Global initiative for chronic Obstructive Lung Disease (GOLD). The SVM classifier showed remarkable performance with an accuracy of 0.63. Moreover, an additional set of 132 features were selected to distinguish between milder (GOLD I + GOLD II) and more severe instances (GOLD III + GOLD IV) of COPD. The accuracy for SVM stood at 0.87.

Conclusions

The proposed method proved that the novel lobe-based radiomics method can significantly contribute to the refinement of COPD severity staging. By combining radiomic features from each lung lobe, it can obtain a more comprehensive and rich set of features and better capture the CT radiomic features of the lung than simply observing the lung as a whole.

Peer Review reports

Background

Chronic obstructive pulmonary disease (COPD) is a global public health challenge due to its widespread prevalence and its lasting impact on disability and mortality [1, 2]. COPD is currently ranked as the fourth most prevalent global health concern, marked by persistent airflow limitation and a range of debilitating symptoms [3].

Pulmonary function assessments, specifically the measurement of the ratio of forced expiratory volume in one second (FEV1) to forced vital capacity (FVC), serve as a primary diagnostic and risk assessment tool for COPD [4]. However, early COPD patients are easy to be ignored because of asymptomatic and mild symptoms [5, 6]. Most patients are often diagnosed with moderate to severe, which seriously affects the quality of life, and the cost of treatment has risen sharply [7]. Consequently, early identification and staging are important to reduce the risk of exacerbations, fewer concurrent health issues, and decreased healthcare expenses [8].

COPD is a multifaceted and remarkably diverse clinical condition with various imaging phenotypes and histopathological characteristics including encompassing parenchymal degradation, thickening of bronchial walls, interstitial lung abnormalities, bronchiectasis, and so on [9]. Computed tomography (CT)-based research in the field of COPD has yielded remarkable outcomes which has proven to be a powerful tool in studying COPD by providing detailed, three-dimensional images of the lungs [10,11,12]. CT imaging provides crucial insights into lung function, disease severity categorization, and the prediction of outcomes for individuals with COPD by examining typical CT features like lung tissue, airways, pulmonary blood vessels, and the chest wall. It has paved the way for more accurate diagnosis, personalized treatment strategies, and the development of innovative therapies, ultimately improving the lives of individuals living with COPD.

Radiomics was proposed by Lambin et al. in 2012 [13]. It entails the extraction and examination of numerous quantitative features, including texture, statistical, histogram, and shape features. [14]. Radiomics features in lung disease imaging have been considered cutting-edge tools for healthcare professionals [15]. Nevertheless, the evolution of radiomics features in COPD has been comparatively slower compared to other lung conditions like lung cancer and pulmonary nodules. As of 2020, Refaee et al. noted that there had been limited exploration of radiomics features in COPD [16]. Nevertheless, there are promising prospects for employing radiomics features in COPD diagnosis, treatment, and monitoring, as well as directions for future research [16]. Additionally, the significance of lung radiomics features in evaluating COPD has been substantiated [17].

Previous studies in radiomics for pulmonary conditions, specifically COPD, have primarily focused on analyzing radiomics features derived from the entire lung region. This study, however, overlooks the heterogeneity of COPD manifestations across different lung lobes. The severity of COPD and the distribution of lesions are known to vary considerably within the lung, often localized to specific lobes rather than uniformly affecting the entire lung region. Our study addresses this gap by proposing a lobe-specific radiomic analysis, allowing for a granular investigation into the distinct structural and functional characteristics of each lobe. The justification for the proposed method lies in the premise that individual lung lobes may contribute disparately to the pathology of COPD. Lesions within different lobes can exhibit unique radiomic signatures, which, when analyzed independently, provide more precise insights into the localized nature of the disease. This lobe-specific analysis enables a more accurate understanding of disease distribution and severity. By integrating radiomic features from each lobe, the proposed method seeks to offer a more nuanced understanding of COPD, pinpointing which lobes are most susceptible to the disease and the extent of their involvement.

Machine learning focuses on the development of algorithms and computer models. It learns from data and makes predictions without being explicitly programmed [18]. Statistical techniques are also involved to enable computers to automatically improve their performance on a specific task [19], for example, classification and staging of COPD [20, 21]. Yang et al. proposed to characterize and classify COPD stages based on multi-layer perceptron [22]. Makimoto and colleagues conducted a comparative analysis of various feature selection and classification methods, ultimately demonstrating that the combination of Elastic Net with a Linear-SVM classifier outperforms others for identifying COPD [23]. Puchakayala et al. demonstrated that radiomics features, particularly parenchymal texture, and shape features of the lung and airway, could accurately diagnose COPD in both standard-dose and low-dose CT images [24]. Support vector machine (SVM) is a supervised machine learning algorithm used for classification and regression tasks. It is particularly well-suited for classification problems by finding the optimal hyperplane. SVM is widely used in various fields which can be applied in diagnosing COPD and classifying its severity [25, 26].

We propose a novel lobe-based radiomic workflow and train a machine learning model for COPD severity staging. More specifically, five pulmonary lobes are segmented from lung CT images and radiomics features of each lobe are extracted, integrated, and dimension reduced. Then, an SVM classifier is employed to classify different severity of COPD, which has excellent performance. By combining radiomic features from each lung lobe, we can obtain a more comprehensive and rich set of features. This helps improve the performance of predictive models.

The contributions of the paper are as follows:

  • Unlike the current radiomic features extracted from the whole lung region, this study introduces a novel approach by combining radiomics features from each lung lobe.

  • The proposed workflow is effective in distinguishing COPD severity, exhibiting strong performance in both binary and multi-class classification tasks.

  • By analyzing the contribution of each lung lobe separately, the study provides empirical evidence that COPD’s impact is localized, which enhances the understanding of the disease’s heterogeneity.

Methods

Datasets

The study received approval from the hospital’s ethics committee (Reference: ES-2023-045-01), and all participants gave informed consent in accordance with the Declaration of Helsinki (2000). COPD is diagnosed by evaluating the post-bronchodilator ratio of forced expiratory volume in one second (FEV1) to forced vital capacity (FVC), which is below 0.7 and further categorized into four stages based on the Global Initiative for Chronic Obstructive Lung Disease (GOLD) criteria: GOLD I (mild, FEV1 ≥ 80% predicted), GOLD II (moderate, FEV1 between 50% and < 80% predicted), GOLD III (severe, FEV1 between 30% and < 50% predicted), and GOLD IV (very severe, FEV1 < 30% predicted) [27].

Dataset 1, as a training dataset, collected 530 patients from the First Affiliated Hospital of Guangzhou Medical University. The number of patients in each stage is 114 in GOLD I, 204 in GOLD II, 154 in GOLD III, and 58 in GOLD IV. The excluded criteria are acute exacerbation of COPD or respiratory infection 4 weeks ago, concomitant pulmonary disease, pulmonary resection, and active malignancy within the past 5 years. Furthermore, Table 1 presents the clinical characteristics and parameter configurations for CT image acquisition.

Table 1 Baseline characteristics of dataset 1

Dataset 2 is from the Central Hospital Affiliated to Shenyang Medical College (CH-SMC) and the Second Hospital of Dalian Medical University (SH-DLMU) as an external validation dataset. Dataset 2 consists of 290 patients: 25 GOLD I, 69 GOLD II, 116 GOLD III, 70 GOLD IV. A summary of the stages of COPD patients in Dataset 2 is shown in Table 2.

Table 2 Summary of stages of COPD patients in the dataset 2

Overview of the study procedure

The pipeline of the proposed lobe-based method for this study is depicted in Fig. 1. First, five lobes are segmented from the lung CT images by the trained Seg-Lobe model, including the right upper (RU), right middle (RM), and right lower (RL) lobe in the right lung, left upper (LU) and the left lower (LL) lobe in the left lung. Second, radiomics features are extracted from regions of interest (ROI) in the lung (the region of five segmented lobes), respectively. And the radiomics features from each lobe are combined. Third, the integrated features are selected using the LASSO algorithm. Finally, an SVM classifier is built to classify the severity staging of COPD.

Fig. 1
figure 1

The lobe-based radiomics workflow for COPD stage classification

Preprocessing

The segmentation of lung lobes is performed by using the Seg-Lobe model, which is based on 3D U-Net [28] and implemented as the automatic pipeline to train, validate, and test the network using the input CT images [29, 30]. The details are provided in our previous work [27]. The Seg-Lobe architecture, illustrated in Fig. 2, comprises five encoder-decoder pairs. Each encoder or decoder is constructed with two blocks, each consisting of a sequence of operations: convolution, instance normalization (IN) [31], and Leaky Rectified Linear Unit (Leaky ReLU). Although Batch Normalization (BN) [32] is commonly used to enhance training speed and stability, its effectiveness diminishes when applied to small batch sizes.

In Seg-Lobe model architecture, strided convolution is performed instead of an ordinary pooling operation in downsampling, which serves to enhance the accuracy of the convolutional neural network while concurrently reducing the model’s size [33, 34]. For upsampling, transposed convolution is employed. Furthermore, a skip connection [35] is implemented between the encoder and decoder to establish a connection between feature maps. This connection allows the decoder to access the information concealed within the encoder, facilitating more effective information flow.

Fig. 2
figure 2

The architecture of the Seg-Lobe

Feature extraction

Pyradiomics [36] is employed for feature extraction from the region of interest in CT images. It is a crucial and advanced Python package in the field of medical image analysis. The extracted features can be divided into seven groups: (a) first order features, (b) shape features, (c) gray level co-occurrence matrix (GLCM), (d) gray level size zone matrix (GLSZM), (e) gray level run length matrix (GLRLM), (f) neighboring gray Tone difference matrix (NGTDM), and (g) gray level dependence matrix (GLDM). For comprehensive definitions and detailed explanations of these texture features, the Pyradiomics official documents should be consulted [37].

Feature selection

Feature selection helps improve the performance of predictive models by identifying and retaining the most relevant and informative features, mitigates the risk of overfitting and reduces dimensionality by eliminating irrelevant or redundant variables. This study employs three methods for the gradual selection of optimal features.

In the feature selection of the four categories of COPD severity, the variance threshold is first used to identify and retain features that exhibit significant variation across the dataset. Features with low variance, implying minimal variation across data points, are often considered less informative. In this experiment, we set the threshold to 1 and features failing to meet this threshold are subsequently discarded. In the feature selection of the two categories of COPD severity, COPD groups are classified according to GOLD stage in groups of mild severity (GOLD I + GOLD II, FEV1 < 50%) and great severity (GOLD III + GOLD IV, FEV1 ≥ 50%). The T-test is used to evaluate whether there is a significant difference between the two categories. Features exhibiting a significant difference (p < 0.05) are retained as they are considered discriminatory.

Finally, the least absolute shrinkage and selection operator (LASSO) method was utilized for the identification of the ultimate discriminative features [38]. LASSO incorporates a regularization term into the linear regression objective function. This method can reduce the coefficients of variables with minimal impact on the regression to zero during the fitting process, thereby achieving variable screening and complexity adjustment [39, 40]. LASSO involves a tuning parameter responsible for controlling the penalty applied to the linear model. This parameter is designed to maintain a minimal penalty while deriving a model with a reduced set of features. In this context, the penalty is expressed through the mean square error (MSE). The optimization objective of LASSO is as follows:

$$\begin{array}{c}y=\left(\frac{1}{2*{n}_{samples}}\right)*{\left|\left|\gamma -X\omega \right|\right|}^{2}+\alpha *\left|\left|\omega \right|\right|\end{array}$$
(1)

In the equation, X represents the matrix of radiomics features, \(\gamma\) is the sample vector marker, n denotes the sample number, \(\omega\) is the coefficient vector of the regression model, and \(\alpha *\left|\left|\omega \right|\right|\) represents the LASSO penalty term.

Machine-learning classification model

In this study, SVM is used to establish four-category and two-category classification models. SVM, a non-probabilistic supervised learning method, excels at forming multi-dimensional hyperplanes that efficiently separate the covariate space into distinct groups for classification. To evaluate the reliability of our proposed approach, we implemented a 5-fold cross-validation procedure on the complete dataset. More precisely, we randomly divided the data into five groups, using each set of four groups for training and the remaining one for testing.

Statistical analysis and experimental setup

The classification performance is evaluated by the accuracy (ACC), precision, recall, F1-score, receiver operating characteristic (ROC) curve, and area under the curve (AUC).

$$\begin{array}{c}Accuracy= \frac{{\sum }_{i=1}^{4}T{P}_{i}}{{\sum }_{i=1}^{4}{TP}_{i}+F{P}_{i}+F{N}_{i}+T{N}_{i}}\end{array}$$
(2)
$$\begin{array}{c}Precision=\frac{T{P}_{i}}{T{P}_{i}+F{P}_{i}}, i=1, 2, 3, 4.\end{array}$$
(3)
$$\begin{array}{c}Recall=\frac{T{P}_{i}}{T{P}_{i}+F{N}_{i}}, i=1, 2, 3, 4.\end{array}$$
(4)
$$\begin{array}{c}F1-score=2*\frac{\text{P}\text{r}\text{e}\text{c}\text{i}\text{s}\text{i}\text{o}\text{n}\text{*}\text{R}\text{e}\text{c}\text{a}\text{l}\text{l}}{\text{P}\text{r}\text{e}\text{c}\text{i}\text{s}\text{i}\text{o}\text{n}+\text{R}\text{e}\text{c}\text{a}\text{l}\text{l}}, i=1, 2, 3, 4.\end{array}$$
(5)

As described in our previous study [41], let \(T=({T}_{1},{T}_{2},{T}_{3},{T}_{4},)\), i.e. represent the target labels in the test set. The predicted classes in the test set, denoted as \(P=\left({P}_{1},{P}_{2},{P}_{3},{P}_{4},\right)\), are determined. True positive (TP) is defined as the count of predicted labels matching the target labels, i.e. \(T{P}_{i}=\left|{P}_{i}\cap {T}_{i}\right|, i=1, 2, 3, 4\). False positive (FP) represents the count of predicted labels that do not match the actual target labels, i.e. \(F{P}_{i}=\left|{P}_{i}\backslash {T}_{i}\right|, i=1, 2, 3, 4\). False negative (FN) is the count of predictions that belong to the ground-truth label but are falsely predicted, i.e. \(F{N}_{i}=\left|{T}_{i}\backslash {P}_{i}\right|, i=1, 2, 3, 4\). True negative (TN) is the count of predictions that neither belong to the ground-truth label nor are classified, i.e. \(T{N}_{i}=\sum _{d?C,d\ne c}|{T}_{d}|\).For the classification of two categories of COPD severity, the evaluation index is calculated similarly to the above process, with the adjustment i = 1, 2.

The ROC curve illustrates the association between false positive rate (FPR) on the x-axis and true positive rate (TPR) on the y-axis [42]. The area under the curve (AUC) is computed from the ROC curve and serves as a metric reflecting the performance of a classifier. A higher AUC value, nearing 1.0, indicates a more effective classifier [43].

In this study, the machine learning models, such as SVM, KNN, and Decision Tree, were primarily implemented using the scikit-learn library. For the implementation of more advanced ensemble methods, specifically gradient boosting techniques, we employed the XGBClassifier from the XGBoost package and the CatBoostClassifier from the CatBoost package. Both of these classifiers were utilized with their default parameter settings. The use of default parameters also facilitates more straightforward comparisons between the different models and highlights the out-of-the-box capabilities of each algorithm.

Results

Radiomic feature selection in four and two categories of severity staging of COPD

In the lobe-combined experiment, 8840 radiomic features were extracted from a total of five lung lobes. Feature dimensionality reduction was carried out for two and four categories in the severity staging of COPD.

In the four categories of COPD severity, 1370 features were selected by a variance threshold method. In Fig. 3(a), the MSE in LASSO is depicted concerning Lambda, while Fig. 3(b) illustrates the change in each feature coefficient corresponding to Lambda. As shown by the dotted line in Fig. 3(a), MSE attains its minimum value, resulting in the reduction of features to 104 through the LASSO algorithm.

Fig. 3
figure 3

Changes in Mean Squared Error (MSE) and the coefficients of individual features concerning Lambda in the LASSO. (a) The MSE for the four categories. (b) The coefficient of each feature for the four categories. (c) The MSE for the two categories. (d) The coefficient of each feature for the two categories

Fig. 4
figure 4

Optimal coefficients of features within LASSO for the staging of COPD severity. (a) The four categories. (b) The two categories

From the initial pool of 8840 features, 4764 features were chosen using the t-test method in the two categories. In Fig. 3(c), the MSE is depicted throughout the Lambda parameter optimization in the LASSO algorithm. Figure 3(d) illustrates that the LASSO algorithm selected 132 optimal features when the MSE reached its minimum, as denoted by the dotted line. Figure 4(a) presents the coefficients of the selected features in the LASSO model for the four categories, while Fig. 4(b) shows the coefficients of the selected features in the LASSO model for the two categories of COPD severity staging.

Performance comparisons in four and two categories of severity staging of COPD with different feature selection methods

Table 3 presents the outcomes of various radiomics experiments applied to stage the severity of COPD. The SVM classifier is trained for both four and two categories of COPD severity staging, utilizing selected radiomics features from three distinct ROIs: the entire lung region, each lobe individually, and a combination of all five lobes.

We present the performance evaluation of lobe-based radiomics methods compared to other-region radiomics methods for staging COPD severity in Dataset 1. The task is categorized into four categories representing GOLD I vs. GOLD II vs. GOLD III vs. GOLD IV, and two categories representing GOLD I + GOLD II vs. GOLD III + GOLD IV. For the four-category classification task, the whole lung method achieves an accuracy of 0.49, with precision, recall, and F1-score all around 0.45. Among the lobe-based methods, RM (Right Middle) exhibits the highest performance metrics, with accuracy, precision, recall, and F1-score ranging from 0.55 to 0.67. Notably, our proposed method outperforms both the whole lung and lobe-based methods, achieving an accuracy of 0.63 and precision, recall, and F1-score all above 0.60, albeit with a slightly lower AUC of 0.49.

In the two-category classification task, the performance of all methods notably improves, with the proposed method consistently outperforming others. Specifically, the proposed method achieves an accuracy of 0.87, with precision, recall, and F1-score all at 0.87, and an AUC of 0.93, indicating its robustness and effectiveness in distinguishing between less severe and more severe COPD stages.

Overall, the results demonstrate the superiority of our proposed lobe-based radiomics method over other-region radiomics methods in accurately staging COPD severity, particularly evident in the two-category classification task, where it achieves the highest performance metrics across all evaluated criteria. The evaluation of two COPD severity classifications, based on selected radiomic features from all five lung lobes and analyzed using SVM, is depicted in Fig. 5(b) through ROC curve analysis.

The above experiments demonstrate that regardless of whether COPD severity is classified into four or two categories, the results of the radiomics model using combined features of five lung lobes are superior to that of using radiomic features directly from the whole lung region and each lung lobe. Overall, the results demonstrate the superiority of our proposed lobe-based radiomics method over other-region radiomics methods in accurately staging COPD severity, particularly evident in the two-category classification task, where it achieves the highest performance metrics across all evaluated criteria.

Table 3 Performance of lobe-based radiomics method compared with other-region radiomics method for COPD severity staging in dataset 1. Four categories represent GOLD I vs. GOLD II vs. GOLD III vs. GOLD IV and two categories represent GOLD I + GOLD II vs. GOLD III + vs. GOLD IV on Dataset1
Fig. 5
figure 5

The ROC curves using SVM for COPD severity staging based on a combination of five lobes radiomic features. (a) Four categories; (b) Two categories

In Table 4, we present the performance comparison between the LASSO and PCA methods for COPD severity staging on Dataset 1, categorized into four categories (representing GOLD I vs. GOLD II vs. GOLD III vs. GOLD IV) and two categories (representing GOLD I + GOLD II vs. GOLD III + GOLD IV).

For the four-category classification task, the PCA method achieves an accuracy of 0.45, with precision, recall, and F1-score all around 0.41. In contrast, the LASSO method outperforms PCA significantly, with an accuracy of 0.63 and precision, recall, and F1-score all above 0.60.

In the two-category classification task, both PCA and LASSO methods demonstrate improved performance compared to the four-category classification. However, the LASSO method continues to exhibit superior performance, achieving an accuracy of 0.87 and precision, recall, and F1-score all at 0.87, whereas PCA achieves an accuracy of 0.65 with slightly lower precision, recall, and F1-score.

Overall, the results suggest that the LASSO method outperforms PCA in accurately staging COPD severity, particularly evident in the higher accuracy, precision, recall, and F1-score achieved across both classification tasks. This underscores the effectiveness of LASSO as a feature selection method for radiomics-based COPD severity staging on Dataset 1.

Table 4 Performance of LASSO compared with PCA method for COPD severity staging on Dataset1

Performance comparisons in four and two categories of severity staging of COPD with different machine learning methods

In Table 5, we present a comprehensive evaluation of various machine learning classifiers for COPD severity staging on Dataset 1 (Training dataset). The task encompasses both four-category (GOLD I vs. GOLD II vs. GOLD III vs. GOLD IV) and two-category (GOLD I + GOLD II vs. GOLD III + GOLD IV) classification scenarios. For the four-category classification, classifiers including KNN, Decision Tree, AdaBoost, Gradient Boosting, XGBoost, Random Forest, CatBoost, and our proposed lobe-based radiomics method are examined. In the task of four-category classification, the proposed approach demonstrates superior performance compared to other classifiers, attaining an accuracy of 0.63, precision of 0.71, recall of 0.63, and F1 score of 0.62. The training time of the five classifiers is also provided in Table 5.

In the two-category classification task, the proposed method again shows superior performance, achieving perfect scores of 0.87 across all metrics. The runner-up, Gradient Boosting and CatBoost, scores 0.84 across all metrics.

Although the accuracy of our proposed method is slightly lower than XGBoost and CatBoost in the four-way classification task of COPD severity, it is still competitive, but it has a clear advantage in the two-way classification of COPD severity. These results demonstrate the effectiveness of the proposed method in staging COPD severity, providing strong evidence for its potential use in clinical settings.

Table 5 Performance of proposed lobe-based radiomics method compared with other machine learning methods for COPD severity staging on Dataset1

Performance of testing on dataset 2

We extended the application of our methodology to an external dataset, Dataset 2 was tested on the previously trained model. Utilizing an SVM classifier, we categorized the features—post feature selection—into two categories of COPD severity.

As shown in Table 6, for the four-category classification task, the proposed method achieves an accuracy of 0.50, precision of 0.47, recall of 0.50, F1-score of 0.45, and AUC of 0.48. In the two-category classification task, the proposed method demonstrates improved performance, achieving an accuracy of 0.81, precision of 0.81, recall of 0.81, F1-score of 0.81, and AUC of 0.88. These results suggest that the proposed method performs well in distinguishing between less severe (GOLD I and II) and more severe (GOLD III and IV) COPD stages.

It is important to note that the training was conducted on Dataset 1, consisting of low-dose lung CT scans, whereas Dataset 2, employed for external validation, contained standard-dose CT images. This difference in dosing may account for the marginally lower performance observed on Dataset 2.

Table 6 The performance of the proposed method for COPD severity staging on dataset 2

Depicting the importance of each lobe radiomics feature in COPD staging

Fig. 6
figure 6

The feature species and the contribution of each lung lobe in feature selection of COPD severity. (a) Four categories; (b) Two categories

We analyze the importance of the radiomics feature in each lobe for COPD staging. Figure 6 shows the contribution of radiomics features of each lung lobe in the feature selection of four classifications of COPD severity. Seven types of features are collected, including first order features, shape features, GLCM, GLSZM, GLRLM, NGTDM, and GLDM.

In the four-category classification task. First Order Features are most prevalent in the RL lobe with 5 occurrences, totaling 17 across all lobes. Shape Features are relatively rare, with just 2 occurrences across all lobes and only appearing in the RM and LU lobes. GLCM features are most common in the LL lobe with 7 occurrences, with a total of 23 across all lobes. GLSZM features are evenly distributed across the RM, RL, and LU lobes with 7 occurrences each, totaling 30 across all lobes. GLRLM and GLDM features are most prevalent in the RU lobe, with 4 and 8 occurrences respectively.

In the two-category classification task. First Order Features are most prevalent in the RU and LL lobes with 7 occurrences each, totaling 27 across all lobes. Shape Features are relatively rare, with just 4 occurrences across all lobes and appearing mostly in the RM lobe. GLCM features are most common in the LL lobe with 9 occurrences, with a total of 37 across all lobes. GLSZM features are most common in the RM lobe with 9 occurrences, totaling 29 across all lobes. GLDM features are most prevalent in the RL lobe with 6 occurrences.

In our study, three critical features are important in our model’s decision-making process. (1) First Order Features: These features, capturing basic statistical properties of the image intensities within the ROI, were predominantly influential in the RL and RU lobes. Their high frequency of occurrence and significance suggest their utility in capturing intensity variations that are crucial for COPD severity distinction. (2) GLCM Features: Particularly dominant in the LL lobe, these texture features are critical as they capture the spatial relationships between pixel intensities, which are pertinent to understanding tissue heterogeneity in COPD. (3) GLSZM and GLRLM Features: Their uniform presence across several lobes highlights their role in assessing larger area variations and the length of uniform runs in pixel values, respectively, providing key insights into the structural changes within the lung tissues.

In both tasks, the total number of features across all lobes and categories is 104 for the four categories and 132 for the two categories. This data provides valuable insights into the distribution and prevalence of radiomics features across different lung lobes, which could be instrumental in the development and refinement of machine learning models for lung disease diagnosis and severity staging.

Discussion

In this study, we proposed a novel lobe-based radiomics model for the precise staging of COPD severity using CT images. Our study found that when assessing COPD severity, combining radiomic features from each lung lobe offered a superior overview compared to using features from the entire lung. This method provided a more detailed and localized perspective of COPD severity, highlighting regional variations in disease progression.

Our approach offers many advantages. The localized analysis not only provided a comprehensive understanding of the regional variations in COPD severity but also allowed for a more personalized assessment of disease progression. Additionally, our method facilitated finer biological insights into the differential impact of COPD across the lung lobes. These insights could potentially inform targeted therapeutic strategies in the future. Finally, our predictive models, which combined radiomic features from each lobe, showed superior performance compared to traditional models using features from the entire lung.

The previous studies have focused on using radiomic features derived from the entire lung, as shown in Table 7. Li et al. proposed a method of binary classification of COPD severity with LR and SVM classifiers based on 2D ROI manually marked by doctors to generate 3D VOI to extract image radiomics features. The accuracy was 76.3% [37]. Makimoto et al. [23] and Puchakayala et al. [24] first extracted the features from the whole lung region. Then, binary classification of COPD severity was performed using Linear-SVM and CatBoost. Finally, an AUC was achieved with 78% and 90%, respectively. In the study by Yang and colleagues, the severity of COPD was stratified into four categories. This classification utilized features extracted from the entire lung region [44]. Gonzalez et al. joined four specific CT slices into an image to classify COPD severity using the CNN method, with an accuracy of 51.1% [45]. Sun et al. used three-channel information, including raw CT volumes, segmented lung parenchyma, and emphysema features, as input for 3D ResNet with an accuracy of 76.4% [46].

Table 7 Previous studies using radiomic features or deep learning for COPD severity staging

While these studies have significantly contributed to the field, our approach to analyzing each lung lobe has shown additional benefits. In the experiment, we classify the severity of COPD by extracting the radiomic features of five lung lobes after lung CT segmentation and propose that it is of great significance to classify the severity of COPD based on the combined radiological features of five lung lobes. We also classify COPD severity quadripartite and bipartite based on the radiomic features of whole lung area and single lung lobe of Dataset 1. It is demonstrated that radiomic features extracted from the lung lobe are more specific than those from the whole lung region. The combined radiomic features of all five lung lobes are more complete than the radiomic features of the individual lung lobes, which are more suitable for COPD severity classification. We try to classify the COPD severity of each lung lobe as a separate whole after extracting features from each lung lobe through the trained model. After the segmentation of lung lobes and the extraction of the radiomic features of each lobe, we no longer group the features of each lobe, but carry out feature selection on the features of the five lobes respectively, and then carry out the four-way classification and two-way classification of COPD severity. In addition, we also segment the whole lung area and then extract the radiomic features, and compare the classification results with the combined radiomic features based on 5 lung lobes.

The result shows that radiomic features based on the combination of lung lobes can better represent the details of the lungs, and can be better applied in the classification of COPD severity. The segmentation of the lung into 5 lobes can better capture the CT radiomic features of the lung than simply observing the lung as a whole and also have a better performance in the classification of COPD severity. Because the location of COPD is not fixed, a single lung lobe is not enough to be used as a standard for classification, and the five lung lobes should be evaluated as a whole. In addition, this study demonstrates the significance of the pulmonary lobe in the classification of COPD severity.

Despite these promising findings, our study has certain limitations that warrant mention. First, our sample size is relatively small, limiting the generalizability of our findings. Our study focuses on a single cohort, which may not fully represent the broader COPD patient population. While our model shows improved predictive performance, there’s a need for external validation in larger, diverse cohorts to confirm its clinical utility. Secondly, we utilized spirometry as the reference standard for identifying and staging COPD. However, the performance of this method is suboptimal when categorizing COPD into four severity levels because these categories rely on setting thresholds for spirometry measurements, which operate on a continuous scale. This means that even a slight change in an individual’s pulmonary function may lead to a different COPD classification. By incorporating additional clinical information such as the 6-minute walk distance, body mass index, exacerbation history, and total scores from the St George’s Respiratory Questionnaire, it becomes feasible to enhance the accuracy of GOLD staging.

Given these limitations, future research should aim to validate our findings in larger, more diverse populations. There is also a need to explore the utility of our proposed method in longitudinal studies to assess its predictive performance over time. Furthermore, while our study focused on radiomic features, integrating clinical and genetic data could potentially enhance predictive accuracy. In addition, Multimodal Measures are currently being applied to the diagnosis of a variety of diseases [47]. In the future, we will study the severity classification of COPD by multimodality combining imaging omics features with other factors. Lastly, exploring the use of advanced machine learning and deep learning algorithms could provide additional improvements.

Conclusions

The study demonstrates that combining radiomic features from five lobes offers a promising approach for staging COPD severity. The lobe-based radiomics features, which are extracted from five lobes in CT images, can provide more localized formation beyond what can be captured by considering the lung as a whole. The proposed method also provides a more granular understanding of disease distribution and severity, potentially improving patient management strategies. Moreover, we aim to refine the radiomics model further, enabling precise COPD staging and providing enhanced personalized support for individual patients in the future.

While the results are encouraging, we acknowledge certain limitations that must be addressed in future work. Firstly, the study’s sample size was restricted, which may limit the generalizability of our findings. Larger, multi-center studies will be essential to validate our approach and ensure its applicability across diverse populations. Additionally, our study focused on only radiomic features, clinical and genetic data have the potential for the prediction of disease progression over time.

Data availability

The data will be available upon reasonable request from the corresponding author (Shouliang Qi, qisl@bmie.neu.edu.cn) after approval by the Ethic Committee of the First Affiliated Hospital of Guangzhou Medical University.

Abbreviations

COPD:

Chronic Obstructive Pulmonary Disease

CT:

Computed Tomography

GOLD:

Global Initiative for Chronic Obstructive Lung Disease

Leaky ReLU:

Leaky Rectified Linear Unit

GLCM:

Gray Level Co-occurrence Matrix

GLSZM:

Gray Level Size Zone Matrix

GLRLM:

Gray Level Run Length Matrix

NGTDM:

Neighboring Gray Tone Difference Matrix

GLDM:

Gray Level Dependence Matrix

LASSO:

Least Absolute Shrinkage and Selection Operator

SVM:

Support Vector Machine

MSE:

Mean Square Error

References

  1. Roth GA, Abate D, Abate KH, et al. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980–2017: a systematic analysis for the global burden of Disease Study 2017. Lancet. 2018;392(10159):1736–88.

    Article  Google Scholar 

  2. Halpin DMG, Criner GJ, Papi A, et al. Global initiative for the diagnosis, management, and prevention of chronic obstructive lung disease. The 2020 GOLD science committee report on COVID-19 and chronic obstructive pulmonary disease. Am J Respir Crit Care Med. 2021;203(1):24–36. https://doi.org/10.1164/rccm.202009-3533so.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Roman-Rodriguez M, Kaplan A. GOLD 2021 strategy report: implications for asthma–COPD overlap. Int J Chronic Obstr Pulm Dis. 2021;1709–15. https://doi.org/10.2147/copd.s300902.

  4. Marçôa R, Rodrigues DM, Dias M, et al. Classification of chronic obstructive pulmonary disease (COPD) according to the new Global Initiative for Chronic Obstructive Lung Disease (GOLD) 2017: comparison with GOLD 2011. COPD: J Chronic Obstr Pulmonary Disease. 2018;15(1):21–6. https://doi.org/10.1080/15412555.2017.1394285.

    Article  Google Scholar 

  5. Zhong N, Wang C, Yao W, et al. Prevalence of chronic obstructive pulmonary disease in China: a large, population-based survey. Am J Respir Crit Care Med. 2007;176(8):753–60. https://doi.org/10.1164/rccm.200612-1749oc.

    Article  PubMed  Google Scholar 

  6. Mapel DW, Dalal AA, Blanchette CM et al. Severity of COPD at initial spirometry-confirmed diagnosis: data from medical charts and administrative claims. Int J Chronic Obstr Pulm Dis, 2011: 573–81. https://doi.org/10.2147/copd.s16975.

  7. Bellamy D, Smith J. Role of primary care in early diagnosis and effective management of COPD. Int J Clin Pract. 2007;61(8):1380–9. https://doi.org/10.1111/j.1742-1241.2007.01447.x.

    Article  CAS  PubMed  Google Scholar 

  8. Wu Y, Du R, Feng J, et al. Deep CNN for COPD identification by Multi-view snapshot integration of 3D airway tree and lung field. Biomed Signal Process Control. 2023;79:104162. https://doi.org/10.1016/j.bspc.2022.104162.

    Article  Google Scholar 

  9. Lynch DA, Austin JHM, Hogg JC, et al. CT-definable subtypes of chronic obstructive pulmonary disease: a statement of the Fleischner Society. Radiology. 2015;277(1):192–205. https://doi.org/10.1148/radiol.2015141579.

    Article  PubMed  Google Scholar 

  10. Bodduluri S, Newell JD Jr, Hoffman EA, et al. Registration-based lung mechanical analysis of chronic obstructive pulmonary disease (COPD) using a supervised machine learning framework. Acad Radiol. 2013;20(5):527–36. https://doi.org/10.1016/j.acra.2013.01.019.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Feragen A, Petersen J, Grimm D et al. Geometric tree kernels: Classification of COPD from airway tree geometry. in Information Processing in Medical Imaging: 23rd International Conference, IPMI 2013, Asilomar, CA, USA, June 28–July 3, 2013. Proceedings 23. 2013. Springer. https://doi.org/10.1007/978-3-642-38868-2_15.

  12. Cheplygina V, Pena IP, Pedersen JH, et al. Transfer learning for multicenter classification of chronic obstructive pulmonary disease. IEEE J Biomedical Health Inf. 2017;22(5):1486–96. https://doi.org/10.1109/jbhi.2017.2769800.

    Article  Google Scholar 

  13. Lambin P, Rios-Velazquez E, Leijenaar R, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. 2012;48(4):441–6. https://doi.org/10.1016/j.ejca.2011.11.036.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Lambin P, Leijenaar RTH, Deist TM, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Reviews Clin Oncol. 2017;14(12):749–62. https://doi.org/10.1038/nrclinonc.2017.141.

    Article  Google Scholar 

  15. Frix AN, Cousin F, Refaee T, et al. Radiomics in lung diseases imaging: state-of-the-art for clinicians. J Personalized Med. 2021;11(7):602. https://doi.org/10.3390/jpm11070602.

    Article  Google Scholar 

  16. Wu G, Ibrahim A, Halilaj I, et al. The emerging role of radiomics in COPD and lung cancer. Respiration. 2020;99(2):99–107.

    Article  PubMed  Google Scholar 

  17. Yang K, Yang Y, Kang Y, et al. The value of radiomic features in chronic obstructive pulmonary disease assessment: a prospective study. Clin Radiol. 2022;77(6):e466–72. https://doi.org/10.1016/j.crad.2022.02.015.

    Article  CAS  PubMed  Google Scholar 

  18. Medicine TLR. Opening the black box of machine learning. 2018. 801. https://doi.org/10.1016/s2213-2600(18)30425-9.

  19. Mekov E, Miravitlles M, Petkov R. Artificial intelligence and machine learning in respiratory medicine. Expert Rev Respir Med. 2020;14(6):559–64. https://doi.org/10.1080/17476348.2020.1743181.

    Article  CAS  PubMed  Google Scholar 

  20. Wu Y, Qi S, Feng J, et al. Attention-guided multiple instance learning for COPD identification: to combine the intensity and morphology. Biocybernetics Biomedical Eng. 2023;43(3):568–85. https://doi.org/10.1016/j.bbe.2023.06.004.

    Article  Google Scholar 

  21. Xu C, Qi S, Feng J, et al. DCT-MIL: deep CNN transferred multiple instance learning for COPD identification using CT images. Phys Med Biol. 2020;65(14):145011. https://doi.org/10.1088/1361-6560/ab857d.

    Article  PubMed  Google Scholar 

  22. Yang Y, Wang S, Zeng N, et al. Lung radiomics features selection for COPD stage classification based on auto-metric graph neural network. Diagnostics. 2022;12(10):2274. https://doi.org/10.3390/diagnostics12102274.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Makimoto K, Au R, Moslemi A, et al. Comparison of feature selection methods and machine learning classifiers for Predicting Chronic Obstructive Pulmonary Disease using texture-based CT lung Radiomic features. Acad Radiol. 2023;30(5):900–10. https://doi.org/10.1016/j.acra.2022.07.016.

    Article  PubMed  Google Scholar 

  24. Amudala Puchakayala PR, Sthanam VL, Nakhmani A, et al. Radiomics for Improved Detection of Chronic Obstructive Pulmonary Disease in low-dose and standard-dose chest CT scans. Radiology. 2023;307(5):e222998. https://doi.org/10.1148/radiol.222998.

    Article  PubMed  Google Scholar 

  25. Hussain A, Ugli IKK, Kim BS et al. Detection of different stages of copd patients using machine learning techniques. in. 2021 23rd International Conference on Advanced Communication Technology (ICACT). 2021. IEEE. https://doi.org/10.23919/icact51234.2021.9370958.

  26. Vora S, Shah C. COPD classification using machine learning algorithms. Int Res J Eng Technol. 2019;6:608–11. https://doi.org/10.1109/aimv53313.2021.9671016.

    Article  Google Scholar 

  27. Vogelmeier CF, Criner GJ, Martinez FJ, et al. Global strategy for the diagnosis, management, and prevention of chronic obstructive lung disease 2017 report. GOLD executive summary. Am J Respir Crit Care Med. 2017;195(5):557–82. https://doi.org/10.1164/rccm.201701-0218pp.

    Article  CAS  PubMed  Google Scholar 

  28. Çiçek Ö, Abdulkadir A, Lienkamp SS et al. 3D U-Net: learning dense volumetric segmentation from sparse annotation. in Medical Image Computing and Computer-Assisted Intervention–MICCAI 2016: 19th International Conference, Athens, Greece, October 17–21, 2016, Proceedings, Part II 19. 2016. Springer. https://doi.org/10.1007/978-3-319-46723-8_49.

  29. Pang H, Wu Y, Qi S, et al. A fully automatic segmentation pipeline of pulmonary lobes before and after lobectomy from computed tomography images. Comput Biol Med. 2022;147:105792. https://doi.org/10.1016/j.compbiomed.2022.105792.

    Article  PubMed  Google Scholar 

  30. Isensee F, Jaeger PF, Kohl SAA, et al. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods. 2021;18(2):203–11. https://doi.org/10.1038/s41592-020-01008-z.

    Article  CAS  PubMed  Google Scholar 

  31. Ulyanov D, Vedaldi A, Lempitsky V. Instance normalization: the missing ingredient for fast stylization. arXiv Preprint arXiv:1607.08022, 2016.

  32. Ioffe S, Szegedy C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. in International conference on machine learning. 2015. pmlr.

  33. Ayachi R, Afif M, Said Y et al. Strided convolution instead of max pooling for memory efficiency of convolutional neural networks. in Proceedings of the 8th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT’18), Vol. 1. 2020. Springer. https://doi.org/10.1007/978-3-030-21005-2_23.

  34. Springenberg JT, Dosovitskiy A, Brox T et al. Striving for simplicity: The all convolutional net arXiv preprint arXiv:1412.6806, 2014.

  35. Drozdzal M, Vorontsov E, Chartrand G et al. The importance of skip connections in biomedical image segmentation. in International Workshop on Deep Learning in Medical Image Analysis, International Workshop on Large-Scale Annotation of Biomedical Data and Expert Label Synthesis. 2016. Springer. https://doi.org/10.1007/978-3-319-46976-8_19.

  36. Van Griethuysen JJM, Fedorov A, Parmar C, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 2017;77(21):e104–7. https://doi.org/10.1158/0008-5472.can-17-0339.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Li Z, Liu L, Zhang Z, et al. A novel CT-based radiomics features analysis for identification and severity staging of COPD. Acad Radiol. 2022;29(5):663–73. https://doi.org/10.1016/j.acra.2022.01.004.

    Article  PubMed  Google Scholar 

  38. Avanzo M, Stancanello J, El Naqa I. Beyond imaging: the promise of radiomics. Physica Med. 2017;38:122–39. https://doi.org/10.1016/j.ejmp.2017.05.071.

    Article  Google Scholar 

  39. Tibshirani R. The lasso method for variable selection in the Cox model. Stat Med. 1997;16(4):385–95. https://doi.org/10.1002/(sici)1097-0258(19970228)16:4%3C385::aid-sim380%3E3.0.co;2-3.

  40. Bühlmann P. Regression shrinkage and selection via the Lasso: a retrospective (Robert Tibshirani): comments on the presentation. J R Stat Soc B. 2011;73(3):277–9.

    Google Scholar 

  41. Wu Y, Qi S, Sun Y, et al. A vision transformer for emphysema classification using CT images. Phys Med Biol. 2021;66(24):245016. https://doi.org/10.1088/1361-6560/ac3dc8.

    Article  Google Scholar 

  42. Fawcett T. An introduction to ROC analysis. Pattern Recognit Lett. 2006;27(8):861–74. https://doi.org/10.1016/j.patrec.2005.10.010.

    Article  Google Scholar 

  43. Sun X, Xu W. Fast implementation of DeLong’s algorithm for comparing the areas under correlated receiver operating characteristic curves. IEEE Signal Process Lett. 2014;21(11):1389–93. https://doi.org/10.1109/lsp.2014.2337313.

    Article  Google Scholar 

  44. Yang Y, Li W, Guo Y, et al. Lung radiomics features for characterizing and classifying COPD stage based on feature combination strategy and multi-layer perceptron classifier. Math Biosci Eng. 2022;19(8):7826–55. https://doi.org/10.3934/mbe.2022366.

    Article  PubMed  Google Scholar 

  45. Gonzalez G, Ash SY, Vegas-Sánchez-Ferrero G, et al. Disease staging and prognosis in smokers using deep learning in chest computed tomography. Am J Respir Crit Care Med. 2018;197(2):193–203. https://doi.org/10.1164/rccm.201705-0860oc.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Sun J, Liao X, Yan Y, et al. Detection and staging of chronic obstructive pulmonary disease using a computed tomography–based weakly supervised deep learning approach. Eur Radiol. 2022;32(8):5319–29. https://doi.org/10.1007/s00330-022-08632-7.

    Article  PubMed  Google Scholar 

  47. Li Y, Yang X, Zhao M et al. FPT-Former: A Flexible Parallel Transformer of Recognizing Depression by Using Audiovisual Expert-Knowledge-Based Multimodal Measures International Journal of Intelligent Systems, 2024, 2024. https://doi.org/10.1155/2024/1564574.

Download references

Acknowledgements

Not applicable.

Funding

This work was partly supported by the National Natural Science Foundation of China (Nos. 82072008, 82270044), and the Fundamental Research Funds for the Central Universities (N2424010-19).

Author information

Authors and Affiliations

Authors

Contributions

Meng Zhao: Methodology, Software, Investigation, Writing – original draft. Yanan Wu: Writing – review & editing, Formal analysis. Yifu Li: Data curation, Investigation. Xiaoyu Zhang: Data curation, Investigation. Shuyue Xia: Supervision, Writing – review & editing. Jiaxuan Xu: Data collection, Formal analysis. Rongchang Chen: Supervision, Funding acquisition. Zhenyu Liang: Supervision, Writing – review & editing, Funding acquisition. Shouliang Qi: Supervision, Writing – review & editing, Resources, Funding acquisition.

Corresponding authors

Correspondence to Zhenyu Liang or Shouliang Qi.

Ethics declarations

Ethics approval and consent to participate

All procedures performed in studies involving human participants were in accordance with the ethical standards of the ethics committee of First Affiliated Hospital of Guangzhou Medical University and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. Consent to participate Informed consent was obtained from all individual participants included in the study.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhao, M., Wu, Y., Li, Y. et al. Learning and depicting lobe-based radiomics feature for COPD Severity staging in low-dose CT images. BMC Pulm Med 24, 294 (2024). https://doi.org/10.1186/s12890-024-03109-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12890-024-03109-3

Keywords