Multi-channel lung sounds intelligent diagnosis of chronic obstructive pulmonary disease

Yu, Hui; Zhao, Jing; Liu, Dongyi; Chen, Zhen; Sun, Jinglai; Zhao, Xiaoyun

doi:10.1186/s12890-021-01682-5

Research
Open access
Published: 15 October 2021

Multi-channel lung sounds intelligent diagnosis of chronic obstructive pulmonary disease

Hui Yu^1,2,
Jing Zhao^1,2,
Dongyi Liu²,
Zhen Chen²,
Jinglai Sun² &
…
Xiaoyun Zhao³

BMC Pulmonary Medicine volume 21, Article number: 321 (2021) Cite this article

2570 Accesses
4 Citations
7 Altmetric
Metrics details

Abstract

Background

Chronic obstructive pulmonary disease (COPD) is a chronic respiratory disease that seriously threatens people’s health, with high morbidity and mortality worldwide. At present, the clinical diagnosis methods of COPD are time-consuming, invasive, and radioactive. Therefore, it is urgent to develop a non-invasive and rapid COPD severity diagnosis technique suitable for daily screening in clinical practice.

Results

This study established an effective model for the preliminary diagnosis of COPD severity using lung sounds with few channels. Firstly, the time-frequency-energy features of 12 channels lung sounds were extracted by Hilbert–Huang transform. And then, channels and features were screened by the reliefF algorithm. Finally, the feature sets were input into a support vector machine to diagnose COPD severity, and the performance with Bayes, decision tree, and deep belief network was compared. Experimental results show that high classification performance using only 4-channel lung sounds of L1, L2, L3, and L4 channels can be achieved by the proposed model. The accuracy, sensitivity, and specificity of mild COPD and moderate + severe COPD were 89.13%, 87.72%, and 91.01%, respectively. The classification performance rates of moderate COPD and severe COPD were 94.26%, 97.32%, and 89.93% for accuracy, sensitivity, and specificity, respectively.

Conclusion

This model provides a standardized evaluation with high classification performance rates, which can assist doctors to complete the preliminary diagnosis of COPD severity immediately, and has important clinical significance.

Peer Review reports

Background

COPD is a common preventable and treatable disease characterized by continuous airflow restriction. Airflow restriction develops progressively, related to the increased chronic inflammatory response to toxic particles or gases in the airway and lungs. COPD has high morbidity and mortality worldwide and has become the fourth leading cause of death in China and the third leading cause of death globally [1]. According to the global initiative for chronic obstructive lung disease (GOLD), in 2020, the prevalence of COPD was 11.7%, and there were approximately 3 million deaths each year. As populations age in high-income countries and the increase of smokers in developing countries, it is roughly calculated that by 2060, over 5.4 million people will die each year from COPD and related diseases [2].

The severity of COPD is graded according to FEV1/FVC, FEV1%, and symptoms [3]. FEV1 refers to the forced expiratory volume in one second, FVC refers to the forced vital capacity. FEV1/FVC is a sensitive index to evaluate airflow limitation, and FEV1% is a good indicator to evaluate COPD severity. The severity of COPD is classified into five grades by GOLD. Patients with COPD0 have FEV1% higher than 85%, COPD1 have FEV1% higher than 80%, COPD2 have FEV1% between 50% and 80%, COPD3 have FEV1% between 30% and 50%, COPD4 have FEV1% less than 30%, or less than 50% but suffer from chronic respiratory failure. In addition, FEV1/FVC is less than 70% in all patients except patients with COPD0.

However, studies have shown that the association between patients’ health status, symptoms, and FEV1 is weak [4, 5]. In addition, the number of classifications obtained by this way is excessive, which is rarely used in clinical practice. Through communication with doctors, it is known that the severity of COPD is often classified into three grades of mild, moderate, and severe in clinical practice, and then the specific diagnosis and treatment plan was determined according to specific symptoms. Generally speaking, mild patients will get better after taking tracheal relaxants, moderate patients will decide outpatient or hospitalization according to specific symptoms, and severe patients should be hospitalized immediately or even need ICU rescue. At present, the clinical diagnosis of COPD includes pulmonary function examination, chest X-ray examination, chest CT examination, blood gas examination, and other diagnostic methods [6,7,8]. These methods are time-consuming, invasive, and radioactive, unsuitable for daily screening. Hence, it is urgent to develop a non-invasive and rapid COPD severity diagnosis technique suitable for daily screening.

Lung sound, as a physiological sound signal produced in the process of gas exchange between the human body and the outside world, contains a lot of physiological and pathological information, representing the health status of the human respiratory system. Pulmonary auscultation plays an essential role in the diagnosis of respiratory diseases and their severity. Previous studies have shown that pulmonary auscultation can be used as an index for preliminary diagnosis of COPD and its severity, worthy of clinical promotion and application [9,10,11,12,13,14,15]. The traditional artificial auscultation method requires experienced respiratory doctors and is limited by environmental factors. Therefore, the diagnosis technology of COPD based on lung sounds is of great significance to clinical practice and research, provides basic theory and experience for further development of diagnostic equipment for COPD.

Previously, most studies have used multi-dimensional features in the diagnosis of COPD. For example, QianWang et al. used the transfer learning algorithm based on balanced probability distribution and instances to diagnose COPD, and the accuracy rate reached 95.2% [16]; Jun Ying et al. utilized DBN to predict the exacerbation frequency of COPD, and the accuracy reached 91.99% [17]. Some studies diagnose COPD based on lung sounds, extract features through the short-time Fourier transform, wavelet transforms, or HHT, and then use an artificial neural network or deep learning algorithm for recognition and classification, achieving high accuracy. For instance, Morillo et al. inputted the features of lung sounds extracted by short-time Fourier transform into an artificial neural network classifier to identify COPD, and an accuracy of 81.8% has been achieved [18]. Altan et al. used the 3D mapping technique to extract features and the DBN classifier model to separate the severity of two COPD, which is utilized for preliminary diagnosis of COPD, with an accuracy of 95.84% [19]. Based on the statistical characteristics of HHT of lung sounds, Altan et al. used DBN to separate COPD patients from healthy subjects with an accuracy of 93.67% [20]. Altan et al. used the 3D second-order difference plot to extract characteristic abnormalities on lung sounds and then used the deep extreme learning machines classifier to classify the severity of COPD, with an accuracy of 94.31% [21]. Ahmet extracted the features of lung sounds through empirical wavelet transform and then input them into many models to distinguish COPD patients from healthy subjects [22]. Altan et al. used HHT to extract the features of lung sounds and fed the feature set into the proposed Deep ELM with HessELM-AE to distinguish COPD patients from healthy subjects, achieving an accuracy rate of 92.22% [23].

However, there are still some problems in the current research. These methods using multi-dimensional features require much information, including lung function, health status, BODE index, and risk assessment based on the GOLD. The process of collecting information is time-consuming and requires experienced respiratory doctors, making it difficult to make a preliminary diagnosis of COPD quickly. Due to doctors’ experience, human ear resolution, and environmental factors, it is challenging to diagnose the severity of COPD quickly and accurately. The current diagnosis based on lung sounds primarily focuses on separating COPD patients from healthy subjects and classifying COPD0 and COPD4, ignoring the clinical needs. Although few studies have completed five classifications of COPD severity, this classification method is rarely used in clinical practice. In addition, these researches adopt full-channel lung sounds, which increases the difficulty of collection and calculation. Moreover, it will lead to over-fitting the model [24] and be challenging to achieve a rapid and real-time diagnosis, reducing the practicability and portability of future diagnosis equipment and limiting the development of online diagnosis tools.

In this paper, firstly, the time-frequency-energy features of 12-channel lung sounds were extracted by HHT. Then, channels and features were selected by the reliefF algorithm to simplify the collection process and the number of features. Finally, the feature sets were input into SVM, Bayes, decision tree, and DBN to separate the auscultation records of mild, moderate, and severe COPD patients. This model can complete the preliminary diagnosis of COPD severity in a short time through the 4-channel lung sounds, considerably shortening the diagnosis time and saving the cost by canceling some additional diagnostic tools, which have important clinical significance.

The main contributions of this paper are as follows:

1.
This study focuses on diagnosing COPD severity, categorizing it into clinically common grades: mild COPD, moderate COPD, and severe COPD. The model proposed in this study is able to reduce the diagnosis time, save medical resources, and assist doctors in completing the preliminary diagnosis of COPD severity. This model has been recognized by respiratory doctors in Chest Hospital of Tianjin University and Tianjin Medical University General Hospital in China.
2.
The channel selection model based on the reliefF algorithm is used to determine the optimal channels for diagnosing the severity of COPD, which proves that use only 4-channel lung sounds of L1, L2, L3, and L4 channels that can undertake the diagnosis of COPD severity without 12-channels. It reduces acquisition difficulty and calculating pressure, contributing to developing portable diagnostic equipment for COPD.

The rest of this paper is structured as follows: The materials and methods are briefly introduced in “Materials and methods” section. “Experimental results” section provides the experimental process and results in detail. “Discussion” section presents the discussion. The conclusions of this paper and the following work are described in “Conclusion” section.

Table 1 The distribution of COPD patients by gender

Full size table

Materials and methods

Database

As a public multimedia respiratory database, RespiratoryDatabase@TR contains 12-channel lung sounds, 4-channel heart sounds, spirometry metrics, and chest X-rays for each subject [25]. The respiratory data were obtained by two pulmonologist clinicians using a Littmann3200 digital stethoscope to record the left (L) and right (R) channels in each lung region simultaneously. The sampling frequency is 4000Hz, and lung auscultation areas are shown in Fig. 1.

The database contains lung sounds of 42 COPD patients with varying degrees of severity, ranging in age from 38 to 68, including 34 males and 8 females. The distribution of COPD patients by gender is shown in Table 1. Subjects had never used cigarettes or tobacco products, and none of their close relatives had asthma or COPD [25]. At the start of each recording, subjects were required to cough for the signal synchronization of each lung region’s left and right channels. The duration of each recording was at least 17s. The labeling of respiratory data was determined by two pulmonologist clinicians based on the COPD severity rating given by the GOLD. RespiratoryDatabase@TR has an ethical committee approval confirmed by Mustafa Kemal University, Turkey.

Technical architecture

The proposed model in this paper contains four main parts: data preprocessing, feature extraction, channel selection, and COPD severity diagnosis. The technical architecture of our model is shown in Fig. 2. Firstly, the 12-channel lung sounds are preprocessed by segmentation and denoising, and the data is enhanced by adding noise. Then, the multi-dimensional features are extracted by HHT based on ensemble empirical mode decomposition (EEMD). Next, the channels and features are filtered based on the reliefF algorithm. Finally, SVM is used to diagnose the severity of COPD, which is classified into mild COPD, moderate COPD, and severe COPD.

Data preprocessing

31 COPD patients with different severity were randomly selected from RespiratoryDatabase@TR (6 patients with COPD0, 5 patients with COPD1, 5 patients with COPD2, 5 patients with COPD3, 10 patients with COPD4). Combined with the professional advice of clinicians, it is necessary to auscultation two or three respiratory cycles in each auscultation site for pathological significance to realize the objective diagnosis. Hence, lung sounds were divided into 15 seconds with 60000 sample points in each segment for subsequent analysis, and the start point of segmentation was set as the first inhalation after the coughing peak [20]. Patients with COPD usually produce continuous and persistent abnormal lung sounds with a frequency above 400Hz [26]. The low-pass filter was abandoned to avoid losing important information, and only a 7.5Hz first-order Butterworth high-pass filter was used to eliminate DC offset.

According to the classification of COPD severity in clinical practice, COPD0 and COPD1 are called mild COPD, COPD2 and COPD3 are combined as moderate COPD, and COPD4 is severe COPD. The unbalanced sample size will lead to the poor training effect of the model and can not truly reflect the accuracy and recall rate of positive and negative samples. Therefore, to ensure a balanced sample size and facilitate subsequent classification, the respiratory data of COPD0 patients were increased to 10 groups by adding noise. The noise was Gaussian white noise with a mean of 0 and a standard deviation of 1; the noise factor was set at 0.06.

Hilbert–Huang transform

HHT is a new nonstationary and nonlinear signal analysis method proposed by Huang E after an in-depth study and summary of previous signal analysis methods [27], which has been widely used in the model research of disease diagnosis, such as chronic respiratory diseases [20, 22, 28] and heart diseases [29]. The method mainly includes empirical mode decomposition (EMD), Hilbert transform (HT), and its spectral analysis, among which EMD is the core part of the algorithm [30].

EMD is a method of adaptively decomposing the signal into a series of intrinsic mode functions (IMFs) based on its characteristics. The implementation steps are as follows:

1.
Calculate all local extremum points (including maximum and minimum points) of this signal, and fit them respectively to get the upper (${\mathrm {f}}_{\max }({\mathrm {t}})$) and lower (${\mathrm {f}}_{\min }({\mathrm {t}})$) envelopes with the cubic spline interpolation algorithm.
2.
The average values of the upper and lower envelopes are calculated and used as the mean envelope n(t) of the original signal:
$$\begin{aligned} {\mathrm {n}}({\mathrm {t}})=\frac{{\mathrm {f}}_{\max }({\mathrm {t}})+{\mathrm {f}}_{\min }({\mathrm {t}})}{2} \end{aligned}$$
(1)
3.
Subtract the mean envelope n(t) from the unprocessed signal s(t):
$$\begin{aligned} m_{1}({\mathrm {t}})={\mathrm {s}}({\mathrm {t}})-{\mathrm {n}}({\mathrm {t}}) \end{aligned}$$
(2)
4.
Determine whether $m_{1}({\mathrm {t}})$ is an IMF. If so, ${\mathrm {c}}_{1}({\mathrm {t}})=m_{1}({\mathrm {t}})$. If not, re-do the analysis of $1) \sim 3)$ based on this signal until $m_{1}({\mathrm {t}})$ is an IMF.
5.
After obtaining the first IMF using the above method, subtract ${\mathrm {c}}_{1}({\mathrm {t}})$ from the original signal s(t) to get the residual component ${\mathrm {r}}_{1}({\mathrm {t}}):$
$$\begin{aligned} {\mathrm {r}}_{1}({\mathrm {t}})={\mathrm {s}}({\mathrm {t}})-{\mathrm {c}}_{1}({\mathrm {t}}) \end{aligned}$$
(3)
6.
Let ${\mathrm {r}}_{1}({\mathrm {t}})$ as the new original signal, IMF2 can be obtained through the analysis of $1) \sim 5)$, and so on until the predetermined stop criterion is met. The EMD decomposition is completed.

Finally, the original signal s(t) is broken down the sum of a residual component and a series of IMFs:

$$\begin{aligned} {\mathrm {s}}({\mathrm {t}})=\sum _{{\mathrm {i}}=1}^{{\mathrm {n}}} {\mathrm {c}}_{{\mathrm {i}}}({\mathrm {t}})+{\mathrm {r}}_{{\mathrm {n}}}({\mathrm {t}}) \end{aligned}$$

(4)

EMD has good properties, including adaptability, completeness, and approximate orthogonality, but it also has some shortcomings, the most important of which is the mode-mixing problem. In order to avoid this phenomenon, EEMD was used to decompose lung sounds. EEMD is essentially a multiple EMD with Gaussian white noise. Gaussian white noise has the characteristic of uniform spectrum distribution, so adding it to the signal will automatically separate the signals of different time scales into the corresponding reference scale. The white noise is canceled by averaging the corresponding IMFs obtained by multiple EMDs [31].

The HT of the signal is the output response $x_{h}(t)$, which is of a continuous-time signal x(t) through a linear system with the impulse response ${\mathrm {h}}({\mathrm {t}})=\frac{1}{\pi t}$. The physical meaning of HT is to postpone all frequency components’ phrases by 90 degrees.

The analytic function of x (t) ’s HT:

$$\begin{aligned} {\mathrm {H}}(\omega , {\mathrm {t}})={\text {Re}} \sum _{{\mathrm {i}}=1}^{{\mathrm {n}}} {\mathrm {c}}_{{\mathrm {i}}}({\mathrm {t}}) {\mathrm {e}}^{{\mathrm {j}} \theta ({\mathrm {t}})}={\text {Re}} \sum _{{\mathrm {i}}=1}^{{\mathrm {n}}} {\mathrm {c}}_{{\mathrm {i}}}({\mathrm {t}}) {\mathrm {e}}^{{\mathrm {j}} \int \omega _{{\mathrm {i}}}({\mathrm {t}}) {\mathrm {dt}}} \end{aligned}$$

(5)

Re indicates the real part, $c_{i}(t)$ indicates the instantaneous amplitude, and $\omega _{i}(t)$ indicates the instantaneous frequency. By integrating the time, the Hilbert marginal spectrum was obtained:

$$\begin{aligned} {\mathrm {h}}(\omega )=\int _{0}^{{\mathrm {T}}} {\mathrm {H}}(\omega , {\mathrm {t}}) {\mathrm {dt}} \end{aligned}$$

(6)

It accumulates the amplitudes of each frequency, showing the global contribution of each frequency.

By squaring the amplitude of the Hilbert marginal spectrum and integrating the frequency, the instantaneous energy density IE (t) was obtained:

$$\begin{aligned} {\mathrm {IE}}({\mathrm {t}})=\int _{\omega _{1}}^{\omega _{2}} {\mathrm {H}}(\omega , {\mathrm {t}})^{2} {\mathrm {d}} \omega \end{aligned}$$

(7)

ReliefF feature selection

Since scale differences among features will affect the selection of valuable features, and the subsequent classification algorithm also requires data normalization, the feature sets of samples were normalized. This paper used the Mapminmax function in Matlab to normalize the feature sets to [0,1].

The relief algorithm is a feature weight algorithm, first proposed by Kira, which gives different weights to each feature, referring to the statistical correlation principle. The advantage of the algorithm is simplicity and high accuracy, while the disadvantage is that it can only deal with binary classification problems [32]. In order to solve the limitation of relief, Kononenko proposed the reliefF algorithm. The steps of the reliefF algorithm are as shown below:

1.
Sample set:S, Sampling times:m, Feature set:F, Nearest neighbor sample number: k, Category set:L.
2.
Set the output value as the feature weight vector W of each feature.
3.
Set the initial values of all feature weights as zero and the feature weight vector W as an empty set.
4.
Loop search

For i=1 to m, do

Select a sample R from the sample set S randomly;

For $N \in L$ and $N \ne {\text {Label}}(R)$, do

Select k nearest neighbor samples ${\mathrm {H}}_{{\mathrm {j}}}$ (j=1,2, ......, k) from the same kind of samples of R and k nearest neighbor samples ${}_{{\mathrm {j}}}$ (j=1,2, ......, k) from the different kind of samples of R;

End

End
5.
For A=1 to the number of features in feature set F, do

Update the weights according to the weight formula;

End

Weight formula:

$$\begin{aligned} \begin{array}{c} {\mathrm {W}}({\mathrm {A}})={\mathrm {W}}({\mathrm {A}})-\frac{\sum _{{\mathrm {j}}=1}^{{\mathrm {k}}} {\text {diff}}\left( {\mathrm {A}}, {\mathrm {R}}, {\mathrm {H}}_{{\mathrm {j}}}\right) }{{\mathrm {mk}}}+\frac{\sum _{{\mathrm {c}} \notin {\mathrm {class}}({\mathrm {R}})}\left[ \frac{{\mathrm {p}}({\mathrm {C}})}{1-{\mathrm {P}}({\mathrm {class}}({\mathrm {R}}))} \sum _{{\mathrm {j}}=1}^{{\mathrm {k}}} {\text {diff}}\left( {\mathrm {A}}, {\mathrm {R}}, {\mathrm {H}}_{{\mathrm {j}}}({\mathrm {C}})\right) \right] }{{\mathrm {mk}}} \end{array} \end{aligned}$$

(8)

Support vector machine

SVM is an extensively used machine learning algorithm that refers to statistical learning theory, suitable for small samples, which can account for model complexity and learning ability. The core idea of SVM is to use nonlinear kernel functions to map nonlinear problems, which can not be separated in low-dimensional feature space, to higher-dimensional feature space, and then find a hyperplane in high-dimensional feature space to separate different kinds of samples. Thus, the nonlinear problems that can not be separated in low-dimensional feature space are transformed into the linear problems that can be separated in high-dimensional feature space [33]. The hyperplane requires to separate different kinds of samples and maximize the distance of the target point from the plane, which can be defined as follows:

$$\begin{aligned} {\mathrm {H}}: \omega ^{{\mathrm {T}}} {\mathrm {x}}+{\mathrm {b}}=0 \end{aligned}$$

(9)

In the process of SVM classification, the choice of kernel function is of capital importance. In this paper, radial basis kernel function (RBF) was selected:

$$\begin{aligned} {\mathrm {K}}\left( {\mathrm {x}}_{{\mathrm {i}}}, {\mathrm {x}}_{{\mathrm {j}}}\right) =\exp \left( -\frac{\left\| {\mathrm {x}}_{{\mathrm {i}}}-{\mathrm {x}}_{{\mathrm {j}}}\right\| ^{2}}{2 \sigma ^{2}}\right) \end{aligned}$$

(10)

Experimental setup

The hardware environment is Intel Core i7-9700 CPU@3.60GHz processor, DDR4 32G memory, NVIDIA GeForce RTX 2080Ti 11G graphics card. The Python version is 3.7.3, and the Matlab version is R2020a.

Table 2 The sorting results of channels

Full size table

Experimental results

The EEMD was used to decompose the pretreated lung sounds. The standard of noise deviation is generally set as 0.2 times the signal standard deviation [31]. Meanwhile, in the case of an appropriate noise level, when the noise number of EEMD is several hundred times, the error caused by residual noise will be less than 1%. Hence, the standard of noise deviation was set as 0.015 and the noise number was 100. In this study, lung sounds were first processed with EMD. It was found that the number of IMFs generated was between 8 and 10. Besides, 80% of lung sounds had 10 IMFs. In EEMD, the number of IMFs is allowed to be specified. Therefore, the IMFs’ number was set to 10, and the remaining IMFs were added to form the residual signal. Figure 3 shows the decomposition result of EEMD. Lung sound is the original signal, IMF1-IMF10 are the 10 IMFs obtained by decomposition, IMF11 is the residual signal.

The Hilbert marginal spectrum of each of IMFs was calculated, and then the HHT-based statistical characteristics were calculated for each IMF, including standard deviation, variance, kurtosis, maximum, median, mode, mean, minimum, energy, and skewness. Figure 4 is the Hilbert marginal spectrum of 10 IMFs for a lung sound signal. The statistical features, a total of 1200, were collected as a dataset.

Standard deviation:

$$\begin{aligned} \sigma =\sqrt{\frac{1}{{\mathrm {N}}} \sum _{{\mathrm {i}}=1}^{{\mathrm {N}}}\left( {\mathrm {x}}_{{\mathrm {i}}}-\mu \right) ^{2}} \end{aligned}$$

(11)

Variance:

$$\begin{aligned} s^{2}=\frac{1}{N} \sum _{i=1}^{N}\left( x_{i}-\mu \right) ^{2} \end{aligned}$$

(12)

Energy:

$$\begin{aligned} {\mathrm {E}}({\mathrm {t}})=\int _{0}^{{\mathrm {T}}} {\mathrm {H}}(\omega , {\mathrm {t}})^{2} {\mathrm {d}} \omega \end{aligned}$$

(13)

Kurtosis:

$$\begin{aligned} {\text {Kurt}}({\mathrm {X}})={\mathrm {E}}\left[ \left( \frac{{\mathrm {X}}-\mu }{\sigma }\right) ^{4}\right] \end{aligned}$$

(14)

Skewness:

$$\begin{aligned} {\text {Skew}}(X)={\mathrm {E}}\left[ \left( \frac{{\mathrm {X}}-\mu }{\sigma }\right) ^{3}\right] \end{aligned}$$

(15)

The reliefF algorithm was applied to the feature set of lung sounds. The features with weights less than 0 were eliminated. Then, the weights of respective features were added up to get the total weight of each channel, which was arranged from big to small. Table 2 shows the sorting results of channels under different categories. According to Table 2 in the classification of mild COPD and moderate + severe COPD, the most weighted channels were the 2, 4, 1, and 9, namely L2, L4, L1, and R3 channels. The most weighted channels were 4, 3, 12, and 7 to classify moderate COPD and severe COPD, namely L4, L3, R6, and R1 channels.

By comparing the classification results of SVM using different kernel functions, RBF with strong nonlinear mapping ability was selected, and grid-search was used to optimize penalty factor C and RBF parameter $\gamma$ on the training samples. Finally, C= 10 and $\gamma$=0.2 were selected in the classification of mild COPD and moderate + severe COPD, C=3 and $\gamma$=1.0 in the classification of moderate COPD and severe COPD.

Mild COPD/moderate + severe COPD

A total of 15 COPD0 and COPD1 samples were taken, labeled as 0. A total of 15 COPD2, COPD3, and COPD4 samples were taken, marked as 1. Features of the different number of optimal channels were selected to form feature sets, and the top 50 optimal features of each feature set were input into SVM, respectively. The train$\_$test$\_$split function of python was used to randomly select 30% of the samples as the test set and 70% samples as the training set. Due to the small amount of data, the test results varied greatly. The average accuracy was obtained by repeated calculations 1000 times to ensure the stability of the results. The results are shown in Fig. 5A.

As shown in Fig. 5A, when the characteristic quantity was set to 50 and the optimal three channels, L2, L4, and L1 channels, were selected, we can already get a high accuracy rate.

The feature set composed of different quantitative features of L2, L4, and L1 channels was input into SVM to test the accuracy, and Fig. 5B was obtained. As shown in the figure, when the number of features is 25, the accuracy rate reaches the highest, 89.13%.

Figure 6A, B show the box plots of IMFs and features for mild COPD and moderate + severe COPD classification. According to these figures, we can see that when the 25 features were used to achieve the highest recognition performance, the most responsible feature sets were IMF3 and IMF6, while IMF7, IMF9, and IMF10 did not appear in the feature set at all. For features, the degree of responsibility for standard deviation, kurtosis, maximum, mean, energy, and skewness was similar, while the other features were less responsible. Further observation revealed that the energy, kurtosis, maximum, and standard deviation of IMF3 were the most responsible features.

Moderate COPD/severe COPD

The analysis focused on 20 subjects. 5 patients with COPD2 and 5 patients with COPD3 were labeled as 0, and 10 patients with COPD4 samples were labeled as 1. Same as the above analysis method, features of the different number of optimal channels were selected to form feature sets, and the top 50 optimal features of each feature set were input into SVM, respectively. The calculations were repeated 1000 times to get the average accuracy, and Fig. 5C showed these results.

As shown in Fig. 5C, when the feature quantity is 50 and the optimal two channels, namely L4 and L3 channels, were selected, we can already get a high accuracy rate. Therefore, we input feature sets composed of L4 and L3 channels into SVM and obtained Fig. 5D by controlling the input feature quantity. It can be seen that when the optimal 33 features of L4 and L3 channels were input, the accuracy rate was the highest, reaching 94.26%.

Figure 6C, D show the box plots of IMFs and features for moderate COPD and severe COPD classification. According to these figures, we can see that when the 33 features were used to achieve the highest recognition performance, the most responsible feature sets were IMF3, IMF2, and IMF1, IMF7, and IMF8 were less responsible, while IMF4, IMF5, and IMF10 did not appear in the feature set at all. For features, the degree of responsibility was similar except that the median value was less responsible, and the mode and minimum value did not appear in the feature set. After further observation, it was found that the most responsible features were the kurtosis of IMF1, the variance and energy of IMF3, and the variance and maximum value of IMF2.

In summary, COPD can be diagnosed only by collecting lung sounds of L1, L2, L3, and L4 channels. The classification of mild COPD and moderate + severe COPD required the optimal 25 features of L2, L4, and L1 channels, with accuracy, sensitivity, and specificity of 89.13%, 87.72%, and 91.01%, respectively. The classification of moderate COPD and severe COPD required the optimal 33 features of L3 and L4 channels, and the accuracy, sensitivity, and specificity were 94.26%, 97.32%, 89.93%, respectively.

Table 3 Performance of different machine learning algorithms in the classification of mild COPD and moderate + severe COPD

Full size table

Discussion

In order to evaluate the classification effect of SVM, we compared the performance of SVM with Bayes, decision tree, and DBN, calculated the confusion matrix of SVM under the two classifications, as shown in Tables 3, 4 and Fig. 7. The classification principle of Bayes is to use the Bayes formula to calculate the posterior probability through the prior probability of an object, and the class with the maximum posterior probability is select as the class to which the object belongs. The decision tree is a tree-like prediction model in machine learning. Internal nodes represent tests on an attribute, each branch represents a test output, and each leaf node represents a category. DBN network is a deep learning network that consists of several unsupervised restricted Boltzmann machines and a supervised backpropagation network. DBN had achieved good results in disease diagnosis by using physiological signals such as electrocardiogram [34], electroencephalogram [35], and lung sounds [19, 20, 28].

Bayes and decision tree used the same feature set as SVM, and DBN worked poorly on this feature set. The feature set composed of the optimal 200 features of 12-channel lung sounds worked best through repeated tests. The Bayes model adopted naive Bayes with a prior polynomial distribution, alpha=1.0, and the prior probability was considered. The decision tree model adopted the gradient promotion decision tree, the maximum number of iterations was set as 100, and the learning rate was 1. The DBN iterated 50 cycles, the number of samples for each training was 5, and the learning rate was 0.001. In addition, the output function of DBN was selected as sigmoid. DBN includes an input layer, an output layer, and several hidden layers. In this paper, two hidden layers were used, of which the first layer had 10 neurons and the second layer had 100 neurons, to achieve the highest classification performance. Figure 8 depicts four machine learning algorithms’ receiver operating characteristic (ROC) curves under two classifications.

For small samples, it is not liable to exploit the advantage of the model relied on self-learning. Furthermore, the traditional machine learning algorithm fully reflects that the decision information has more advantages in robustness and expansibility. By comparison, it is observed that although DBN had achieved good results in classifying healthy subjects and COPD patients[18], in this study, the performance of DBN even with the use of full-channel feature sets was far lower than that of SVM. In the two classifications, the classification effect of SVM was superior to Bayes, decision tree, and DBN in terms of accuracy, sensitivity, specificity, Area Under Curve (AUC), F1-score, and Kappa Score. It can also be seen from the confusion matrix that SVM had a good classification effect under the two classifications.

In this study, the weight of each channel was determined by the reliefF channel selection model, and COPD severity was classified by SVM. The study found that the model could obtain a good classification effect by using the 4-channel lung sounds of L1, L2, L3, and L4 channels. The auscultation areas of these 4 channels were all located in the back and the left lung region. For this reason, we consulted much literature and consulted doctors. Although lung auscultation can be performed in both the front chest and back, lung sounds from the back are usually clearer due to the influence of the heart. For the phenomenon that left-side lung sounds can provide more information than right-side lung sounds in COPD patients, no reasonable explanation has been found, and further research is needed.

Previous studies primarily focused on the recognition and classification of abnormal lung sounds. For example, wavelet analysis combined with BP neural network was used to realize automatic recognition of wheezing, crackles, and other abnormal lung sounds [36], and EMD decomposition was used to identify crackles [37]. However, considering that different respiratory diseases may have the same abnormal lung sounds, abnormal lung sounds cannot be used to diagnose COPD severity. For example, COPD is a common cause of wheezing, but asthma, bronchitis, laryngitis may also occur wheezing. Using multi-dimensional features to diagnose COPD has high accuracy and can save diagnosis and treatment costs to some extent [16, 17]. Nevertheless, it is difficult to quickly diagnose the severity of COPD due to a large amount of information required and the need for experienced doctors.

Few studies on COPD diagnosis based on lung sounds focus on separating the digital auscultation records of COPD patients from healthy subjects [18, 20]. Altan et al. investigated how lung sounds could be used to diagnose COPD severity. First, they combined the three-dimensional feature-extraction technique with DBN to classify COPD0 and COPD4 [19]. After that, they used 3D second-order difference plot to extract characteristic abnormalities on lung sounds and then used the deep extreme learning machines classifier to complete the five classifications of COPD severity [21]. Compared with them, our research has the advantage of being more suitable for clinical needs and establishing a channel selection model based on the reliefF algorithm. It reduces the number of features and the calculating pressure, more importantly, reduces the number of channels and simplifies the acquisition process, thus completing the diagnosis of COPD severity more quickly, which helps to improve the portability and practicability of future diagnostic devices, provides a theoretical basis for the advancement of online COPD diagnosis and treatment tools.

It should be noted that despite the rapid development of the application of artificial intelligence in the medical field, due to the lack of standardized datasets, limited by the quantity and quality of data, the models proposed by the current research focus on the innovation of methods. It is not remarkable significant to make a detailed comparison of the quantitative results of these studies from the different sample datasets. It is hoped that a sizeable standardized dataset of lung sounds can be established as soon as possible to accelerate the application of artificial intelligence in the diagnosis of COPD and verify the clinical feasibility of the model.

The clinical diagnosis of COPD is very complicated, which is time-consuming, invasive, and radioactive. Pulmonary auscultation, as an essential part of the diagnosis process, requires experienced respiratory doctors to complete. The proposed model can diagnose the severity of COPD quickly through the 4-channel lung sounds of L1, L2, L3, and L4 channels, which eliminates the long-term clinical examination and saves medical resources.

Table 4 Performance of different machine learning algorithms in the classification of moderate COPD and severe COPD

Full size table

Conclusion

In this study, firstly, the time-frequency-energy features of 12 channels lung sounds were extracted by HHT; secondly, channels and features were screened by the reliefF algorithm; finally, the feature sets were input into SVM to diagnose COPD severity and compared the performance of SVM with Bayes, decision tree, and DBN. As the experimental results demonstrate, COPD severity can be effectively diagnosed within 5 minutes only by feature extraction, feature selection, and SVM classification of 4-channel lung sounds collected. In the classification of mild COPD and moderate + severe COPD, feature extraction based on HHT took 297.81 s, feature selection based on reliefF algorithm took 0.246s, and SVM classification took 0.836s. In the classification of mild COPD and moderate + severe COPD, feature extraction, feature selection, and SVM classification took 201.97s, 0.238s, and 0.801s, respectively. The noise number of EEMD was set as 100 to reduce the error caused by residual noise, which increased the time. If EMD is used instead of EEMD, only 4.70s and 3.01s are needed for feature extraction under the two categories.

Compared with previous studies on COPD, on the one hand, this model uses the channel selection algorithm, which not only reduces the calculating pressure and prevents over-fitting, more importantly, reduces the difficulty of acquisition and speeds up the diagnosis speed, which is conducive to improving the portability and practicability of future diagnostic devices and promoting the advancement of online diagnosis tools. On the other hand, this study aims to help doctors quickly complete the preliminary diagnosis of mild COPD, moderate COPD, and severe COPD, which is more consistent with the clinical needs and has important clinical significance. The weakest aspect of this research is the quantity and quality of the data, which is currently available exclusively from the RespiratoryDatabase@TR. To better evaluate the reliability of this model in clinical application, we are using a large amount of clinical data in the following research and improving the generalization performance of the model through transfer learning.

Availability of data and materials

The dataset of lung sounds used in the study is available at https://data.mendeley.com/datasets/p9z4h98s6j/1.

References

Wang C, Xu J, Yang L, Xu Y, Zhang X, Bai C, Kang J, Ran P, Shen H, Wen F-Q, Huang K, Yao W, Sun T, Shan G, Yang T, Lin Y, Wu S, Zhu J, Wang R, He J. Prevalence and risk factors of chronic obstructive pulmonary disease in china (the china pulmonary health [CPH] study): a national cross-sectional study. The Lancet. 2018;391:1706–17. https://doi.org/10.1016/S0140-6736(18)30841-9.
Article Google Scholar
Global strategy for the diagnosis, management and prevention of COPD, global initiative for chronic obstructive lung disease (gold) 2020 report (2020)
Lca B, Sc C, Mm D. Global strategy for the diagnosis, management, and prevention of chronic obstructive lung disease 2019 report: future challenges—sciencedirect. Archivos de Bronconeumología. 2020;56(2):65–7.
Google Scholar
Jones PW. Health status and the spiral of decline. COPD: J Chronic Obstr Pulm Dis. 2009;6(1):59–63. https://doi.org/10.1080/15412550802587943.
Article Google Scholar
Han MK, Muellerova H, Curran-Everett D, Dransfield MT, Washko GR, Regan EA, Bowler RP, Beaty TH, Hokanson JE, Lynch DA, Jones PW, Anzueto A, Martinez FJ, Crapo JD, Silverman EK, Make BJ. Gold 2011 disease severity classification in COPDGene: a prospective cohort study. Lancet Respir Med. 2013;1(1):43–50. https://doi.org/10.1016/S2213-2600(12)70044-9.
Article PubMed Google Scholar
Patel A, Patel A, Singh S, Singh S, Khawaja I. Global initiative for chronic obstructive lung disease: the changes made. Cureus. 2019;11:1–7. https://doi.org/10.7759/cureus.4985.
Article CAS Google Scholar
Kinney G, Santorico S, Young K, Cho M, Castaldi P, Estépar R, Ross J, Dy J, Make B, Regan E, Lynch D, Curran-Everett D, Lutz S, Silverman E, Washko G, Crapo J, Hokanson J. Identification of chronic obstructive pulmonary disease axes that predict all-cause mortality: the COPDGene study. Am J Epidemiol. 2018;187:2109–16. https://doi.org/10.1093/aje/kwy087.
Article PubMed PubMed Central Google Scholar
Gershon A, Hwee J, Croxford R, Aaron S, To T. Patient and physician factors associated with pulmonary function testing for COPD a population study. Chest. 2013;145:272–81. https://doi.org/10.1378/chest.13-0790.
Article Google Scholar
Chen S, Huang M, Peng X, Yuan Y, Zhao H. Lung sounds can be used as an indicator for assessing severity of chronic obstructive pulmonary disease at the initial diagnosis. Nan fang yi ke da xue xue bao = Journal of Southern Medical University. 2020;40(2):177–82.
PubMed Google Scholar
Sarkar M, Bhardwaz R, Madabhavi I, Modi M. Physical signs in patients with chronic obstructive pulmonary disease. Lung India. 2019;36(1):38.
Article Google Scholar
Jácome C, Oliveira A, Marques A. Computerized respiratory sounds: a comparison between patients with stable and exacerbated COPD. Clin Respir J. 2015;11(9):612–20.
PubMed Google Scholar
Baroi S, McNamara RJ, McKenzie DK, Gandevia S, Brodie MA. Advances in remote respiratory assessments for people with chronic obstructive pulmonary disease: a systematic review. Telemed e-Health. 2018;24(6):415–24. https://doi.org/10.1089/tmj.2017.0160.
Article Google Scholar
Jácome C, Marques A. Computerized respiratory sounds: novel outcomes for pulmonary rehabilitation in COPD. Respir Care. 2017;62(2):199–208.
Article Google Scholar
Tokuda Y, Miyagi S. Physical diagnosis of chronic obstructive pulmonary disease. Internal Med. 2007;46(23):1885–91.
Article Google Scholar
Adhi P, Stuart B, Esther RV, Thomas P. Automatic adventitious respiratory sound analysis: a systematic review. PLoS ONE. 2017;12(5):0177926.
Google Scholar
Wang Q, Wang H, Wang L, Yu F. Diagnosis of chronic obstructive pulmonary disease based on transfer learning. IEEE Access. 2020;PP:1. https://doi.org/10.1109/ACCESS.2020.2979218.
Article Google Scholar
Ying J, Dutta J, Guo N, Hu C, Zhou D, Sitek A, Li Q. Classification of exacerbation frequency in the COPDGene cohort using deep learning with deep belief networks. IEEE J Biomed Health Inform. 2020;24(6):1805–13. https://doi.org/10.1109/JBHI.2016.2642944.
Article PubMed Google Scholar
Daniel SM, Antonio LJ, Astorga MS. Computer-aided diagnosis of pneumonia in patients with chronic obstructive pulmonary disease. J Am Med Inform Assoc Jamia. 2013;20(e1):111–7.
Article Google Scholar
Altan G, Kutlu Y, Pekmezci AO, Nural S. Deep learning with 3D-second order difference plot on respiratory sounds. Biomed Signal Process Control. 2018;45(8):58–69.
Article Google Scholar
Altan G, Kutlu Y, Allahverdi N. Deep learning on computerized analysis of chronic obstructive pulmonary disease. IEEE J Biomed Health Inform. 2020;24(5):1344–50.
Article Google Scholar
Altan G, Kutlu Y, Gken A. Chronic obstructive pulmonary disease severity analysis using deep learning on multi-channel lung sounds. Turk J Electr Eng Comput Sci. 2020;28(5):2979–96.
Article Google Scholar
Gökçen A. Computer-aided diagnosis system for chronic obstructive pulmonary disease using empirical wavelet transform on auscultation sounds. Comput J. 2021. https://doi.org/10.1093/comjnl/bxaa191.
Article Google Scholar
Altan G, Kutlu Y. Hessenberg Elm autoencoder kernel for deep learning. J Eng Technol Appl Sci. 2018;3:141–51. https://doi.org/10.30931/jetas.450252.
Article Google Scholar
Otaiby T, Abd El-Samie F, Alshebeili S, Ahmad I. A review of channel selection algorithms for EEG signal processing. EURASIP J Adv Signal Process. 2015;2015:66. https://doi.org/10.1186/s13634-015-0251-9.
Article Google Scholar
Altan G, Kutlu Y, Garbi Y, Pekmezci A, Nural S. Multimedia respiratory database (RespiratoryDatabase@Tr): auscultation sounds and chest X-rays. Nat Eng Sci. 2017;2458–8989(2):59–72. https://doi.org/10.28978/nesciences.349282.
Article Google Scholar
Meslier N, Charbonneau G, RAcIneux JL. Wheezes. Eur Respir J. 1995;8(11):1942–8.
Article CAS Google Scholar
Yan R, Gao R. A tour of the tour of the Hilbert–Huang transform: an empirical tool for signal analysis. Instrum Meas Mag IEEE. 2007;10:40–5. https://doi.org/10.1109/MIM.2007.4343566.
Article Google Scholar
Allahverdi, N., Altan, G., Kutlu, Y.: Deep learning for copd analysis using lung sounds. In: The 6th international conference on control and optimization with industrial applications COIA 2018; July 11–13, 2018 Baku, Azerbaijan (2018)
Altan G, Kutlu Y, Allahverdi N. A new approach to early diagnosis of congestive heart failure disease by using Hilbert–Huang transform. Comput Methods Progr Biomed. 2016;137:23–34. https://doi.org/10.1016/j.cmpb.2016.09.003.
Article Google Scholar
Huang N, Shen Z, Long S, Wu MLC, Shih H, Zheng Q, Yen N-C, Tung C-C, Liu H. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc R Soc Lond Ser A Math Phys Eng Sci. 1998;454:903–95. https://doi.org/10.1098/rspa.1998.0193.
Article Google Scholar
Wu Z, Huang N. Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv Adapt Data Anal. 2009;1:1–41. https://doi.org/10.1142/S1793536909000047.
Article Google Scholar
Kira, K., Rendell, L.: The feature selection problem: traditional methods and a new algorithm. In: Proceedings tenth national conference on artificial intelligence, p. 129–134 (1992)
Mahendra Y, Tjandrasa H, Fatichah C. Klasifikasi data eeg untuk mendeteksi keadaan tidur dan bangun menggunakan autoregressive model dan support vector machine. JUTI: Jurnal Ilmiah Teknologi Informasi. 2017;15:35. https://doi.org/10.12962/j24068535.v15i1.a633.
Article Google Scholar
Altan G, Kutlu Y, Allahverdi N. A multistage deep belief networks application on arrhythmia classification. Int J Intell Syst Appl Eng. 2016. https://doi.org/10.18201/ijisae.2016SpecialIssue-146978.
Article Google Scholar
Altan G, Kutlu Y, Allahverdi N. Deep belief networks based brain activity classification using EEG from slow cortical potentials in stroke. Int J Appl Math Electron Comput. 2016. https://doi.org/10.18100/ijamec.270307.
Article Google Scholar
Kandaswamy A, Kumar C, Ramanathan RP, Jayaraman S, Malmurugan N. Neural classification of lung sounds using wavelet coefficients. Comput Biol Med. 2004;34(6):523–37.
Article CAS Google Scholar
Charleston-Villalobos S, Gonzalez Camarena R, Chi-Lem G, Corrales T. Crackle sounds analysis by empirical mode decomposition. nonlinear and nonstationary signal analysis for distinction of crackles in lung sounds. IEEE Eng Med Biol Mag Quart Mag Eng Med Biol Soc. 2021;26:40–7.
Google Scholar

Download references

Acknowledgements

The authors express thanks to the respiratory doctors of Chest Hospital of Tianjin University for providing full help in the research process.

Funding

This study is funded by National Key Research and Development Project, China Under Grant No.2019YFC0119402 and Major Science and Technology Projects of Tianjin, China Under Grant No.18ZXZNSY00240.

Author information

Authors and Affiliations

Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, 300072, China
Hui Yu & Jing Zhao
Department of Biomedical Engineering, Tianjin Key Laboratory of Biomedical Detecting Techniques and Instruments, Tianjin University, Tianjin, 300072, China
Hui Yu, Jing Zhao, Dongyi Liu, Zhen Chen & Jinglai Sun
Chest Hospital of Tianjin University, Tianjin, 300051, China
Xiaoyun Zhao

Authors

Hui Yu
View author publications
You can also search for this author in PubMed Google Scholar
Jing Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Dongyi Liu
View author publications
You can also search for this author in PubMed Google Scholar
Zhen Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jinglai Sun
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyun Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JZ completed most of the experimental research, wrote the code and paper. HY provided important research ideas, reviewed and revised the manuscript together with DL. ZC supervised the project. JS and XZ provided theoretical and technical support for the project. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Jinglai Sun or Xiaoyun Zhao.

Ethics declarations

Ethics approval and consent to participate

RespiratoryDatabase@TR has an ethical committee approval confirmed by Mustafa Kemal University, Turkey.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Yu, H., Zhao, J., Liu, D. et al. Multi-channel lung sounds intelligent diagnosis of chronic obstructive pulmonary disease. BMC Pulm Med 21, 321 (2021). https://doi.org/10.1186/s12890-021-01682-5

Download citation

Received: 28 July 2021
Accepted: 29 September 2021
Published: 15 October 2021
DOI: https://doi.org/10.1186/s12890-021-01682-5

Multi-channel lung sounds intelligent diagnosis of chronic obstructive pulmonary disease