Skip to main content

Cluster features in fibrosing interstitial lung disease and associations with prognosis

Abstract

Background

Clustering is helpful in identifying subtypes in complex fibrosing interstitial lung disease (F-ILD) and associating them with prognosis at an early stage of the disease to improve treatment management. We aimed to identify associations between clinical characteristics and outcomes in patients with F-ILD.

Methods

Retrospectively, 575 out of 926 patients with F-ILD were eligible for analysis. Four clusters were identified based on baseline data using cluster analysis. The clinical characteristics and outcomes were compared among the groups.

Results

Cluster 1 was characterized by a high prevalence of comorbidities and hypoxemia at rest, with the worst lung function at baseline; Cluster 2 by young female patients with less or no smoking history; Cluster 3 by male patients with highest smoking history, the most noticeable signs of velcro crackles and clubbing of fingers, and the severe lung involvement on chest image; Cluster 4 by male patients with a high percentage of occupational or environmental exposure. Clusters 1 (median overall survival [OS] = 7.0 years) and 3 (OS = 5.9 years) had shorter OS than Clusters 2 (OS = not reached, Cluster 1: p < 0.001, Cluster 3: p < 0.001) and 4 (OS = not reached, Cluster 1: p = 0.004, Cluster 3: p < 0.001). Clusters 1 and 3 had a higher cumulative incidence of acute exacerbation than Clusters 2 (Cluster 1: p < 0.001, Cluster 3: p = 0.014) and 4 (Cluster 1: p < 0.001, Cluster 3: p = 0.006). Stratification by using clusters also independently predicted acute exacerbation (p < 0.001) and overall survival (p < 0.001).

Conclusions

The high degree of disease heterogeneity of F-ILD can be underscored by four clusters based on clinical characteristics, which may be helpful in predicting the risk of fibrosis progression, acute exacerbation and overall survival.

Peer Review reports

Introduction

Fibrosing interstitial lung disease (F-ILD) encompasses a heterogeneous group of diseases of various etiology, including idiopathic pulmonary fibrosis (IPF), connective tissue disease-associated interstitial lung disease (CTD-ILD), idiopathic non-specific interstitial pneumonia (iNSIP), fibrotic hypersensitivity pneumonitis (FHP); unclassifiable idiopathic interstitial pneumonia (uIIP), and occupational induced ILDs [1, 2]. Accurate diagnosis is essential for these diseases to management. However, a highly heterogeneous disease course often delays correct diagnosis and anti-fibrotic treatment in clinical practice.

Researchers have tried to find patients with irreversible rapidly progressive fibrosis, which may indicate higher mortality [3]. The term “progressive fibrosing ILD (PF-ILD)” used in a hallmark clinical trial was proposed to cover several diseases featuring high-resolution computed tomography (HRCT)-documented increase in the extent of pulmonary fibrosis, a decline in lung function, worsening respiratory symptoms, and a high risk of early mortality despite available treatments, with a clinical course similar to that of IPF [1, 2, 4, 5]. Recently, the newly updated American Thoracic Society (ATS)/European Respiratory Society (ERS)/Japanese Respiratory Society (JRS)/Asociación Latinoamericana de Tórax (ALAT) guideline suggests using the term “progressive pulmonary fibrosis (PPF)” instead. Still, the criteria of PF-ILD and PPF are different [6]. It is difficult to predict the proportion of patients with non-IPF ILDs who will develop a progressive fibrotic pattern; however, the likely predictors at high risk of progression and mortality at baseline have been investigated, such as male sex, increasing age, smoking, and HRCT-documented usual interstitial pneumonia (UIP) pattern [2, 3, 5, 7,8,9].

Using the baseline clinical features, a new trend of clustering method was aimed to help reveal a novel subtype of F-ILD, which was associated with the prognosis and treatment and had distinct clinical features. Patients with similar characteristics are grouped in the same cluster so that each cluster will be the most homogeneous and heterogeneous compared to the other clusters [10]. Previously, in chronic ILDs, clinical characteristics determined using cluster analysis illustrated considerable predictive accuracy for clinical outcomes, including progression-free survival, transplant-free survival, and acute exacerbation (AE) [11]. However, they didn’t include patients with occupational or environmental related ILDs, though occupational and environmental dust exposure is essential for clustering. Besides, the possible relationship between clusters and fibrosis progression has not been explored.

To elucidate potential relationships between clusters and outcomes, including fibrosis progression, AE, and all-cause mortality, we aimed to employ cluster analysis on a patient cohort obtained from a regional tertiary referral center specializing in the management of ILDs. We also conducted a comparison of the clinical characteristics among patients who met different criteria for fibrosis progression.

Methods

Study population and design

Patients with F-ILD aged ≥ 18 years between January 1, 2016, and January 1, 2021, from Beijing Chao-Yang Hospital, China, were retrospectively screened for the study. The diagnoses were based on multidisciplinary discussion, following ATS/ ERS guidelines [12, 13]. Patients enrolled in the study were those whose baseline medical records, pulmonary function tests, and HRCT findings were accessible, and who had at least two documented follow-up visits with follow-up intervals of less than one year. Following were the exclusion criteria: (1) < 10% fibrosis documented on baseline HRCT, (2) diagnosis of pulmonary embolism, decompensated heart failure, lower respiratory tract infections, or acute respiratory distress syndrome at baseline, (3) lung cancer at baseline, (4) missing available baseline data, (5) loss to follow-up (Supplemental Material Figure S1). The study was approved by the Ethics Committee of Beijing Chao-Yang Hospital (2020-KY-437) and performed by the principles of the Declaration of Helsinki.

Data collection

The electronic medical record was reviewed to extract pertinent data at the first visit, including age, sex, body mass index (BMI), symptoms, signs, smoking history, occupational and environmental exposures based on each patient’s reported history, comorbidities (pulmonary hypertension [defined by the right heart catheterization demonstrating a mean pulmonary artery pressure greater than 20 mmHg], diabetes, coronary heart disease, hypertension, hypothyroidism, gastroesophageal reflux disease [GERD]). All patients provided laboratory test results, pulmonary function measures [forced vital capacity (FVC), percent predicted for FVC (FVC% pred), and percent predicted for diffusion capacity of the lung for carbon monoxide (DLCO% pred)] and HRCT images. Laboratory data included arterial blood gas test results, antinuclear antibody (ANA) titers, routine blood counts, the derivative blood cell count inflammation indexes (neutrophil-to-lymphocyte ratio, monocyte-to-lymphocyte ratio, platelet-to-lymphocyte ratio), and serum oncomarkers (squamous cell carcinoma [SCC] antigen, carcinoembryonic antigen [CEA], cytokeratin fraction 21–1[CYFRA21-1], carbohydrate antigen 125 [CA125], and neuron-specific enolase [NSE]). Hypoxemia was defined as a partial pressure of oxygen in the arterial blood of less than 80 mm Hg obtained from an arterial blood gas test at rest. Two radiologists blinded to the clinical data independently reviewed the HRCT scans. The disagreements were resolved via consensus. An HRCT-documented UIP-like pattern was considered a definitive UIP or probable UIP pattern according to the Clinical Practice Guideline of IPF [6, 12].

We reviewed each case to gather treatment options. The choice of treatment depended on the decision of physicians and the use of each treatment was documented if it was ever used for at least one month. The treatment regimens for patients consisted of corticosteroids, both in monotherapy and in combination with immunosuppressive drugs (cyclophosphamide, mycophenolate mofetil, cyclosporine, azathioprine or tacrolimus), antifibrotic therapy (nintedanib or pirfenidone), and long-term oxygen therapy (at least 15 h per day). Each patient’s Gender-Age-Physiology (GAP) score was calculated and assigned to their respective GAP stage. The GAP index was preferred over the ILD-GAP index, as some patients had ILD diagnoses that did not align with the ILD categories in the ILD-GAP index [14, 15].

Follow-up and outcomes of the study

Follow-up data were obtained from outpatient follow-up records (usually every 3–6 months), hospitalization details, and telephone interviews and the follow-up period ended on January 1, 2022. The outcome of the study was the (1) occurrence of fibrosis progression, (2) occurrence of AE, and (3) all-cause mortality. AE was defined as an acute, clinically significant respiratory deterioration that occurred in less than one month and was accompanied by new radiologic abnormalities on HRCT, such as diffuse, bilateral ground-glass opacification, with no obvious clinical cause, such as fluid overload, left heart failure, or pulmonary embolism [16, 17]. Overall survival (OS) was measured from the first visit to death from any cause.

Fibrosis progression

Both two different definitions explain fibrosis progression in the cohort. Firstly, PPF was defined as at least two of three criteria (worsening symptoms, radiological progression, and physiological progression) occurring within 12 months (Criteria A) [6]. Physiological progression was defined as either of the following: (1) absolute decline in FVC% pred > 5%; (2) absolute decline in DLCO% pred > 10% within 12 months of follow-up [6]. Notably, our cohort included patients with IPF. Secondly, Patients meeting any of the following criteria within 24 months period have experienced fibrosis progression (Criteria B): (a) a relative decline of ≥ 10% in FVC% pred; (b) a relative decline of ≥ 15% in DLCO% pred; (c) worsening symptoms or a worsening radiological appearance accompanied by a ≥ 5–<10% relative decrease in FVC% pred [1].

Statistical analysis

Statistical analyses were performed using the SPSS Statistics software, version 26 (IBM, Inc., Chicago, IL, USA). After excluding those with missing baseline data, the complete data of 575 patients were used for cluster analysis. Based on previous literature, 17 variables were identified with substantial clinical relevance for inclusion in the “two-step” cluster analysis [1, 11, 18]. These variables were as follows: male, age, BMI, heavy smoking, exposure history, hypoxemia, signs [velcro crackles, clubbing of fingers], lung function [FVC, FVC% pred, DLCO% pred], HRCT features [UIP-like pattern, emphysema], ANA titers and comorbidities [pulmonary hypertension, diabetes, coronary heart disease]). The choice of a similarity measure was based on the log-likelihood distance. To determine which number of clusters is “best”, each of the cluster solutions was compared using Schwarz’s Bayesian Criterion as the clustering criterion. The optimal number of four clusters was automatically determined. The silhouette coefficient was 0.5 and the cluster quality was fair. When we specified the number at three or five, the silhouette coefficient was lower than 0.5. The discriminant function analysis was performed to validate the cluster analysis results, and 93.7% of original grouped cases were correctly classified.

Data were expressed as means (standard deviation, SD) or medians (quartile) depending on distribution and numbers (percentage), respectively. Mann–Whitney U, Kruskal–Wallis test, or One-way ANOVA was used for continuous variables and the chi-squared test or Fisher’s exact test for categorical variables with Bonferroni post-hoc tests to compare the difference between every two groups. Survival curves were obtained using the Kaplan-Meier method. Survival was assessed using unadjusted log-rank testing and univariate and multivariable Cox proportional hazards regression. All statistical tests were 2-sided, and a p-value < 0.05 was considered statistically significant.

Results

Demographic and clinical characteristics of the whole cohort

The final cohort included 575 patients with 3.5 years median follow-up time. A summary of the demographic and clinical characteristics of all the patients included in this cohort was shown in Table 1. 94 (16.3%), 299 (52%), 2 (0.3%), 48 (8.3%), 36 (6.3%), 93 (16.2%), and 3(1.8%) patients were diagnosed with IPF, CTD-ILD, iNSIP, FHP, uIIP, occupational induced ILDs (asbestos/chronic silicosis) and sarcoidosis, respectively (Table 2).

Table 1 Demographics and clinical characteristics of the patients in four clusters
Table 2 Distinct diagnosis of the enrolled patients

Clinical characteristics of the clusters

Seventeen variables were selected to apply the two-step method for clustering. A 4-cluster model best fit the overall dataset.

Cluster 1 (n = 181) featured a higher prevalence of hypoxemia at rest (66.3%), lowest median FVC% pred (79.7%) and diabetes (50.3%) (Fig. 1; Table 1). The median age of patients was 65 years with almost equal distributions of males (51.5%) and females (48.5%). The cluster was with the second-lowest rate of smoking (17.1%) or heavy smoking (13.3%), and 53.6% of patients had autoimmune features (ANA > 1:100). Cluster 1 had the highest expression of white blood cell (WBC, 8.1 × 109), neutrophils (5.7 × 109), platelet (241.7 × 109), neutrophil-to-lymphocyte ratio (4.8), platelet-to-lymphocyte ratio (180.7), and the highest expression of SCC (1.3 ng/ml), CYFRA21-1 (5.6 ng/ml), and NSE (17.2 ng/ml) (Table S2).

Fig. 1
figure 1

Radar plot of top 10 important predictors. Plot scale: 1, 0.6 standard deviations (SDs) below the mean; 2, 0.2 SDs below the mean; 3, > 0.2 SDs above the mean; 4, > 0.6 SDs above the mean; 5, > 1 SDs above the mean; Variables were displayed clockwise based on predictor’s importance. Abbreviations: ANA: antinuclear antibody; DLCO: diffusion capacity of the lung for carbon monoxide; FVC: forced vital capacity; UIP: usual interstitial pneumonia

Cluster 2 (n = 164) described a cohort of predominantly female patients (81.7%) with the second-lowest rate of UIP-like fibrotic pattern (9.1%), least heavy smoking history (3.7%), and no hypoxemia was found (Fig. 1; Table 1). The median age was the youngest (59.5 years). The positive rate of ANA (ANA > 1:100) was the highest (54.6%). No emphysema was found in this cluster. It featured preserved FVC% pred (89.6%) and DLCO% pred (61%), the lowest prevalence of comorbidities. Regarding routine blood counts (Table S2), Cluster 2 had the lowest expression of WBC (6.7 × 109), neutrophils (4.3 × 109), lymphocytes (1.7 × 109), monocyte (0.4 × 109), and hemoglobin (125.0 g/L), and the lowest expression of SCC (0.9 ng/ml), CEA (2.2 ng/ml), and CA125 (16.4 U/ml).

Cluster 3 (n = 134) included a cohort of male patients (88.8%) with the highest rate of smoking (79.9%) or heavy smoking (67.9%), UIP-like fibrotic pattern (73.9%) on HRCT (Fig. 1; Table 1). The median age was 66 years and the percentages of clubbing of fingers (55.2%) and velcro crackles (95.5%) were the highest. The FVC% pred (85.1%) was preserved, while the DLCO% pred (58%) was the lowest. Regarding laboratory data (Table S2), it had the highest expression of monocyte (0.5 × 109), hemoglobin (141.1 g/L), red cell distribution width (coefficient of variation) (15.9%), SCC (1.3 ng/ml), CEA (3.9 ng/ml), CYFRA21-1 (4.7 ng/ml) and CA125 (29.9 U/ml).

Cluster 4 (n = 96) featured the least UIP-like fibrotic pattern (2.1%), highest proportion of male patients (90.6%), and the most frequent exposure history (87.5%) (Fig. 1; Table 1). The clinical signs of clubbing of fingers (1%) and velcro crackles (29.2%) and emphysema (11.5%) on HRCT were presented least (Table 1). The median FVC% pred was 87.3% and DLCO% pred was 71.1%. The positive rate of ANA (ANA > 1:100) was the lowest (12.5%) and it had the highest expression of monocyte-to-lymphocyte ratio (4.9) and lowest of platelet-to-lymphocyte ratio (122.7) (Table S2).

To assess baseline disease severity among the clusters, we evaluated the GAP score (Table 1). The median GAP score was similar among Cluster 1, Cluster 3 and Cluster 4 (GAP = 3). Cluster 2 had the lowest median score compared with others (GAP = 2, p < 0.001).

Clinical outcomes among the clusters

The clinical outcomes of 4 clusters at the end of the study were shown in Table 3. The percentage for AE and mortality of the 4 clusters was 37%, 17.1%, 27.6%, 14.6%, and 24.9%, 6.7%, 33.6%, 12.5%, respectively. OS was significantly shorter in Cluster 1 (median OS = 7.0 years) and Cluster 3 (median OS = 5.9 years) than in Cluster 2 (Cluster 1: p < 0.001, Cluster 3: p < 0.001) and Cluster 4 (Cluster 1: p = 0.004, Cluster 3: p < 0.001) (Fig. 2). The accumulative incidence rate of AE was higher in Clusters 1 and 3 than in Clusters 2 (Cluster 1: p < 0.001, Cluster 3: p = 0.014) and 4 (Cluster 1: p < 0.001, Cluster 3: p = 0.006). When stratified by diagnosis, OS was significantly shorter in patients with IPF (median OS = 5.0 years) than in CTD-ILD (p < 0.001), FHP (p = 0.004), and occupational induced ILDs (p < 0.001). Patients with IPF (p = 0.04) and (p = 0.043) had higher cumulative incidence rates of AE than FHP and patients with CTD-ILD had higher incidence rates than occupational induced ILDs (p = 0.003). Patients with uIIPs also had higher cumulative incidence rates than occupational-induced ILDs (p = 0.028). OS was shortest in patients with iNSIP than in others (p < 0.001), but only two patients were included in this group. Cluster 1 (Criteria A: 29.8%, Criteria B: 47%) and Cluster 3 (Criteria A: 35.8%, Criteria B: 49.3%) showed a higher rate of fibrosis progression despite different criteria for progression (Table 3). In addition, 150 out of 575 (26.1%) patients met Criteria A of fibrosis progression while 239 (41.6%) met Criteria B. Statistical significance was not observed in routine blood counts, oncomarkers and outcomes between patients with different criteria (Tables S1 and S3).

Table 3 Clinical outcomes of the patients in four clusters
Fig. 2
figure 2

Overall survival (OS) and acute exacerbation (AE). (a) OS in 4 clusters. (b) OS in distinct diagnosis. (c) AE in 4 clusters. (d) AE in distinct diagnosis. Abbreviations: CTD: connective tissue disease; FHP: fibrotic hypersensitivity pneumonitis; ILD: interstitial lung disease; iNSIP: idiopathic non-specific interstitial pneumonia; IPF: idiopathic pulmonary fibrosis; uIIP: unclassifiable idiopathic interstitial pneumonia

When evaluating AE and OS according to univariate Cox regression analysis, predictors included phenotypic clusters (p < 0.001), diagnosis category (p < 0.001) and GAP score (p < 0.001). After adjusting for diagnosis category, corticosteroid use, immunosuppressive therapy and anti-fibrotic treatment, multivariable analysis also suggested that phenotypic clusters (p < 0.001) and GAP score (p < 0.001) independently predicted OS and AE (Table 4).

Table 4 Variables predicting acute exacerbation and overall survival

Discussion

In this retrospective cohort study, a heterogeneous group of patients with F-LID was grouped into four clusters according to clinical and comorbidities variables present before clinical diagnosis. After forming the clusters, we compared the clinical features, laboratory data, and clinical outcomes between clusters and tried to identify the group of patients with the poorest prognosis.

Cluster 1, in which 53.6% of the patients had autoimmune features, included a group of gender equally distributed patients with a higher prevalence of some common comorbidities in aging people and a higher prevalence of hypoxemia at rest, and the lung function measures at baseline were worst among four clusters. Like Cluster 1, 54.3% of patients in Cluster 2 had autoimmune features. It featured mainly young female patients with less or no smoking history. Cluster 3 included predominantly male patients with the highest rate of smoking history. The signs of velcro crackles and clubbing fingers, and the UIP pattern were presented most. Besides, we identified a cluster of patients (Cluster 4) with the highest percentage of occupational or dust exposure history. The median age for this group with the highest proportion of male patients was younger, and the clinical signs and UIP-like fibrotic pattern and emphysema on HRCT were presented least. Regarding the prognosis, Clusters 1 and 3 had a higher risk for AE, and OS was shorter in the two Clusters, consistent with clinical practice. The prevalence of fibrosis progression was also higher in Clusters 1 and 3, despite the different definitions.

Cluster analysis is a valuable approach to investigating patients with respiratory diseases [19, 20], such as asthma [21], chronic obstructive pulmonary disease [22], idiopathic pulmonary arterial hypertension [23], sarcoidosis [24], and IPF [25], as it allows the identification of distinct characteristics among the heterogeneous disease courses. A cluster analysis of patients with idiopathic interstitial pneumonia and emphysema concluded that the cluster consisting mostly of IPF patients had significantly poor mortality [26]. Another cluster analysis was performed with a cohort of 770 chronic ILD patients (37% were diagnosed with IPF) [11]. Patients with the most rapid decline in lung function, increased fibrosis progression, and poor survival were usually clustered together. And a group of elderly white male smokers with severe honeycombing had poor mortality, similar to Cluster 3 in our study. However, the study did not include patients with ILD of known etiology, of which occupational exposure is an essential factor. Our clustering was of clinical feasibility and practice in exploratory inclusion of occupational induced ILD patients. Patients in Cluster 4 in our study was characterized by non-UIP pattern on HRCT and the highest frequency of occupational exposure history. The primary underlying diagnosis for this cluster was occupational related ILDs. Generally, patients with occupational-related ILDs progress slowly, which may account for the better OS observed in this cluster [27, 28]. Conversely, Cluster 1, characterized by a high prevalence of CTD-ILD, displayed a poorer survival outcome. One plausible explanation for this observation is that patients in this cluster may have been influenced by baseline lower lung function and comorbid conditions, such as diabetes, a known risk factor for the development of IPF and vascular complications [29].

Patients with a rapid decline in lung function, increased extent of pulmonary fibrosis on chest images and worsened symptoms showed a higher risk of AE and poorer survival [2, 9]. Previously, a clinical trial reporting a beneficial effect of antifibrotic treatment in non-IPF ILDs with fibrosis progression suggested criteria B as the definition for fibrosis progression [1, 4, 30]. Recently, updated Criteria A was published [6]. Here, we adopted both two measures. In total, 26.1% of patients fulfilled Criteria A while 41.6% the Criteria B. No statistically significant difference was found in clinical features. However, the percentage of patients with AE and mortality was higher in criteria A than B, suggesting the possible sensitivity of the newly updated definition for poor prognosis.

Given the similar clinical features and prognosis of patients with PPF and IPF, the acceptance of antifibrotic treatment for IPF led to investigation of such treatment in other fibrotic lung diseases [2, 8, 31, 32]. Previously, clinical trials such as INBUILD [4], RELIEF [32] and pirfenidone on uIIP [33] reported a beneficial effect of antifibrotic treatment in non-IPF ILDs with PPF. The annual rate of decline in the FVC in patients with PF-ILD was significantly lower among patients who received nintedanib (-80.8 mL/year) than among those who received a placebo (-187.8 mL/year) [4]. In this study, OS was shorter in patients with anti-fibrotic treatment than in non-anti-fibrotic therapy in the whole cohort. After cluster and criteria stratification, no difference was found in subgroups (Figure S2). It is not unexpected that patients who received anti-fibrotic therapy would have higher mortality because the medication varied due to the compliance, socio-economic situation and medical insurance of the patients and the doctors’ prescriptions in this retrospective cohort [34]. The treatment choices could be clustered as factors in a prospective cohort in further research.

This study has several strengths. In contrast to prior studies [11, 21], we specifically included occupational induced ILD of known etiology and assessed the presence of fibrosis progression in various clusters. For occupationally related exposures such as chronic silicosis and asbestosis, the majority of patients tend to experience slow progression [27, 28]. Next, our exploration involving the inclusion of PF-ILD/PPF patients and comparison of patients meeting PF-ILD criteria to those meeting PPF criteria represented a novel and pioneering endeavor, provided valuable additional findings. Using cluster analysis, comprehensive information including demographics, physiological values, chest imaging, laboratory values and complications was provided in this study.

Several limitations should be considered. First, the monocentric and retrospective design with inherent limitations may have led to a selection and information recall bias, and the exclusion of cases with missing values during the clustering analysis undoubtedly introduces selection bias. The aims of this study and the explanations of the results should be interpreted with caution. Secondly, though we employed clinical variables that could represent patients’ demographic, historical, physical, laboratory, and radiographic information, the numbers and optimal combination of variables used for the clustering needed to be validated. A previous study has indicated that other variables like race and GERD could contribute to clustering [11], which would have provided important information regarding the outcomes of ILD. However, this was a single ethnic study and the presence of GERD was not recorded reliably in our medical system. GERD was sometimes diagnosed due to proton pump inhibitors used by patients to be eligible for government health insurance coverage. Furthermore, we compared clinical features of chronic silicosis and asbestosis patients, considering potential variations in disease progression due to distinct exposures. Our findings indicated no statistically significant differences in prognosis between these groups (Table S6and Figure S4). Given the limited sample size and disease categories when stratifying disease types, a comprehensive exposure dataset is anticipated.

Conclusion

Using baseline clinical features, our study identified four clusters of the heterogeneous group of F-ILD patients. The four clinical clusters differed in expressions of laboratory data, risks of fibrosis progression, AE, and survival, highlighting the significant heterogeneity between clusters and homogeneity within clusters and may give a clue to predict clinical prognosis and develop management strategies for these patients. Further studies are warranted to optimize the cluster variables and clinical classifications of F-ILD with prospective and multicenter study designs.

Data Availability

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.

Abbreviations

AE:

acute exacerbation

ALAT:

Asociación Latinoamericana de Tórax

ANA:

antinuclear antibody

ATS:

American Thoracic Society

BMI:

body-mass index

CA125:

carbohydrate antigen 125

CEA:

carcinoembryonic antigen

CTD:

connective tissue disease

CYFRA21-1:

cytokeratin fraction 21–1

DLCO:

diffusion capacity of the lung for carbon monoxide

ERS:

European Respiratory Society

FHP:

fibrotic hypersensitivity pneumonitis

F-ILD:

fibrosing interstitial lung disease

FVC:

forced vital capacity

GAP:

gender-age-physiology

GERD:

gastroesophageal reflux disease

HRCT:

high-resolution computed tomography

ILD:

interstitial lung disease

iNSIP:

idiopathic non-specific interstitial pneumonia

IPF:

idiopathic pulmonary fibrosis

JRS:

Japanese Respiratory Society

OS:

overall survival

PF-ILD:

progressive fibrosing- interstitial lung disease

PPF:

progressive pulmonary fibrosis

NSE:

neuron-specific enolase

SCC:

squamous cell carcinoma antigen

uIIP:

unclassifiable idiopathic interstitial pneumonia

UIP:

usual interstitial pneumonia

WBC:

white blood cell

References

  1. Cottin V, Hirani NA, Hotchkin DL, et al. Presentation, diagnosis and clinical course of the spectrum of progressive-fibrosing interstitial lung Diseases. Eur Respir Rev. 2018;27(150):180076.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Wongkarnjana A, Scallan C, Kolb MRJ. Progressive fibrosing interstitial lung Disease: treatable traits and therapeutic strategies. Curr Opin Pulm Med. 2020;26(5):436–42.

    Article  CAS  PubMed  Google Scholar 

  3. Nasser M, Larrieu S, Si-Mohamed S, et al. Progressive fibrosing interstitial lung Disease: a clinical cohort (the PROGRESS study). Eur Respir J. 2021;57(2):2002718.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Flaherty KR, Wells AU, Cottin V, et al. Nintedanib in Progressive Fibrosing interstitial lung Diseases. N Engl J Med. 2019;381(18):1718–27.

    Article  CAS  PubMed  Google Scholar 

  5. George PM, Spagnolo P, Kreuter M, et al. Progressive fibrosing interstitial lung Disease: clinical uncertainties, consensus recommendations, and research priorities. Lancet Respir Med. 2020;8(9):925–34.

    Article  PubMed  Google Scholar 

  6. Raghu G, Remy-Jardin M, Richeldi L, et al. Idiopathic Pulmonary Fibrosis (an update) and Progressive pulmonary fibrosis in adults: an Official ATS/ERS/JRS/ALAT Clinical Practice Guideline. Am J Respir Crit Care Med. 2022;205(9):e18–e47.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Schimmelpennink MC, Meek DB, Vorselaars ADM, et al. Characterization of the PF-ILD phenotype in patients with advanced pulmonary sarcoidosis. Respir Res. 2022;23(1):169.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Hoffmann-Vold AM, Allanore Y, Alves M, et al. Progressive interstitial lung Disease in patients with systemic sclerosis-associated interstitial lung Disease in the EUSTAR database. Ann Rheum Dis. 2021;80(2):219–27.

    Article  CAS  PubMed  Google Scholar 

  9. Walsh SLF, Devaraj A, Enghelmayer JI, et al. Role of imaging in progressive-fibrosing interstitial lung Diseases. Eur Respir Rev. 2018;27(150):180073.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Dalmaijer ES, Nord CL, Astle DE. Statistical power for cluster analysis. BMC Bioinformatics. 2022;23(1):205–5.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Adegunsoye A, Oldham JM, Chung JH, et al. Phenotypic clusters predict outcomes in a longitudinal interstitial lung Disease Cohort. Chest. 2018;153(2):349–60.

    Article  PubMed  Google Scholar 

  12. Raghu G, Remy-Jardin M, Myers JL, et al. Diagnosis of Idiopathic Pulmonary Fibrosis. An Official ATS/ERS/JRS/ALAT Clinical Practice Guideline. Am J Respir Crit Care Med. 2018;198(5):e44–e68.

    Article  PubMed  Google Scholar 

  13. Travis WD, Costabel U, Hansell DM, King TE Jr, Lynch DA, Nicholson AG, et al. An official American Thoracic Society/European Respiratory Society statement: update of the international multidisciplinary classification of the idiopathic interstitial pneumonias. Am J Respir Crit Care Med. 2013;188(6):733–48.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Ley B, Ryerson CJ, Vittinghoff E, et al. A multidimensional index and staging system for Idiopathic Pulmonary Fibrosis. Ann Intern Med. 2012;156:684–91.

    Article  PubMed  Google Scholar 

  15. Ryerson CJ, Vittinghoff E, Ley B, et al. Predicting survival across chronic interstitial lung Disease: the ILD-GAP model. Chest. 2014;145:723–8.

    Article  PubMed  Google Scholar 

  16. Collard HR, Ryerson CJ, Corte TJ, Jenkins G, Kondoh Y, Lederer DJ, Lee JS, Maher TM, Wells AU, Antoniou KM, et al. Acute exacerbation of Idiopathic Pulmonary Fibrosis. An international working group report. Am J Respir Crit Care Med. 2016;194(3):265–75.

    Article  CAS  PubMed  Google Scholar 

  17. Suzuki A, Kondoh Y, Brown KK, et al. Acute exacerbations of fibrotic interstitial lung Diseases. Respirology. 2020;25(5):525–34.

    Article  PubMed  Google Scholar 

  18. American Thoracic Society/European. Respiratory Society international multidisciplinary consensus classification of the idiopathic interstitial pneumonias. Am J Respir Crit Care Med. 2002;165(2):277–304.

    Article  Google Scholar 

  19. Lia Y, Fana Y, Wanga Y, et al. Phenotypic clusters and survival analyses in interstitial Pneumonia with myositis-specific autoantibodies. Sarcoidosis Vasc Diffuse Lung Dis. 2022;38(4):e2021047.

    PubMed  Google Scholar 

  20. Zhang J, Xue Y, Liu X, et al. Identification of 4 subgroups in juvenile dermatomyositis by principal component analysis-based cluster analysis. Clin Exp Rheumatol. 2022;40(2):443–9.

    Article  CAS  PubMed  Google Scholar 

  21. Wu W, Bang S, Bleecker ER, et al. Multiview Cluster Analysis identifies variable corticosteroid response phenotypes in severe Asthma. Am J Respir Crit Care Med. 2019;199(11):1358–67.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Yoon HY, Park SY, Lee CH, et al. Prediction of first acute exacerbation using COPD subtypes identified by cluster analysis. Int J Chron Obstruct Pulmon Dis. 2019;14:1389–97.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Badagliacca R, Rischard F, Papa S, et al. Clinical implications of idiopathic pulmonary arterial Hypertension phenotypes defined by cluster analysis. J Heart Lung Transplant. 2020;39(4):310–20.

    Article  PubMed  Google Scholar 

  24. Lin NW, Arbet J, Mroz MM, et al. Clinical phenotyping in sarcoidosis using cluster analysis. Respir Res. 2022;23(1):88.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Marks-Garber K, Bdolah-Abram T, Nusair S. Cluster analysis based clinical profiling of Idiopathic Pulmonary Fibrosis patients according to comorbidities evident prior to diagnosis: a single-center observational study. Eur J Intern Med. 2020;80:18–23.

    Article  PubMed  Google Scholar 

  26. Sato S, Tanino Y, Misa K, et al. Identification of clinical phenotypes in idiopathic interstitial Pneumonia with Pulmonary Emphysema. Intern Med. 2016;55(12):1529–35.

    Article  PubMed  Google Scholar 

  27. O’Reilly KM, Mclaughlin AM, Beckett WS, Sime PJ. Asbestos-related lung Disease. Am Fam Physician. 2007;75(5):683–8.

    PubMed  Google Scholar 

  28. Hoy RF, Chambers DC. Silica-related Diseases in the modern world. Allergy. 2020;75(11):2805–17.

    Article  PubMed  Google Scholar 

  29. Rajasurya V, Gunasekaran K, Surani S. Interstitial lung Disease and Diabetes. World J Diabetes. 2020;11(8):351–7.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Collins BF, Raghu G. Antifibrotic therapy for fibrotic lung Disease beyond Idiopathic Pulmonary Fibrosis. Eur Respir Rev. 2019;28(153):190022.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Aimo A, Spitaleri G, Nieri D, et al. Pirfenidone for Idiopathic Pulmonary Fibrosis and Beyond. Card Fail Rev. 2022;8:e12.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Behr J, Prasse A, Kreuter M, et al. Pirfenidone in patients with Progressive fibrotic interstitial lung Diseases other than Idiopathic Pulmonary Fibrosis (RELIEF): a double-blind, randomised, placebo-controlled, phase 2b trial. Lancet Respir Med. 2021;9(5):476–86.

    Article  CAS  PubMed  Google Scholar 

  33. Maher TM, Corte TJ, Fischer A, et al. Pirfenidone in patients with unclassifiable Progressive fibrosing interstitial lung Disease: a double-blind, randomised, placebo-controlled, phase 2 trial. Lancet Respir Med. 2020;8(2):147–57.

    Article  CAS  PubMed  Google Scholar 

  34. Wuyts WA, Papiris S, Manali E, et al. The Burden of Progressive Fibrosing interstitial lung Disease: a DELPHI Approach. Adv Ther. 2020;37(7):3246–64.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank all patients who participated in this study. We thank Miss Moyang Xu of the University of Michigan, Ann Arbor, for polishing the language and grammar of the manuscript. The supplemental methods, figures and tables can be found in the Supplemental Materials.

Funding

The work was sponsored by High Level Public Health Technology Talent Construction Project (DL-02-21) (to QY) and the Reform and Development Program of Beijing Institute of Respiratory Medicine (Ggyfz202321) (to QY).

Author information

Authors and Affiliations

Authors

Contributions

The authors were responsible for all content of the manuscript. YW collected, analyzed and interpreted data, and wrote the manuscript. DS and JW were responsible for recruiting the patients and data collecting data. NW and SY helped review chest images. QY contributed as the primary investigator and was responsible for designing the study, analyzing and interpreting the data, and revising the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Qiao Ye.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the Ethics Committee of Beijing Chao-Yang Hospital (2020-KY-437) and performed by the principles of the Declaration of Helsinki. The requirement of obtaining informed consent was waived by the Ethics Committee of Beijing Chao-Yang Hospital due to the retrospective design of the study.

Competing interests

The authors declare no competing interests.

Consent for publication

Not Applicable.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, Y., Sun, D., Wang, J. et al. Cluster features in fibrosing interstitial lung disease and associations with prognosis. BMC Pulm Med 23, 420 (2023). https://doi.org/10.1186/s12890-023-02735-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12890-023-02735-7

Keywords