- Research article
- Open Access
A cross-omics integrative study of metabolic signatures of chronic obstructive pulmonary disease
BMC Pulmonary Medicine volume 20, Article number: 193 (2020)
Chronic obstructive pulmonary disease (COPD) is a common lung disorder characterized by persistent and progressive airflow limitation as well as systemic changes. Metabolic changes in blood may help detect COPD in an earlier stage and predict prognosis.
We conducted a comprehensive study of circulating metabolites, measured by proton Nuclear Magnetic Resonance Spectroscopy, in relation with COPD and lung function. The discovery sample consisted of 5557 individuals from two large population-based studies in the Netherlands, the Rotterdam Study and the Erasmus Rucphen Family study. Significant findings were replicated in 12,205 individuals from the Lifelines-DEEP study, FINRISK and the Prospective Investigation of the Vasculature in Uppsala Seniors (PIVUS) studies. For replicated metabolites further investigation of causality was performed, utilizing genetics in the Mendelian randomization approach.
There were 602 cases of COPD and 4955 controls used in the discovery meta-analysis. Our logistic regression results showed that higher levels of plasma Glycoprotein acetyls (GlycA) are significantly associated with COPD (OR = 1.16, P = 5.6 × 10− 4 in the discovery and OR = 1.30, P = 1.8 × 10− 6 in the replication sample). A bi-directional two-sample Mendelian randomization analysis suggested that circulating blood GlycA is not causally related to COPD, but that COPD causally increases GlycA levels. Using the prospective data of the same sample of Rotterdam Study in Cox-regression, we show that the circulating GlycA level is a predictive biomarker of COPD incidence (HR = 1.99, 95%CI 1.52–2.60, comparing those in the highest and lowest quartile of GlycA) but is not significantly associated with mortality in COPD patients (HR = 1.07, 95%CI 0.94–1.20).
Our study shows that circulating blood GlycA is a biomarker of early COPD pathology.
Chronic obstructive pulmonary disease (COPD) is a progressive inflammatory lung disease and currently the third leading cause of death worldwide [1, 2]. COPD is characterised by chronic airway inflammation, airway remodelling and airflow limitation . A reduced ratio of the Forced Expiratory Volume in 1 s (FEV1) to Forced Vital Capacity (FVC) is a measure of obstruction and is used to diagnose COPD even before the onset of clinical symptoms [3, 4]. Smoking is the most important risk factor for COPD and related to impaired lung function . COPD is a complex heterogeneous disease in which systemic features beyond airflow obstruction, including systemic inflammation, oxidative stress, muscle dysfunction, cachexia and vascular pathology occur [5, 6]. Understanding these systemic effects may give new insights into the pathogenesis and progression of COPD but may alternatively yield important clues for preventive research.
Recent developments in metabolomics have made it possible to investigate the associations between circulating metabolites and COPD. Glycoprotein acetyls (GlycA) was found to be predictive for several chronic diseases, among which COPD . In a previous metabolomics study using proton Nuclear Magnetic Resonance (1H-NMR), lower levels of lipoproteins, N,N-dimethylglycine and higher levels of glutamine, phenylalanine, 3-methylhistidine and ketone bodies were found in the circulation of ex-smoking COPD patients compared with ex-smoking controls . In severe COPD patients, branched chain amino acids (BCAAs) were found to be lower, compared with controls . Interestingly, BCAAs, 3-methylhistidine, ketone bodies, and triglycerides were negatively correlated with cachexia and positively correlated with systemic inflammation , but these findings have not been replicated. Another question that remains to be answered is whether the metabolic changes are a cause or a consequence of COPD. If the latter is true, the metabolites may be relevant for the disease progression and prognosis.
To answer these questions, we performed a comprehensive integrative metabolic analysis to identify plasma metabolic measures associated with COPD and lung function levels, defined as FEV1/FVC, using the NMR approach in a set of large epidemiological studies, in depth characterized for genetic and environmental risk factors. The discovery phase of the study was conducted in two population-based studies in the Netherlands, the Rotterdam Study (RS)  and the Erasmus Rucphen Family study (ERF) [10, 11]. A replication meta-analysis was conducted in the Lifelines-DEEP study (LLDEEP) , two cohorts of the FINRISK study [13, 14] and the Prospective Investigation of the Vasculature in Uppsala Seniors (PIVUS) study [15, 16].
Studies included in the discovery sample
The RS is a population-based study of 14,926 people older than 45 years, from the Ommoord area of Rotterdam, incorporating three independent cohorts: RS-I (established in 1989), RS-II (2000) and RS-III (2006), with multiple subsequent visits . Participants filled in questionnaires, underwent physical examination and provided fasting blood samples at each visit. For this analysis, three independent samples from different RS cohorts were enrolled: Sample 1) visit 4 of RS-I (RS-I-4); sample 2) a combined sample, which we collectively call RS-E5 in this manuscript, comprising of visit 5 of RS-I (RS-I-5), visit 3 of RS-II (RS-II-3), and visit 2 of RS-III (RS-III-2); and sample 3) another independent set from RS-III-2.
ERF is a population-based study from the south-west of the Netherlands. It is a genetically isolated population comprising 3465 living descendants of 22 couples from the nineteenth century and their spouses . The baseline data collection was performed in 2002–2005 when participants underwent physical examinations, provided blood samples and completed questionnaires. A follow-up of the participants was performed in 2015–2018, reviewing the medical records at the general practitioner’s office.
Studies included in the replication sample
LLDEEP is a sub-cohort of the large general population-based cohort study Lifelines, which was initiated to study genes, exposures and their interactions in the aetiology of complex multifactorial diseases and healthy ageing [17, 18]. LLDEEP consists of 1500 participants who registered at the Lifelines research site in Groningen between April and August 2013. These subjects gave additional biological materials, including blood samples for metabolite and inflammation profiling, and extensive phenotype information . Metabolic and lung function data were available for 717 LLDEEP individuals and these subjects are included in the current study.
The FINRISK cohorts comprise cross-sectional population surveys that are carried out every 5 years since 1972, to assess the risk factors of chronic diseases (e.g. cardio-vascular disease, diabetes, obesity, cancer) and health behaviour in the working age population (25–74 years of age), in 3–5 large study areas of Finland. The FINRISK surveys are conducted by the National Institute for Health and Welfare, THL (previously National Public Health Institute, KTL). Extensive information from each participant was collected at baseline via questionnaire and health examination with blood collection. The cohorts were followed up by linking them to national health registers. The cohorts FINRISK 1997 (total of 6898 participants) and an extension of FINRISK 2007, known as DIetary, Lifestyle and Genetic determinants of Obesity and Metabolic syndrome (DILGOM) study  (total of 4600 participants) are included in our replication sample for COPD analysis.
The PIVUS study started in 2001 with the aim to investigate endothelial function as a prospective cardiovascular risk factor in elderly subjects. A random sample of Uppsala city residents were invited from the register of inhabitants within 1 month following their 70th birthday. No exclusion criteria were applied except that participants were required to have a Swedish identification number. In PIVUS, 1016 subjects agreed to participate, resulting in a participation rate of 50.1% of all invited, whereof 51.5% were female. The participants have undergone a range of physical measurements, and given information about their medical history, lifestyle habits and regular medication. In addition, blood samples were drawn.
Assessment of COPD status and lung function measurements
COPD in the RS was defined as pre-bronchodilator FEV1/FVC < 0.7, assessed either by spirometry at the RS research center or by reviewing medical histories of the participants. Spirometry was performed in the RS by trained paramedical personnel, according to the guidelines of the American Thoracic Society/European Respiratory Society (ATS/ERS). When spirometry measurements were absent or uninterpretable, all files from specialists and general practitioners were reviewed to set a diagnosis of COPD. In total, this analysis included 541 incident COPD subjects and 4407 subjects without COPD which had metabolomics data available from all three RS cohorts.
For the ERF study, the doctor’s diagnosis of COPD was confirmed by reviewing medical records based on FEV1/FVC < 0.7, with or without medication use. If the information on FVC was missing, the following criteria for COPD were used: FEV1 < 80% of predicted, use of respiratory medication and a COPD diagnosis mentioned in the report of the respiratory specialist to the general practitioner. In total, 61 incident and prevalent COPD subjects and 548 subjects without COPD which had metabolomics data available were included from ERF study. For ERF participants, we did not have lung function measurements at the time of the metabolomics measurements, so we did not include this cohort in the FEV1/FVC analysis.
For LLDEEP, COPD was also defined as a FEV1/FVC < 0.7. Pre-bronchodilator spirometry was performed according to the ATS/ERS guidelines using a Welch Allyn Version 22.214.171.1249, PC-based Spiroperfect with CA Workstation. Technical quality and results were assessed by well-trained assistants and abnormal results were re-evaluated by lung physicians.
In the FINRISK study the COPD information was extracted based on diagnoses and reimbursement information from the National health register, which include the Drug Reimbursement Register, the Care Register for Health Care, the Register for Prescribed Drug Purchases, the Causes-of-Death Register, and the Cancer Register. The maximum retrospective time period available for obtaining prevalent disease events was 20 years for DILGOM and 10 years for FINRISK97.
In the PIVUS study FEV1 and FVC were assessed with spirometry using a Vitalograph Alpha spirometer (Vitalograph Ltd. Buckingham, United Kingdom) according to the American Thoracic Society recommendations [20, 21]. The best value of three acceptable recordings was used. FEV1 and FVC expressed as percent of predicted values, were adjusted for age, sex and height according to Hedenström’s formula [22, 23]. PIVUS study was included only in the FEV1/FVC analysis, as this study does not have confirmed diagnosis of COPD by lung specialist.
Assessment of blood metabolites
Metabolic profiling in RS, ERF and LLDEEP was done as part of the 4th Rainbow Project of the BioBanking for Medical Research Infrastructure of the Netherlands (BBMRI-NL) (https://www.bbmri.nl/omics-metabolomics/). For all studies used in the discovery and replication samples, to quantify the metabolite biomarkers random selection of fasting EDTA plasma samples were used for quantitative high-throughput 1H-NMR metabolomics platform performed by the same company using the same standardized quality control protocol (Nightingale Ltd., Helsinki, Finland). All samples were stored at − 80 °C which ensures the biological stability. Details of the protocol and advantages of the NMR-based metabolomics analyses using plasma were described elsewhere [24, 25]. The protocol describes steps for quality control and sample preparation, data storage and spectral analyses. If metabolite values were flagged to be unreliable by the quality control protocol, they were treated as missing. If distributions of the metabolites deviated from normal, every cohort applied normalization steps as suggested by Nightingale. Those included natural logarithm transformation and scaling to standard deviation units. Using this method, we were able to quantify a wide range of blood metabolite biomarkers such as lipoprotein fractions, amino-acids, cholesterol levels, glycerides, phospholipids, fatty acids, ketone bodies and metabolites related to inflammation and glycolysis. In total, 161 metabolites, overlapping between RS and ERF, were used in the discovery analysis.
Association of COPD and FEV1/FVC with metabolites
Per cohort, we used transformed metabolite levels as independent variable and COPD status or FEV1/FVC as dependent variables in logistic and linear regression models, respectively. The models were adjusted for age, sex, body mass index (BMI, kg/m2), lipid lowering medication (LLM) use and smoking status (current, ex- or never smokers). For the discovery sample, the results from ERF, RS-I-4, RS-E5 and RS-III-2 were meta-analysed using fixed effect models in “METAL” software . As the metabolites are known to be highly correlated, we applied the method by Li and Ji  to assess the number of independent metabolites. Using this method, we calculated that for the 161 metabolites, the number of independent tests was 45, which resulted in the Bonferroni significance threshold of P = 0.001 (0.05/45). Significant metabolites were further tested for replication in the meta-analysis of LLDEEP, FINRISK1997 and DILGOM studies for the COPD analysis and of LLDEEP and PIVUS studies for the FEV1/FVC analysis. Again, the same regression models were used for the fixed effect meta-analysis in “METAL” software.
For significant metabolites, we calculated the odds ratios per quartile of the metabolite distribution in the discovery sample. To investigate the effects of smoking on this association, we used two logistic regression models, one adjusted for age, sex, BMI and LLM use, and a second model additionally adjusted for smoking status (current, ex- and never smokers). Results from each cohort were combined using inverse-variance weighted fixed effects meta-analysis in “rmeta” package in R.
Exploring causality of the association between COPD and metabolites
We have used a bi-directional approach in which we examined whether: 1) the genetic determinants of the significant metabolites are associated with COPD and lung function, which would lead to the conclusion that the metabolites are most likely driving the disease; 2) the genetic determinants of COPD are associated with significant metabolites when the metabolites would most likely be altered as an integral part of the disease pathophysiology and may be biomarkers. The R package “TwoSampleMR” was used for the two sample Mendelian Randomization (MR) tests [28, 29]. We used the genetic information from previously published genome-wide association studies (GWAS) on metabolites (Model 1)  and COPD (Model 2) . In brief, the genetic score was based on the top single nucleotide polymorphisms (SNPs, P-value < 5 × 10− 8) with linkage disequilibrium (LD) R2 < 0.05 within 500 kb clumping distance. Harmonization was checked, including the strand issues and palindromic SNPs. It resulted in eight independent SNPs for COPD (R2 = 1.7%), and nine SNPs for GlycA (R2 = 2.3%). Inverse variance weighted MR, Maximum likelihood MR, MR Egger analysis and median-based estimator were performed to check the significant results.
Association with morbidity and mortality
We wanted to investigate whether an identified metabolite in the circulation is a biomarker of early pathology thus can be used as a predictive or diagnostic biomarker or rather prognostic biomarker for mortality in COPD patients. To this end, we performed an analysis in the Rotterdam Study in which we associated identified metabolite to the future risk of COPD. We determined the relative risk by quartile of the metabolite concentration in the circulation, using the lowest quartile as a reference. Only incident patients are included in this analysis (whole RS sample, in total 541 case and 4407 controls); prevalent COPD patients are excluded. To investigate whether metabolites have utility in predicting COPD, we constructed classical receiver operating curves (ROC) and compared areas under the curve (AUC) . To further investigate whether the identified metabolites may act as biomarker of the disease prognosis, we performed a survival analysis in SPSS, similar to the previous study by Fischer and colleagues for all-cause mortality, ignoring any underlying morbidity . To check whether the metabolites associated with mortality in COPD patients, we performed the Cox proportional hazards model in three RS cohorts. Analyses were adjusted for age at sampling, sex and smoking. We further performed a similar analysis using four quartiles of metabolite, testing in COPD cases and controls.
Descriptive characteristics of the samples
Descriptive characteristics of all cohorts used in the analysis are presented in Table 1. Comparing the discovery cohorts, ERF participants were younger (mean age 49.0 ± 13.3) and had a higher percentage of current smokers compared to the participants of the three RS cohorts (RS-I-4 mean age 74.8 ± 6.5; RS-E5 mean age 68.4 ± 5.7; RS-III-2 mean age 62.8 ± 5.8). The RS cohorts had a higher percentage of users of the LLM, compared to ERF (Table 1).
The mean FEV1/FVC and BMI were comparable across the studies. Descriptive characteristics for COPD cases and subjects without COPD separately in the discovery cohorts are provided in eTable 1 in the Supplement. In general, COPD subjects were older and more often smokers compared to subjects without COPD. Since FINRISK97 and DILGOM studies are based on the data from National health registers, and thus do not have minimum age entry criteria, the percentage of COPD cases is lower compared with discovery sample, containing elderly population.
Association of COPD and FEV1/FVC with metabolites
At nominal significance, higher levels of GlycA (odds ratio (OR) = 1.16; P = 5.6 × 10− 4), 3-hydroxybutyrate (OR = 1.13; P = 0.003), free cholesterol in medium high-density lipoprotein (HDL, OR = 1.10; P = 0.045) and acetoacetate (OR = 1.09; P = 0.047) were associated with a higher prevalence of COPD. Higher levels of histidine and 18:2 linoleic acid (OR = 0.91 for both, P = 0.04 and P = 0.05 respectively) were associated with a lower prevalence of COPD. When considering the multiple testing correction threshold, only GlycA was significantly associated with COPD (P = 5.6 × 10− 4). We tested all six metabolites for replication in the independent samples. The association of higher levels of GlycA with COPD was significantly replicated (OR = 1.30, P = 1.8 × 10− 6) in the 12,205 participants of the replication sample, after multiple testing correction.
Findings for the FEV1/FVC ratio were not consistent over the discovery and replication studies. Adjusting for multiple testing, we found in the discovery cohorts that lower levels of valine (β = 0.005, P = 2.5 × 10− 4) and higher levels of GlycA (β = − 0.005, P = 4.5 × 10− 4) were associated with a lower FEV1/FVC ratio (Table 3, Fig. 1).
Other metabolites that reached nominal significance in the discovery included albumin which was positively associated with FEV1/FVC, and glutamine, triglycerides in very large HDL and phenylalanine which were negatively associated with FEV1/FVC (Table 3, Fig. 1). Only the association of FEV1/FVC to albumin showed nominal significance in the replication samples (β = 0.005, P = 0.03), but none were significantly associated when considering multiple testing correction. Meta-analysis results of all metabolites tested for the association with COPD and FEV1/FVC in the discovery sample are provided in the supplementary material (eTable 4 and eTable 5, respectively).
Exploring causality of COPD and circulating GlycA
Next, we performed a Mendelian Randomisation experiment investigating the hypothesis that: 1) GlycA is increasing the risk of COPD and therefore the genetic determinants of GlycA (used as instrumental variables) are also associated with COPD and 2) the opposite scenario is true in which (pre)clinical COPD pathology increases GlycA levels. The results of both models are presented in Table 4.
The genetic risk score (GRS) for Model 1 included nine independent SNPs (R2 = 2.3%) and yielded no significant evidence for association (P = 0.97 for inverse variance weighted method). In Model 2, we found that genes associated with a higher risk of COPD are also associated with higher levels of GlycA, through the COPD (Table 4, P = 0.00068 for inverse variance weighted method), suggesting that COPD pathology increased GlycA levels. The results of weighted median and weighted mode were significant as well (P-value< 0.05). This analysis is based on eight independent SNPs in the GRS (R2 = 1.7%). No heterogeneity or pleiotropic effect were detected. Leaving out either SNP did not change the significance of the MR results. The detailed MR output are shown in supplementary information.
Is circulating GlycA predictive biomarker for COPD?
Compared to the lowest quartile, those subjects in the highest quartile of GlycA had a 1.99-fold (95% Confidence interval: 1.52–2.60) higher risk of developing COPD, after adjustment for age, sex, BMI and LLM use (eTable 2). Smoking accounted for a part of the observed association between plasma GlycA and COPD attenuating the OR for those in the highest quartile of GlycA to 1.74, while the association remained significant (95% Confidence interval: 1.32–2.28). To test whether circulating GlycA adds to the predictive value, we compared the AUC curves for the models including: 1) age and sex (AUC = 0.601); 2) age, sex and smoking (AUC = 0.670) and 3) age, sex, smoking and circulating GlycA levels in blood (AUC = 0.675). The AUC comparing model 2 and 1 shows that smoking is associated with an increase in AUC by 0.069. Adding circulating GlycA increased the AUC further by only 0.005 (eFigure 1).
Is circulating GlycA a prognostic biomarker for mortality in COPD?
A previous study has shown that GlycA is a predictor of all-cause mortality in the general population . We confirm this in our current study, after adjustment for age, sex and smoking (hazard ratio (HR) = 1.16, P = 4.39 × 10− 9) (eTable 3). The mean follow-up time in years was 6.94, ranging from 0.04 to 15.96. We first performed the analysis with continuous GlycA and then compared mortality across the quartiles of GlycA. We found that those in the highest quartile have 1.4-fold (95% Confidence interval: 1.22–1.61, P = 1.64 × 10− 6) higher risk of mortality during follow-up compared to those in the lowest quartile (eTable 3). However, when stratifying these analyses by COPD status, we observed that this association is driven by controls (eTable 3; eFigure 2). In COPD patients, circulating GlycA levels are not significantly associated with mortality when studying GlycA as a continuous variable (HR = 1.06, P = 0.32) nor for those in the highest quartile (HR = 1.07, P = 0.70 in COPD cases). In those without COPD, the association of continuous GlycA to mortality is stronger and significant (HR = 1.18, P = 1.43 × 10− 9).
In our metabolome-wide discovery analysis, we identified 11 plasma metabolites associated with COPD or lung function levels (FEV1/FVC) at marginal significance. Of these 11 metabolites, only higher levels of GlycA were significantly associated with COPD when adjusting for multiple testing and this is the only metabolite we could replicate in the independent cohorts. Our MR analysis suggested a causal relation between COPD and higher GlycA levels in the circulation by showing that the genetic predisposition to COPD associates with GlycA. The GlycA level seemed to be an early biomarker of COPD since it was associated with the incidence of COPD, even after adjustment for smoking. Although GlycA was found to be a predictor of mortality in the general population , the metabolite did not predict mortality in COPD patients.
GlycA is the most convincing and interesting finding of our study. This metabolite was recently associated with the incidence of a variety of disorders, including COPD based on record linkage . Using two population-based cohorts, we identified new associations with GlycA including alcoholic liver disease, chronic renal failure, glomerular diseases and inflammatory polyarthropathies. The GlycA associations were for a large part independent of that of high-sensitivity C-reactive protein (hsCRP), but GlycA and hsCRP also share contributions to mortality risk, suggesting chronic inflammation as the common pathway. GlycA is shown to be a biomarker for chronic inflammation, neutrophil activity and risk of future severe infection, even superior compared with CRP [34, 35].
The present study extends previous research by widening the number of NMR metabolites studied and we found that GlycA is the only metabolite significantly associated with COPD after adjusting for multiple testing. Our analyses were adjusted for smoking and the association between GlycA and COPD is thus not explained by smoking. We used data integration approach (MR) to test the hypothesis that GlycA increases the risk of COPD causally or rather is a bystander biomarker that is part of the disease pathogenesis (marker of the disease). Our findings suggest that the latter is more likely, as the genes associated with COPD also associate with GlycA levels. In contrast, no support was found for the hypothesis that GlycA is a causal determinant of COPD: the genes that are known to be associated with GlycA levels are not associated with the risk of COPD. The findings of the MR are in line with the finding that GlycA was not consistently associated with the FEV1/FVC ratio across the discovery and replication cohorts, which suggests that GlycA is more likely increased as an early consequence of the developed disease. This is in line with other studies on different diseases involving systemic inflammation. However, as many other factors, aside from genetics, play a role in this complex disease and blood metabolic patterns, our MR results need further corroboration using experimental animal models to support the causality.
Although it is known to be a marker of acute inflammation, it has also been shown that it is predictive of long-term risk of severe infection, and high levels correlated with an increased risk of hospitalization and death from septicaemia and pneumonia . This is particularly important for exacerbations of COPD and the prognosis. In the present paper we do not find evidence that GlycA is associated with COPD mortality. Such relationship was seen for cardiovascular disease. GlycA not only increased the risk of incident cardiovascular disease [7, 36] but was also associated with a 5-fold increased 12-year risk of mortality in those with the highest GlycA levels . This suggests that our analysis would benefit from increasing the sample size even more.
GlycA, is a composite NMR-based signal related to changes in multiple circulating glycoproteins, mainly orosomucoids , which are a positive acute phase proteins, and their concentration increases in response to systemic tissue injury, inflammation or infection [38, 39]. Even in apparently healthy people high GlycA was related with elevation in many inflammatory cytokines suggesting they may be in a state of chronic inflammatory response up to 10 years . Another acute phase protein modulating the immune response, whose deficiency has an established effect on COPD pathogenesis, is alpha 1-antitrypsin (AAT). It is found that although alpha-1-acid-glycoprotein had a strongest correlation with GlycA, it was the AAT variation that had the most predictive properties for morbidity and mortality for many different diseases . Moreover, protein haptoglobin, also included in GlycA signal, was estimated to be the strongest predictor of chronic lower respiratory diseases of all proteins included in this signal . GlycA is mainly produced by the liver, but it is also synthesized in myelocytes and released by activated neutrophils . Being a type I acute phase protein, GlycA is induced by cytokines, interleukins and tumour necrosis factor alpha (TNFα) [41, 42], which among others stimulate a systemic inflammatory response in COPD patients who lose weight . GlycA is one of the main drug binding proteins, carrying basic and neutral lipophilic drugs such as steroid hormones or medications in blood .
A strength of our study is that it is the largest and most comprehensive metabolic study of COPD and lung function. Another strength is the use of the NMR platform, which is valued for being non-invasive, non-destructive, fast and for providing highly reproducible results . A limitation of this study is our COPD definition, mainly based on pre-bronchodilator lung function measurements or review of medical records and national registries, which may have introduced some selection bias. Nevertheless, we do identify and replicate significant results which should be further corroborated in studies with post-bronchodilator measures. Our MR approach allowed us to gain more insight into the direction of the effects, suggesting that GlycA is an independent biomarker of COPD. Yet we have to acknowledge that MR is limited to the knowledge of the genetic determinants of both COPD and GlycA. In addition, we acknowledge possible limitations of MR due to pleiotropy, the lack of trans-ethnic studies and remaining bias due to canalization.
Altogether, combining the epidemiological data with our MR analyses suggests that GlycA is a biomarker of COPD inflammatory pathways, present in higher concentrations even before the COPD is clinically present. Further studies should investigate the possibility for GlycA to serve as a prediction tool for COPD morbidity and severity. Further functional studies investigating the role of GlycA in COPD will provide more insight into the pathogenesis, prognosis and treatment response of patients with COPD. Our study highlights the power of cross-omics and epidemiological data integration.
Availability of data and materials
The datasets generated and analysed during the current study are not publicly available due to the stringent consent form requirements signed by the study participants, but are available from the corresponding author on reasonable request.
Proton Nuclear Magnetic Resonance
American thoracic society
Area under the curve
Branched chain amino acids
Body mass index
Chronic Obstructive Pulmonary Disease
DIetary, Lifestyle and Genetic determinants of Obesity and Metabolic syndrome study
Erasmus Rucphen Family study
European respiratory society
- FEV1 :
Forced Expiratory Volume in 1 s
Forced Vital Capacity
Genetic risk score
Genome-wide association study
Lifelines Deep study
Lipid lowering medication
Prospective Investigation of the Vasculature in Uppsala Seniors study
Receiver operating curves
Single nucleotide polymorphism
Tumor necrosis factor alpha
World Health Organizaon - WHO. World Health Statistics 2008: World Health Organization; 2008..
Lozano R, Naghavi MFK. Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the global burden of disease study 2010. Lancet. 2012;380(380):2095–128 https://doi.org/10.1016/S0140-6736(12)61728-0.
Vestbo J, Hurd SS, Agustí AG, et al. Global strategy for the diagnosis, management, and prevention of chronic obstructive pulmonary disease GOLD executive summary. Am J Respir Crit Care Med. 2013;187(4):347–65. https://doi.org/10.1164/rccm.201204-0596PP.
Wain LV, Shrine N, Artigas MS, et al. Supplementary: genome-wide association analyses for lung function and chronic obstructive pulmonary disease identify new loci and potential druggable targets. Nat Genet. 2017;49(3):416–25. https://doi.org/10.1038/ng.3787.
Vestbo J, Anderson W, Coxson HO, et al. Evaluation of COPD longitudinally to identify predictive surrogate end-points (ECLIPSE). Eur Respir J. 2008;31(4):869–73. https://doi.org/10.1183/09031936.00111707.
Agusti A, Calverley PMA, Celli B, et al. Characterisation of COPD heterogeneity in the ECLIPSE cohort. Respir Res. 2010;11(1):122.
Kettunen J, Ritchie SC, Anufrieva O, et al. Biomarker glycoprotein acetyls is associated with the risk of a wide Spectrum of incident diseases and stratifies mortality risk in angiography patients. Circ Genomic Precis Med. 2018;11(11):e002234. https://doi.org/10.1161/CIRCGEN.118.002234.
Ubhi BK, Riley JH, Shaw PA, et al. Metabolic profiling detects biomarkers of protein degradation in COPD patients. Eur Respir J. 2012;40(2):345–55. https://doi.org/10.1183/09031936.00112411.
Ikram MA, Brusselle G, Ghanbari M, et al. Objectives, design and main findings until 2020 from the Rotterdam study. Eur J Epidemiol. 2020. https://doi.org/10.1007/s10654-020-00640-5.
Pardo LM, MacKay I, Oostra B, van Duijn CM, Aulchenko YS. The effect of genetic drift in a young genetically isolated population. Ann Hum Genet. 2005;69(3):288–95. https://doi.org/10.1046/j.1529-8817.2005.00162.x.
Henneman P, Aulchenko YS, Frants RR, van Dijk KW, Oostra BA, van Duijn CM. Prevalence and heritability of the metabolic syndrome and its individual components in a Dutch isolate: the Erasmus Rucphen family study. J Med Genet. 2008;45(9):572–7. https://doi.org/10.1136/jmg.2008.058388.
Tigchelaar EF, Zhernakova A, Dekens JAM, et al. Cohort profile: LifeLines DEEP, a prospective, general population cohort study in the northern Netherlands: study design and baseline characteristics. BMJ Open. 2015;5(8):e006772. https://doi.org/10.1136/bmjopen-2014-006772.
Borodulin K, Tolonen H, Jousilahti P, et al. Cohort Profile: The National FINRISK Study. Int J Epidemiol. 2018;47(3):696–696i. https://doi.org/10.1093/ije/dyx239.
Borodulin K, Vartiainen E, Peltonen M, et al. Forty-year trends in cardiovascular risk factors in Finland. Eur J Pub Health. 2015;25(3):539–46. https://doi.org/10.1093/eurpub/cku174.
Lind L, Fors N, Hall J, Marttala K, Stenborg A. A comparison of three different methods to evaluate endothelium-dependent vasodilation in the elderly: the prospective investigation of the vasculature in Uppsala seniors (PIVUS) study. Arterioscler Thromb Vasc Biol. 2005;25(11):2368–75. https://doi.org/10.1161/01.ATV.0000184769.22061.da.
Salihovic S, Ganna A, Fall T, et al. The metabolic fi ngerprint of p , p ’-DDE and HCB exposure in humans. Environ Int. 2016. https://doi.org/10.1016/j.envint.2015.12.015.
Scholtens S, Smidt N, Swertz MA, et al. Cohort profile: LifeLines, a three-generation cohort study and biobank. Int J Epidemiol. 2015;44(4):1172–80. https://doi.org/10.1093/ije/dyu229.
Stolk RP, Rosmalen JGM, Postma DS, et al. Universal risk factors for multifactorial diseases. Eur J Epidemiol. 2008;23(1):67–74. https://doi.org/10.1007/s10654-007-9204-4.
Konttinen H, Llewellyn C, Silventoinen K, et al. Genetic predisposition to obesity, restrained eating and changes in body weight: a population-based prospective study. Int J Obes. 2018;42(4):858–65. https://doi.org/10.1038/ijo.2017.278.
Engström G, de Verdier MG, Dahlbäck M, Janson C, Lind L. BP variability and cardiovascular autonomic function in relation to forced expiratory volume: a population-based study. Chest. 2009;136(1):177–83. https://doi.org/10.1378/chest.08-2529.
Standardization of Spirometry, 1994 Update. American Thoracic Society. Am J Respir Crit Care Med. 1995;152(3):1107–36. https://doi.org/10.1164/ajrccm.152.3.7663792.
Hedenström H, Malmberg P, Fridriksson HV. Reference values for lung function tests in men: regression equations with smoking variables. Ups J Med Sci. 1986;91(3):299–310 http://www.ncbi.nlm.nih.gov/pubmed/3811032. Accessed 15 Nov 2018.
Hedenström H, Malmberg P, Agarwal K. Reference values for lung function tests in females. Regression equations with smoking variables. Bull Eur Physiopathol Respir. 21(6):551–7 http://www.ncbi.nlm.nih.gov/pubmed/4074961. Accessed 15 Nov 2018.
Soininen P, Kangas AJ, Wurtz P, Suna T, Ala-Korpela M. Quantitative serum nuclear magnetic resonance metabolomics in cardiovascular epidemiology and genetics. Circ Cardiovasc Genet. 2015;8(1):192–206. https://doi.org/10.1161/Circgenetics.114.000216.
Kettunen J, Demirkan AA, Würtz P, et al. Genome-wide study for circulating metabolites identifies 62 loci and reveals novel systemic effects of LPA. Nat Commun. 2016;7:11122. https://doi.org/10.1038/ncomms11122.
Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010;26(17):2190–1. https://doi.org/10.1093/bioinformatics/btq340.
Li J, Ji L. Adjusting multiple testing in multilocus analysis using the eigenvalues of a correlaton matrix. Heredity (Edinb). 2005;95(3):1–7.
Leidinger P, Keller A, Heisel S, et al. Novel autoantigens immunogenic in COPD patients. Respir Res. 2009;10(1):20. https://doi.org/10.1186/1465-9921-10-20.
Hemani G, Zheng J, Elsworth B, et al. The MR-base platform supports systematic causal inference across the human phenome. Elife. 2018;7. https://doi.org/10.7554/eLife.34408.
Hobbs BD, De Jong K, Lamontagne M, et al. Genetic loci associated with chronic obstructive pulmonary disease overlap with loci for lung function and pulmonary fibrosis. Nat Genet. 2017;49(3):426–32. https://doi.org/10.1038/ng.3752.
Akobeng AK. Understanding diagnostic tests 3: receiver operating characteristic curves. Acta Paediatr. 2007;96(5):644–7. https://doi.org/10.1111/j.1651-2227.2006.00178.x.
Fischer K, Kettunen J, Würtz P, et al. Biomarker Profiling by Nuclear Magnetic Resonance Spectroscopy for the Prediction of All-Cause Mortality : An Observational Study of 17 , 345 Persons. PLoS Med. 2014;11(2):e1001606. https://doi.org/10.1371/journal.pmed.1001606 Minelli C, ed.
Lawler PR, Akinkuolie AO, Chandler PD, et al. Circulating N-linked glycoprotein acetyls and longitudinal mortality risk. Circ Res. 2016;118(7):1106–15. https://doi.org/10.1161/CIRCRESAHA.115.308078.
Ritchie SC, Würtz P, Nath AP, et al. The biomarker GlycA is associated with chronic inflammation and predicts long-term risk of severe infection. Cell Syst. 2015;1(4):293–301. https://doi.org/10.1016/j.cels.2015.09.007.
Otvos JD, Shalaurova I, Wolak-Dinsmore J, et al. GlycA: a composite nuclear magnetic resonance biomarker of systemic inflammation. Clin Chem. 2015;61(5):714–23. https://doi.org/10.1373/clinchem.2014.232918.
Gruppen EG, Riphagen IJ, Connelly MA, Otvos JD, Bakker SJL, Dullaart RPF. GlycA, a Pro-Inflammatory Glycoprotein Biomarker, and Incident Cardiovascular Disease: Relationship with C-Reactive Protein and Renal Function. PLoS One. 2015;10(9):e0139057. https://doi.org/10.1371/journal.pone.0139057 Shimosawa T, ed.
Fournier T, Medjoubi-N N, Porquet D. Alpha-1-acid glycoprotein. Biochim Biophys Acta Protein Struct Mol Enzymol. 2000;1482(1):157–71 https://doi.org/10.1016/S0167-4838(00)00153-9.
Gabay C, Kushner I. Acute-phase proteins and other systemic responses to inflammation. N Engl J Med. 1999;340(6):448–54. https://doi.org/10.1056/NEJM199902113400607.
Ritchie SC, Kettunen J, Brozynska M, et al. Elevated serum alpha-1 antitrypsin is a major component of GlycA-associated risk for future morbidity and mortality. PLoS One. 2019;14(10):e0223692. https://doi.org/10.1371/journal.pone.0223692 Feng Y-M, ed.
Theilgaard-Mönch K, Jacobsen LC, Rasmussen T, et al. Highly glycosylated alpha1-acid glycoprotein is synthesized in myelocytes, stored in secondary granules, and released by activated neutrophils. J Leukoc Biol. 2005;78(2):462–70. https://doi.org/10.1189/jlb.0105042.
Moshage H. REVIEW ARTICLECytokines and the hepatic acute phase response. J Pathol. 1997;266(August 1996):257–66.
Bistrian BR. Acute phase proteins and the systemic inflammatory response. Crit Care Med. 1999;27(3):452–3. https://doi.org/10.1097/00003246-199903000-00004.
de Godoy I, Donahoe M, Calhoun WJ, Mancino J, Rogers RM. Elevated TNF-alpha production by peripheral blood monocytes of weight-losing COPD patients. Am J Respir Crit Care Med. 1996;153(2):633–7. https://doi.org/10.1164/ajrccm.153.2.8564110.
Israili ZH, Dayton PG. Human alpha 1-acid glycoprotein and its interactions with drugs. Drug Metab Rev. 2001;33(2):161–235.
Pan Z, Raftery D. Comparing and combining NMR spectroscopy and mass spectrometry in metabolomics. Anal Bioanal Chem. 2007;387(2):525–7. https://doi.org/10.1007/s00216-006-0687-8.
The authors are grateful to all study participants, the staff and the participating general practitioners, specialists and pharmacists. The authors are grateful to P. Veraart for her help in genealogy, J. Vergeer for the supervision of the laboratory work, Sven van der Lee and Ashley van der Spek for follow-up data collection, Peter Snijders for his help in data collection and Dirkje Postma for help with COPD case confirmation in ERF study. We want to give special thanks to the members of the International COPD Genetics Consortium and Michael Cho for sharing the data of their published GWAS of COPD.
This work was performed within the framework of the BBMRI Metabolomics Consortium funded by BBMRI-NL, a research infrastructure financed by the Dutch government (NWO, grant nr 184.021.007 and 184033111). DvdP and NA were supported by grant number 4.1.13.007 of Lung Foundation Netherlands (Longfonds).
The Rotterdam Study is funded by Erasmus Medical Center and Erasmus University, Rotterdam, Netherlands Organization for the Health Research and Development (ZonMw), the Research Institute for Diseases in the Elderly (RIDE), the Ministry of Education, Culture and Science, the Ministry for Health, Welfare and Sports, the European Commission (DG XII), and the Municipality of Rotterdam.
The ERF study as a part of EUROSPAN (European Special Populations Research Network) was supported by European Commission FP6 STRP grant number 018947 (LSHG-CT-2006-01947) and also received funding from the European Community’s Seventh Framework Programme (FP7/2007–2013)/grant agreement HEALTH-F4–2007-201413 by the European Commission under the programme “Quality of Life and Management of the Living Resources” of 5th Framework Programme (no. QLG2-CT-2002-01254). The ERF study was further supported by ENGAGE consortium and CMSB. High-throughput analysis of the ERF data was supported by joint grant from Netherlands Organization for Scientific Research and the Russian Foundation for Basic Research (NWO-RFBR 047.017.043).
LLDEEP was funded by the Netherlands Heart Foundation (IN-CONTROL CVON grant 2012–03 to A.Z. and J.F.); by the Netherlands Organization for Scientific Research (NWO) (NWO-VIDI 864.13.013 to JF, NWO-VIDI 016.178.056 to AZ, and by the European Research Council (ERC) (ERC Starting Grant 715772) to AZ. AZ also holds a Rosalind Franklin Fellowship from the University of Groningen. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
FINRISK surveys are mainly funded by the budgetary funds of the National Institute for Health and Welfare. Additional funding has been obtained from the Finnish Academy (#139635 and 118065) and from domestic foundations, such as the Finnish Foundation for Cardiovascular Research (to VS). JK was supported through funds from the Academy of Finland (grant numbers 297338 and 307247) and Novo Nordisk Fonden (grant number NNF17OC0026062). MAK and JK are funded by a research grant from the Sigrid Juselius Foundation, Finland.
PIVUS study was funded by Uppsala University Hospital.
Ethics approval and consent to participate
Both RS and ERF were approved by the Medical Ethics committee of the Erasmus Medical Center and all participants gave informed consent for participation in the study and for evaluation of the available information from their physicians.
LLDEEP was approved by the ethics committee of the University Medical Center Groningen and all participants signed an informed consent prior to enrolment.
The FINRISK 1997 study was approved by the Ethical Committee of the National Public Health Institute, while the DILGOM study was approved by the Coordinating Ethical Committee of the Hospital District of Helsinki and Uusimaa. All participants have signed an informed consent, allowing the use of their data and samples for studying environmental and genetic risk factors of chronic diseases.
The Ethics Committee of the University of Uppsala approved the study and the participants gave informed consent (approval number 00–419).
Consent for publication
All authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. VS has participated in a conference trip sponsored by Novo Nordisk and received a modest honorarium from the same source for participating in an advisory board meeting. He also has ongoing research collaboration with Bayer Ltd.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Prokić, I., Lahousse, L., de Vries, M. et al. A cross-omics integrative study of metabolic signatures of chronic obstructive pulmonary disease. BMC Pulm Med 20, 193 (2020). https://doi.org/10.1186/s12890-020-01222-7
- Mendelian randomization
- Glycoprotein acetyls