A burden of rare variants in BMPR2 and KCNK3 contributes to a risk of familial pulmonary arterial hypertension

Background Pulmonary arterial hypertension (PAH) is a severe lung disease with only few effective treatments available. Familial cases of PAH are usually recognized as an autosomal dominant disease, but incomplete penetrance of the disease makes it difficult to identify pathogenic variants in accordance with a Mendelian pattern of inheritance. Methods To elucidate the complex genetic basis of PAH, we obtained whole exome- or genome-sequencing data of 17 subjects from 9 families with heritable PAH and applied gene-based association analysis with 9 index patients and 300 PAH-free controls. Results A burden of rare variants in BMPR2 significantly contributed to the risk of the disease (p = 6.0 × 10−8). Eight of nine families carried four previously reported single nucleotide variants and four novel insertion/deletion variants in the gene. One of the novel variants was a large 6.5 kilobase-deletion. In the remaining one family, the patient carried a pathogenic variant in a member of potassium channels, KCNK3, which was the first replicative finding of channelopathy in an Asian population. Conclusions The variety of rare pathogenic variants suggests that gene-based association analysis using genome-wide sequencing data from increased number of samples is essential to tracing the genetic heterogeneity and developing an appropriate panel for genetic testing. Electronic supplementary material The online version of this article (doi:10.1186/s12890-017-0400-z) contains supplementary material, which is available to authorized users.


Background
Pulmonary arterial hypertension (PAH, MIM #178600) is a rare vascular lung disease presenting increased pulmonary vascular resistance and elevation of mean pulmonary arterial pressure, leading to a grave prognosis of right heart failure without treatment. While survival rates are increasing with a number of recently developed treatments such as epoprostenol, the optimal care of patients with these therapies is unclear due to phenotypic variations and genetic backgrounds.
Although identification of individuals who carry genetic variants that increase the risk of developing PAH offers an opportunity for earlier diagnosis and finding a therapeutic strategy, the majority of previous studies was only focused on the protein coding regions of the most frequently mutated gene BMPR2 with the use of conventional methods such as Sanger sequencing [6,14]. Thus, for the patients who have no mutation in BMPR2, i.e., around 30% of HPAH and 60-90% of IPAH, another approach, which is practical for multiple genes, is necessary. Considering that current state-of-the-art sequencing technologies allow us to access exome-and genome-wide variants with reasonable costs, unbiased screening of pathogenic variants is beneficial to continue expanding the genetic diagnosis catalogue. However, conventional segregation-based approaches, e.g., linkage analysis, do not have enough power to pinpoint the true pathogenic variants from these genome-wide candidate variants without functional evaluations, especially for diseases with phenotypic and genetic heterogeneity. In this study, applying a gene-based association test, we statistically evaluate the significance of rare variant enrichment in genes responsible for PAH. The strategy we employed here would be useful to unbiasedly elucidate the pathogenicity of multiple rare variants arising from independent founder events in Mendelian diseases as well as common diseases.

Subjects
We consecutively enrolled five families with PAH and four individual cases with a family history of PAH to this study (Fig. 1). The patients have been diagnosed in the National Hospital Organization Okayama Medical Center between 1996 and 2014. All subjects who participated in our study were approved by the Institutional Review Board of our institutes in which donors gave written informed consent in accordance with institutional and national guidelines.  variant. Nucleotide and amino acid changes for BMPR2 and KCNK3 are described on NM_001204.6 and NM_002246.2, respectively. Index patients of each family are pointed with arrows. The subjects whose DNAs were available are indicated in plus signs. b All the possible pathogenic variants discovered in the eight PAH families were located before or in the kinase domain. Four previously reported and four novel variants were indicated with black and red letters, respectively

Next-generation sequencing and data analysis
To understand comprehensive genetic background of these 9 PAH families, we applied whole exome-or genomesequencing to 12 patients and 5 healthy family members whose DNA were available ( Fig. 1 and Additional file 1: Table S1). For the exome sequencing, DNA fragments were enriched by SureSelectXT Human All Exon v4 + UTR (Agilent Technologies, Santa Clara, CA, USA) and then applied to SOLiD™ 5500XL sequencer (Thermo Fisher Scientific inc., Waltham, MA, USA). The whole genome sequencing was conducted with the Illumina HiSeq X sequencer (Illumina Inc., San Diego, CA, USA). After aligning the sequence reads onto the reference genome (NCBI Build 37) using the Burrows-Wheeler Aligner [15], downstream processes including the duplication removal, the recalibration of base quality values, the local realignment, the variant call, and the variant quality score recalibration were analyzed using GATK [16]. The variants were called with an exome sequencing data set of 300 control samples obtained from the Human Genome Variation Database (accession ID: HGV0000004) [17]. The resulting VCF file has been deposited on the same database under accession HGV0000005.

Gene-based association study
To identify genes responsible for the pathogenesis of PAH, we applied a gene-based association test (Variable Threshold test [18]) to the 60,367 damaging variants extracted from the nine PAH patients and the 300 control samples. These variants were located within 10,744 gene regions. Despite the small sample size, the burden of association between BMPR2 and PAH was highly significant (p = 6.0 × 10 −8 ) compared to the genome-wide significance threshold (p < 2.4 × 10 −6 ) after Bonferroni correction for approximately 21,000 genes (Fig. 2). No other gene was found beyond the threshold.

Identifying the pathogenic variants
The spectrum of rare variants found in BMPR2 is summarized in Table 1. Of the nine families, four carried previously reported single nucleotide pathogenic variants (2 missenses, 1 nonsense, and 1 splice site) [1,4,6] and four carried novel insertions/deletions (indels) in this gene (88.9%). One of the novel indels was a large deletion of 6.5 kilobases in length by which one allele lacks the entire region of exon 3 (Additional file 2: Figure S1). Two variants are suspected to be pathogenic although showing incomplete penetrance, since clinically unaffected subjects in the families harbored the same variants found in the patients (Table 1). There were no   940.89 -BMPR2 variants observed in the remaining one of nine families, but we identified one heterozygous missense variant (p.Gly203Asp) in KCNK3 by screening the previously reported pathogenic variants [25]. This variant was shown to disrupt the ion-channel function by patchclump analysis [11]. None of the pathogenic variants we identified was observed in the 300 control samples or in the public database for the Japanese population [17]. Of these, all three missense variants were occurred at highly conserved nucleotides among vertebrates and were assumed to be damaging to the protein function by at least three in silico prediction programs [26][27][28][29] (Table 1).

Discussion
To our knowledge, this is the first report of gene-based genome-wide association analysis of HPAH. A burden of rare variants in BMPR2 significantly contributes to risk of the disease (p = 6.0 × 10 −8 ). The approach robustly detected the gene having a large effect on the pathogenesis of PAH, despite the genetic heterogeneity. Eight probands in the nine families harbored possible pathogenic variants in BMPR2. Half of these variants were novel indels. One of the novel indels was a large 6.5 kilobase deletion spanning the entire region of exon 3. Another novel indel was a three base insertion (NM_001204.6:c.1277-10_1277-9insGGG) in intron 9 (Additional file 3: Figure S2). Although we could not dismiss that this insertion has no responsibility to the disease pathogenicity, a potential creation of a new splice acceptor site by this insertion was strongly suggested from the multiple splice site prediction tools (Table 1, Additional file 4: Figure S3 and Additional file 5: Figure  S4) [31,32]. The remaining one patient harbored a missense variant in KCNK3, which was the first replicative finding of channelopathy in Japanese population. Among the nine families, all variants identified here were mutually exclusive, suggesting that the variants have originated from independent genetic founder events.
Patients who suffered from chronic lung diseases such as chronic obstructive pulmonary disease (COPD) and pulmonary fibrosis are prone to pulmonary hypertension (PH) development. They are categorized as Group 3 in the latest guidelines [33]. Most patients with COPD develop mild PH but 3-5% of them show a further rise in mean pulmonary arterial pressure >35 mmHg. It is unknown how "severe PH-COPD", formerly known as "outof-proportion PH" is induced. Furthermore, the PAHapproved drugs are yet to be approved for the patients with Group 3 PH. In this study, a male patient (OM0195 in HPAH005) had been treated at another hospital for COPD. He was later diagnosed with PH and referred to our hospital. This patient could be categorized as "severe PH-COPD", if none of his family members developed PAH. Since we had treated his daughter for IPAH at our hospital, we clinically diagnosed them as HPAH. Genetic testing revealed an in-frame-deletion (c.1443_1445del-GAA) in BMPR2 in both patients. Underlying genetic predisposition might be one of the reasons for developing "severe PH-COPD". Given our finding and a similarity of morphological appearance of vascular lesions between Group 1 and "severe PH-COPD" patients [33], PAHapproved drug treatments tailored to genetic diagnosis could well be a therapeutic strategy for such patients.

Conclusions
According to the genetic testing registry at the National Institutes of Health, the available panels for clinical genetic testing for PAH do not include KCNK3 and the detection methods are limited. Considering that pathogenic variants could occur within or spanning noncoding regions with a variety of sizes, the sequencing of the entire region of candidate genes is recommended to further understand the genetic factors relevant to PAH. This strategy will be essential for improving genetic diagnosis and counseling for PAH.