- Technical advance
- Open Access
Application of structured statistical analyses to identify a biomarker predictive of enhanced tralokinumab efficacy in phase III clinical trials for severe, uncontrolled asthma
BMC Pulmonary Medicine volume 19, Article number: 129 (2019)
Tralokinumab is an anti–interleukin (IL)-13 monoclonal antibody investigated for the treatment of severe, uncontrolled asthma in two Phase III clinical trials, STRATOS 1 and 2. The STRATOS 1 biomarker analysis plan was developed to identify biomarker(s) indicative of IL-13 activation likely to predict tralokinumab efficacy and define a population in which there was an enhanced treatment effect; this defined population was then tested in STRATOS 2.
The biomarkers considered were blood eosinophil counts, fractional exhaled nitric oxide (FeNO), serum dipeptidyl peptidase-4, serum periostin and total serum immunoglobulin E. Tralokinumab efficacy was measured as the reduction in annualised asthma exacerbation rate (AAER) compared with placebo (primary endpoint measure of STRATOS 1 and 2). The biomarker analysis plan included negative binomial and generalised additive models, and the Subgroup Identification based on Differential Effect Search (SIDES) algorithm, supported by robustness and sensitivity checks. Effects on the key secondary endpoints of STRATOS 1 and 2, which included changes from baseline in standard measures of asthma outcomes, were also investigated. Prior to the STRATOS 1 read-out, numerous simulations of the methodology were performed with hypothetical data.
FeNO and periostin were identified as the only biomarkers potentially predictive of treatment effect, with cut-offs chosen by the SIDES algorithm of > 32.3 ppb and > 27.4 ng/ml, respectively. The FeNO > 32.3 ppb subgroup was associated with greater AAER reductions and improvements in key secondary endpoints compared with the periostin > 27.4 ng/ml subgroup. Upon further evaluation of AAER reductions at different FeNO cut-offs, ≥37 ppb was chosen as the best cut-off for predicting tralokinumab efficacy.
A rigorous statistical approach incorporating multiple methods was used to investigate the predictive properties of five potential biomarkers and to identify a participant subgroup that demonstrated an enhanced tralokinumab treatment effect. Using STRATOS 1 data, our analyses identified FeNO at a cut-off of ≥37 ppb as the best assessed biomarker for predicting enhanced treatment effect to be tested in STRATOS 2. Our findings were inconclusive, which reflects the complexity of subgroup identification in the severe asthma population.
Interleukin (IL)-13 is a type-2 pleiotropic cytokine thought to play a central role in asthma pathophysiology . Overexpression of pulmonary IL-13 in transgenic mice led to development of features typical of asthma, such as eosinophilic airway inflammation, increased mucus production, sub-epithelial fibrosis and airway hyper-responsiveness . In addition, in mice sensitised to ovalbumin, neutralisation of IL-13 resulted in the attenuation of airway hyper-responsiveness, goblet cell metaplasia and lung eosinophilia [3, 4]. Clinical data have demonstrated that people with atopic and non-atopic asthma have increased concentrations of IL-13 mRNA and IL-13 in sputum samples and bronchial biopsies compared with those without asthma [5,6,7,8,9].
The presumed role of IL-13 in asthma led to the clinical development of anti–IL-13 treatment strategies such as tralokinumab, an immunoglobulin (Ig) G4 human monoclonal antibody (mAb) that potently and specifically neutralises IL-13 by preventing its interaction with the IL-13 receptor α1 and α2 subunits [10, 11]. A Phase IIa tralokinumab trial in participants with moderate-to-severe uncontrolled asthma showed no improvement in asthma control in the all-comers population, but did show increases in forced expiratory volume in 1 s (FEV1). Analysis of participants by IL-13 axis activation revealed better outcomes with tralokinumab in those participants with IL-13 activation (sputum IL-13 ≥ 10 pg/ml) compared with participants with low or no activation (sputum IL-13 < 10 pg/ml), or those receiving placebo . In a follow-up Phase IIb trial in a similar population, tralokinumab did not reduce the annualised asthma exacerbation rate (AAER) in the all-comers population. However, post-hoc analyses indicated enhanced benefits in participants with evidence of IL-13 axis activation, assessed by elevated serum concentrations of periostin or dipeptidyl peptidase-4 (DPP-4), which are biomarkers induced by IL-13 . The data from these two Phase II trials suggested that tralokinumab would only be effective in severe asthma when there was evidence of IL-13 activation. This concept was supported by data from clinical trials of another anti–IL-13 mAb, lebrikizumab [14, 15]. It was also consistent with emerging evidence that underlying patterns of airway inflammation, and thus response to treatment, vary among people with severe asthma .
The tralokinumab late-stage clinical development programme in severe, uncontrolled asthma  was specifically designed to include two similar pivotal Phase III trials, STRATOS 1 (NCT02161757) and STRATOS 2 (NCT02194699), which were conducted in parallel but with staggered analyses (Fig. 1) . STRATOS 1 primarily investigated the efficacy of tralokinumab in an all-comers population and, using an exploratory biomarker analysis plan, investigated several biomarkers that were potentially predictive of tralokinumab efficacy. The candidate biomarkers considered were blood eosinophil counts, fractional exhaled nitric oxide (FeNO), serum DPP-4 concentration, serum periostin concentration and total serum IgE concentration. These biomarkers are all continuous in nature and are either associated with IL-13 activation [19, 20] or with previous successful treatment of asthma with a biologic therapy [17, 21]. IL-13 was not assessed as a biomarker as circulating levels are very low, and when this study was conducted, available immunoassays did not reliably detect this protein .
The biomarker analysis results of STRATOS 1 were used for two purposes: to determine whether any of the biomarkers predicted a greater benefit with tralokinumab treatment; and to identify the threshold value for any predictive biomarker that would distinguish subgroups of participants with an enhanced benefit. These findings were then tested in STRATOS 2. Here, we describe the analyses applied in the identification of the best biomarker candidate and threshold value determined from STRATOS 1. The results of the primary analyses of STRATOS 1 and 2 in the biomarker-identified subgroup of participants has been published separately .
Methods and results
The STRATOS 1 and STRATOS 2 clinical trials
STRATOS 1 and 2 were both multicentre, randomised, double-blind, parallel-group, placebo-controlled Phase III clinical trials (Fig. 1). The two trials were conducted during an overlapping period, with the start and end dates staggered to allow for sequential analysis. They each had a 4–6-week run-in period, a 52-week treatment period and follow-up visits at Weeks 56 and 72 [18, 23]. In STRATOS 1, 1,207 participants were randomised 2:1:2:1 to receive either 300 mg tralokinumab or placebo subcutaneously (SC) every 2 weeks (Q2W), or 300 mg tralokinumab or placebo SC every 4 weeks (Q4W). In STRATOS 2, 856 participants were randomised 1:1 to receive either 300 mg tralokinumab or placebo SC Q2W .
The primary objective of STRATOS 1 was to investigate the effect of tralokinumab Q2W on the AAER up to Week 52 compared with placebo in an unselected all-comers population. The STRATOS 2 primary objective was originally to evaluate the effect of tralokinumab on the AAER in both an all-comers and a biomarker-positive population, but was amended to investigate only the biomarker-positive population as defined by the biomarker analysis of STRATOS 1. The analysis of the all-comers population was redefined as a secondary objective. Key secondary measures for STRATOS 1 and 2 were percentage change from baseline to Week 52 in pre-bronchodilator FEV1 and absolute change from baseline to Week 52 in scores of Asthma Control Questionnaire 6-item version (ACQ-6), Standardised Asthma Quality of Life Questionnaire for 12 years and older (AQLQ) and Asthma Symptom Score [18, 23].
The biomarker analysis plan for STRATOS 1
The biomarker analysis plan for STRATOS 1 had two objectives:
To assess the relationship between continuous baseline values for the five identified biomarkers, AAER and treatment as the basis for identifying the biomarker with potential properties to predict the treatment effect of tralokinumab.
To determine the most appropriate threshold for the biomarker identified as having the best potential predictive properties of enhanced treatment effect.
The biomarker analyses of STRATOS 1 were based on the Full Analysis Set (FAS), defined as all randomised participants (irrespective of baseline biomarker concentration, the ‘all-comers’) who received any investigational product, regardless of protocol adherence and/or premature investigational product discontinuation or delay. The definition of the biomarker-positive subgroup, participants in the FAS with baseline biomarker concentrations equal to or greater than the identified threshold cut-off, was determined prior to unblinding of STRATOS 2. These biomarker analyses were focused on comparing the tralokinumab Q2W data with placebo data. For this purpose, the Q2W and Q4W placebo arms, which were well balanced in terms of demographic characteristics such as age, sex, race and ethnicity and had comparable lung function at baseline, were combined. Results for the tralokinumab Q4W arm (vs. combined placebo data) were used to support the Q2W findings. All analyses and the covariates used in each model were pre-specified in the STRATOS 1 statistical analysis plan, which was shared with the FDA. Analyses were conducted using either SAS software (version 9.4; SAS Institute Inc., Cary, NC) or R (version 3.2.4 [https://www.r-project.org/]).
The STRATOS 1 biomarker analysis plan was based on an understanding that a single statistical analysis would not be suitable for addressing the two objectives, instead requiring multiple approaches. Consequently, the plan was developed using various statistical approaches to answer four separate questions, as described in detail below. Once the biomarker analysis plan was developed, multiple realistic scenarios with different effect sizes and biomarker interactions were simulated in order to assess whether the full statistical methodology was able to detect known predictive signals, as well as to refine our ability to interpret the results and practice the decision-making process. These blinded scenario simulations were carried out prior to the read-out of STRATOS 1 by a statistician who was not otherwise involved in the analysis. The simulated data were based on modelled relationships between biomarkers, other covariates and exacerbations, developed using baseline data from STRATOS 1. All of the analyses outlined below were run using the simulated data and interpreted by the blinded clinical team, the results of which were used to improve the decision-making process for moving forward with particular biomarkers and subgroups. The scenarios considered potential differences in placebo rate, all-comers effect (i.e. the overall treatment effect) and various relationships between biomarkers and exacerbations. These simulation exercises confirmed the ability of the methodology to support adequate identification of biomarker-positive subgroups and, in turn, helped to overcome the difficulties in interpreting the results.
Question 1: are baseline values of the five biomarkers predictive of treatment effect?
Before the predictive properties of the five candidate biomarkers were assessed, their distribution within the STRATOS 1 population and relationship with known potential risk factors for asthma exacerbations were assessed using descriptive statistics (Table 1). These potential risk factors included geographical region, number of exacerbations in the year prior to trial entry, body mass index, smoking status, inhaled corticosteroid (ICS) dosage, sex and age. Baseline concentrations of candidate biomarkers were similar across the treatment groups, but median concentrations of blood eosinophils (≈200 cells/μl) and FeNO (≈20.3 ppb) were relatively low for a severe asthma population. Clear relationships were found between biomarker concentrations and some non-biomarker covariates. Greater baseline FeNO, eosinophil and, to some degree, periostin concentrations were associated with a greater number of previous exacerbations, suggesting that these biomarkers were prognostic to some extent (Additional file 1: Figure S1). Greater baseline periostin concentrations were found in participants from the Asia/Pacific region compared with other regions, while greater baseline periostin and DPP-4 concentrations were observed in adolescents (Additional file 1: Figures S2 and S3).
To investigate the potential biomarker predictive properties of the five biomarkers expressed as continuous variables, graphs were created to present the relationship between AAER and baseline biomarker concentration (Fig. 2). For these graphs, negative binomial models were used to assess treatment effect (measured as AAER) with covariates of treatment group, geographical region, age and number of exacerbations in the previous year. The log of each participant’s corresponding follow-up time was used in the models as an offset variable to adjust for participants having different exposure times during which asthma exacerbations occurred. These graphs demonstrated greater exacerbation rates in the placebo group with increasing baseline concentrations of FeNO, periostin and eosinophils, suggesting a prognostic relationship. They also showed that the exacerbation rate did not increase with greater baseline concentrations of these biomarkers in the tralokinumab treatment group, suggesting a predictive relationship.
The biomarker predictive properties were investigated further using Generalized Additive Models (GAM), a type of generalised linear model, with smoothing splines used to visualise potential relationships. In GAM, some of the (log-)linear X terms are replaced with a fitted smooth function, f(X), to give a visual representation of the shape of f(biomarker) and therefore potential relationships in the data. The plots produced using GAM visually supported the predictive and prognostic properties of baseline concentrations of FeNO, periostin and, to a lesser extent, eosinophils (Fig. 3).
Likelihood ratio tests were used to provisionally quantify the predictive properties of the continuous biomarkers by assessing the impact of biomarker-by-treatment interactions. Firstly, a negative binomial model that included all interaction terms for all five candidate biomarkers versus treatment was compared with a model without interaction terms. Secondly, separate models with and without treatment-by-biomarker interaction terms were compared for each individual candidate biomarker. These assessments provided exploratory interaction effects for each biomarker, but had low power and were only able to identify non-complex linear relationships. Noting these limitations, these analyses found nominally significant (p < 0.10) interaction effects for FeNO in both tralokinumab treatment groups and for periostin in the Q4W, but not the Q2W, group (Table 2).
In addition to investigating the biomarkers as continuous variables, Forest plots were used to show the AAER reduction with tralokinumab versus placebo for the five biomarkers both within each quartile group (Fig. 4a), and in biomarker-high and -low subgroups defined by cumulative cut-offs based on quartiles (Fig. 4b). These data were estimated using negative binomial models that, in addition to treatment group, biomarker group, treatment*biomarker group and time on study, included covariates with which the outcome was likely to correlate, such as geographical region, age and previous number of exacerbations in the past year; these covariates were also included in the model used for the primary analysis of STRATOS 2. Within-quartile grouping indicated that the treatment effect was greatest at high baseline concentrations of both FeNO and periostin (Fig. 4a), with a similar pattern demonstrated with increasing cut-offs of FeNO (Fig. 4b). This suggested that FeNO and periostin were potentially predictive of treatment response to tralokinumab.
Question 2: is the choice of a biomarker-positive subgroup defensible?
The collective evidence from the above analyses identified FeNO and periostin as biomarkers that were prognostic and potentially predictive of response to tralokinumab. The Subgroup Identification based on Differential Effect Search (SIDES) algorithm [24, 25] was used to further support the predictive properties of these candidate biomarkers and to identify the respective cut-off values for an enhanced response to tralokinumab.
SIDES recursively partitions specific areas of the covariate space associated with treatment benefit using a treatment effect–based splitting criterion in order to identify the best split for each covariate [24, 25] (Fig. 5). In the search algorithm, a negative binomial model was used to estimate the treatment effect, which included treatment group as a covariate, as well as the log of each participant’s corresponding follow-up time as an offset variable. When further assessing the identified subgroups, age, geographical region and number of previous exacerbations were included as covariates to match the model that was to be used in the primary analysis of STRATOS 2. Because AAER reduction, the primary outcome measure of tralokinumab treatment effect, was a count variable and the SIDES package available at the time of these analyses did not allow for the modelling of count data via negative binomial models, a bespoke package was developed in collaboration with I Lipkovich for the analysis of STRATOS 1 and 2 (it should be noted that the latest version of SIDES allows for count data modelling).
In order to restrict how complex the subgroups identified by SIDES could be, it was pre-specified that the prevalence of a biomarker cut-off was required to be at least 30% in the study population and subgroups could only be based on one of the candidate biomarkers. Additional SIDES analyses were conducted in which subgroups could be identified using either multiple biomarkers or non-biomarker covariates to aid understanding of how the biomarkers influenced the effect of tralokinumab. The results of the SIDES analysis were presented in Forest plots, which confirmed the potential predictive properties of FeNO and periostin and identified baseline cut-off values of > 32.3 ppb and > 27.4 ng/ml, respectively (Fig. 6). The FeNO biomarker subgroup identified through SIDES had a slightly better AAER reduction with tralokinumab versus placebo than the periostin subgroup (38% versus 31%).
The SIDES-identified biomarker cut-off values were confirmed using robustness and sensitivity analyses; for example, assessing the effects of minor parameter modifications and removing the restriction on subgroup size (see Additional file 2 for further details). The certainty of the identified cut-offs was tested by bootstrapping the data (i.e. a number of bootstrap sample populations were created by sampling with replacement data from the STRATOS 1 study) and then re-running SIDES on each bootstrap sample, resulting in a range of cut-offs (across 50 evenly distributed splits) for each biomarker. The resulting plots identified how many times the cut-off values were chosen for each biomarker and whether the subgroups chosen (with greater observed efficacy) were above or below the cut-off value. These results were then compared with the ‘best’ cut-off identified by the initial SIDES analysis. The comparison showed a greater degree of certainty (i.e. less variability) with FeNO than with periostin (Fig. 7), which while exploratory, may reflect the uncertainty in our understanding of the roles of various cytokines and biomarkers in the pathophysiology of severe asthma.
A permutation approach was used to assess the likelihood of recording the observed AAER reduction by chance in any subgroup when no true connection between biomarker and treatment existed. The biomarker variables were randomly permuted against participant-level data (treatment, exacerbation history, etc.) to remove the biomarker effects and leave only the overall treatment effect. These data were then run through SIDES to identify the subgroup (defined by any of the five candidate biomarkers) with the best treatment effect. The process was repeated 500 times to give a distribution of permuted ‘best subgroup’ effects by chance that were then compared with the results obtained in the initial SIDES analysis for each biomarker (Fig. 8). Based on this analysis, the median best AAER reduction observed by chance was estimated to be 33%. The effect observed in the main SIDES analysis for the FeNO subgroup (38% reduction) was greater than this value, although still within the distribution of ‘chance’ results; in contrast, the effect observed in the periostin subgroup (31% reduction) was slightly lower. This analysis provided support for choosing FeNO over periostin as the biomarker to assess further.
Alongside the evaluation of the secondary endpoints in the biomarker subgroups (described below), further analyses were conducted to assess the treatment effect on AAER in subgroups defined by FeNO cut-off values ranging from 30 to 40 ppb (Table 3). This was done using a negative binomial model with treatment group, geographical region, age, number of exacerbations in the previous year, treatment*biomarker group interaction and periostin group at baseline as covariates. Based on this analysis, a threshold of FeNO ≥37 ppb provided the best AAER reduction with tralokinumab treatment. Similar further analyses were not conducted for periostin following the assessment of the key secondary endpoints (described below) using the cut-off value identified by SIDES (> 27.4 ng/ml).
Question 3: is there consistency of predictive effect across key secondary efficacy endpoints?
To evaluate further the choice of biomarker and threshold, key secondary endpoints in STRATOS 1 (percentage change from baseline in FEV1, and absolute changes from baseline in ACQ-6 score, AQLQ score and Asthma Symptom Score) were analysed using repeated measures models for the FeNO- and periostin-defined subgroups. Nominally significant improvements versus placebo in all key secondary endpoints – except for Asthma Symptom Score – were observed in the FeNO ≥37 ppb subgroup (Table 4); similar results were observed in the subgroup with FeNO ≥32.3 ppb (data not shown). In contrast, there was no consistent enhancement of treatment effect in the periostin > 27.4 ng/ml subgroup (Table 5).
The combined observations obtained through the above statistical methods supported the choice of FeNO as the preferred predictive biomarker with the threshold of FeNO ≥37 ppb. An overall comparison of the findings with FeNO ≥37 ppb and periostin > 27.4 ng/ml is shown in Table 6.
Question 4: is there a safety signal in the chosen biomarker subgroup?
Adverse event (AE) reporting for the subgroups of FeNO ≥37 ppb and FeNO < 37 ppb were evaluated. Reporting rates of overall AEs, serious AEs (SAEs) and AEs leading to study drug discontinuation were similar for the two subgroups. Reporting rates of individual AEs were also similar for the two subgroups (data reported elsewhere ).
Asthma is common, affecting around 339 million people worldwide ; up to 10% of these individuals have severe disease [27, 28]. People with severe asthma, despite established standards of care, experience diminished health-related quality of life, acute asthma exacerbations with frequent emergency room visits and hospitalisations, and thereby consume the majority of asthma-related healthcare resources [29,30,31]. There remains a significant unmet clinical need for these individuals, which has been partly met by the recent development of biologics. However, the five biologics currently approved for the treatment of asthma are not effective in all people with severe asthma, and rely on biomarkers to identify the individuals who are most likely to benefit from their use. These biomarkers are total serum IgE for omalizumab  and blood eosinophil counts for benralizumab , mepolizumab , reslizumab  and dupilumab . The reason for the differences in the indicated patient profiles for these biologics is that asthma is a heterogeneous disease; there are different underlying mechanisms of airway inflammation driving disease and, thus, affecting treatment response . The Phase II trials with tralokinumab illustrate this point. Effect was not found in the all-comer populations in these trials, but enhanced benefit was observed in subgroups of participants who had evidence of IL-13 axis activation [12, 13]. Unfortunately, the Phase II trials did not identify the biomarker with the best predictive properties for tralokinumab efficacy. The Phase III clinical development programme, which included the pivotal trials STRATOS 1 and 2, was therefore designed to first evaluate whether tralokinumab was effective in the all-comers severe asthma population, and second to determine whether there was a biomarker that identified a subgroup with an enhanced treatment benefit with tralokinumab .
We have presented the statistical methods employed in the biomarker analyses of the tralokinumab Phase III clinical trial, STRATOS 1. The aim of these analyses was to identify the biomarker and associated cut-off value most likely to define a biomarker-positive participant subgroup with an enhanced tralokinumab treatment effect. Based on the totality of evidence, FeNO with a cut-off of ≥37 ppb was considered the best option.
Biomarker-positive subgroup identification is extremely complex, with a wide variety of approaches available. As the number of biomarkers believed to predict tralokinumab treatment effect is small, machine learning methods that identify variables by relative influence, such as random forest , virtual twins  and gradient boosting models , were not appropriate. These methods primarily identify and rank potential biomarkers from a very large pool of candidates, but do not provide estimates of treatment effects or suggested cut-offs . Bayesian model approaches were rejected as they are typically used for the analysis of pre-specified subgroups and also require the specification of a prior distribution, the choice of which can have a substantial impact on the result [42, 43]. Instead, we used a structured approach relying on multiple statistical methods. The introduction of the SIDES algorithm into this structured approach was useful for several reasons. The SIDES algorithm is reproducible and intuitive, with the advantage that the outputs are easily interpretable. Most importantly for our needs, it identifies potential predictive biomarkers while simultaneously determining the cut-off values for defining subgroups and allowing explicit control of subgroup complexity . The search methodology of SIDES can incorporate covariate-adjusted estimates of treatment effect in subgroups and is less restrictive than tree-based algorithms, allowing evaluation of multiple overlapping subgroups . Finally, as we have demonstrated here, the method can be easily adapted to new types of data. In comparison, classical methods such as interaction tests have low power, only measure linear contributions, would suffer from a greater degree of multiplicity and do not provide biomarker cut-off values . This flexibility allowed for the exploration of various options and for an increased understanding of how biomarkers impact the treatment effect of tralokinumab.
Of the five biomarkers we tested, the two identified as most likely to predict enhanced tralokinumab treatment effect were FeNO and periostin. Concentrations of both of these biomarkers are directly related to IL-13 axis activation. High concentrations of FeNO are associated with elevated type-2 inflammation , an increased risk of asthma exacerbations [45, 46] and steroid insensitivity in people with asthma . It is produced in the airways by inducible NO synthase , an enzyme that is upregulated by IL-13 . FeNO has previously been investigated as a biomarker in clinical trials of biologics for the treatment of asthma, often as a surrogate biomarker of eosinophilic inflammation, and has demonstrated predictive properties for improved treatment responses [14, 50,51,52,53,54]. Whilst the best FeNO cut-off we identified using SIDES was > 32.3 ppb, upon further investigation of the tralokinumab effect on AAER reduction and secondary endpoints we established the subgroup defined by a cut-off of ≥37 ppb as the best choice. This was because – based on the dataset in hand – it predicted the greatest treatment effect with tralokinumab in STRATOS 1, despite the prevalence of participants with baseline FeNO ≥37 ppb in STRATOS 1 and 2 being lower than the minimum we had originally planned for (24% [285/1,202] and 27% [229/837] vs. 30%, respectively). The potential added benefit of tralokinumab was considered to outweigh this decrease in prevalence. In addition, the ≥37 ppb cut-off was similar to the value of > 35 ppb established in the classification of a subgroup of participants with particularly poor asthma outcomes . Interestingly, in the STRATOS 2 trial, we observed a greater effect on AAER with tralokinumab compared with placebo in the FeNO-high subgroup than the all-comers population, but this effect was not statistically significant nor clinically meaningful . Potential reasons as to why statistical significance was not achieved are discussed elsewhere , but may include a number of factors. For example, in addition to the low prevalence of FeNO-high participants (27%), there was a lack of opportunity to enrich the study population for a FeNO-high subgroup or to allow for stabilisation of baseline FeNO concentrations prior to randomisation. This was due to the staggered design of the STRATOS 1 and 2 trials, as FeNO was not identified as a predictive biomarker until after STRATOS 2 had enrolled participants. Further, the smaller treatment effect in terms of reduction in AAER with tralokinumab, as observed in the all-comers population in STRATOS 2, compared with STRATOS 1, may have limited the potential treatment effect in the FeNO-high subgroup. Finally, as is often done when searching for a subgroup, we selected the most appropriate subgroup from a number of potential candidates. Even though we attempted to discount for it, it is possible that we observed a random high exacerbation rate reduction within the FeNO-high subgroup in STRATOS 1.
Periostin is a matricellular protein secreted by airway epithelia in response to IL-4 and IL-13 and is involved in the development and persistence of allergic inflammation . It can induce transforming growth factor-β–mediated collagen secretion from fibroblasts, which contributes to fibrosis in bronchial asthma [20, 57], and can facilitate eosinophil migration to sites of type-2 inflammation . Periostin (cut-off ≥50 ng/ml) was investigated as a predictive biomarker for therapeutic effect (measured as reductions in the exacerbation rate) in Phase III trials of the anti–IL-13 mAb lebrikizumab for the treatment of uncontrolled asthma, but proved inconsistent . Our analysis of STRATOS 1, using a lower cut-off value of > 27.4 ng/ml, was also able to identify a treatment effect in a periostin-high subgroup. However, this effect was less than that seen with FeNO, and was only seen for asthma exacerbations, not for the secondary endpoints based on lung function and quality of life assessments. An important limitation of periostin was the observed regional differences in baseline concentrations, as greater baseline concentrations were found in participants from the Asia/Pacific region and adolescents than other groups, which would have complicated potential use of periostin to guide personalised treatment with tralokinumab in routine practice. In contrast to the findings in relation to periostin and to those from a previous Phase IIb trial , DPP-4 levels were not shown to be predictive of response to tralokinumab treatment. This further highlights not only the complexity of type-2 inflammation in severe asthma, but also that the role of IL-13 in severe asthma exacerbations may be limited.
There are important strengths of our analysis. It was rigorously developed and tested using simulation exercises prior to implementation for the analysis of STRATOS 1 results. The consistency of findings across the multiple statistical methods used reassured us that the choice of FeNO with the threshold of ≥37 ppb was reasonable. One of the main limitations of analyses that aim to identify participant subgroups is the large number of individuals required . For example, powering a trial to identify a 10-unit difference in treatment effect between two subgroups (of equal size) rather than powering the trial for a 10-unit treatment effect in an unselected population would require four times the number of participants . The STRATOS 1 population was not large enough to fully assess the predictive properties of the assessed biomarkers because of the required sample size this would have entailed. As the trials were run largely in parallel, by the time FeNO was identified as the potentially predictive biomarker in STRATOS 1, STRATOS 2 had completed recruitment, precluding enrichment of the population of that trial with a FeNO-high subgroup. In accordance with the STRATOS 2 statistical analysis plan and based on the expected effect and sample size for the selected FeNO-high subgroup (estimated from the STRATOS 1 data), the testing strategy used in STRATOS 2 was adjusted to increase power by allowing the FeNO-high subgroup (which comprised 27% [229/837] of the participants in STRATOS 2) to be tested using the full allocated alpha. Finally, the innovative nature of SIDES could have affected the clinical team’s ability to interpret accurately the outputs in a timely manner. To help prevent this, the simulation and interpretation exercise we conducted were in part used to familiarise the clinical teams with interpretation of the data.
Identifying a biomarker for predicting treatment effect of a biologic for use in severe asthma is a challenge. We describe the use of a rigorous approach using multiple statistical methods to identify a biomarker that most effectively identified a subgroup with an enhanced tralokinumab treatment effect in STRATOS 1. SIDES was applied as one of the components of this biomarker analysis plan and provided important insights into the predictive properties of the five potential biomarkers. Simulation and interpretation exercises allowed us to confirm that the methods used would be able to detect the signals required, as well as refine our ability to interpret the results and practice the decision-making process. Using data from the STRATOS 1 trial, our analyses identified FeNO at a cut-off of ≥37 ppb as the best option for predicting enhanced treatment effect to be tested in the STRATOS 2 trial. To support this finding, additional analyses were performed, including robustness and sensitivity checks to mitigate false discovery, overfitting and overoptimistic belief in the chosen subgroup. However, findings from STRATOS 2, in terms of effect of tralokinumab on AAER compared with placebo, were not sufficient to support future development of anti–IL-13 therapy with tralokinumab for severe asthma . This further emphasizes the level of complication involved in subgroup identification in the severe asthma population.
Availability of data and materials
The data that support the findings of this study and the bespoke SIDES package that was developed to include count data are available from the authors upon written request.
Annualised asthma exacerbation rate
Asthma Control Questionnaire (6-item)
Standardised Asthma Quality of Life Questionnaire for 12 years and older
Full Analysis Set
Fractional exhaled nitric oxide
- FEV1 :
Forced expiratory volume in 1 s
Generalised additive models
Every 2 weeks
Every 4 weeks
Serious adverse event
Subgroup Identification based on Differential Effect Search
Corren J. Role of interleukin-13 in asthma. Curr Allergy Asthma Rep. 2013;13:415–20.
Zhu Z, Homer RJ, Wang Z, Chen Q, Geba GP, Wang J, Zhang Y, Elias JA. Pulmonary expression of interleukin-13 causes inflammation, mucus hypersecretion, subepithelial fibrosis, physiologic abnormalities, and eotaxin production. J Clin Invest. 1999;103:779–88.
Grunig G, Warnock M, Wakil AE, Venkayya R, Brombacher F, Rennick DM, Sheppard D, Mohrs M, Donaldson DD, Locksley RM, Corry DB. Requirement for IL-13 independently of IL-4 in experimental asthma. Science. 1998;282:2261–3.
Wills-Karp M, Luyimbazi J, Xu X, Schofield B, Neben TY, Karp CL, Donaldson DD. Interleukin-13: central mediator of allergic asthma. Science. 1998;282:2258–61.
Saha SK, Berry MA, Parker D, Siddiqui S, Morgan A, May R, Monk P, Bradding P, Wardlaw AJ, Pavord ID, Brightling CE. Increased sputum and bronchial biopsy IL-13 expression in severe asthma. J Allergy Clin Immunol. 2008;121:685–91.
Berry MA, Parker D, Neale N, Woodman L, Morgan A, Monk P, Bradding P, Wardlaw AJ, Pavord ID, Brightling CE. Sputum and bronchial submucosal IL-13 expression in asthma and eosinophilic bronchitis. J Allergy Clin Immunol. 2004;114:1106–9.
Naseer T, Minshall EM, Leung DY, Laberge S, Ernst P, Martin RJ, Hamid Q. Expression of IL-12 and IL-13 mRNA in asthma and their modulation in response to steroid therapy. Am J Respir Crit Care Med. 1997;155:845–51.
Kotsimbos TC, Ernst P, Hamid QA. Interleukin-13 and interleukin-4 are coexpressed in atopic asthma. Proc Assoc Am Physicians. 1996;108:368–73.
Humbert M, Durham SR, Kimmitt P, Powell N, Assoufi B, Pfister R, Menz G, Kay AB, Corrigan CJ. Elevated expression of messenger ribonucleic acid encoding IL-13 in the bronchial mucosa of atopic and nonatopic subjects with asthma. J Allergy Clin Immunol. 1997;99:657–65.
May RD, Monk PD, Cohen ES, Manuel D, Dempsey F, Davis NH, Dodd AJ, Corkill DJ, Woods J, Joberty-Candotti C, et al. Preclinical development of CAT-354, an IL-13 neutralizing antibody, for the treatment of severe uncontrolled asthma. Br J Pharmacol. 2012;166:177–93.
Popovic B, Breed J, Rees DG, Gardender MJ, Vinall LMK, Kemp B, Spooner J, Keen J, Minter R, Uddin F, et al. Structural characterisation reveals mechanism of IL-13 neutralising monoclonal antibody tralokinumab as inhibition of binding to IL-13Rα1 and IL-13Rα2. J Mol Biol. 2017;429:208–19.
Piper E, Brightling C, Niven R, Oh C, Faggioni R, Poon K, She D, Kell C, May RD, Geba GP, Molfino NA. A phase II placebo-controlled study of tralokinumab in moderate-to-severe asthma. Eur Respir J. 2013;41:330–8.
Brightling CE, Chanez P, Leigh R, O'Byrne PM, Korn S, She D, May RD, Streicher K, Ranade K, Piper E. Efficacy and safety of tralokinumab in patients with severe uncontrolled asthma: a randomised, double-blind, placebo-controlled, phase 2b trial. Lancet Respir Med. 2015;3:692–701.
Hanania NA, Noonan M, Corren J, Korenblat P, Zheng Y, Fischer SK, Cheu M, Putnam WS, Murray E, Scheerens H, et al. Lebrikizumab in moderate-to-severe asthma: pooled data from two randomised placebo-controlled studies. Thorax. 2015;70:748–56.
Hanania NA, Korenblat P, Chapman KR, Bateman ED, Kopecky P, Paggiaro P, Yokoyama A, Olsson J, Gray S, Holweg CT, et al. Efficacy and safety of lebrikizumab in patients with uncontrolled asthma (LAVOLTA I and LAVOLTA II): replicate, phase 3, randomised, double-blind, placebo-controlled trials. Lancet Respir Med. 2016;4:781–96.
Wenzel SE. Asthma phenotypes: the evolution from clinical to molecular approaches. Nat Med. 2012;18:716–25.
Panettieri R, Wang M, Braddock M, Bowen K, Colice G. Tralokinumab for the treatment of severe, uncontrolled asthma: the ATMOSPHERE clinical development program. Immunotherapy. 2018;10:473–90.
Panettieri RA Jr, Brightling C, Sjobring U, Péterffy A, Tornling G, Daoud SZ, Ranade K, Hollis S, Colice G. STRATOS 1 and 2: considerations in clinical trial design for a fully human monoclonal antibody in severe asthma. Clin Invest (Lond). 2015;5:701–11.
Shiobara T, Chibana K, Watanabe T, Arai R, Horigane Y, Nakamura Y, Hayashi Y, Shimizu Y, Takemasa A, Ishii Y. Dipeptidyl peptidase-4 is highly expressed in bronchial epithelial cells of untreated asthma and it increases cell proliferation along with fibronectin production in airway constitutive cells. Respir Res. 2016;17:28.
Takayama G, Arima K, Kanaji T, Toda S, Tanaka H, Shoji S, McKenzie AN, Nagai H, Hotokebuchi T, Izuhara K. Periostin: a novel component of subepithelial fibrosis of bronchial asthma downstream of IL-4 and IL-13 signals. J Allergy Clin Immunol. 2006;118:98–104.
Medrek SK, Parulekar AD, Hanania NA. Predictive biomarkers for asthma therapy. Curr Allergy Asthma Rep. 2017;17:69.
Cai F, Hornauer H, Peng K, Schofield CA, Scheerens H, Morimoto AM. Bioanalytical challenges and improved detection of circulating levels of IL-13. Bioanalysis. 2016;8:323–32.
Panettieri R, Sjobring U, Péterffy A, Wessman P, Bowen K, Piper E, Colice G, Brightling C. Tralokinumab for severe, uncontrolled asthma (STRATOS 1 and STRATOS 2): two randomised, double-blind, placebo-controlled, phase 3 clinical trials. Lancet Respir Med. 2018;6(7):511–25.
Lipkovich I, Dmitrienko A, Denne J, Enas G. Subgroup identification based on differential effect search--a recursive partitioning method for establishing response to treatment in patient subpopulations. Stat Med. 2011;30:2601–21.
Lipkovich I, Dmitrienko A. Strategies for identifying predictive biomarkers and subgroups with enhanced treatment effect in clinical trials using SIDES. J Biopharm Stat. 2014;24:130–53.
The Global Asthma Report 2018 [http://www.globalasthmareport.org/Global%20Asthma%20Report%202018.pdf].
Chung KF, Wenzel SE, Brozek JL, Bush A, Castro M, Sterk PJ, Adcock IM, Bateman ED, Bel EH, Bleecker ER, et al. International ERS/ATS guidelines on definition, evaluation and treatment of severe asthma. Eur Respir J. 2014;43:343–73.
Global strategy for asthma management and prevention [http://www.ginasthma.org].
Chipps BE, Zeiger RS, Borish L, Wenzel SE, Yegin A, Hayden ML, Miller DP, Bleecker ER, Simons FE, Szefler SJ, et al. Key findings and clinical implications from the epidemiology and natural history of asthma: outcomes and treatment regimens (TENOR) study. J Allergy Clin Immunol. 2012;130:332–42 e310.
Shaw DE, Sousa AR, Fowler SJ, Fleming LJ, Roberts G, Corfield J, Pandis I, Bansal AT, Bel EH, Auffray C, et al. Clinical and inflammatory characteristics of the European U-BIOPRED adult severe asthma cohort. Eur Respir J. 2015;46:1308–21.
Sullivan SD, Rasouliyan L, Russo PA, Kamath T, Chipps BE, Group TS. Extent, patterns, and burden of uncontrolled disease in severe or difficult-to-treat asthma. Allergy. 2007;62:126–33.
XOLAIR® (omalizumab): highlights of prescribing information [http://www.gene.com/download/pdf/xolair_prescribing.pdf].
FASENRA® (benralizumab): highlights of prescribing information [https://www.azpicentral.com/fasenra/fasenra_pi.pdf].
NUCALA® (mepolizumab): highlights of prescribing information [https://www.gsksource.com/pharma/content/dam/GlaxoSmithKline/US/en/Prescribing_Information/Nucala/pdf/NUCALA-PI-PIL.PDF].
CINQAIR® (reslizumab): highlights of prescribing information [http://www.accessdata.fda.gov/drugsatfda_docs/label/2016/761033lbl.pdf].
DUPIXENT® (dupilumab): highlights of prescribing information [https://www.accessdata.fda.gov/drugsatfda_docs/label/2018/761055s007lbl.pdf].
Godar M, Blanchetot C, de Haard H, Lambrecht BN, Brusselle G. Personalized medicine with biologics for severe type 2 asthma: current status and future prospects. MAbs. 2018;10:34–45.
Breiman L. Random forests. Mach Learn. 2001;45:5–32.
Foster JC, Taylor JM, Ruberg SJ. Subgroup identification from randomized clinical trial data. Stat Med. 2011;30:2867–80.
Hastie T, Tibshirani R, Friedman J. Elements of statistical learning: data mining, inference, and prediction. 2nd ed. New York: Springer-Verlag; 2009.
Lipkovich I, Dmitrienko A, B. R. D’ Agostino S. Tutorial in biostatistics: data-driven subgroup identification and analysis in clinical trials. Stat Med. 2017;36:136–96.
Jones HE, Ohlssen DI, Neuenschwander B, Racine A, Branson M. Bayesian models for subgroup analysis in clinical trials. Clin Trials. 2011;8:129–43.
Millen BA, Dmitrienko A, Song G. Bayesian assessment of the influence and interaction conditions in multipopulation tailoring clinical trials. J Biopharm Stat. 2014;24:94–109.
Modena BD, Tedrow JR, Milosevic J, Bleecker ER, Meyers DA, Wu W, Bar-Joseph Z, Erzurum SC, Gaston BM, Busse WW, et al. Gene expression in relation to exhaled nitric oxide identifies novel asthma phenotypes with unique biomolecular pathways. Am J Respir Crit Care Med. 2014;190:1363–72.
Saito J, Gibeon D, Macedo P, Menzies-Gow A, Bhavsar PK, Chung KF. Domiciliary diurnal variation of exhaled nitric oxide fraction for asthma control. Eur Respir J. 2014;43:474–84.
Horváth I, Barnes PJ, Loukides S, Sterk PJ, Hogman M, Olin AC, Amann A, Antus B, Baraldi E, Bikov A, et al. A European Respiratory Society technical standard: exhaled biomarkers in lung disease. Eur Respir J. 2017;49:1–26.
Hirano T, Matsunaga K, Sugiura H, Minakata Y, Koarai A, Akamatsu K, Ichikawa T, Furukawa K, Ichinose M. Persistent elevation of exhaled nitric oxide and modification of corticosteroid therapy in asthma. Respir Investig. 2013;51:84–91.
Alderton WK, Cooper CE, Knowles RG. Nitric oxide synthases: structure, function and inhibition. Biochem J. 2001;357:593–615.
Chibana K, Trudeau JB, Mustovich AT, Hu H, Zhao J, Balzar S, Chu HW, Wenzel SE. IL-13 induced increases in nitrite levels are primarily driven by increases in inducible nitric oxide synthase as compared with effects on arginases in human primary bronchial epithelial cells. Clin Exp Allergy. 2008;38:936–46.
Pavord ID, Korn S, Howarth P, Bleecker ER, Buhl R, Keene ON, Ortega H, Chanez P. Mepolizumab for severe eosinophilic asthma (DREAM): a multicentre, double-blind, placebo-controlled trial. Lancet. 2012;380:651–9.
Castro M, Wenzel SE, Bleecker ER, Pizzichini E, Kuna P, Busse WW, Gossage DL, Ward CK, Wu Y, Wang B, et al. Benralizumab, an anti-interleukin 5 receptor alpha monoclonal antibody, versus placebo for uncontrolled eosinophilic asthma: a phase 2b randomised dose-ranging study. Lancet Respir Med. 2014;2:879–90.
Hanania NA, Wenzel S, Rosen K, Hsieh HJ, Mosesova S, Choy DF, Lal P, Arron JR, Harris JM, Busse W. Exploring the effects of omalizumab in allergic asthma: an analysis of biomarkers in the EXTRA study. Am J Respir Crit Care Med. 2013;187:804–11.
Corren J, Parnes JR, Wang L, Mo M, Roseti SL, Griffiths JM, van der Merwe R. Tezepelumab in adults with uncontrolled asthma. N Engl J Med. 2017;377:936–46.
Wenzel S, Castro M, Corren J, Maspero J, Wang L, Zhang B, Pirozzi G, Sutherland ER, Evans RR, Joish VN, et al. Dupilumab efficacy and safety in adults with uncontrolled persistent asthma despite use of medium-to-high-dose inhaled corticosteroids plus a long-acting beta2 agonist: a randomised double-blind placebo-controlled pivotal phase 2b dose-ranging trial. Lancet. 2016;388:31–44.
Dweik RA, Sorkness RL, Wenzel S, Hammel J, Curran-Everett D, Comhair SA, Bleecker E, Busse W, Calhoun WJ, Castro M, et al. Use of exhaled nitric oxide measurement to identify a reactive, at-risk phenotype among patients with asthma. Am J Respir Crit Care Med. 2010;181:1033–41.
Conway SJ, Izuhara K, Kudo Y, Litvin J, Markwald R, Ouyang G, Arron JR, Holweg CT, Kudo A. The role of periostin in tissue remodeling across health and disease. Cell Mol Life Sci. 2014;71:1279–88.
Sidhu SS, Yuan S, Innes AL, Kerr S, Woodruff PG, Hou L, Muller SJ, Fahy JV. Roles of epithelial cell-derived periostin in TGF-beta activation, collagen production, and collagen gel elasticity in asthma. Proc Natl Acad Sci U S A. 2010;107:14170–5.
Blanchard C, Mingler MK, McBride M, Putnam PE, Collins MH, Chang G, Stringer K, Abonia JP, Molkentin JD, Rothenberg ME. Periostin facilitates eosinophil tissue infiltration in allergic lung and esophageal responses. Mucosal Immunol. 2008;1:289–96.
Yusuf S, Wittes J, Probstfield J, Tyroler HA. Analysis and interpretation of treatment effects in subgroups of patients in randomized clinical trials. JAMA. 1991;266:93–8.
Brookes ST, Whitely E, Egger M, Smith GD, Mulheran PA, Peters TJ. Subgroup analyses in randomized trials: risks of subgroup-specific analyses; power and sample size for the interaction test. J Clin Epidemiol. 2004;57:229–36.
The authors thank the healthcare providers, research staff, patients and caregivers who participated in STRATOS 1 and STRATOS 2. The authors would like to acknowledge Amy Yellen-Shaw, PhD (Audubon PM Associates, Inc. [Fort Washington, PA]), who assisted with development of the STRATOS 1 Biomarker Exploration Report, Bohdana Ratitch, PhD (IQVIA [Montreal, QC, Canada]), who conducted the simulations to evaluate SIDES for count data, both funded by AstraZeneca (Cambridge, UK), Fredrik Öhrn, PhD (AstraZeneca [Mölndal, Sweden]), who conducted the blinded scenario simulations, and Sally Hollis, MSc (AstraZeneca [Macclesfield, UK] at the time of study start and currently Phastar [Macclesfield, UK]), who developed the strategy for using two sequential trials to identify and then test a predictive biomarker. Medical writing support was provided by Sophie Walton, MSc (QXV Comms [Macclesfield, UK], an Ashfield Company, part of UDG Healthcare plc), funded by AstraZeneca (Cambridge, UK), in accordance with Good Publication Practice (GPP3) guidelines (http://www.ismpp.org/gpp3).
This study was sponsored by AstraZeneca. AstraZeneca was involved in the study design, analysis and interpretation of the data, and in the writing of this manuscript.
Ethics approval and consent to participate
STRATOS 1 and 2 were conducted in accordance with the Declaration of Helsinki and the International Conference on Harmonisation Guidance for Good Clinical Practice. Independent ethics committee approval of the protocols was obtained at all participating centres and all participants provided written informed consent. For those considered to be minors (as per local law), the participant’s legal guardian also provided written informed consent. The protocols for STRATOS 1 and 2 can be accessed at https://astrazenecagrouptrials.pharmacm.com/.
Consent for publication
MG, DJS, MH, KB, PW and GC are all employees of AstraZeneca, the sponsor of STRATOS 1 and STRATOS 2, and KB owns AstraZeneca shares. IL was an employee of IQVIA (formerly Quintiles) at the time of the study and provided consulting services per AstraZeneca’s contract with Quintiles; he is currently an employee of Eli Lilly.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. Relationship between biomarker values and number of exacerbations in the previous year in the STRATOS 1 all-comers population (full analysis set). Figure S2. Relationship between biomarker values and region in the STRATOS 1 all-comers population (full analysis set). Figure S3. Relationship between biomarker values and age categories in the STRATOS 1 all-comers population (full analysis set). (DOCX 1067 kb)
Parameter modifications applied to the SIDES algorithm. (DOCX 17 kb)
About this article
Cite this article
Gottlow, M., Svensson, D.J., Lipkovich, I. et al. Application of structured statistical analyses to identify a biomarker predictive of enhanced tralokinumab efficacy in phase III clinical trials for severe, uncontrolled asthma. BMC Pulm Med 19, 129 (2019). https://doi.org/10.1186/s12890-019-0889-4
- Predictive biomarker
- SIDES (subgroup identification based on differential effect search)
- STRATOS 1
- STRATOS 2
- Subgroup identification
- Tralokinumab (up to 10)