Sunday, September 15, 2013

‘As treated’ versus ‘As randomized’ analysis

When we design a non-inferiority trial to support the product registration, the regulatory agencies may suggest the statistical analysis using ‘as treated’ population. In FDA’s Guidance for Industry Non-Inferiority Clinical Trials, it suggested the ‘as-treated’ analysis for primary efficacy endpoint:
“Intent-to-treat (ITT) analyses in superiority trials are nonetheless preferred because they protect against the kinds of bias that might be associated with early departure from the study. In non-inferiority trials, many kinds of problems fatal to a superiority trial, such as non-adherence, misclassification of the primary endpoint, or measurement problems more generally (i.e., “noise”), or many dropouts who must be assessed as part of the treated group, can bias toward no treatment difference (success) and undermine the validity of the trial, creating apparent non-inferiority where it did not really exist. Although an “as-treated” analysis is therefore often suggested as the primary analysis for NI studies, there are also significant concerns with the possibility of informative censoring in an as-treated analysis. It is therefore important to conduct both ITT and as-treated analyses in NI studies. Differences in results using the two analyses will need close examination.”
The term “as treated” means that when we do analysis/summaries, the treatment assignment is based on the actual treatment the patients receive, not the treatment the patients are supposed to receive. The concept of “as treated” should be explained in comparison with “as randomized” – a key component for the concept of “Intention-to-Treat” principle.

Typically, ‘Intention-to-Treat’ population can be simply defined as all patients who are randomized. However, the formal definition of the “Intention-to-Treat” population contains several other concepts:

The formal definition of the "Intention to Treat" is usually referred to the one suggested by Fisher, LD et al. in “Intention to treat in clinical trials in Statistical Issues in Drug Research and Development. Edited by Peace KE (1990)”. Intention to Treat population “Includes all randomized patients in the groups to which they were randomly assigned, regardless of their adherence with the entry criteria, regardless of the treatment they actually received, and regardless of subsequent withdrawal from treatment or deviation from the protocol”.

In other words, ITT analysis includes every subject who is randomized according to randomized treatment assignment. It ignores noncompliance, protocol deviations, withdrawal, and anything that happens after randomization. ITT analysis is usually described as “once randomized, always analyzed”. ITT analysis is referred as ‘as randomized’ - the opposite of the term ‘as treated’.

In majority of cases, if all randomized subjects receive their allocated treatments and if there is no randomization error, the ‘as treated’ analysis will be the same as 'as randomized' analysis.

If some randomized subjects do not receive the randomly alloccated treatment and if there are randomization errors, ‘as treated’ population will be different from ITT population or 'as randomized' population.

The example below can be used to illustrate the difference between the ‘as treated’ and ‘as randomized’. This is an open label, randomized study to compare the treatment effect of r-ProUK (intra-artery prourokinase + heparin) with control (heparin alone) on acute ischemia stroke - PROACT II Study. Eligible subjects were randomized to r-proUK or Control in 2:1 ratio. 180 subjects were randomized (121 subjects to r-proUK group and 59 to Control group). The problem was that in subjects who were randomized to r-proUK group, 13 subjects never received the treatment and in subjects who were randomized to control group, 5 subjects received the wrong treatment (they were supposed to receive control treatment, but instead received r-proUK). 


In this case, to strictly follow the ITT principle, the ITT analysis will include all 121 subjects in r-proUK group and all 59 subjects in Control group - purely based on the randomization, not based on whether or not the randomized subject receive the actual treatment or receive the wrong treatment. The ITT analysis was indeed used in the paper. However, since the PROACT II study was a proof of concept study, the analysis based on 'as treated' population should be used as well. 

With 'as treated' analysis, r-proUK group would have 121 subjects and Control group would have 54 subjects (see the comparison and the calculation below). 

The Number of Subjects for Analysis

r-proUK
Control
As Randomized
121
59
As Treated
113
(121 randomized subjects - 13 subjects who did not receive treatment + 5 subjects who were randomized to Control group, but received r-proUK)
54
(59 randomized subjects – 5 subjects who were randomized to Control, but received r-proUK)

Typically, the safety analysis should always be based on the 'as treated' population since it really reflects the safety of patients under each treatment.

Monday, September 02, 2013

Fixed covariate, time dependent covariate, time varying covariate, and post-randomization covariate

A covariate is a variable that is possibly predictive of the outcome under study. A covariate may be of direct interest or it may be a confounding or interacting variable. Typically, when we use the term of ‘covariate’, we refer to the fixed covariate – a variable whose value will not change over the time. The fixed covariates include the demographic information (gender, age, race, …) and baseline characteristics (weight, height, baseline measure,…). Since ‘covariate’ typically means ‘fixed covariate’, we often drop the term ‘fixed’.

In contrary to the fixed covariate, time depedent covariate or time varying covariate (exact the same thing) refers to a covariate that is not necessarily constant through the whole study and has different values at different time points.  

There are many examples of the time dependent or time varying covariate in clinical trials or observational studies. For instance, if one wishes to examine the link between area of residence and cancer, this would be complicated by the fact that study subjects move from one area to another. The area of residency could then be introduced in the statistical model as a time-varying covariate.

When we analyze the CT lung densitometry data, we need to deal with the measurement of the lung volume – a reflection of the inspiration level. The CT densitometry measure is negatively correlated with the effort of the inspiration level. If we inhale more air into the lung, the lung density measure will be smaller and vice versa. In order for quantitative CT lung density to be valid endpoint, the corresponding measure of the lung volume (inspiration level) needs to considered and adjusted. If the outcome measure of CT lung density is measured at different times, there will also be corresponding measures of lung volumes at various times. The lung volume measure is considered as a time dependent covariate.

In clinical trials, the timing of the randomization is a critical point. The covariate can be separated into two groups: covariate measured or existed prior to the randomization and covariate measured after the randomization. The fixed covariates are measured or known prior to the randomization. The covariate measured after the randomization is called ‘post-randomization covariate’. The post-randomization covariate is usually also the time-dependent covariate.

When the time dependent covariate exists, we may be very tempted to include the covariate in the statistical analysis model. However, such approach could be biased if the time dependent covariate involves measures post-randomization.

A paper by Chen et al ‘A Note on Postrandomization Adjustment of Covariates” explained the issue:
Examples include adjusting rescue therapies to explore the underlying difference of treatments which would have been observed in the absence of rescue medication, adjusting the level of patient compliance to estimate the treatment effect at the same compliance level, adjusting a postrandomization biomarker to evaluate the surrogating status of the biomarker to the clinical endpoint, and adjusting an intermediate endpoint or an endpoint other than the primary one to explore the therapeutic mechanism of one intervention.”
In its “ Guideline on adjustment for baseline covariates”, EMA explicitly opposes the use of adjustment for post-baseline covariates. In Section 4.2.5 (covariates affected by the treatment allocation), it states:
“A covariate that may be affected by the treatment allocation (for example, a covariate measured after randomisation such as duration of treatment, level of compliance or use of rescue medication) should not normally be included in the primary analysis of a confirmatory trial. When a covariate is affected by the treatment either through direct causation or through association with another factor, the adjustment may hide or exaggerate the treatment effect. It therefore makes the treatment effect difficult to interpret. However, such covariates (e.g. duration of treatment) might be included in secondary (exploratory) analyses and might offer the sponsor useful insights during the drug development process. Alternatively, subgroup analyses might offer similar insights.”
 The main concern for using the post-randomization covariates is the potential effect of treatment on the covariate measures. The covariates could become outcome measures or dependent variables instead of the independent variables. The post-randomization covariate measures may also be biased if there is inadequate blinding of the treatment allocation. In reality, it is not possible to defend that a post-randomization covariate is really a covariate and not impacted by the treatment. It is also difficult to confirm whether or not if the treatment effect of the experimental drug is mediated through the post-randomization or time-dependent covariates.

FDA does not have any guidance similar to EMA’s guidance on the use of post-randomization covariates. However, it is very rare to see the use of post-randomization covariates in the analysis of the primary efficacy endpoint in NDA or BLA submissions. There may be cases where post-randomization covariates are used in exploratory or ad-hoc analyses or used in analyses for non-randomized, observational studies.  

In sNDA ODAC Briefing document for Proscar, post hoc analysis was performed to include the post-randomization covariates. However, the briefing documents also commented “Adjustment for post-randomization variables (cores and volume) that are affected by treatment can introduce confounding bias and complicate causal interpretation.”

It has been noted that the time-dependent covaraites are more commonly used in survival analysis. See paper by Dr Lin “TIME-DEPENDENT COVARIATES IN THE COX PROPORTIONAL-HAZARDS REGRESSION MODEL


Dealing with the time-dependent variable is a challenging statistical issue. The fundamental statistical problem is how to properly adjust for post-randomization variables, since, if the treatment has an effect on post-randomization variables, standard adjustment by regression modelling is susceptible to selection bias. The current proposal is to utilize Causal-Effects approach.  Dai et al had a pretty good introduction on this topic in their paper “Partially hidden Markov model for time-varying principal stratification in HIV prevention trials“. For details about the concept of Causal-Effects, the report of ”From Neurons to Neighborhoods: The Science of Early Childhood Development” by National Academy Press is a pretty good resource.