Tuesday, August 01, 2023

Mediation Analysis vs. Landmark Analysis for Clinical Trial Data

Mediation analysis and Landmark analysis are two valuable statistical tools in different contexts, providing insights into different aspects of data analysis. Both methods can be utilized to investigate if a surrogate endpoint or short-term measure can predict the long-term clinical endpoint or to investigate if short-term measures can be mediators for the long-term clinical endpoint. Both mediation analysis and landmark analysis are useful tools for post-hoc, exploratory analyses, but not the primary analysis method for the primary efficacy endpoint. 

Mediation analysis is a statistical method used to explore and understand the mechanism or process through which an independent variable influences a dependent variable. It helps to determine whether the effect of an independent variable on a dependent variable is mediated (i.e., transmitted through) one or more intermediate variables, often referred to as mediators.

In mediation analysis, the main focus is on understanding the causal chain of relationships between variables. It involves three key components:
  • Independent variable (X): The variable that is hypothesized to influence the dependent variable directly or indirectly through one or more mediators.
  • Mediator (M): The variable(s) that mediates the relationship between the independent variable and the dependent variable. It explains how and why the independent variable affects the dependent variable.
  • Dependent variable (Y): The variable that is influenced by the independent variable either directly or indirectly through the mediators.
The analysis aims to estimate the direct effect of the independent variable on the dependent variable and the indirect effect mediated through the mediator(s). Several statistical techniques can be used for mediation analysis, such as regression-based methods (e.g., ordinary least squares regression) or more advanced methods like structural equation modeling.

In our published paper "Contemporary risk scores predict clinicalworsening in pulmonary arterial hypertension - Ananalysis of FREEDOM-EV",  mediation analysis was used to test the hypothesis that improvements in risk score (a surrogate endpoint) contributed to reduced likelihood for clinical worsening (a long-term clinical endpoint). 
 
In a paper by Blette et al, "Is low-risk status a surrogate outcome in pulmonary arterialhypertension? An analysis of three randomised trials ", the mediation analysis was used to investigate surrogacy. The author stated:
"we performed a mediation analysis considering each candidate surrogate as an intermediate outcome. To avoid inconsistent mediation (ie, when direct and indirect effects cancel each other out, the direct effect is even larger than the total effect, or other situations that can result in a negative proportion mediated), we first empirically tested four criteria to justify the mediation analysis. Next, Cox proportional hazards models were fit for the clinical worsening and survival outcomes conditional on each candidate surrogate separately, as well as treatment, corresponding baseline risk score, and a fixed effect variable with three levels for trial membership (to allow for similarity within each trial). Results from these models were combined with parallel models that did not condition on the candidate surrogates, but otherwise conditioned on the same set of variables to perform the difference method for mediation, estimating the total effect and direct effects of treatment on clinical worsening and survival, as well as the indirect effects through each candidate surrogate. The proportion of the effect mediated through each surrogate risk score was estimated, along with 95% CIs via a bootstrap procedure."
In a previous post, the mediation analysis was discussed. "Mediation analysis and SAS CAUSALMED procedure"

Landmark analysis, also known as landmark survival analysis, is a statistical method commonly used in survival analysis to investigate the impact of time-dependent variables on the occurrence of an event of interest. It allows for the assessment of time-varying effects in longitudinal studies or clinical trials where the values of variables may change over time.

In landmark analysis, the follow-up time is divided into predefined intervals or "landmarks," and the analysis is performed separately for each landmark. The key steps in landmark analysis are as follows:

  • Define landmarks: Choose specific time points of interest during the follow-up period.
  • Create landmark cohorts: At each landmark, divide the study population into subgroups based on the status of the time-dependent variable(s) of interest.
  • Analyze survival outcomes: Estimate survival probabilities or hazard rates for each subgroup defined by the landmark cohorts.
  • Compare survival outcomes: Compare the survival outcomes between the different subgroups to assess the impact of the time-dependent variable(s) on the event occurrence.

Landmark analysis allows researchers to capture time-varying effects and observe how the relationship between variables changes over the course of a study. It is particularly useful when analyzing data with time-dependent covariates, treatment interventions, or changes in exposure levels over time.

In a paper by McLaughlin et al "Pulmonary Arterial Hypertension-Related Morbidity Is Prognostic for Mortality", the landmark analysis was used to assess the impact of morbidity events on the risk of subsequent mortality.

In a paper by Eisenstein et al "Clopidogrel use and long-term clinical outcomes after drug-eluting stent implantation" , landmark analyses were performed to explore the association of extended clopidogrel use and long-term clinical outcomes of patients receiving drug eluting stents (DES) and bare-metal stents (BMS) for treatment of coronary artery disease.

The tutorial paper "Landmark Analysis at the 25-Year Landmark Point" by Dr Dafni is a good reference for Landmark analysis. 

Mediation analysis and Landmark analysis differ in their goals, focus, and types of data they analyze:

Goals: Mediation analysis aims to understand the mechanism or process through which an independent variable influences a dependent variable, exploring direct and indirect effects. Landmark analysis, on the other hand, focuses on assessing time-varying effects and understanding how the occurrence of an event is influenced by time-dependent variables.

Focus: Mediation analysis emphasizes identifying mediators that explain the relationship between an independent variable and a dependent variable. Landmark analysis focuses on investigating the impact of time-dependent variables on survival outcomes or event occurrences.


Data: Mediation analysis typically requires cross-sectional or longitudinal data with variables measured at different time points. It is applicable to both continuous and categorical variables. Landmark analysis is commonly used in survival analysis, analyzing time-to-event data, and requires longitudinal data with time-dependent variables.

Both mediation analysis and landmark analysis are valuable statistical tools in different contexts, providing insights into different aspects of data analysis. Here is a table to compare the mediation analysis and the landmark analysis generated by ChatGPT, but I added the last row for implementing the mediation and landmark analyses. 

Aspect

Mediation Analysis

Landmark Analysis

Statistical Method

Investigates relationships and effects between variables

Analyzes time-varying effects and event occurrences

Time Dependency

Considers temporal aspect of data

Accounts for time-varying effects and changing values over time

Causal Inference

Aims to understand causal chain of relationships

Examines impact of time-dependent variables on event outcomes

Focus

Identifying mediators that explain relationships

Assessing time-dependent variables and event occurrences

Variables

Independent, mediator, and dependent variables

Time-dependent variables influencing event occurrences

Data Type

Cross-sectional or longitudinal with measured variables

Longitudinal data with time-dependent variables

Analytical Steps

Estimating direct and indirect effects using regression

Dividing follow-up into landmarks, comparing survival outcomes

Research Questions

Mechanisms and processes of variable influence

Impact of time-dependent variables on events or survival

Implementation




In SAS, Procedure CASUALMED allows you to estimate direct and indirect effects using different mediation models. It supports various regression-based mediation approaches, including Sobel, bootstrapping, and Bayesian estimation.

In R, to conduct mediation analysis, the most commonly used package is "mediation." This package provides a comprehensive set of functions to estimate direct and indirect effects in mediation models. It supports various mediation methods, including the causal steps approach, bootstrapping, and structural equation modeling (SEM).

In SAS, Procedures LIFETEST, PHREG, and LIFEREG can all be used. You can divide the follow-up time into landmark intervals and analyze survival outcomes for different subgroups defined by the landmarks

In R, you can utilize the "survival" package, which is widely used for survival analysis. The "survival" package provides functions to handle time-to-event data, perform survival analysis, and estimate survival probabilities. You can divide the follow-up time into landmarks and analyze survival outcomes for different landmark cohorts.


Personally, I prefer the mediation analysis to the landmark analysis. In the landmark analysis, the subjects who did not reach the landmark timepoint were excluded from the analysis and the analyses are performed on a subset of the overall population - sort of principle stratum. The endpoint measure at the landmark timepoint and whether or not the subjects reach the landmark timepoint itself is meaningful. Excluding it from the analysis is against the intention-to-treatment principle and may cause biases. 

No comments: