Investigating non-inferiority or equivalence in time-to-event data under non-proportional hazards

Kathrin Möllenhoff; Achim Tresch

doi:10.1007/s10985-023-09589-5

Investigating non-inferiority or equivalence in time-to-event data under non-proportional hazards

Lifetime Data Anal. 2023 Jul;29(3):483-507. doi: 10.1007/s10985-023-09589-5. Epub 2023 Jan 28.

Authors

Kathrin Möllenhoff¹, Achim Tresch^{2

3

4}

Affiliations

¹ Mathematical Institute, Heinrich Heine University, 40225, Düsseldorf, Germany. kathrin.moellenhoff@hhu.de.
² Institute of Medical Statistics and Computational Biology, Faculty of Medicine, University of Cologne, Cologne, Germany.
³ CEDAD, University of Cologne, Cologne, Germany.
⁴ Center for Data and Simulation Science, University of Cologne, Cologne, Germany.

Abstract

The classical approach to analyze time-to-event data, e.g. in clinical trials, is to fit Kaplan-Meier curves yielding the treatment effect as the hazard ratio between treatment groups. Afterwards, a log-rank test is commonly performed to investigate whether there is a difference in survival or, depending on additional covariates, a Cox proportional hazard model is used. However, in numerous trials these approaches fail due to the presence of non-proportional hazards, resulting in difficulties of interpreting the hazard ratio and a loss of power. When considering equivalence or non-inferiority trials, the commonly performed log-rank based tests are similarly affected by a violation of this assumption. Here we propose a parametric framework to assess equivalence or non-inferiority for survival data. We derive pointwise confidence bands for both, the hazard ratio and the difference of the survival curves. Further we propose a test procedure addressing non-inferiority and equivalence by directly comparing the survival functions at certain time points or over an entire range of time. Once the model's suitability is proven the method provides a noticeable power benefit, irrespectively of the shape of the hazard ratio. On the other hand, model selection should be carried out carefully as misspecification may cause type I error inflation in some situations. We investigate the robustness and demonstrate the advantages and disadvantages of the proposed methods by means of a simulation study. Finally, we demonstrate the validity of the methods by a clinical trial example.

Keywords: Equivalence; Non-inferiority; Non-proportional hazards; Survival analysis; Time-to-event data.

MeSH terms

Computer Simulation
Humans
Proportional Hazards Models
Research Design*
Sample Size
Survival Analysis
Time Factors