An Evaluation of an Algorithm for the Selection of Flexible Survival Models for Cancer Immunotherapies: Pass or Fail?

Nicholas R Latimer; Kurt Taylor; Anthony J Hatswell; Sophia Ho; Gabriel Okorogheye; Clara Chen; Inkyu Kim; John Borrill; David Bertwistle

doi:10.1007/s40273-024-01429-0

An Evaluation of an Algorithm for the Selection of Flexible Survival Models for Cancer Immunotherapies: Pass or Fail?

Pharmacoeconomics. 2024 Sep 20. doi: 10.1007/s40273-024-01429-0. Online ahead of print.

Authors

Nicholas R Latimer^{1

2}, Kurt Taylor³, Anthony J Hatswell^{3

4}, Sophia Ho⁵, Gabriel Okorogheye⁵, Clara Chen⁶, Inkyu Kim⁶, John Borrill⁵, David Bertwistle⁵

Affiliations

¹ Delta Hat Limited, Bramley House, Bramley Road, Nottingham, NG10 3SX, UK. nlatimer@deltahat.com.
² University of Sheffield, Sheffield, UK. nlatimer@deltahat.com.
³ Delta Hat Limited, Bramley House, Bramley Road, Nottingham, NG10 3SX, UK.
⁴ Department of Statistical Science, University College London, London, UK.
⁵ Bristol Myers Squibb, Uxbridge, London, UK.
⁶ Bristol Myers Squibb, Lawrenceville, NJ, USA.

PMID: 39302594
DOI: 10.1007/s40273-024-01429-0

Abstract

Background and objective: Accurately extrapolating survival beyond trial follow-up is essential in a health technology assessment where model choice often substantially impacts estimates of clinical and cost effectiveness. Evidence suggests standard parametric models often provide poor fits to long-term data from immuno-oncology trials. Palmer et al. developed an algorithm to aid the selection of more flexible survival models for these interventions. We assess the usability of the algorithm, identify areas for improvement and evaluate whether it effectively identifies models capable of accurate extrapolation.

Methods: We applied the Palmer algorithm to the CheckMate-649 trial, which investigated nivolumab plus chemotherapy versus chemotherapy alone in patients with gastroesophageal adenocarcinoma. We evaluated the algorithm's performance by comparing survival estimates from identified models using the 12-month data cut to survival observed in the 48-month data cut.

Results: The Palmer algorithm offers a systematic procedure for model selection, encouraging detailed analyses and ensuring that crucial stages in the selection process are not overlooked. In our study, a range of models were identified as potentially appropriate for extrapolating survival, but only flexible parametric non-mixture cure models provided extrapolations that were plausible and accurately predicted subsequently observed survival. The algorithm could be improved with minor additions around the specification of hazard plots and setting out plausibility criteria.

Conclusions: The Palmer algorithm provides a systematic framework for identifying suitable survival models, and for defining plausibility criteria for extrapolation validity. Using the algorithm ensures that model selection is based on explicit justification and evidence, which could reduce discordance in health technology appraisals.