Predictive modeling of treatment resistant depression using data from STAR*D and an independent clinical study

Zhi Nie; Srinivasan Vairavan; Vaibhav A Narayan; Jieping Ye; Qingqin S Li

doi:10.1371/journal.pone.0197268

Predictive modeling of treatment resistant depression using data from STAR*D and an independent clinical study

PLoS One. 2018 Jun 7;13(6):e0197268. doi: 10.1371/journal.pone.0197268. eCollection 2018.

Authors

Zhi Nie^{1

2}, Srinivasan Vairavan^{3

4}, Vaibhav A Narayan^{3

4}, Jieping Ye^{1

2}, Qingqin S Li^{3

4}

Affiliations

¹ Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States of America.
² Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI, United States of America.
³ Neuroscience Therapeutic Area, Janssen Research & Development, LLC, Pennington, NJ, United States of America.
⁴ Research Information Technology, Janssen Research & Development, LLC, Pennington, NJ, United States of America.

Abstract

Identification of risk factors of treatment resistance may be useful to guide treatment selection, avoid inefficient trial-and-error, and improve major depressive disorder (MDD) care. We extended the work in predictive modeling of treatment resistant depression (TRD) via partition of the data from the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) cohort into a training and a testing dataset. We also included data from a small yet completely independent cohort RIS-INT-93 as an external test dataset. We used features from enrollment and level 1 treatment (up to week 2 response only) of STAR*D to explore the feature space comprehensively and applied machine learning methods to model TRD outcome at level 2. For TRD defined using QIDS-C16 remission criteria, multiple machine learning models were internally cross-validated in the STAR*D training dataset and externally validated in both the STAR*D testing dataset and RIS-INT-93 independent dataset with an area under the receiver operating characteristic curve (AUC) of 0.70-0.78 and 0.72-0.77, respectively. The upper bound for the AUC achievable with the full set of features could be as high as 0.78 in the STAR*D testing dataset. Model developed using top 30 features identified using feature selection technique (k-means clustering followed by χ2 test) achieved an AUC of 0.77 in the STAR*D testing dataset. In addition, the model developed using overlapping features between STAR*D and RIS-INT-93, achieved an AUC of > 0.70 in both the STAR*D testing and RIS-INT-93 datasets. Among all the features explored in STAR*D and RIS-INT-93 datasets, the most important feature was early or initial treatment response or symptom severity at week 2. These results indicate that prediction of TRD prior to undergoing a second round of antidepressant treatment could be feasible even in the absence of biomarker data.

Trial registration: ClinicalTrials.gov NCT00021528.

Publication types

Clinical Trial
Multicenter Study
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Adult
Antidepressive Agents / administration & dosage*
Databases, Factual*
Depressive Disorder, Major* / drug therapy
Depressive Disorder, Major* / physiopathology
Drug Resistance*
Female
Humans
Machine Learning*
Male
Models, Biological
Predictive Value of Tests
Risk Factors

Substances

Antidepressive Agents

Associated data

ClinicalTrials.gov/NCT00021528

Grants and funding

N01MH90003/MH/NIMH NIH HHS/United States