High-dimensional genomic feature selection with the ordered stereotype logit model

Brief Bioinform. 2022 Nov 19;23(6):bbac414. doi: 10.1093/bib/bbac414.

Abstract

For many high-dimensional genomic and epigenomic datasets, the outcome of interest is ordinal. While these ordinal outcomes are often thought of as the observed cutpoints of some latent continuous variable, some ordinal outcomes are truly discrete and are comprised of the subjective combination of several factors. The nonlinear stereotype logistic model, which does not assume proportional odds, was developed for these 'assessed' ordinal variables. It has previously been extended to the frequentist high-dimensional feature selection setting, but the Bayesian framework provides some distinct advantages in terms of simultaneous uncertainty quantification and variable selection. Here, we review the stereotype model and Bayesian variable selection methods and demonstrate how to combine them to select genomic features associated with discrete ordinal outcomes. We compared the Bayesian and frequentist methods in terms of variable selection performance. We additionally applied the Bayesian stereotype method to an acute myeloid leukemia RNA-sequencing dataset to further demonstrate its variable selection abilities by identifying features associated with the European LeukemiaNet prognostic risk score.

Trial registration: ClinicalTrials.gov NCT00003190 NCT00085124 NCT00651261 NCT00006363 NCT00742625 NCT00416598 NCT01253070 NCT01420926.

Keywords: acute myeloid leukemia; hierarchical model; ordinal response; variable selection.

Publication types

  • Review
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Genomics*
  • Logistic Models
  • Risk Factors

Associated data

  • ClinicalTrials.gov/NCT00003190
  • ClinicalTrials.gov/NCT00085124
  • ClinicalTrials.gov/NCT00651261
  • ClinicalTrials.gov/NCT00006363
  • ClinicalTrials.gov/NCT00742625
  • ClinicalTrials.gov/NCT00416598
  • ClinicalTrials.gov/NCT01253070
  • ClinicalTrials.gov/NCT01420926