Microbiome Data Enhances Predictive Models of Lung Function in People With Cystic Fibrosis

J Infect Dis. 2021 Jun 16;223(12 Suppl 2):S246-S256. doi: 10.1093/infdis/jiaa655.

Abstract

Background: Microbiome sequencing has brought increasing attention to the polymicrobial context of chronic infections. However, clinical microbiology continues to focus on canonical human pathogens, which may overlook informative, but nonpathogenic, biomarkers. We address this disconnect in lung infections in people with cystic fibrosis (CF).

Methods: We collected health information (lung function, age, and body mass index [BMI]) and sputum samples from a cohort of 77 children and adults with CF. Samples were collected during a period of clinical stability and 16S rDNA sequenced for airway microbiome compositions. We use ElasticNet regularization to train linear models predicting lung function and extract the most informative features.

Results: Models trained on whole-microbiome quantitation outperformed models trained on pathogen quantitation alone, with or without the inclusion of patient metadata. Our most accurate models retained key pathogens as negative predictors (Pseudomonas, Achromobacter) along with established correlates of CF disease state (age, BMI, CF-related diabetes). In addition, our models selected nonpathogen taxa (Fusobacterium, Rothia) as positive predictors of lung health.

Conclusions: These results support a reconsideration of clinical microbiology pipelines to ensure the provision of informative data to guide clinical practice.

Keywords: cystic fibrosis; machine learning; microbiome.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Adolescent
  • Adult
  • Bacteria / classification
  • Bacteria / genetics
  • Bacteria / isolation & purification
  • Cystic Fibrosis / microbiology*
  • Cystic Fibrosis / physiopathology*
  • Humans
  • Lung / microbiology
  • Lung / physiology
  • Male
  • Microbiota*
  • Middle Aged
  • Models, Biological
  • Predictive Value of Tests
  • RNA, Ribosomal, 16S / genetics
  • Sputum / microbiology
  • Young Adult

Substances

  • RNA, Ribosomal, 16S