Leveraging non-targeted metabolite profiling via statistical genomics

PLoS One. 2013;8(2):e57667. doi: 10.1371/journal.pone.0057667. Epub 2013 Feb 28.

Abstract

One of the challenges of systems biology is to integrate multiple sources of data in order to build a cohesive view of the system of study. Here we describe the mass spectrometry based profiling of maize kernels, a model system for genomic studies and a cornerstone of the agroeconomy. Using a network analysis, we can include 97.5% of the 8,710 features detected from 210 varieties into a single framework. More conservatively, 47.1% of compounds detected can be organized into a network with 48 distinct modules. Eigenvalues were calculated for each module and then used as inputs for genome-wide association studies. Nineteen modules returned significant results, illustrating the genetic control of biochemical networks within the maize kernel. Our approach leverages the correlations between the genome and metabolome to mutually enhance their annotation and thus enable biological interpretation. This method is applicable to any organism with sufficient bioinformatic resources.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Genome-Wide Association Study
  • Genomics / methods*
  • Linear Models
  • Mass Spectrometry
  • Metabolomics / methods*
  • Molecular Sequence Annotation
  • Phenotype
  • Plant Extracts / genetics
  • Polymorphism, Single Nucleotide
  • Zea mays / genetics

Substances

  • Plant Extracts

Grants and funding

Funding was provided to IRB and OAH by the US Department of Agriculture’s Agricultural Research Service and by the National Science Foundation (IOS #1126950). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.