Gene expression profiling of human breast tissue samples using SAGE-Seq

Genome Res. 2010 Dec;20(12):1730-9. doi: 10.1101/gr.108217.110. Epub 2010 Nov 2.

Abstract

We present a powerful application of ultra high-throughput sequencing, SAGE-Seq, for the accurate quantification of normal and neoplastic mammary epithelial cell transcriptomes. We develop data analysis pipelines that allow the mapping of sense and antisense strands of mitochondrial and RefSeq genes, the normalization between libraries, and the identification of differentially expressed genes. We find that the diversity of cancer transcriptomes is significantly higher than that of normal cells. Our analysis indicates that transcript discovery plateaus at 10 million reads/sample, and suggests a minimum desired sequencing depth around five million reads. Comparison of SAGE-Seq and traditional SAGE on normal and cancerous breast tissues reveals higher sensitivity of SAGE-Seq to detect less-abundant genes, including those encoding for known breast cancer-related transcription factors and G protein-coupled receptors (GPCRs). SAGE-Seq is able to identify genes and pathways abnormally activated in breast cancer that traditional SAGE failed to call. SAGE-Seq is a powerful method for the identification of biomarkers and therapeutic targets in human disease.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Analysis of Variance
  • Base Sequence
  • Bayes Theorem
  • Breast / cytology*
  • Breast Neoplasms / metabolism*
  • Epithelial Cells / metabolism*
  • Female
  • Gene Expression Profiling / methods*
  • Gene Library
  • Humans
  • Molecular Sequence Data
  • Sensitivity and Specificity
  • Sequence Analysis, DNA / methods*

Associated data

  • GEO/GSE24491