Complex rearrangements and oncogene amplifications revealed by long-read DNA and RNA sequencing of a breast cancer cell line

Genome Res. 2018 Aug;28(8):1126-1135. doi: 10.1101/gr.231100.117. Epub 2018 Jun 28.

Abstract

The SK-BR-3 cell line is one of the most important models for HER2+ breast cancers, which affect one in five breast cancer patients. SK-BR-3 is known to be highly rearranged, although much of the variation is in complex and repetitive regions that may be underreported. Addressing this, we sequenced SK-BR-3 using long-read single molecule sequencing from Pacific Biosciences and develop one of the most detailed maps of structural variations (SVs) in a cancer genome available, with nearly 20,000 variants present, most of which were missed by short-read sequencing. Surrounding the important ERBB2 oncogene (also known as HER2), we discover a complex sequence of nested duplications and translocations, suggesting a punctuated progression. Full-length transcriptome sequencing further revealed several novel gene fusions within the nested genomic variants. Combining long-read genome and transcriptome sequencing enables an in-depth analysis of how SVs disrupt the genome and sheds new light on the complex mechanisms involved in cancer genome evolution.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Breast Neoplasms / genetics*
  • Breast Neoplasms / pathology
  • Female
  • Gene Amplification / genetics*
  • Gene Rearrangement / genetics*
  • Genome, Human
  • Genomic Structural Variation
  • High-Throughput Nucleotide Sequencing
  • Humans
  • MCF-7 Cells
  • Oncogenes / genetics*
  • Receptor, ErbB-2 / genetics
  • Repetitive Sequences, Nucleic Acid / genetics
  • Transcriptome / genetics

Substances

  • ERBB2 protein, human
  • Receptor, ErbB-2