Most B cell precursor acute lymphoblastic leukemia (BCP ALL) can be classified into known major genetic subtypes, while a substantial proportion of BCP ALL remains poorly characterized in relation to its underlying genomic abnormalities. We therefore initiated a large-scale international study to reanalyze and delineate the transcriptome landscape of 1,223 BCP ALL cases using RNA sequencing. Fourteen BCP ALL gene expression subgroups (G1 to G14) were identified. Apart from extending eight previously described subgroups (G1 to G8 associated with MEF2D fusions, TCF3-PBX1 fusions, ETV6-RUNX1-positive/ETV6-RUNX1-like, DUX4 fusions, ZNF384 fusions, BCR-ABL1/Ph-like, high hyperdiploidy, and KMT2A fusions), we defined six additional gene expression subgroups: G9 was associated with both PAX5 and CRLF2 fusions; G10 and G11 with mutations in PAX5 (p.P80R) and IKZF1 (p.N159Y), respectively; G12 with IGH-CEBPE fusion and mutations in ZEB2 (p.H1038R); and G13 and G14 with TCF3/4-HLF and NUTM1 fusions, respectively. In pediatric BCP ALL, subgroups G2 to G5 and G7 (51 to 65/67 chromosomes) were associated with low-risk, G7 (with ≤50 chromosomes) and G9 were intermediate-risk, whereas G1, G6, and G8 were defined as high-risk subgroups. In adult BCP ALL, G1, G2, G6, and G8 were associated with high risk, while G4, G5, and G7 had relatively favorable outcomes. This large-scale transcriptome sequence analysis of BCP ALL revealed distinct molecular subgroups that reflect discrete pathways of BCP ALL, informing disease classification and prognostic stratification. The combined results strongly advocate that RNA sequencing be introduced into the clinical diagnostic workup of BCP ALL.
Keywords: BCP ALL; RNA-seq; gene fusion; gene mutation; subtypes.