The value of primary transcripts to the clinical and non-clinical genomics community: Survey results and roadmap for improvements

Joannella Morales; Aoife C McMahon; Jane Loveland; Emily Perry; Adam Frankish; Sarah Hunt; Irina M Armean; Paul Flicek; Fiona Cunningham

doi:10.1002/mgg3.1786

The value of primary transcripts to the clinical and non-clinical genomics community: Survey results and roadmap for improvements

Mol Genet Genomic Med. 2021 Dec;9(12):e1786. doi: 10.1002/mgg3.1786. Epub 2021 Aug 26.

Authors

Joannella Morales¹, Aoife C McMahon¹, Jane Loveland¹, Emily Perry¹, Adam Frankish¹, Sarah Hunt¹, Irina M Armean¹, Paul Flicek¹, Fiona Cunningham¹

Affiliation

¹ European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom.

Abstract

Background: Variant interpretation is dependent on transcript annotation and remains time consuming and challenging. There are major obstacles for historical data reuse and for interpretation of new variants. First, both RefSeq and Ensembl/GENCODE produce transcript sets in common use, but there is currently no easy way to translate between the two. Second, the resources often used for variant interpretation (e.g. ClinVar, gnomAD, UniProt) do not use the same transcript set, nor default transcript or protein sequence.

Method: Ensembl ran a survey in 2018 to sample attitudes to choosing one default transcript per locus, and to gather data on reference sequences used by the scientific community. This was publicised on the Ensembl and UCSC genome browsers, by email and on social media.

Results: The survey had 788 responses from 32 different countries, the results of which we report here.

Conclusions: We present our roadmap to create an effective default set of transcripts for resources, and for reporting interpretation of clinical variants.

Keywords: default transcript; survey; transcript annotation; variant interpretation.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Animals
Biomarkers*
Computational Biology* / methods
Databases, Genetic
Genomics* / methods
Humans
RNA, Messenger / genetics*
Software
Web Browser

Substances

Biomarkers
RNA, Messenger

Abstract

Publication types

MeSH terms

Substances

Grants and funding