Genomic analyses of intricate interaction of TE-lncRNA overlapping genes with miRNAs in human diseases

Genes Genomics. 2024 Aug 31. doi: 10.1007/s13258-024-01547-1. Online ahead of print.

Abstract

Background: Transposable elements (TEs) are known to be inserted into genome to create transcript isoforms or to generate long non-coding RNA (lncRNA) sequences. The insertion of TEs generates a gene protein sequence within the genome, but also provides a microRNA (miRNA) regulatory region.

Objective: To determine the effect of gene sequence changes caused by TE insertion on miRNA binding and to investigate the formation of an overlapping lncRNA that represses it.

Methods: The distribution of overlapping regions between exons and TE regions with lncRNA was examined using the Bedtools. miRNAs that can bind to those overlapping regions were identified through the miRDB web program. For TE-lncRNA overlapping genes, bioinformatic analysis was conducted using DAVID web database. Differential expression analysis was conducted using data from the GEO dataset and TCGA.

Results: Most TEs were distributed more frequently in untranslated regions than open reading frames. There were 30 annotated TE-lncRNA overlapping genes with same strand that could bind to the same miRNA. As a result of identifying the association between these 30 genes and diseases, TGFB2, FCGR2A, DCTN5, and IFI6 were associated with breast cancer, and HMGCS1, FRMD4A, EDNRB, and SNCA were associated with Alzheimer's disease. Analysis of the GEO and TCGA data showed that the relevant expression of miR-891a and miR-28, which bind to the TE overlapping region of DCTN5 and HMGCS1, decreased.

Conclusion: This study indicates that the interaction between TE-lncRNA overlapping genes and miRNAs can affect disease progression.

Keywords: Bioinformatics; Human disease; Transposable elements; long non-coding RNA; microRNA.