Retrotransposons in Plant Genomes: Structure, Identification, and Classification through Bioinformatics and Machine Learning
- PMID: 31390781
- PMCID: PMC6696364
- DOI: 10.3390/ijms20153837
Retrotransposons in Plant Genomes: Structure, Identification, and Classification through Bioinformatics and Machine Learning
Abstract
Transposable elements (TEs) are genomic units able to move within the genome of virtually all organisms. Due to their natural repetitive numbers and their high structural diversity, the identification and classification of TEs remain a challenge in sequenced genomes. Although TEs were initially regarded as "junk DNA", it has been demonstrated that they play key roles in chromosome structures, gene expression, and regulation, as well as adaptation and evolution. A highly reliable annotation of these elements is, therefore, crucial to better understand genome functions and their evolution. To date, much bioinformatics software has been developed to address TE detection and classification processes, but many problematic aspects remain, such as the reliability, precision, and speed of the analyses. Machine learning and deep learning are algorithms that can make automatic predictions and decisions in a wide variety of scientific applications. They have been tested in bioinformatics and, more specifically for TEs, classification with encouraging results. In this review, we will discuss important aspects of TEs, such as their structure, importance in the evolution and architecture of the host, and their current classifications and nomenclatures. We will also address current methods and their limitations in identifying and classifying TEs.
Keywords: bioinformatics; classification; deep learning; detection; function; machine learning; retrotransposons; structure; transposable elements.
Conflict of interest statement
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
Figures
Similar articles
-
A machine learning based framework to identify and classify long terminal repeat retrotransposons.PLoS Comput Biol. 2018 Apr 23;14(4):e1006097. doi: 10.1371/journal.pcbi.1006097. eCollection 2018 Apr. PLoS Comput Biol. 2018. PMID: 29684010 Free PMC article.
-
Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes.Chromosome Res. 2011 Aug;19(6):787-808. doi: 10.1007/s10577-011-9230-7. Chromosome Res. 2011. PMID: 21850457
-
Automatic curation of LTR retrotransposon libraries from plant genomes through machine learning.J Integr Bioinform. 2022 Jul 12;19(3):20210036. doi: 10.1515/jib-2021-0036. eCollection 2022 Sep 1. J Integr Bioinform. 2022. PMID: 35822734 Free PMC article.
-
Use of retrotransposon-derived genetic markers to analyse genomic variability in plants.Funct Plant Biol. 2018 Jan;46(1):15-29. doi: 10.1071/FP18098. Funct Plant Biol. 2018. PMID: 30939255 Review.
-
Transposable elements as genetic accelerators of evolution: contribution to genome size, gene regulatory network rewiring and morphological innovation.Genes Genet Syst. 2020 Jan 30;94(6):269-281. doi: 10.1266/ggs.19-00029. Epub 2020 Jan 10. Genes Genet Syst. 2020. PMID: 31932541 Review.
Cited by
-
Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks.PLoS One. 2023 Sep 21;18(9):e0291925. doi: 10.1371/journal.pone.0291925. eCollection 2023. PLoS One. 2023. PMID: 37733731 Free PMC article.
-
Transposable elements in Rosaceae: insights into genome evolution, expression dynamics, and syntenic gene regulation.Hortic Res. 2024 Apr 26;11(6):uhae118. doi: 10.1093/hr/uhae118. eCollection 2024 Jun. Hortic Res. 2024. PMID: 38919560 Free PMC article.
-
PTGS is dispensable for the initiation of epigenetic silencing of an active transposon in Arabidopsis.EMBO Rep. 2024 Dec;25(12):5780-5809. doi: 10.1038/s44319-024-00304-5. Epub 2024 Nov 7. EMBO Rep. 2024. PMID: 39511423 Free PMC article.
-
A 192 bp ERV fragment insertion in the first intron of porcine TLR6 may act as an enhancer associated with the increased expressions of TLR6 and TLR1.Mob DNA. 2021 Aug 18;12(1):20. doi: 10.1186/s13100-021-00248-w. Mob DNA. 2021. PMID: 34407874 Free PMC article.
-
Inpactor2: a software based on deep learning to identify and classify LTR-retrotransposons in plant genomes.Brief Bioinform. 2023 Jan 19;24(1):bbac511. doi: 10.1093/bib/bbac511. Brief Bioinform. 2023. PMID: 36502372 Free PMC article.
References
-
- Mustafin R.N., Khusnutdinova E.K. The role of transposons in epigenetic regulation of ontogenesis. Russ. J. Dev. Biol. 2018;49:61–78. doi: 10.1134/S1062360418020066. - DOI
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources