Analysis of selection in protein-coding sequences accounting for common biases
- PMID: 33479739
- DOI: 10.1093/bib/bbaa431
Analysis of selection in protein-coding sequences accounting for common biases
Abstract
The evolution of protein-coding genes is usually driven by selective processes, which favor some evolutionary trajectories over others, optimizing the subsequent protein stability and activity. The analysis of selection in this type of genetic data is broadly performed with the metric nonsynonymous/synonymous substitution rate ratio (dN/dS). However, most of the well-established methodologies to estimate this metric make crucial assumptions, such as lack of recombination or invariable codon frequencies along genes, which can bias the estimation. Here, we review the most relevant biases in the dN/dS estimation and provide a detailed guide to estimate this metric using state-of-the-art procedures that account for such biases, along with illustrative practical examples and recommendations. We also discuss the traditional interpretation of the estimated dN/dS emphasizing the importance of considering complementary biological information such as the role of the observed substitutions on the stability and function of proteins. This review is oriented to help evolutionary biologists that aim to accurately estimate selection in protein-coding sequences.
Keywords: dN/dS estimation; dN/dS interpretation; codon evolution; codon frequencies; molecular adaptation; recombination.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Similar articles
-
The influence of heterogeneous codon frequencies along sequences on the estimation of molecular adaptation.Bioinformatics. 2020 Jan 15;36(2):430-436. doi: 10.1093/bioinformatics/btz558. Bioinformatics. 2020. PMID: 31304972
-
The influence of selection for protein stability on dN/dS estimations.Genome Biol Evol. 2014 Oct 28;6(10):2956-67. doi: 10.1093/gbe/evu223. Genome Biol Evol. 2014. PMID: 25355808 Free PMC article.
-
Why time matters: codon evolution and the temporal dynamics of dN/dS.Mol Biol Evol. 2014 Jan;31(1):212-31. doi: 10.1093/molbev/mst192. Epub 2013 Oct 14. Mol Biol Evol. 2014. PMID: 24129904 Free PMC article.
-
Incorporation of transition to transversion ratio and nonsense mutations, improves the estimation of the number of synonymous and non-synonymous sites in codons.DNA Res. 2022 Jun 25;29(4):dsac023. doi: 10.1093/dnares/dsac023. DNA Res. 2022. PMID: 35920776 Free PMC article.
-
Coding sequence evolution.Curr Opin Genet Dev. 1999 Dec;9(6):637-41. doi: 10.1016/s0959-437x(99)00034-9. Curr Opin Genet Dev. 1999. PMID: 10607619 Review.
Cited by
-
In Silico Analysis: Genome-Wide Identification, Characterization and Evolutionary Adaptations of Bone Morphogenetic Protein (BMP) Gene Family in Homo sapiens.Mol Biotechnol. 2024 Nov;66(11):3336-3356. doi: 10.1007/s12033-023-00944-3. Epub 2023 Nov 1. Mol Biotechnol. 2024. PMID: 37914865
-
Molecular Evolution of SARS-CoV-2 during the COVID-19 Pandemic.Genes (Basel). 2023 Feb 4;14(2):407. doi: 10.3390/genes14020407. Genes (Basel). 2023. PMID: 36833334 Free PMC article. Review.
-
Evolutionary History of TOPIIA Topoisomerases in Animals.J Mol Evol. 2022 Apr;90(2):149-165. doi: 10.1007/s00239-022-10048-2. Epub 2022 Feb 14. J Mol Evol. 2022. PMID: 35165762
-
ProteinEvolverABC: coestimation of recombination and substitution rates in protein sequences by approximate Bayesian computation.Bioinformatics. 2021 Dec 22;38(1):58-64. doi: 10.1093/bioinformatics/btab617. Bioinformatics. 2021. PMID: 34450622 Free PMC article.
-
Evolution of Transcriptomes in Early-Generation Hybrids of the Apomictic Ranunculus auricomus Complex (Ranunculaceae).Int J Mol Sci. 2022 Nov 10;23(22):13881. doi: 10.3390/ijms232213881. Int J Mol Sci. 2022. PMID: 36430360 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources