Statistical inference of the generation probability of T-cell receptors from sequence repertoires
- PMID: 22988065
- PMCID: PMC3479580
- DOI: 10.1073/pnas.1212755109
Statistical inference of the generation probability of T-cell receptors from sequence repertoires
Abstract
Stochastic rearrangement of germline V-, D-, and J-genes to create variable coding sequence for certain cell surface receptors is at the origin of immune system diversity. This process, known as "VDJ recombination", is implemented via a series of stochastic molecular events involving gene choices and random nucleotide insertions between, and deletions from, genes. We use large sequence repertoires of the variable CDR3 region of human CD4+ T-cell receptor beta chains to infer the statistical properties of these basic biochemical events. Because any given CDR3 sequence can be produced in multiple ways, the probability distribution of hidden recombination events cannot be inferred directly from the observed sequences; we therefore develop a maximum likelihood inference method to achieve this end. To separate the properties of the molecular rearrangement mechanism from the effects of selection, we focus on nonproductive CDR3 sequences in T-cell DNA. We infer the joint distribution of the various generative events that occur when a new T-cell receptor gene is created. We find a rich picture of correlation (and absence thereof), providing insight into the molecular mechanisms involved. The generative event statistics are consistent between individuals, suggesting a universal biochemical process. Our probabilistic model predicts the generation probability of any specific CDR3 sequence by the primitive recombination process, allowing us to quantify the potential diversity of the T-cell repertoire and to understand why some sequences are shared between individuals. We argue that the use of formal statistical inference methods, of the kind presented in this paper, will be essential for quantitative understanding of the generation and evolution of diversity in the adaptive immune system.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
Similar articles
-
Inferring processes underlying B-cell repertoire diversity.Philos Trans R Soc Lond B Biol Sci. 2015 Sep 5;370(1676):20140243. doi: 10.1098/rstb.2014.0243. Philos Trans R Soc Lond B Biol Sci. 2015. PMID: 26194757 Free PMC article.
-
Insights into immune system development and function from mouse T-cell repertoires.Proc Natl Acad Sci U S A. 2017 Feb 28;114(9):2253-2258. doi: 10.1073/pnas.1700241114. Epub 2017 Feb 14. Proc Natl Acad Sci U S A. 2017. PMID: 28196891 Free PMC article.
-
repgenHMM: a dynamic programming tool to infer the rules of immune receptor generation from sequence data.Bioinformatics. 2016 Jul 1;32(13):1943-51. doi: 10.1093/bioinformatics/btw112. Epub 2016 Feb 26. Bioinformatics. 2016. PMID: 27153709 Free PMC article.
-
Predicting the spectrum of TCR repertoire sharing with a data-driven model of recombination.Immunol Rev. 2018 Jul;284(1):167-179. doi: 10.1111/imr.12665. Immunol Rev. 2018. PMID: 29944757 Free PMC article. Review.
-
The Bayesian optimist's guide to adaptive immune receptor repertoire analysis.Immunol Rev. 2018 Jul;284(1):148-166. doi: 10.1111/imr.12664. Immunol Rev. 2018. PMID: 29944760 Free PMC article. Review.
Cited by
-
Amino acids at position 5 in the peptide/MHC binding region of a public virus-specific TCR are completely inter-changeable without loss of function.Eur J Immunol. 2022 Nov;52(11):1819-1828. doi: 10.1002/eji.202249975. Epub 2022 Oct 19. Eur J Immunol. 2022. PMID: 36189878 Free PMC article.
-
Quantifying changes in the T cell receptor repertoire during thymic development.Elife. 2023 Jan 20;12:e81622. doi: 10.7554/eLife.81622. Elife. 2023. PMID: 36661220 Free PMC article.
-
A Framework for Annotation of Antigen Specificities in High-Throughput T-Cell Repertoire Sequencing Studies.Front Immunol. 2019 Sep 26;10:2159. doi: 10.3389/fimmu.2019.02159. eCollection 2019. Front Immunol. 2019. PMID: 31616409 Free PMC article.
-
Origin of Public Memory B Cell Clones in Fish After Antiviral Vaccination.Front Immunol. 2018 Sep 27;9:2115. doi: 10.3389/fimmu.2018.02115. eCollection 2018. Front Immunol. 2018. PMID: 30319606 Free PMC article.
-
Distinctive properties of identical twins' TCR repertoires revealed by high-throughput sequencing.Proc Natl Acad Sci U S A. 2014 Apr 22;111(16):5980-5. doi: 10.1073/pnas.1319389111. Epub 2014 Apr 7. Proc Natl Acad Sci U S A. 2014. PMID: 24711416 Free PMC article.
References
-
- Murphy KP, Travers P, Walport M, Janeway C. Janeway’s Immunobiology. New York: Garland; 2008.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials