Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2018 Jul;284(1):148-166.
doi: 10.1111/imr.12664.

The Bayesian optimist's guide to adaptive immune receptor repertoire analysis

Affiliations
Review

The Bayesian optimist's guide to adaptive immune receptor repertoire analysis

Branden J Olson et al. Immunol Rev. 2018 Jul.

Abstract

Probabilistic modeling is fundamental to the statistical analysis of complex data. In addition to forming a coherent description of the data-generating process, probabilistic models enable parameter inference about given datasets. This procedure is well developed in the Bayesian perspective, in which one infers probability distributions describing to what extent various possible parameters agree with the data. In this paper, we motivate and review probabilistic modeling for adaptive immune receptor repertoire data then describe progress and prospects for future work, from germline haplotyping to adaptive immune system deployment across tissues. The relevant quantities in immune sequence analysis include not only continuous parameters such as gene use frequency but also discrete objects such as B-cell clusters and lineages. Throughout this review, we unravel the many opportunities for probabilistic modeling in adaptive immune receptor analysis, including settings for which the Bayesian approach holds substantial promise (especially if one is optimistic about new computational methods). From our perspective, the greatest prospects for progress in probabilistic modeling for repertoires concern ancestral sequence estimation for B-cell receptor lineages, including uncertainty from germline genotype, rearrangement, and lineage development.

Keywords: Bayesian inference; high-throughput sequencing; likelihood models; repertoire analysis.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest

We declare no conflict of interest.

Figures

Figure 1:
Figure 1:
Immune repertoires are hierarchically structured, here illustrated by the hierarchy for B cell receptor sequences. We benefit by considering the whole hierarchy that contributes to our observable sequences rather than one sequence at a time. For example, by considering all the reads at once one can infer a personal germline set, which then informs the per-read annotation. By learning lots of personal germline sets one can infer population-level germline trends.
Figure 2:
Figure 2:
Human height and immune receptor formation can both be modeled using probabilistic methods. Right panel modified (with permission) from (4).
Figure 3:
Figure 3:
Statisticians and biologists have typically approached adaptive immune receptor research in two parallel tracks. Statisticians (upper path) treat data as given and perform model criticism based on numerical estimates of how well models fit the data. Biologists (lower path) formulate their models mechanistically and generate data in targeted experiments to directly test their model. An integrated approach (dashed arrows) has biologists using parameter estimates to formulate mechanistic models, and statisticians using the results of targeted experiments to formulate statistical models. Ideally, the distinction between these two classes of models would evaporate, although mechanistic models are not always readily fit using statistical means.

Similar articles

Cited by

References

    1. Sompayrac LM. How the immune system works. Hoboken, New Jersey: John Wiley & Sons; 2011.
    1. Robins H Immunosequencing: applications of immune repertoire deep sequencing. Curr Opin Immunol. 2013. October;25(5):646–652. Available from: 10.1016/j.coi.2013.09.017. - DOI - PubMed
    1. Yaari G, Kleinstein SH. Practical guidelines for B-cell receptor repertoire sequencing analysis. Genome Med. 2015;7(1):121 Available from: 10.1186/s13073-015-0243-2. - DOI - PMC - PubMed
    1. Murugan A, Mora T, Walczak AM, Callan CG Jr. Statistical inference of the generation probability of T-cell receptors from sequence repertoires. Proc Natl Acad Sci U S A. 2012. October;109(40):16161–16166. Available from: 10.1073/pnas.1212755109. - DOI - PMC - PubMed
    1. Elhanati Y, Sethna Z, Marcou Q, Callan CG Jr, Mora T, Walczak AM. Inferring processes underlying B-cell repertoire diversity. Philos Trans R Soc Lond B Biol Sci. 2015. September;370(1676). Available from: 10.1098/rstb.2014.0243. - DOI - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources