Neopepsee: accurate genome-level prediction of neoantigens by harnessing sequence and amino acid immunogenicity information
- PMID: 29360924
- DOI: 10.1093/annonc/mdy022
Neopepsee: accurate genome-level prediction of neoantigens by harnessing sequence and amino acid immunogenicity information
Abstract
Background: Tumor-specific mutations form novel immunogenic peptides called neoantigens. Neoantigens can be used as a biomarker predicting patient response to cancer immunotherapy. Although a predicted binding affinity (IC50) between peptide and major histocompatibility complex class I is currently used for neoantigen prediction, large number of false-positives exist.
Materials and methods: We developed Neopepsee, a machine-learning-based neoantigen prediction program for next-generation sequencing data. With raw RNA-seq data and a list of somatic mutations, Neopepsee automatically extracts mutated peptide sequences and gene expression levels. We tested 14 immunogenicity features to construct a machine-learning classifier and compared with the conventional methods based on IC50 regarding sensitivity and specificity. We tested Neopepsee on independent datasets from melanoma, leukemia, and stomach cancer.
Results: Nine of the 14 immunogenicity features that are informative and inter-independent were used to construct the machine-learning classifiers. Neopepsee provides a rich annotation of candidate peptides with 87 immunogenicity-related values, including IC50, expression levels of neopeptides and immune regulatory genes (e.g. PD1, PD-L1), matched epitope sequences, and a three-level (high, medium, and low) call for neoantigen probability. Compared with the conventional methods, the performance was improved in sensitivity and especially two- to threefold in the specificity. Tests with validated datasets and independently proven neoantigens confirmed the improved performance in melanoma and chronic lymphocytic leukemia. Additionally, we found sequence similarity in proteins to known pathogenic epitopes to be a novel feature in classification. Application of Neopepsee to 224 public stomach adenocarcinoma datasets predicted ∼7 neoantigens per patient, the burden of which was correlated with patient prognosis.
Conclusions: Neopepsee can detect neoantigen candidates with less false positives and be used to determine the prognosis of the patient. We expect that retrieval of neoantigen sequences with Neopepsee will help advance research on next-generation cancer immunotherapies, predictive biomarkers, and personalized cancer vaccines.
Comment in
-
Computational prediction of neoantigens: do we need more data or new approaches?Ann Oncol. 2018 Apr 1;29(4):799-801. doi: 10.1093/annonc/mdy070. Ann Oncol. 2018. PMID: 29481589 No abstract available.
Similar articles
-
Machine learning methods and harmonized datasets improve immunogenic neoantigen prediction.Immunity. 2023 Nov 14;56(11):2650-2663.e6. doi: 10.1016/j.immuni.2023.09.002. Epub 2023 Oct 9. Immunity. 2023. PMID: 37816353
-
Population-level distribution and putative immunogenicity of cancer neoepitopes.BMC Cancer. 2018 Apr 13;18(1):414. doi: 10.1186/s12885-018-4325-6. BMC Cancer. 2018. PMID: 29653567 Free PMC article.
-
pTuneos: prioritizing tumor neoantigens from next-generation sequencing data.Genome Med. 2019 Oct 30;11(1):67. doi: 10.1186/s13073-019-0679-x. Genome Med. 2019. PMID: 31666118 Free PMC article.
-
Cancer Neoantigens: Challenges and Future Directions for Prediction, Prioritization, and Validation.Front Oncol. 2022 Mar 3;12:836821. doi: 10.3389/fonc.2022.836821. eCollection 2022. Front Oncol. 2022. PMID: 35311072 Free PMC article. Review.
-
Identifying neoantigens for use in immunotherapy.Mamm Genome. 2018 Dec;29(11-12):714-730. doi: 10.1007/s00335-018-9771-6. Epub 2018 Aug 24. Mamm Genome. 2018. PMID: 30167844 Free PMC article. Review.
Cited by
-
Developing Vaccines in Pancreatic Adenocarcinoma: Trials and Tribulations.Curr Oncol. 2024 Aug 23;31(9):4855-4884. doi: 10.3390/curroncol31090361. Curr Oncol. 2024. PMID: 39329989 Free PMC article. Review.
-
Tumor Neoepitope-Based Vaccines: A Scoping Review on Current Predictive Computational Strategies.Vaccines (Basel). 2024 Jul 24;12(8):836. doi: 10.3390/vaccines12080836. Vaccines (Basel). 2024. PMID: 39203962 Free PMC article. Review.
-
Transformers meets neoantigen detection: a systematic literature review.J Integr Bioinform. 2024 Jul 4;21(2):20230043. doi: 10.1515/jib-2023-0043. eCollection 2024 Jun 1. J Integr Bioinform. 2024. PMID: 38960869 Free PMC article.
-
IMPROVE: a feature model to predict neoepitope immunogenicity through broad-scale validation of T-cell recognition.Front Immunol. 2024 Apr 3;15:1360281. doi: 10.3389/fimmu.2024.1360281. eCollection 2024. Front Immunol. 2024. PMID: 38633261 Free PMC article.
-
NeoAgDT: optimization of personal neoantigen vaccine composition by digital twin simulation of a cancer cell population.Bioinformatics. 2024 May 2;40(5):btae205. doi: 10.1093/bioinformatics/btae205. Bioinformatics. 2024. PMID: 38614133 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous