Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Jan 25:14:1258083.
doi: 10.3389/fgene.2023.1258083. eCollection 2023.

Artificial intelligence and database for NGS-based diagnosis in rare disease

Affiliations
Review

Artificial intelligence and database for NGS-based diagnosis in rare disease

Yee Wen Choon et al. Front Genet. .

Abstract

Rare diseases (RDs) are rare complex genetic diseases affecting a conservative estimate of 300 million people worldwide. Recent Next-Generation Sequencing (NGS) studies are unraveling the underlying genetic heterogeneity of this group of diseases. NGS-based methods used in RDs studies have improved the diagnosis and management of RDs. Concomitantly, a suite of bioinformatics tools has been developed to sort through big data generated by NGS to understand RDs better. However, there are concerns regarding the lack of consistency among different methods, primarily linked to factors such as the lack of uniformity in input and output formats, the absence of a standardized measure for predictive accuracy, and the regularity of updates to the annotation database. Today, artificial intelligence (AI), particularly deep learning, is widely used in a variety of biological contexts, changing the healthcare system. AI has demonstrated promising capabilities in boosting variant calling precision, refining variant prediction, and enhancing the user-friendliness of electronic health record (EHR) systems in NGS-based diagnostics. This paper reviews the state of the art of AI in NGS-based genetics, and its future directions and challenges. It also compare several rare disease databases.

Keywords: artificial intelligence; data science; diagnosis; machine learning; next-generation sequencing; rare disease.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
The use of sequencing in precision medicine.
FIGURE 2
FIGURE 2
A framework of WES/WGS data analysis from individual patients with rare diseases. (GATK-Genome Analysis Toolkit, VQSR-Variant Quality Score Recalibration, BWA-Burrows-Wheeler Alignment, SAMtools-Sequence Alignment/Map tools, VEP-Ensembl Variant Effect Predictor, HGC-Hierarchical Graph-based Clustering, GTEx-Genotype Tissue Expression, IGV-Integrative Genomics Viewer).
FIGURE 3
FIGURE 3
General workflow for NGS data analysis.

Similar articles

References

    1. Abul-Husn N. S., Kenny E. E. (2019). Personalized medicine and the power of electronic health records. Cell 177 (1), 58–69. 10.1016/j.cell.2019.02.039 - DOI - PMC - PubMed
    1. Amberger J. S., Bocchini C. A., Schiettecatte F., Scott A. F., Hamosh A. (2015). OMIM.org: online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders. Nucleic acids Res. 43 (D1), D789–D798. 10.1093/nar/gku1205 - DOI - PMC - PubMed
    1. Amorim B. R., Santos P. A. C. D., Lima C. L. D., Andia D. C., Mazzeu J. F., Acevedo A. C. (2019). “Protocols for genetic and epigenetic studies of rare diseases affecting dental tissues,” in Odontogenesis (New York, NY: Humana Press; ), 453–492. - PubMed
    1. Anzar I., Sverchkova A., Stratford R., Clancy T. (2019). NeoMutate: an ensemble machine learning framework for the prediction of somatic mutations in cancer. BMC Med. genomics 12 (1), 63–14. 10.1186/s12920-019-0508-5 - DOI - PMC - PubMed
    1. Austin C. P., Cutillo C. M., Lau L. P., Jonker A. H., Rath A., Julkowska D., et al. (2018). Future of rare diseases research 2017–2027: an IRDiRC perspective. Clin. Transl. Sci. 11 (1), 21–27. 10.1111/cts.12500 - DOI - PMC - PubMed

Grants and funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was sponsored by the ASPIRE, the technology program management pillar of Abu Dhabi’s Advanced Technology Research Council (ATRC), via the ASPIRE Precision Medicine Research Institute Abu Dhabi (ASPIREPMRIAD) award grant number VRI-20‐10. The United Arab Emirates University also supported this work through the Research Start‐up Program (Grant # 12M109).

LinkOut - more resources