Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jun;1864(6):129545.
doi: 10.1016/j.bbagen.2020.129545. Epub 2020 Feb 10.

Machine learning and ligand binding predictions: A review of data, methods, and obstacles

Affiliations

Machine learning and ligand binding predictions: A review of data, methods, and obstacles

Sally R Ellingson et al. Biochim Biophys Acta Gen Subj. 2020 Jun.

Abstract

Computational predictions of ligand binding is a difficult problem, with more accurate methods being extremely computationally expensive. The use of machine learning for drug binding predictions could possibly leverage the use of biomedical big data in exchange for time-intensive simulations. This paper reviews current trends in the use of machine learning for drug binding predictions, data sources to develop machine learning algorithms, and potential problems that may lead to overfitting and ungeneralizable models. A few popular datasets that can be used to develop virtual high-throughput screening models are characterized using spatial statistics to quantify potential biases. We can see from evaluating some common benchmarks that good performance correlates with models with high-predicted bias scores and models with low bias scores do not have much predictive power. A better understanding of the limits of available data sources and how to fix them will lead to more generalizable models that will lead to novel drug discovery.

Keywords: Drug binding; Drug discovery; Machine learning; Overfitting.

PubMed Disclaimer

Similar articles

Cited by

Publication types

LinkOut - more resources