Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 29;12(7):919.
doi: 10.3390/biom12070919.

A Physics-Guided Neural Network for Predicting Protein-Ligand Binding Free Energy: From Host-Guest Systems to the PDBbind Database

Affiliations

A Physics-Guided Neural Network for Predicting Protein-Ligand Binding Free Energy: From Host-Guest Systems to the PDBbind Database

Sahar Cain et al. Biomolecules. .

Abstract

Calculation of protein-ligand binding affinity is a cornerstone of drug discovery. Classic implicit solvent models, which have been widely used to accomplish this task, lack accuracy compared to experimental references. Emerging data-driven models, on the other hand, are often accurate yet not fully interpretable and also likely to be overfitted. In this research, we explore the application of Theory-Guided Data Science in studying protein-ligand binding. A hybrid model is introduced by integrating Graph Convolutional Network (data-driven model) with the GBNSR6 implicit solvent (physics-based model). The proposed physics-data model is tested on a dataset of 368 complexes from the PDBbind refined set and 72 host-guest systems. Results demonstrate that the proposed Physics-Guided Neural Network can successfully improve the "accuracy" of the pure data-driven model. In addition, the "interpretability" and "transferability" of our model have boosted compared to the purely data-driven model. Further analyses include evaluating model robustness and understanding relationships between the physical features.

Keywords: binding free energy; graph convolutional network; implicit solvent model.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Workflow of the proposed PGNN model. The sample input structure, the complex of the BIR domain of MLIAP and GDC0152, is selected from the PDBbind refined set [58]. After featurization, the structure-based features along with the adjacency matrix A enter the network. The results of the first network iteration are the model variable M. The output of the physics-based model is vector P, which is concatenated with M. The resulting vector goes through several iterations before entering the final dense layer.
Figure 2
Figure 2
Distribution of ΔΔG values of PDBbind refined set vs. our sample dataset.
Figure 3
Figure 3
Validation loss (val loss) and training loss (train loss) per epoch for GraphConv, AtomicConv (data-driven) and PGNN (hybrid) models on the PDBbind dataset. (a) GraphConv model; (b) AtomicConv model; (c) PGNN model.
Figure 4
Figure 4
Validation loss (val loss) and training loss (train loss) per epoch for the GraphConv model (data-driven) and the PGNN model (hybrid) on the host–guest dataset. (a) GraphConv model; (b) PGNN model.
Figure 5
Figure 5
Error in calculating ΔΔG values on data with added noise compared to the original data.
Figure 6
Figure 6
Correlation heatmap between the physics–based features of PDBbind structures.

Similar articles

Cited by

References

    1. Du X., Li Y., Xia Y.L., Ai S.M., Liang J., Sang P., Ji X.L., Liu S.Q. Insights into protein–ligand interactions: Mechanisms, models, and methods. Int. J. Mol. Sci. 2016;17:144. doi: 10.3390/ijms17020144. - DOI - PMC - PubMed
    1. Woo H.J., Roux B. Calculation of absolute protein–ligand binding free energy from computer simulations. Proc. Natl. Acad. Sci. USA. 2005;102:6825–6830. doi: 10.1073/pnas.0409005102. - DOI - PMC - PubMed
    1. Jorgensen W.L. The Many Roles of Computation in Drug Discovery. Science. 2004;303:1813–1818. doi: 10.1126/science.1096361. - DOI - PubMed
    1. Mobley D.L., Gilson M.K. Predicting binding free energies: Frontiers and benchmarks. Annu. Rev. Biophys. 2017;46:531–558. doi: 10.1146/annurev-biophys-070816-033654. - DOI - PMC - PubMed
    1. de Ruiter A., Oostenbrink C. Advances in the calculation of binding free energies. Curr. Opin. Struct. Biol. 2020;61:207–212. doi: 10.1016/j.sbi.2020.01.016. - DOI - PubMed

Publication types

Grants and funding

This research was partially funded by the National Science Foundation (NSF) Grant No. 2136095 to N.F.

LinkOut - more resources