Deep reinforcement learning for de novo drug design

doi:10.1126/sciadv.aap7885

. 2018 Jul 25;4(7):eaap7885.

doi: 10.1126/sciadv.aap7885. eCollection 2018 Jul.

Deep reinforcement learning for de novo drug design

Mariya Popova^{1

2

3}, Olexandr Isayev¹, Alexander Tropsha¹

Affiliations

¹ Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA.
² Moscow Institute of Physics and Technology, Dolgoprudny, Moscow 141700, Russia.
³ Skolkovo Institute of Science and Technology, Moscow 143026, Russia.

PMID: 30050984
PMCID: PMC6059760
DOI: 10.1126/sciadv.aap7885

Deep reinforcement learning for de novo drug design

Mariya Popova et al. Sci Adv. 2018.

. 2018 Jul 25;4(7):eaap7885.

doi: 10.1126/sciadv.aap7885. eCollection 2018 Jul.

Authors

Mariya Popova^{1

2

3}, Olexandr Isayev¹, Alexander Tropsha¹

Affiliations

¹ Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA.
² Moscow Institute of Physics and Technology, Dolgoprudny, Moscow 141700, Russia.
³ Skolkovo Institute of Science and Technology, Moscow 143026, Russia.

PMID: 30050984
PMCID: PMC6059760
DOI: 10.1126/sciadv.aap7885

Abstract

We have devised and implemented a novel computational strategy for de novo design of molecules with desired properties termed ReLeaSE (Reinforcement Learning for Structural Evolution). On the basis of deep and reinforcement learning (RL) approaches, ReLeaSE integrates two deep neural networks-generative and predictive-that are trained separately but are used jointly to generate novel targeted chemical libraries. ReLeaSE uses simple representation of molecules by their simplified molecular-input line-entry system (SMILES) strings only. Generative models are trained with a stack-augmented memory network to produce chemically feasible SMILES strings, and predictive models are derived to forecast the desired properties of the de novo-generated compounds. In the first phase of the method, generative and predictive models are trained separately with a supervised learning algorithm. In the second phase, both models are trained jointly with the RL approach to bias the generation of new chemical structures toward those with the desired physical and/or biological properties. In the proof-of-concept study, we have used the ReLeaSE method to design chemical libraries with a bias toward structural complexity or toward compounds with maximal, minimal, or specific range of physical properties, such as melting point or hydrophobicity, or toward compounds with inhibitory activity against Janus protein kinase 2. The approach proposed herein can find a general use for generating targeted chemical libraries of novel compounds optimized for either a single desired property or multiple properties.

PubMed Disclaimer

Figures

**Fig. 1. The workflow of deep RL algorithm for generating new SMILES strings of compounds with the desired properties.**
(A) Training step of the generative Stack-RNN. (B) Generator step of the generative Stack-RNN. During training, the input token is a character in the currently processed SMILES string from the training set. The model outputs the probability vector p_Θ(a_t|s_{t − 1}) of the next character given a prefix. Vector of parameters Θ is optimized by cross-entropy loss function minimization. In the generator regime, the input token is a previously generated character. Next, character a_t is sampled randomly from the distribution p_Θ(a_t| s_{t − 1}). (C) General pipeline of RL system for novel compound generation. (D) Scheme of predictive model. This model takes a SMILES string as an input and provides one real number, which is an estimated property value, as an output. Parameters of the model are trained by l₂-squared loss function minimization.

**Fig. 2. A sample of molecules produced by the generative model.**

**Fig. 3. Performance of the generative model G, with and without stack-augmented memory.**
(A) Internal diversity of generated libraries. (B) Similarity of the generated libraries to the training data set from the ChEMBL database.

**Fig. 4. Property distributions for RL-optimized versus baseline generator model.**
(A) Melting temperature. (B) JAK2 inhibition. (C) Partition coefficient. (D) Number of benzene rings. (E) Number of substituents.

**Fig. 5. Evolution of generated structures as chemical substructure reward increases.**
(A) Reward proportional to the total number of small group substituents. (B) Reward proportional to the number of benzene rings.

**Fig. 6. Examples of Stack-RNN cells with interpretable gate activations.**
Color coding corresponds to GRU cells with hyperbolic tangent tanh activation function, where dark blue corresponds to the activation function value of −1 and red describes the value of the activation function of 1; the numbers in the range between −1 and 1 are colored using a cool-warm color map.

**Fig. 7. Clustering of generated molecules by t-SNE.**
Molecules are colored on the basis of the predicted properties by the predictive model P, with values shown by the color bar on the right. (A and C) Examples of the generated molecules randomly picked from matches with ZINC database and property values predicted by the predictive model P. (A) Partition coefficient, logP. (B) Melting temperature, T_m (^°C); examples show generated molecules with lowest and highest predicted T_m. (C) JAK2 inhibition, predicted pIC₅₀.

See this image and copyright information in PMC

Cited by

Network-principled deep generative models for designing drug combinations as graph sets.
Karimi M, Hasanzadeh A, Shen Y. Karimi M, et al. Bioinformatics. 2020 Jul 1;36(Suppl_1):i445-i454. doi: 10.1093/bioinformatics/btaa317. Bioinformatics. 2020. PMID: 32657357 Free PMC article.
Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations.
Das P, Sercu T, Wadhawan K, Padhi I, Gehrmann S, Cipcigan F, Chenthamarakshan V, Strobelt H, Dos Santos C, Chen PY, Yang YY, Tan JPK, Hedrick J, Crain J, Mojsilovic A. Das P, et al. Nat Biomed Eng. 2021 Jun;5(6):613-623. doi: 10.1038/s41551-021-00689-x. Epub 2021 Mar 11. Nat Biomed Eng. 2021. PMID: 33707779
Predicting chemical structure using reinforcement learning with a stack-augmented conditional variational autoencoder.
Kim H, Ko S, Kim BJ, Ryu SJ, Ahn J. Kim H, et al. J Cheminform. 2022 Dec 9;14(1):83. doi: 10.1186/s13321-022-00666-9. J Cheminform. 2022. PMID: 36494855 Free PMC article.
Deep learning of pharmacogenomics resources: moving towards precision oncology.
Chiu YC, Chen HH, Gorthi A, Mostavi M, Zheng S, Huang Y, Chen Y. Chiu YC, et al. Brief Bioinform. 2020 Dec 1;21(6):2066-2083. doi: 10.1093/bib/bbz144. Brief Bioinform. 2020. PMID: 31813953 Free PMC article.
Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction.
Zhang M, Jia C, Li F, Li C, Zhu Y, Akutsu T, Webb GI, Zou Q, Coin LJM, Song J. Zhang M, et al. Brief Bioinform. 2022 Mar 10;23(2):bbab551. doi: 10.1093/bib/bbab551. Brief Bioinform. 2022. PMID: 35021193 Free PMC article.

See all "Cited by" articles

References

1. Gil Y., Greaves M., Hendler J., Hirsh H., Amplify scientific discovery with artificial intelligence. Science 346, 171–172 (2014). - PubMed
1. Krittanawong C., Zhang H., Wang Z., Aydar M., Kitai T., Artificial intelligence in precision cardiovascular medicine. J. Am. Coll. Cardiol. 69, 2657–2664 (2017). - PubMed
1. Chockley K., Emanuel E., The end of radiology? Three threats to the future practice of radiology. J. Am. Coll. Radiol. 13, 1415–1420 (2016). - PubMed
1. Altae-Tran H., Ramsundar B., Pappu A. S., Pande V., Low data drug discovery with one-shot learning. ACS Cent. Sci. 3, 283–293 (2017). - PMC - PubMed
1. Gawehn E., Hiss J. A., Schneider G., Deep learning in drug discovery. Mol. Inform. 35, 3–14 (2016). - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

Grants and funding

T32 GM067553/GM/NIGMS NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

[1] Gil Y., Greaves M., Hendler J., Hirsh H., Amplify scientific discovery with artificial intelligence. Science 346, 171–172 (2014). - PubMed

[2] Gil Y., Greaves M., Hendler J., Hirsh H., Amplify scientific discovery with artificial intelligence. Science 346, 171–172 (2014). - PubMed

[3] Krittanawong C., Zhang H., Wang Z., Aydar M., Kitai T., Artificial intelligence in precision cardiovascular medicine. J. Am. Coll. Cardiol. 69, 2657–2664 (2017). - PubMed

[4] Krittanawong C., Zhang H., Wang Z., Aydar M., Kitai T., Artificial intelligence in precision cardiovascular medicine. J. Am. Coll. Cardiol. 69, 2657–2664 (2017). - PubMed

[5] Chockley K., Emanuel E., The end of radiology? Three threats to the future practice of radiology. J. Am. Coll. Radiol. 13, 1415–1420 (2016). - PubMed

[6] Chockley K., Emanuel E., The end of radiology? Three threats to the future practice of radiology. J. Am. Coll. Radiol. 13, 1415–1420 (2016). - PubMed

[7] Altae-Tran H., Ramsundar B., Pappu A. S., Pande V., Low data drug discovery with one-shot learning. ACS Cent. Sci. 3, 283–293 (2017). - PMC - PubMed

[8] Altae-Tran H., Ramsundar B., Pappu A. S., Pande V., Low data drug discovery with one-shot learning. ACS Cent. Sci. 3, 283–293 (2017). - PMC - PubMed

[9] Gawehn E., Hiss J. A., Schneider G., Deep learning in drug discovery. Mol. Inform. 35, 3–14 (2016). - PubMed

[10] Gawehn E., Hiss J. A., Schneider G., Deep learning in drug discovery. Mol. Inform. 35, 3–14 (2016). - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Deep reinforcement learning for de novo drug design

Affiliations

Deep reinforcement learning for de novo drug design

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources