Abstract
Generating the appropriate protonation states of drug-like molecules in solution is important for success in both ligand- and structure-based virtual screening. Screening collections of millions of compounds requires a method for determining tautomers and their energies that is sufficiently rapid, accurate, and comprehensive. To maximise enrichment, the lowest energy tautomers must be determined from heterogeneous input, without over-enumerating unfavourable states. While computationally expensive, the density functional theory (DFT) method M06-2X/aug-cc-pVTZ(-f) [PB-SCRF] provides accurate energies for enumerated model tautomeric systems. The empirical Hammett–Taft methodology can very rapidly extrapolate substituent effects from model systems to drug-like molecules via the relationship between pKT and pKa. Combining the two complementary approaches transforms the tautomer problem from a scientific challenge to one of engineering scale-up, and avoids issues that arise due to the very limited number of measured pKT values, especially for the complicated heterocycles often favoured by medicinal chemists for their novelty and versatility. Several hundreds of pre-calculated tautomer energies and substituent pKa effects are tabulated in databases for use in structural adjustment by the program Epik, which treats tautomers as a subset of the larger problem of the protonation states in aqueous ensembles and their energy penalties. Accuracy and coverage is continually improved and expanded by parameterizing new systems of interest using DFT and experimental data. Recommendations are made for how to best incorporate tautomers in molecular design and virtual screening workflows.
Similar content being viewed by others
References
Comer J, Tam K (2001) In: Testa B, van de Waterbeemd H, Folkers G, Guy R (eds) Pharmacokinetic optimization in drug research: biological, physicochemical, and computational strategies. Wiley, Weinheim, pp 275–304
Pospisil P, Ballmer P, Scapozza L, Folkers G (2003) Tautomerism in computer-aided drug design. J Recept Signal Transduct Res 23(4):361–371
Rester U (2008) From virtuality to reality—virtual screening in lead discovery and lead optimization: a medicinal chemistry perspective. Curr Opin Drug Discov Dev 11(4):559–568
Shelley JC, Cholleti A, Frye LL, Greenwood JR, Timlin MR, Uchimaya M (2007) Epik: a software program for pK(a) prediction and protonation state generation for drug-like molecules. J Comput Aided Mol Des 21(12):681–691
Epik, v2.0 (2009) Schrödinger, Inc., New York
LigPrep, v2.3 (2009) Schrödinger, Inc., New York
Milletti F, Storchi L, Sforna G, Cross S, Cruciani G (2009) Tautomer enumeration and stability prediction for virtual screening on large chemical databases. J Chem Inf Model 49(1):68–75
Oellien F, Cramer J, Beyer C, Ihlenfeldt WD, Selzer PM (2006) The impact of tautomer forms on pharmacophore-based virtual screening. J Chem Inf Model 46(6):2342–2354
Haranczyk M, Gutowski M (2007) Quantum mechanical energy-based screening of combinatorially generated library of tautomers. TauTGen: a tautomer generator program. J Chem Inf Model 47(2):686–694
Pisklak M, Maciejewska D, Herold F, Wawer I (2003) Solid state structure of coumarin anticoagulants: warfarin and sintrom. 13C CPMAS NMR and GIAO DFT calculations. J Mol Struct 649(1–2):169–176
Zhang J, Yang PL, Gray NS (2009) Targeting cancer with small molecule kinase inhibitors. Nat Rev Cancer 9(1):28–39
Pauling L (1960) The nature of the chemical bond, 3rd edn. Cornell University Press, Ithaca
Elguero J, Katritzky AR, Marzin C, Linda P (1976) The tautomerism of heterocycles. Academic Press, New York
Rauhut G (2002) Recent advances in computing heteroatom-rich five and six-membered ring systems. Adv Heterocycl Chem 81:2–85
Bryantsev VS, Diallo MS, van Duin ACT, Goddard III WA (2009) Evaluation of B3LYP, X3LYP, and M06-class density functionals for predicting the binding energies of neutral, protonated, and deprotonated water clusters. J Chem Theory Comput 5(4):1016–1026
Harding ME, Metzroth T, Gauss J, Auer AA (2008) Parallel calculation of CCSD and CCSD(T) analytic first and second derivatives. J Chem Theory Comput 4(1):64–74
Fabian WMF (1991) Tautomeric equilibria of heterocyclic molecules. A test of the semiempirical AM1 and MNDO-PM3 methods. J Comput Chem 12(1):17–35
Stewart JJP (2007) Optimization of parameters for semiempirical methods V: modification of NDDO approximations and application to 70 elements. J Mol Model 13(12):1173–1213
Tirado-Rives J, Jorgensen WL (2008) Performance of B3LYP density functional methods for a large set of organic molecules. J Chem Theory Comput 4(2):297–306
Cramer CJ, Truhlar DG (1993) Correlation and solvation effects on heterocyclic equilibria in aqueous solution. J Am Chem Soc 115(19):8810–8817
Zhao Y, Truhlar DG (2007) The M06 suite of density functionals for main group thermochemistry, kinetics, noncovalent interactions, excited states, and transition elements: two new functionals and systematic testing of four M06 functionals and twelve other functionals. Theor Chem Acc 120:215–241
Jaguar, v7.6 (2009) Schrödinger, Inc., New York
Zhao Y, Truhlar DG (2008) Exploring the limit of accuracy of the global hybrid meta density functional for main-group thermochemistry, kinetics, and noncovalent interactions. J Chem Theory Comput 4(11):1849–1868
Frydenvang K, Greenwood JR, Vogensen SB, Brehm L (2002) Structural features of ATPA and Thio-ATPA—potent and selective GluR5 receptor agonists. Crystal structure determinations and quantum chemical calculations. Struct Chem 13(5–6):479–490
Ahlquist MSG, Kozuch S, Shaik S, Tanner DA, Norrby P-O (2006) On the performance of continuum solvation models for the solvation energy of small anions. Organometallics 25(1):45–47
Nielsen PA, Jaroszewski JW, Norrby PO, Liljefors T (2001) An NMR and ab initio quantum chemical study of acid-base equilibria for conformationally constrained acidic alpha-amino acids in aqueous solution. J Am Chem Soc 123(9):2003–2006
Nielsen PA, Jaroszewski JW, Norrby PO, Liljefors T (2002) Conformational analysis of cyclic acidic alpha-amino acids in aqueous solution—an evaluation of different continuum hydration models. Unpublished data and PhD thesis, University of Copenhagen
Marenich AV, Cramer CJ, Truhlar DG (2009) Performance of SM6, SM8, and SMD on the SAMPL1 test set for the prediction of small-molecule solvation free energies. J Phys Chem B 113(14):4538–4543
Katritzki AR, Lagowski J (1963) Adv Heterocycl Chem 1:339
Katritzky AR, Øksne S, Boulton AJ (1962) The tautomerism of heteroaromatic compounds with five-membered rings—III: further isoxazol-5-ones. Tetrahedron 18(6):777–790
Le HT, Lamb JG, Franklin MR (1998) Drug metabolizing enzyme induction by benzoquinolines, acridine, and quinacrine; tricyclic aromatic molecules containing a single heterocyclic nitrogen. J Biochem Toxicol 11(6):297–303
Harvey RG (1991) Aromatic hydrocarbons: chemistry and carcinogenicity. Cambridge University Press, Cambridge
Perrin DD, Dempsy B, Sergeant EP (1981) pKa prediction for organic acids and bases. Chapman and Hall, London
Gordon A, Katritzky AR, Roy SK (1968) Tautomeric pyridines. Part X. Effects of substituents on pyridone–hydroxypyridine equilibria and pyridone basicity. J Chem Soc B 1968:556–561
ACD/PhysChem Suite, v12.0 (2009) Advanced Chemistry Development, Inc., Toronto
Pallas pKalc Net., v2.0 (2009) Compudrug International Inc., Sedona
Hilal S, Karickhoff SW, Carreira LA (1995) A rigorous test for SPARC’s chemical reactivity models: estimation of more than 4300 ionization pKa’s. Quant Struct Act Relatsh 14:348–355
Liao C, Nicklaus MC (2009) Comparison of nine programs predicting pK(a) values of pharmaceutical substances. J Chem Inf Model 49(12):2801–2812
Klicic J, Friesner RA, Liu S-Y, Guida WC (2002) Accurate prediction of acidity constants in aqueous solution via density functional theory and self-consistent reaction field methods. J Phys Chem A 106(7):1327–1335
Martin YC (2009) Let’s not forget tautomers. J Comput-Aided Mol Des 23(10):673–704
Guimaraes CR, Cardozo M (2008) MM-GB/SA rescoring of docking poses in structure-based lead optimization. J Chem Inf Model 48(5):958–970
Fogolari F, Brigo A, Molinari H (2003) Protocol for MM/PBSA molecular dynamics simulations of proteins. Biophys J 85(1):159–166
Furet P, Meyer T, Strauss A, Raccuglia S, Rondeau JM (2002) Structure-based design and protein X-ray analysis of a protein kinase inhibitor. Bioorg Med Chem Lett 12(2):221–224
Repasky M (2009) Enhancing Glide Enrichment Using Epik Ionization and Tautomeric State Penalties. Schrodinger Quaterly Newsletter, August
Prime-X, v1.6 (2009) Schrödinger, Inc., New York
Brandstetter H, Grams F, Glitz D, Lang A, Huber R, Bode W, Krell HW, Engh RA (2001) The 1.8-A crystal structure of a matrix metalloproteinase 8-barbiturate inhibitor complex reveals a previously unobserved mechanism for collagenase substrate recognition. J Biol Chem 276(20):17405–17412
Temperini C, Cecchi A, Scozzafava A, Supuran CT (2009) Carbonic anhydrase inhibitors. Comparison of chlorthalidone and indapamide X-ray crystal structures in adducts with isozyme II: when three water molecules and the keto-enol tautomerism make the difference. J Med Chem 52(2):322–328
Yan X, Hollis T, Svinth M, Day P, Monzingo AF, Milne GW, Robertus JD (1997) Structure-based identification of a ricin inhibitor. J Mol Biol 266(5):1043–1049
Kalliokoski T, Salo HS, Lahtela-Kakkonen M, Poso A (2009) The effect of ligand-based tautomer and protomer prediction on structure-based virtual screening. J Chem Inf Model 49(12):2742–2748
Glide, v5.5 (2009) Schrödinger, Inc., New York
Rejnek J, Hobza P (2007) Hydrogen-bonded nucleic acid base pairs containing unusual base tautomers: complete basis set calculations at the MP2 and CCSD(T) levels. J Phys Chem B 111(3):641–645
Shukla M, Leszcynski J (2008) Radiation induced molecular phenomena in nucleic acids: a comprehensive theoretical and experimental analysis. Springer, Berlin
Canvas, v1.2 (2009) Schrödinger, Inc., New York
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Greenwood, J.R., Calkins, D., Sullivan, A.P. et al. Towards the comprehensive, rapid, and accurate prediction of the favorable tautomeric states of drug-like molecules in aqueous solution. J Comput Aided Mol Des 24, 591–604 (2010). https://doi.org/10.1007/s10822-010-9349-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10822-010-9349-1