Abstract
Over the past two decades, solvent mapping has emerged as a useful tool for identifying hot spots within binding sites on proteins for drug-like molecules and suggesting properties of potential binders. While the experimental technique requires solving multiple crystal structures of a protein in different solvents, computational solvent mapping allows for fast analysis of a protein for potential binding sites and their druggability. Recent advances in genomics, systems biology and interactomics provide a multitude of potential targets for drug development and solvent mapping can provide useful information to help prioritize targets for drug discovery projects. Here, we review various approaches to computational solvent mapping, highlight some key advances and provide our opinion on future directions in the field.
Papers of special note have been highlighted as: • of interest; •• of considerable interest
References
- 1 . Probing hot spots at protein-ligand binding sites: a fragment-based approach using biophysical methods. J. Med. Chem. 49(16), 4992–5000 (2006).
- 2 . Discovering high-affinity ligands for proteins: SAR by NMR. Science 274(5292), 1531–1534 (1996).
- 3 . Druggability indices for protein targets derived from NMR-based screening data. J. Med. Chem. 48(7), 2518–2525 (2005).
- 4 . Predicting protein druggability. Drug Discov. Today 10(23–24), 1675–1682 (2005).
- 5 . FTSite: high accuracy detection of ligand binding sites on unbound protein structures. Bioinformatics 28(2), 286–287 (2012).
- 6 Two classes of p38α MAP kinase inhibitors having a common diphenylether core but exhibiting divergent binding modes. Bioorg. Med. Chem. Lett. 15(23), 5274–5279 (2005).
- 7 Discovery of a novel class of non-ATP site DFG-out state p38 inhibitors utilizing computationally assisted virtual fragment-based drug design (vFBDD). Bioorg. Med. Chem. Lett. 21(23), 7155–7165 (2011).
- 8 Rational design of potent sialidase-based inhibitors of influenza virus replication. Nature 363(6428), 418–423 (1993).
- 9 A study of the active site of influenza virus sialidase: an approach to the rational design of novel anti-influenza drugs. J. Med. Chem. 39(2), 388–391 (1996).
- 10 . The multi-copy simultaneous search methodology: a fundamental tool for structure-based drug design. J. Comput. Aided Mol. Des. 23(8), 475–489 (2009).• Thorough review of multiple copy simultaneous search and its applications.
- 11 . Automated generation of MCSS‐derived pharmacophoric DOCK site points for searching multiconformation databases. Proteins 51(2), 189–202 (2003).
- 12 . Use of MCSS to design small targeted libraries: application to picornavirus ligands. J. Am. Chem. Soc. 123(51), 12758–12769 (2001).
- 13 . Computational methods for functional site identification suggest a substrate access channel in transaldolase. Genome Inform. 17(1), 13–22 (2006).
- 14 Novel druggable hot spots in avian influenza neuraminidase h5n1 revealed by computational solvent mapping of a reduced and representative receptor ensemble. Chem. Biol. Drug Des. 71(2), 106–116 (2008).
- 15 . Exploring the binding site structure of the PPAR gamma ligand-binding domain by computational solvent mapping. Biochemistry 44(4), 1193–1209 (2005).
- 16 . Hot spot analysis for driving the development of hits into leads in fragment-based drug discovery. J. Chem. Inf. Model 52(1), 199–209 (2011).
- 17 How proteins bind macrocycles. Nat. Chem. Biol. 10(9), 723–731 (2014).
- 18 FTMap ftmap.bu.edu.
- 19 FTFlex ftflex.bu.edu.
- 20 National Biomedical Computation Resource nbcr.ucsd.edu.
- 21 Druggability Suite for VMD prody.csb.pitt.edu/drugui/.
- 22 . Site-identification by Ligand Competitive Saturation (SILCS) assisted pharmacophore modeling. J. Comput. Aided Mol. Des. 28(5), 491–507 (2014).
- 23 Using ligand‐mapping simulations to design a ligand selectively targeting a cryptic surface pocket of polo‐like kinase 1. Angew. Chem. 124(40), 10225–10228 (2012).
- 24 . The use of chlorobenzene as a probe molecule in molecular dynamics simulations. J. Chem. Inf. Model 54(7), 1821–1827 (2014).
- 25 . Hydrophobic binding hot spots of bcl-xl protein–protein interfaces by cosolvent molecular dynamics simulation. ACS Med. Chem. Lett. 2(4), 280–284 (2011).
- 26 An experimental approach to mapping the binding surfaces of crystalline proteins. J. Phys. Chem. 100(7), 2605–2611 (1996).
- 27 . Locating and characterizing binding sites on proteins. Nat. Biotechnol. 14(5), 595–599 (1996).•• Description of multiple solvent crystal structures and important ideas and observations behind the method.
- 28 . Proteins in organic solvents. Curr. Opin. Struct. Biol. 11(6), 761–764 (2001).
- 29 . Analysis of the binding surfaces of proteins. Med. Res. Rev. 19(4), 321–331 (1999).
- 30 . Locating interaction sites on proteins: the crystal structure of thermolysin soaked in 2% to 100% isopropanol. Proteins 37(4), 628–640 (1999).
- 31 . Fragment-based drug discovery. J. Med. Chem. 47(14), 3463–3482 (2004).
- 32 . Discovering novel ligands for macromolecules using X-ray crystallographic screening. Nat. Biotechnol. 18(10), 1105–1108 (2000).
- 33 Multiple solvent crystal structures: probing binding sites, plasticity and hydration. J. Mol. Bio. 357(5), 1471–1482 (2006).
- 34 . Protein modeling: what happened to the "protein structure gap"? Structure 21(9), 1531–1540 (2013).
- 35 . A computational procedure for determining energetically favorable binding sites on biologically important macromolecules. J. Med. Chem. 28(7), 849–857 (1985).• Initial description of GRID.
- 36 . Functionality maps of binding sites: a multiple copy simultaneous search method. Proteins 11(1), 29–34 (1991).
- 37 . Experimental and computational mapping of the binding surface of a crystalline protein. Protein Eng. 14(1), 47–59 (2001).•• Key comparison of experimental solvent mapping methods to computational solvent mapping using GRID and multiple copy simultaneous search.
- 38 . HOOK: a program for finding novel molecular architectures that satisfy the chemical and steric requirements of a macromolecule binding site. Proteins 19(3), 199–221 (1994).
- 39 . A decade of fragment-based drug design: strategic advances and lessons learned. Nat. Rev. Drug Discov. 6(3), 211–219 (2007).
- 40 . New hydrogen-bond potentials for use in determining energetically favorable binding sites on molecules of known structure. J. Med. Chem. 32(5), 1083–1094 (1989).
- 41 . Further development of hydrogen bond functions for use in determining energetically favorable binding sites on molecules of known structure. 1. Ligand probe groups with the ability to form two hydrogen bonds. J. Med. Chem. 36(1), 140–147 (1993).
- 42 . Characterization of protein-binding sites and ligands using molecular interaction fields. In: Comprehensive Medicinal Chemistry II. Taylor JB, Triggle DJ (Eds). London, Elsevier, 237–253 (2007).
- 43 . Hydrogen bonding interactions of covalently bonded fluorine atoms: from crystallographic data to a new angular function in the GRID force field. J. Med. Chem. 47(21), 5114–5125 (2004).
- 44 . A search for specificity in DNA-drug interactions. J. Mol. Graph. 12, 116–129 (1994).
- 45 . Enhanced sampling in molecular dynamics: use of the time-dependent Hartree approximation for a simulation of carbon monoxide diffusion through myoglobin. J. Am. Chem. Soc. 112(25), 9161–9175 (1990).
- 46 . MCSS functionality maps for a flexible protein. Proteins 37(4), 512–529 (1999).
- 47 . Functionality maps of the ATP binding site of DNA gyrase B: generation of a consensus model of ligand binding. J. Med. Chem. 47(18), 4373–4390 (2004).
- 48 . Computational combinatorial ligand design: application to human alpha-thrombin. J. Comput. Aided Mol. Des. 10(5), 372–396 (1996).
- 49 . Predicting fragment binding poses using a combined MCSS MM-GBSA approach. J. Chem. Inf. Model 51(5), 1092–1105 (2011).
- 50 . Combining solvent thermodynamic profiles with functionality maps of the Hsp90 binding site to predict the displacement of water molecules. J. Chem. Inf. Model 53(10), 2571–2586 (2013).
- 51 . Simulated annealing of chemical potential: a general procedure for locating bound waters. Application to the study of the differential hydration propensities of the major and minor grooves of DNA. J. Am. Chem. Soc. 118(35), 8493–8494 (1996).
- 52 , Sarnoff Corporation. Computational protein probing to identify binding sites. US6735530 (2004).
- 53 . Grand canonical monte carlo simulation of ligand−protein binding. J. Chem. Inf. Model 46(1), 231–242 (2006).
- 54 . Diverse fragment clustering and water exclusion identify protein hot spots. J. Am. Chem. Soc. 133(28), 10740–10743 (2011).
- 55 . Exploring potential solvation sites of proteins by multistart local minimization. In: Optimization in Computational Chemistry and Molecular Biology. Springer, Boston, MA, USA, 243–261 (2000).
- 56 Fragment-based identification of druggable “hot spots” of proteins using Fourier domain correlation techniques. Bioinformatics 25(5), 621–627 (2009).• Initial description of the FTMap/Atlas method and comparison to experimental solvent mapping and known ligands.
- 57 . Determination of atomic desolvation energies from the structures of crystallized proteins. J. Mol. Bio. 267(3), 707–726 (1997).
- 58 . Algorithms for computational solvent mapping of proteins. Proteins 51(3), 340–351 (2003).
- 59 . Improved mapping of protein binding sites. J. Comput. Aided Mol. Des. 17(2–4), 173–186 (2003).
- 60 . Computational mapping identifies the binding sites of organic solvents on proteins. Proc. Natl Acad. Sci. USA 99(7), 4290–4295 (2002).
- 61 . A systematic study of low-resolution recognition in protein–protein complexes. Proc. Natl Acad. Sci. USA 96(15), 8477–8482 (1999).
- 62 . Identification of substrate binding sites in enzymes by computational solvent mapping. J. Mol. Bio. 332(5), 1095–1113 (2003).
- 63 . Exploring the binding sites of the haloalkane dehalogenase DhlA from Xanthobacter autotrophicus GJ10. Biochemistry 46(32), 9239–9249 (2007).
- 64 . Computational solvent mapping reveals the importance of local conformational changes for broad substrate specificity in mammalian cytochromes P450. Biochemistry 45(31), 9393–9407 (2006).
- 65 . PIPER: an FFT‐based protein docking program with pairwise potentials. Proteins 65(2), 392–406 (2006).
- 66 . DARS (Decoys As the Reference State) potentials for protein–protein docking. Biophys. J. 95(9), 4217–4227 (2008).
- 67 Structural conservation of druggable hot spots in protein–protein interfaces. Proc. Natl Acad. Sci. USA 108(33), 13528–13533 (2011).
- 68 Reversing chemoresistance by small molecule inhibition of the translation initiation complex eIF4F. Proc. Natl Acad. Sci. USA 108(3), 1046–1051 (2011).
- 69 . Relationship between hot spot residues and ligand binding hot spots in protein–protein interfaces. J. Chem. Inf. Model 52(8), 2236–2244 (2012).
- 70 Minimal ensembles of side chain conformers for modeling protein–protein interactions. Proteins 80(2), 591–601 (2012).
- 71 . FTFlex: accounting for binding site flexibility to improve fragment-based identification of druggable hot spots. Bioinformatics 29(9), 1218–1219 (2013).
- 72 Computational mapping reveals dramatic effect of Hoogsteen breathing on duplex DNA reactivity with formaldehyde. Nucl. Acids Res. 40(16), 7644–7652 (2012).
- 73 . An integral equation to describe the solvation of polar molecules in liquid water. J. Phys. Chem. B. 101(39), 7821–7826 (1997).
- 74 . Three-dimensional density profiles of water in contact with a solute of arbitrary shape: a RISM approach. Chem. Phys. Lett. 290(1–3), 237–244 (1998).
- 75 . Ligand mapping on protein surfaces by the 3D-RISM theory: toward computational fragment-based drug design. J. Am. Chem. Soc. 131(34), 12430–12440 (2009).
- 76 . Binding site detection and druggability index from first principles. J. Med. Chem. 52(8), 2363–2371 (2009).
- 77 . Full protein flexibility is essential for proper hot-spot mapping. J. Am. Chem. Soc. 133(2), 200–202 (2010).
- 78 . Druggability assessment of allosteric proteins by dynamics simulations in the presence of probe molecules. J. Chem. Theory Comput. 8(7), 2435–2447 (2012).
- 79 . Computational fragment-based binding site identification by ligand competitive saturation. PLoS Comput. Biol. 5(7), e1000435 (2009).
- 80 . Reproducing crystal binding modes of ligand functional groups using site-identification by Ligand Competitive Saturation (SILCS) simulations. J. Chem. Inf. Model 51(4), 877–896 (2011).
- 81 . Inclusion of multiple fragment types in the site-identification by Ligand Competitive Saturation (SILCS) approach. J. Chem. Inf. Model 53(12), 3384–3398 (2013).
- 82 . Sampling of organic solutes in aqueous and heterogeneous environments using oscillating excess chemical potentials in grand canonical-like monte carlo-molecular dynamics simulations. J. Chem. Theory Comput. 10(6), 2281–2290 (2014).
- 83 . Improving protocols for protein mapping through proper comparison to crystallography data. J. Chem. Inf. Model 53(2), 391–402 (2013).
- 84 . Parameter choice matters: validating probe parameters for use in mixed-solvent simulations. J. Chem. Inf. Model 54(8), 2190–2199 (2014).
- 85 . From induced fit to conformational selection: a continuum of binding mechanism controlled by the timescale of conformational transitions. Biophys. J. 98(6), L15–L17 (2010).
- 86 . Induced fit, conformational selection and independent dynamic segments: an extended view of binding events. Trends Biochem. Sci. 35(10), 539–546 (2010).
- 87 . Response. Science 257(5068), 412–413 (1992).
- 88 Binding of small molecules to an adaptive protein–protein interface. Proc. Natl Acad. Sci. USA 100(4), 1603–1608 (2003).
- 89 . Potent small-molecule binding to a dynamic hot spot on IL-2. J. Am. Chem. Soc. 125(50), 15280–15281 (2003).
- 90 . Robust identification of binding hot spots using continuum electrostatics: application to hen egg-white lysozyme. J. Am. Chem. Soc. 133(51), 20668–20671 (2011).• Comparison of MixMD, grand canonical monte carlo and ftmap for finding hot spots on hen egg-white lysozyme.
- 91 Crystal structure of ABT-737 complexed with Bcl-xL: implications for selectivity of antagonists of the Bcl-2 family. Cell Death Differ. 14(9), 1711–1713 (2007).
- 92 . Identification of potential small molecule binding pockets on Rho family GTPases. PLoS ONE 7(7), e40809 (2012).
- 93 . LowModeMD–implicit low-mode velocity filtering applied to conformational search of macrocycles and protein loops. J. Chem. Inf. Model 50(5), 792–800 (2010).
- 94 . Exploring experimental sources of multiple protein conformations in structure-based drug design. J. Am. Chem. Soc. 129(26), 8225–8235 (2007).
- 95 . Evidence of conformational selection driving the formation of ligand binding sites in protein–protein interfaces. PLoS Comput. Biol. 10(10), e1003872 (2014).
- 96 . A molecular dynamics ensemble-based approach for the mapping of druggable binding sites. In: Computational Drug Discovery and Design. Baron R (Ed.). Springer New York, USA, 3–12 (2012).
- 97 . Novel naphthalene-based inhibitors of trypanosoma brucei rna editing ligase 1. PLoS Negl. Trop. Dis. 4(8),
doi:10.1371/journal.pntd.0000803 (2010) (Epub ahead of print). - 98 Computational identification of a transiently open L1/S3 pocket for reactivation of mutant p53. Nat. Comms. 4(1407),
doi:10.1038/ncomms2361 (2013) (Epub ahead of print). - 99 . Molecular simulations of aromatase reveal new insights into the mechanism of ligand binding. J. Chem. Inf. Model 53(8), 2047–2056 (2013).
- 100 . Mapping the druggable allosteric space of G-protein coupled receptors: a fragment-based molecular dynamics approach. Chem. Biol. Drug Des. 76(3), 201–217 (2010).
- 101 Novel allosteric sites on Ras for lead generation. PLoS ONE 6(10), e25711 (2011).
- 102 . Multistructural hot spot characterization with FTProd. Bioinformatics 29(3), 393–394 (2013).
- 103 . Method for including the dynamic fluctuations of a protein in computer-aided drug design. J. Phys. Chem. A. 103(49), 10213–10219 (1999).
- 104 . Druggable protein interaction sites are more predisposed to surface pocket formation than the rest of the protein surface. PLoS Comput. Biol. 9(3), e1002951 (2013).
- 105 . Balancing target flexibility and target denaturation in computational fragment‐based inhibitor discovery. J. Comput. Chem. 33(23), 1880–1891 (2012).
- 106 . Binding hot spots and amantadine orientation in the influenza a virus M2 proton channel. Biophys. J. 97(10), 2846–2853 (2009).
- 107 . Dynamic ligand design and combinatorial optimization: designing inhibitors to endothiapepsin. Proteins 40(2), 258–289 (2000).
- 108 . An automated method for dynamic ligand design. Proteins 23(4), 472–490 (1995).
- 109 . CLIX: a search algorithm for finding novel ligands capable of binding proteins of known three‐dimensional structure. Proteins 12(1), 31–41 (1992).
- 110 . Pharmacophore‐based molecular docking to account for ligand flexibility. Proteins 51(2), 172–188 (2003).
- 111 Developing a dynamic pharmacophore model for HIV-1 integrase. J. Med. Chem. 43(11), 2100–2114 (2000).
- 112 . Comparison of shape-matching and docking as virtual screening tools. J. Med. Chem. 50(1), 74–82 (2007).
- 113 . FRED pose prediction and virtual screening accuracy. J. Chem. Inf. Model 51(3), 578–596 (2011).
- 114 CHARMM: the biomolecular simulation program. J. Comput. Chem. 30(10), 1545–1614 (2009).
- 115 FTMAP: extended protein mapping with user-selected probe molecules. Nucl. Acids Res. 40 w271–w275 (2012).
- 116 . A comprehensive analytical treatment of continuum electrostatics. J. Phys. Chem. 100(5), 1578–1599 (1996).
- 117 . Computational analysis of protein hot spots. ACS Med. Chem. Lett. 1(3), 125–129 (2010).