Summary
Substructural fragments are proposed as a simple and safe way to encode molecular structures in a matrix containing the occurrence of fragments of a given type. The knowledge retrieved from QSPR modelling can also be stored in that matrix in addition to the information about fragments. Complex supramolecular systems (using special bond types) and chemical reactions (represented as Condensed Graphs of Reactions, CGR) can be treated similarly. The efficiency of fragments as descriptors has been demonstrated in QSPR studies of aqueous solubility for a diverse set of organic compounds as well as in the analysis of thermodynamic parameters for hydrogen-bonding in some supramolecular complexes. It has also been shown that CGR may be an interesting opportunity to perform similarity searches for chemical reactions. The relationship between the density of information in descriptors/knowledge matrices and the robustness of QSPR models is discussed.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Faulon J.-L., Churchwell C.J., Visco D.P. Jr., (2003) J. Chem. Inf. Comput. Sci. 43: 721
Rucker G., Rucker C., (2001) J. Chem. Inf. Comput. Sci. 41: 1457
Churchwell C.J., Rinoul M.D., Martin S., Visco D.P. Jr., Kotu A., Larson R.S., Sillerud L.O., Brown D.C., Faulon J.-L., (2004) J. Mol. Graph. Mod. 22: 263
Klopman G., Tu M., (1999) J. Med. Chem. 42: 992
Solov’ev V.P., Varnek A., Wipff G., (2000) J. Chem. Inf. Comput. Sci. 40: 847
Klopman G., Zhu H., (2001) J. Chem. Inf. Comput. Sci. 41: 439
Varnek A., Wipff G., Solov’ev V.P., (2001) Solvent Extr. Ion Exch. 19: 791
Artemenko N.V., Baskin I.I., Palyulin V.A., Zefirov N.S., (2001) Doklady Chem. 381: 317
Varnek A., Wipff G., Solov’ev V.P., Solotnov A.F., (2002) J. Chem. Inf. Comput. Sci. 42: 812
Zefirov N.S., Palyulin V.A., (2002) J. Chem. Inf. Comput. Sci. 42: 1112
Solov’ev V.P., Varnek A., (2003) J. Chem. Inf. Comput. Sci. 43: 1703
Varnek, A., Fourches, D., Solov’ev, V.P., Baulin, V.E., Turanov, A. and Katritzky, A.R., J. Chem. Inf. Comput. Sci., 44 (2004) 1365
Katritzky A.R., Fara D.C., Yang H., Karelson M., Suzuki T., Solov’ev V.P., Varnek A., (2004) J. Chem. Inf. Comput. Sci. 44: 529
Clark M., (2005) J. Chem. Inf. Comput. Sci. 42: 30
Faulon J.-L., Collins M.J., Carr R.D., (2004) J. Chem. Inf. Comput. Sci. 44: 427
Faulon J.-L., Visco D.P., Jr. Pophale R.S., (2003) J. Chem. Inf. Comput. Sci. 43: 707
Baskin I., Skvortsova M., Stankevich I., Zefirov N., (1995) J. Chem. Inf. Comput. Sci. 35: 527
Skvortsova M., Baskin I., Skvortsova L., Palyulin V., Stankevich I., Zefirov N., (1999) Theochem: J. Mol. Struct. 466: 211
Trepalin S.V., Gerasimenko V.A., Kozyukov A.V., Savchuk N. and Ph. Ivaschenko A.A., J. Chem. Inf. Comput. Sci., 42 (2002) 249
Mavrovouniotis M.L., (1990) Biotech. Bioeng. 36: 1070
Mavrovouniotis M.L., (1991) J. Biol. Chem. 266: 14440
Meylan W.M., Howard P.H., (1995) J. Pharm. Sci. 84: 83
Hansch C. and Leo A., Exploring QSAR Fundamentals and Applications in Chemistry and Biology. ACS Prof. Ref. Book, Washington, 1995, 557 pp
Klopman G., Ding C., Macina O.T., (1997) J. Chem. Inf. Comput. Sci. 37: 569
Wang R., Fu Y., Lai L., (1997) J. Chem. Inf. Comput. Sci. 37: 615
Golbraikh A., Tropsha A., (2003) J. Chem. Inf. Comput. Sci. 43: 144
Zheng W., Tropsha A., (2000) J. Chem. Inf. Comput. Sci. 40: 185
Dalby A., Nourse J.G., Hounshell W.D., Gushurst A.K.I., Grier D.L., Leland B.A., Laufer J., (1992) J. Chem. Inf. Comput. Sci. 32: 244
Hanser T., Jauffret P., Marchaland J.F., Ellermann L., Gruber E., Kaufmann G., Am. Inst. Phys., Conference Proceeding no. 330, 1995, 575
Jauffret P., Vogel H., Schildknecht S. and Kaufmann G., Learning synthetic knowledge from reaction databases: dealing with experimental conditions. Pub Informatics Ltd. Tetbury (Eng.), 2000
Fujita S., (1986) J. Chem. Inf. Comput. Sci. 26: 205
Fujita S., (1987) J. Chem. Inf. Comput. Sci. 27: 99
http://infochimie.u-strasbg.fr/recherche/isida/index.php
Korn G.A. and Korn T.M., Mathematical Handbook for Scientists and Engineers, 2nd Edition. McGraw-Hill Book Co, New York, 1968
Golub G.H., Reinsch C., (1970) Numer. Math., 14: 403
Muller P.H., Neumann P. and Storm R., Tafeln der mathematischen Statistik VEB Fachbuchverlag: Leipzig, 1979, 280 pp
Hou T., Xia K., Zhang W., Xu X., (2004) J. Chem. Inf. Comput. Sci. 44: 266
Bergström C., Wassvik C., Norinder U., Luthman K., Artursson P., (2004) J. Chem. Inf. Comp. Sci. 44: 1477
Yan A., Gasteiger J., (2003) J. Chem. Inf. Comput. Sci. 43: 429
Delaney J., (2004) J. Chem. Inf. Comput. Sci. 44: 1000
Butina D., Gola J., (2003) Chem. Inf. Comput. Sci. 43: 837
Cheng A., Merz K., (2003) J. Med. Chem. 46: 3572
Engkvist O., Wrede P., (2002) J. Chem. Inf. Comput. Sci. 42: 1247
Klopman G., Zhu H., (2001) J. Chem. Inf. Comput. Sci. 41: 439
Lipinski C., Lombardo F., Dominy B., Feeney P., (2001) Adv. Drug Delivery Rev. 46: 3
Catana C., Gao H., Orrenius C., Stouten P., (2005) J. Chem. Inf. Comp. Sci. 45: 170
Delaney J.S., (2005) Drug Discovery Today 10: 289
Yaffe D., Cohen Y., Espinosa G., Arenas A., Giralt F., (2001) J. Chem. Inf. Comput. Sci. 41: 1177
Ran Y., Jain N., Yalkowsky S., (2001) J. Chem. Inf. Comput. Sci. 41: 1208
McElroy N., Jurs P., (2001) J. Chem. Inf. Comput. Sci. 41: 1237
Tetko I., Tanchuk V., Kasheva T., Villa A., (2001) J. Chem. Inf. Comput. Sci. 41: 1488
Raevsky O.A., Solov’ev V.P. and Grigor’ev V.Y., VINITI Deposited Doc. No. 1001–V88 (1988) 83 pp
Drago R.S., Ferris D.C., Wong N., (1990) J. Am. Chem. Soc. 112: 8953
Drago R.S., Dadmun A.P., Vogel G.C., (1993) Inorg. Chem. 32: 2473
Abraham M.H., (1993) Chem. Soc. Rev. 22: 73
Abraham M.H., Grellier P.L., Prior D.V., Taft R.W., Morris J.J., Taylor P.J., Laurence C., Berthelot M., Doherty R.M., et al., (1988) J. Am. Chem. Soc. 110: 8534
Abraham M.H., Platts J.A., (2001) J. Org. Chem. 66: 3484
Raevskii O.A., Grigor’ev V.Y., Solov’ev V.P., Martynov I.V., (1988) Doklady Akademii Nauk SSSR, 299: 1433 Phys. Chem
Raevskii O.A., Grigor’ev V.Y., Solov’ev V.P., Martynov I.V., (1988) Doklady Akademii Nauk SSSR, 298: 1166 Phys. Chem
Raevskii O.A., Grigor’ev V.Y., Solov’ev V.P., (1989) Khimiko-Farmatsevticheskii Zhurnal, 23: 1294
Raevsky O.A., (1997) J. Phys. Org. Chem., 10: 405
http://www.novalyst.com/
Acknowledgement
GDR PARIS and GDRE SupraChem are acknowledged for support. FH thanks Novalyst Discovery for a PhD fellowship.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Varnek, A., Fourches, D., Hoonakker, F. et al. Substructural fragments: an universal language to encode reactions, molecular and supramolecular structures. J Comput Aided Mol Des 19, 693–703 (2005). https://doi.org/10.1007/s10822-005-9008-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10822-005-9008-0