Abstract
Optimal descriptors based on the simplified molecular input line entry system (SMILES) have been utilized in modeling of carcinogenicity. Carcinogenicity of 401 compounds has been modeled by means of balance of correlations for the training (n = 170) and calibration (n = 170) sets. The obtained models were evaluated with an external test set (n = 61). Comparison of models based on the balance of correlations and models which were obtained on the basis of the total training set (i.e., both training and calibration sets as the united training set) has shown that the balance of correlations improves the statistical quality for the external test set.
Article PDF
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Richardson SD, Plewa MJ, Wagner ED et al (2007) Occurrence, genotoxicity, and carcinogenicity of regulated and emerging disinfection by-products in drinking water: a review and roadmap for research. Mutat Res-Rew Mutat 636: 178–242
Helguera AM, González MP, Cordeiro MNDS et al (2007) Quantitative structure carcinogenicity relationship for detecting structural alerts in nitroso-compounds. Toxicol Appl Pharm 221: 189–202
Gini G, Lorenzini M, Benfenati E et al (1999) Predictive carcinogenicity: a model for aromatic compounds, with nitrogen-containing substituents, based on molecular descriptors using an artificial neural network. J Chem Inf Comput Sci 39: 1076–1080
Vracko M (1997) A study of structure-carcinogenic potency relationship with artificial neural networks. The using of descriptors related to geometrical and electronic structures. J Chem Inf Comput Sci 37: 1037–1043
Contrera JF, Kruhlak NL, Matthews EJ et al (2007) Comparison of MC4PC and MDL-QSAR rodent carcinogenicity predictions and the enhancement of predictive performance by combining QSAR models. Regul Toxicol Pharm 49: 172–182
Richard AM (1994) Application of SAR methods to non- congeneric data bases associated with carcinogenicity and mutagenicity: issues and approaches. Mutat Res 305: 73–97
Ashby J (1994) Two million rodent carcinogens? The role of SAR and QSAR in their detection. Mutat Fund-Mol M 305: 3–12
Kazius J, McGuire R, Bursi RJ (2005) Derivation and validation of toxicophores for mutagenicity prediction. J Med Chem 48: 312–320
Benigni R, Netzeva T, Benfenati E et al (2007) The expanding role of predictive toxicology: an update on the (Q)SAR models for mutagens and carcinogens. J Environ Sci Heal Part C 25: 53–97
Weininger D (1988) SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J Chem Inf Comput Sci 28: 31–36
Weininger D, Weininger A, Weininger JL (1989) SMILES. 2. Algorithm for generation of unique SMILES notation. J Chem Inf Comput Sci 29: 97–101
Weininger D (1990) SMILES. 3. DEPICT. Graphical depiction of chemical structures. J Chem Inf Comput Sci 30: 237–243
ACD/ChemSketch Freeware (2007) version 11.00, Advanced Chemistry Development, Inc., Toronto, ON, Canada, http://www.acdlabs.com
Randic M, Dobrowolski JC (1998) Optimal molecular connectivity descriptors for nitrogen-containing molecules. Int J Quant Chem 70: 1209–1215
Randic M, Basak SC (2000) Construction of high-quality structure–property–activity regressions: the boiling points of sulfides. J Chem Inf Comput Sci 40: 899–905
Toropov AA, Toropova AP (1998) Optimization of correlation weights of the local graph invariants: use of the enthalpies of formation of complex compounds for the QSPR modeling. Russ J Coord Chem 24: 81–85
Vidal D, Thormann M, Pons M. (2005) LINGO, an efficient holographic text based method to calculate biophysical properties and intermolecular similarities. J Chem Inf Model 45: 386–393
Toropov AA, Benfenati E. (2007) Optimisation of correlation weights of SMILES invariants for modelling oral quail toxicity. Eur J Med Chem 42: 606–613
Toropov AA, Toropova AP, Mukhamedzhanova D et al (2005) Simplified molecular input line entry system (SMILES) as an alternative for constructing quantitative structure–property relationships (QSPR). Indian J Chem Sec A 44: 1545–1552
Toropov AA, Toropova AP (2003) QSPR modeling of alkanes properties based on graph of atomic orbitals. J Mol Struct (Theochem) 637: 1–10
Toropov AA, Benfenati E (2006) Correlation weighting of valence shells in QSAR analysis of toxicity. Bioorg Med Chem 14: 3923–3928
Raska I Jr, Toropov A (2006) Comparison of QSPR models of octanol/water partition coefficient for vitamins and non vitamins. Eur J Med Chem 41: 1271–1278
Helguera AM, Cordeiro MNDS, Perez MAC et al (2008) QSAR modeling of the rodent carcinogenicity of nitrocompounds. Bioorg Med Chem 16: 3395–3407
Toropov AA, Rasulev BF, Leszczynski J (2007) QSAR modeling of acute toxicity for nitrobenzene derivatives towards rats: comparative analysis by MLRA and optimal descriptors . QSAR Comb Sci 26: 686–693
Golbraikh A, Tropsha A (2002) Beware of q2!. J Mol Graph Model 20: 269–276
Johnson SR (2008) The trouble with QSAR (or how i learned to stop worrying and embrace fallacy). J Chem Inf Model 48: 25–26
Mayer J, Cheeseman MA, Twaroski ML (2008) Structure–activity relationship analysis tools: validation and applicability in predicting carcinogens. Regul Toxicol Pharm 50: 50–58
Morales AH, Duchowicz PR, Pérez MÁC et al (2006) Application of the replacement method as a novel variable selection strategy in QSAR. 1. Carcinogenic potential. Chemom Intell Lab 81: 180–187
Andrzejewski P, Kasprzyk-Hordern B, Nawrocki J (2005) The hazard of N-nitrosodimethylamine (NDMA) formation during water disinfection with strong oxidants. Desalination 176: 37–45
Duchowicz PR, Mercader AG, Fernández FM et al (2008) . Chemom Intell Lab 90: 97–107
Casanola-Martın GM, Marrero-Ponce Y, Hassan Khan MT et al (2007) TOMOCOMD-CARDD descriptors-based virtual screening of tyrosinase inhibitors: evaluation of different classification model combinations using bond-based linear indices. Bioorg Med Chem 15: 1483–1503
Toropov AA, Rasulev BF, Leszczynski J (2008) QSAR modeling of acute toxicity by balance of correlations. Bioorg Med Chem 16: 5999–6008
Roy PP, Leonard JT, Roy K (2008) Exploring the impact of size of training sets for the development of predictive QSAR models. Chemom Intell Lab 90: 31–42
Author information
Authors and Affiliations
Corresponding author
Electronic Supplementary Material
The Below is the Electronic Supplementary Material.
Rights and permissions
About this article
Cite this article
Toropov, A.A., Toropova, A.P., Benfenati, E. et al. QSAR modelling of carcinogenicity by balance of correlations. Mol Divers 13, 367–373 (2009). https://doi.org/10.1007/s11030-009-9113-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11030-009-9113-4