Regression model (regression)
Open in:QDB ExplorerQDB Predictor
Name | Type | n |
R2 |
σ |
---|---|---|---|---|
Training set i | training | 35 | 0.824 | 0.421 |
Testing set (inside AD) i | testing | 344 | N/A | N/A |
Testing set (outside AD) | testing | 25 | N/A | N/A |
Regression model (regression)
Open in:QDB ExplorerQDB Predictor
Name | Type | n |
R2 |
σ |
---|---|---|---|---|
Training set i | training | 35 | 0.820 | 0.425 |
Testing set (inside AD) | testing | 344 | N/A | N/A |
Testing set (outside AD) | testing | 25 | N/A | N/A |
Regression model (regression)
Open in:QDB ExplorerQDB Predictor
Name | Type | n |
R2 |
σ |
---|---|---|---|---|
Training set i | training | 35 | 0.804 | 0.445 |
Testing set (inside AD) | testing | 328 | N/A | N/A |
Testing set (outside AD) | testing | 41 | N/A | N/A |
Regression model ensemble (regression)
Open in:QDB ExplorerQDB Predictor
Name | Type | n |
R2 |
σ |
---|---|---|---|---|
Training set i | training | 35 | 0.865 | 0.373 |
Testing set (inside AD) | testing | 337 | N/A | N/A |
Testing set (outside AD) | testing | 32 | N/A | N/A |
When using this QDB archive, please cite (see details) it together with the original article:
Kahn, I. Data for: QSAR Modeling is not “Push a Button and Find a Correlation”: A Case Study of Toxicity of (Benzo-)triazoles on Algae. QsarDB repository, QDB.182. 2016. https://doi.org/10.15152/QDB.182
Gramatica, P.; Cassani, S.; Roy, P. P.; Kovarich, S.; Yap, C. W.; Papa, E. QSAR Modeling is not “Push a Button and Find a Correlation”: A Case Study of Toxicity of (Benzo-)triazoles on Algae. Mol. Inform. 2012, 31, 817–835. https://doi.org/10.1002/minf.201200075
Title: | Gramatica, P.; Cassani, S.; Roy, P. P.; Kovarich, S.; Yap, C. W.; Papa, E. QSAR Modeling is not “Push a Button and Find a Correlation”: A Case Study of Toxicity of (Benzo-)triazoles on Algae. Molecular Informatics 2012, 31, 817–835. |
Abstract: | A case study of toxicity of (benzo)triazoles ((B)TAZs) to the algae Pseudokirchneriella subcapitata is used to discuss some problems and solutions in QSAR modeling, particularly in the environmental context. The relevance of data curation (not only of experimental data, but also of chemical structures and input formats for the calculation of molecular descriptors), the crucial points of QSAR model validation and the potential application for new chemicals (internal robustness, exclusion of chance correlation, external predictivity, applicability domain) are described, while developing MLR-OLS models based on molecular descriptors, calculated by various QSAR software tools (commercial DRAGON, free PaDEL-Descriptor and QSPR-THESAURUS). Additionally, the utility of consensus models is highlighted. This work summarizes a methodology for a rigorous statistical approach to obtain reliable QSAR predictions, also for a large number of (B)TAZs in the ECHA preregistration list of REACH (even if starting from limited experimental data availability), and has evidenced some ambiguities and discrepancies related to SMILES notations from different databases; furthermore it highlighted some general problems related to QSAR model generation and was useful in the implementation of the PaDEL-Descriptor software. |
URI: | http://hdl.handle.net/10967/182
http://dx.doi.org/10.15152/QDB.182 |
Date: | 2016-08-29 |
Name | Description | Format | Size | View |
---|---|---|---|---|
2012MI817.qdb.zip | QSARs for algal toxicity of (B)TAZ | application/zip | 388.6Kb | View/ |