Schultz, T.W.; Netzeva, T.I.; Cronin, M.T.D. Selection of data sets for qsars: analyses of tetrahymena toxicity from aromatic compounds. SAR QSAR Environ. Res. 2003, 14, 1, 59–81.

QsarDB Repository

Schultz, T.W.; Netzeva, T.I.; Cronin, M.T.D. Selection of data sets for qsars: analyses of tetrahymena toxicity from aromatic compounds. SAR QSAR Environ. Res. 2003, 14, 1, 59–81.

QDB archive DOI: 10.15152/QDB.66   DOWNLOAD

QsarDB content

Property pIGC50: 40-h Tetrahymena toxicity as log(1/IGC50) [log(L/mmol)]

2: All compounds

Regression model (regression)

Open in:QDB ExplorerQDB Predictor

NameTypen

R2

σ

Trainingtraining3850.8600.273
3: Training compounds (n = 10)

Regression model (regression)

Open in:QDB ExplorerQDB Predictor

NameTypen

R2

σ

Trainingtraining100.9770.215
4: Training compounds (n = 15)

Regression model (regression)

Open in:QDB ExplorerQDB Predictor

NameTypen

R2

σ

Trainingtraining150.9600.240
5: Training compounds (n = 20)

Regression model (regression)

Open in:QDB ExplorerQDB Predictor

NameTypen

R2

σ

Trainingtraining200.9570.235
6: Training compounds (n = 25)

Regression model (regression)

Open in:QDB ExplorerQDB Predictor

NameTypen

R2

σ

Trainingtraining250.9450.248
7: Training compounds (n = 30)

Regression model (regression)

Open in:QDB ExplorerQDB Predictor

NameTypen

R2

σ

Trainingtraining300.9270.302
8: Training compounds (n = 35)

Regression model (regression)

Open in:QDB ExplorerQDB Predictor

NameTypen

R2

σ

Trainingtraining350.9200.293
9: Training compounds (n = 40)

Regression model (regression)

Open in:QDB ExplorerQDB Predictor

NameTypen

R2

σ

Trainingtraining400.9130.297
10: Training compounds (n = 45)

Regression model (regression)

Open in:QDB ExplorerQDB Predictor

NameTypen

R2

σ

Trainingtraining450.9140.287
11: Training compounds (n = 50)

Regression model (regression)

Open in:QDB ExplorerQDB Predictor

NameTypen

R2

σ

Trainingtraining500.9110.292

Citing

When using this QDB archive, please cite (see details) it together with the original article:

  • Ruusmann, V. Data for: Selection of data sets for qsars: analyses of tetrahymena toxicity from aromatic compounds. QsarDB repository, QDB.66. 2012. http://dx.doi.org/10.15152/QDB.66

  • Schultz, T. W.; Netzeva, T. I.; Cronin, M. T. D. Selection of data sets for qsars: analyses of tetrahymena toxicity from aromatic compounds. SAR QSAR Environ. Res. 2003, 14, 59–81. http://dx.doi.org/10.1080/1062936021000058782

Metadata

Show simple item record

dc.date.accessioned2012-05-23T16:03:03Z
dc.date.available2012-05-23T16:03:03Z
dc.date.issued2012-05-23
dc.identifier.urihttp://hdl.handle.net/10967/66
dc.identifier.urihttp://dx.doi.org/10.15152/QDB.66
dc.description.abstractThe aim of this investigation was to develop a strategy for the formulation of a valid ecotoxicological-based QSAR while, at the same time, minimizing the required number of toxicological data points. Two chemical selection approaches-distance-based optimality and K Nearest Neighbor (KNN), were used to examine the impact of the number of compounds used in the training and testing phases of QSAR development (i.e. diversity and representivity, respectively) on the predictivity (i.e. external validation) of the QSAR. Regression-based QSARs for the ectotoxic potency for population growth impairment of aromatic compounds (benzenes) to the aquatic ciliate Tetrahymena pyriformis were developed based on descriptors for chemical hydrophobicity and electrophilicity. A ratio of one compound in the training set to three in the test set was applied. The results indicate that from a known chemical universe, in this case 385 derivatives, robust QSARs of equal quality may be developed from a small number of diverse compounds, validated by a representative test set. As a conservative recommendation it is suggested that there should be a minimum of 10 observations for each variable in a QSAR.
dc.publisherVillu Ruusmann
dc.rightsAttribution 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.titleSchultz, T.W.; Netzeva, T.I.; Cronin, M.T.D. Selection of data sets for qsars: analyses of tetrahymena toxicity from aromatic compounds. SAR QSAR Environ. Res. 2003, 14, 1, 59–81.
qdb.property.endpoint6. Other (Acute toxicity to ciliate protozoa)
qdb.property.speciesTetrahymena pyriformis
qdb.descriptor.applicationClogP 1.0.0
qdb.descriptor.applicationMOPAC 93
bibtex.entryarticle
bibtex.entry.authorSchultz, T. W.
bibtex.entry.authorNetzeva, T. I.
bibtex.entry.authorCronin, M. T. D.
bibtex.entry.doi10.1080/1062936021000058782
bibtex.entry.journalSAR QSAR Environ. Res.
bibtex.entry.number1
bibtex.entry.pages59–81
bibtex.entry.titleSelection of data sets for qsars: analyses of tetrahymena toxicity from aromatic compounds
bibtex.entry.volume14
bibtex.entry.year2003
qdb.model.typeRegression model (regression)


Files in this item

NameDescriptionFormatSizeView
713745890.qdb.zipn/aapplication/zip22.97KbView/Open
Files associated with this item are distributed
under Creative Commons license.

This item appears in the following Collection(s)

Show simple item record