Logistic regression (classification)
Open in:QDB ExplorerQDB Predictor
Name | Type | n | Accuracy |
---|---|---|---|
Training set | training | 1685 | 0.916 |
Evaluation set | validation | 3882 | 0.884 |
When using this QDB archive, please cite (see details) it together with the original article:
Garcia-Sosa, A. T.; Maran, U. Data for: Combined Naïve Bayesian, chemical fingerprints, and molecular docking classifiers to model and predict androgen receptor binding activity data for environmentally- and health-sensitive substances. QsarDB repository, QDB.235. 2021. https://doi.org/10.15152/QDB.235
Garcia-Sosa, A. T.; Maran, U. Combined Naïve Bayesian, chemical fingerprints, and molecular docking classifiers to model and predict androgen receptor binding activity data for environmentally- and health-sensitive substances. Int. J. Mol. Sci. 2021, 22, 6695. https://doi.org/10.3390/ijms22136695
Title: | Garcia-Sosa, A. T.; Maran, U. Combined Naïve Bayesian, chemical fingerprints, and molecular docking classifiers to model and predict androgen receptor binding activity data for environmentally- and health-sensitive substances. Int. J. Mol. Sci. 2021, 22, 6695. |
Abstract: | Many chemicals that enter the environment, food chain, and the human body can disrupt androgen-dependent pathways and mimic hormones and therefore, may be responsible for multiple diseases from reproductive to tumor. Thus, modeling and predicting androgen binding activity is an important area of research. The aim of the current study was to find a method or combination of methods to predict compounds that can bind to and/or disrupt the androgen receptor, and thereby guide decision making and further analysis. A stepwise procedure proceeded from analysis of protein structures from human, chimp, and rat, followed by docking and subsequent ligand- and statistics based techniques that improved classification gradually. The best methods used multivariate logistic regression of combinations of chimpanzee protein structural docking scores, extended connectivity fingerprints, and Naive Bayesians of known binders and non-binders. Combination or consensus methods included data from a variety of procedures to improve the final model accuracy. |
URI: | http://hdl.handle.net/10967/235
http://dx.doi.org/10.15152/QDB.235 |
Date: | 2021-06-22 |
Name | Description | Format | Size | View |
---|---|---|---|---|
newQDBroundedPreds.zip | Article Garcia-Sosa and Maran | application/zip | 1.552Mb | View/ |