Random forest (classification)
Open in:QDB ExplorerQDB Predictor
Name | Type | n | Accuracy |
---|---|---|---|
Training set | training | 350 | 1.000 |
Out of bag | internal validation | 350 | 0.760 |
Test set | external validation | 1327 | 0.789 |
Evaluation set | external validation | 5033 | 0.818 |
Random forest (classification)
Open in:QDB ExplorerQDB Predictor
Name | Type | n | Accuracy |
---|---|---|---|
Training set | training | 64 | 1.000 |
Out of bag | internal validation | 64 | 0.734 |
Test set | external validation | 1613 | 0.653 |
Evaluation set | external validation | 5074 | 0.683 |
Random forest (classification)
Open in:QDB ExplorerQDB Predictor
Name | Type | n | Accuracy |
---|---|---|---|
Training set | training | 378 | 1.000 |
Out of bag | internal validation | 378 | 0.706 |
Test set | external validation | 1299 | 0.766 |
Evaluation set | external validation | 4740 | 0.767 |
When using this QDB archive, please cite (see details) it together with the original article:
Piir, G.; Sild, S.; Maran, U. Data for: Interpretable machine learning for the identification of estrogen receptor agonists, antagonists, and binders. QsarDB repository, QDB.259. 2023. https://doi.org/10.15152/QDB.259
Piir, G.; Sild, S.; Maran, U. Interpretable machine learning for the identification of estrogen receptor agonists, antagonists, and binders. Chemosphere 2024, 347, 140671. https://doi.org/10.1016/j.chemosphere.2023.140671
Title: | Piir, G.; Sild, S.; Maran, U. Interpretable machine learning for the identification of estrogen receptor agonists, antagonists, and binders. Chemosphere 2024, 347, 140671. |
Abstract: | An abnormal hormonal activity or exposure to endocrine-disrupting chemicals (EDCs) can cause endocrine system malfunction. Among the many interactions EDCs can affect is the disruption of estrogen signalling, which can lead to adverse health effects such as cancer, osteoporosis, neurodegenerative diseases, cardiovascular disease, insulin resistance, and obesity. Knowing which chemical can act as an EDC is a significant advantage and a practical necessity. New Approach Methodologies (NAM) computational models offer a quick and cost-effective solution for preliminary hazard assessment of chemicals without animal testing. Therefore, a machine learning approach was used to investigate the relationships between estrogen receptor (ER) activity and chemical structure to identify chemicals that can interact with ER. For this purpose, the consolidated in vitro assay data from ToxCast/Tox21 projects was used for developing Random Forest classification models for ER binding, agonists, and antagonists. The overall classification prediction accuracy reaches up to 82%, depending on whether the model predicted agonists, antagonists, or compounds that bind to the active site. Given the imbalance in endocrine disruption data, the derived models are good candidates for deprioritising chemicals and reducing animal testing. The interpretation of theoretical molecular descriptors of the models was consistent with the molecular interactions known in the ligand binding pocket. The estimated class probabilities enabled the analysis of the applicability domain of the developed models and the assessment of the predictions’ reliability, followed by the guidelines for interpreting prediction results. The models are openly accessible and usable at QsarDB.org according to the FAIR (Findable, Accessible, Interoperable, Reusable) principles. |
URI: | http://hdl.handle.net/10967/259
http://dx.doi.org/10.15152/QDB.259 |
Date: | 2023-09-14 |
Funding: | This work was supported by the Ministry of Education and Research, Republic of Estonia, through Estonian Research Council (grant number PRG1509), Ministry of Climate, Republic of Estonia (grant 4-4/22/19), Ministry of Social Affairs, Republic of Estonia (grant 3-4/1593-1) and European Union through Horizon Europe Framework Programme project Partnership for the Assessment of Risks from Chemicals (PARC, grant 101057014). |
Name | Description | Format | Size | View |
---|---|---|---|---|
Estrogen.RF.qdb.zip | Models for estrogen activity | application/zip | 5.822Mb | View/ |