Random forest (classification)
Open in:QDB ExplorerQDB Predictor
Name | Type | n | Accuracy |
---|---|---|---|
toxicity_class.train | training | 524 | 0.979 |
toxicity_class.test | external validation | 131 | 0.802 |
toxicity_class.oob | internal validation | 524 | 0.781 |
When using this QDB archive, please cite (see details) it together with the original article:
Kotli, M.; Piir, G.; Maran, U. Data for: Pesticide effect on earthworm lethality via interpretable machine learning. QsarDB repository, QDB.258. 2023. https://doi.org/10.15152/QDB.258
Kotli, M.; Piir, G.; Maran, U. Pesticide effect on earthworm lethality via interpretable machine learning. J. Hazard Mater. 2023, 461, https://doi.org/10.1016/j.jhazmat.2023.132577
Title: | Kotli, M.; Piir, G.; Maran, U. Pesticide effect on earthworm lethality via interpretable machine learning. J. Hazard Mater. 2023, 461, 132577 |
Abstract: | Earthworms are among the most important animals (invertebrates) for soil health. Many chemical substances released into nature for agricultural development, such as pesticides, may have unwanted effects on those organisms. However, it is essential to understand the extent of the impact of chemicals on soil health first and then make the proper decisions for regulatory or commercial purposes. We hypothesize that there is an expressible quantitative structure-activity relationship (QSAR) between the structure of pesticide compounds and the acute toxicity effect of earthworm species Eisenia fetida. The description of this relationship allows for a better assessment of the impact of chemicals on the said earthworm. To describe this relationship, a dataset of chemicals was collected from open-access sources to develop a mathematical model. A novel approach, combining genetic algorithm and Bayesian optimization, was used to select structural features into the model and to optimize model parameters. The final QSAR classification model was created with the Random Forest algorithm and exhibited good prediction Accuracy of 0.78 on training set and 0.80 on test set. The model representation follows FAIR principles and is available on QsarDB.org. |
URI: | http://hdl.handle.net/10967/258
http://dx.doi.org/10.15152/QDB.258 |
Date: | 2023-08-18 |
Funding: | This work was supported by the Ministry of Education and Research, Republic of Estonia, through the Estonian Research Council (grant number PRG1509), Ministry of Environment, Republic of Estonia (grant 4-4/22/19), Ministry of Social Affairs, Republic of Estonia (grant 3-4/1593-1) and European Union through Horizon Europe Health Framework Programme project "Partnership for the Assessment of Risks from Chemicals (Grant ID 101057014)". |
Name | Description | Format | Size | View |
---|---|---|---|---|
final_archive.zip | Main article data and model | application/zip | 903.8Kb | View/ |