<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
<title>PARC models by UT</title>
<link href="http://hdl.handle.net/10967/271" rel="alternate"/>
<subtitle>PARC</subtitle>
<id>http://hdl.handle.net/10967/271</id>
<updated>2026-04-16T17:42:27Z</updated>
<dc:date>2026-04-16T17:42:27Z</dc:date>
<entry>
<title>Kotli, M.; Piir, G.; Maran, U. Predictive Modeling of Pesticides Reproductive Toxicity in Earthworms Using Interpretable Machine-Learning Techniques on Imbalanced Data. ACS Omega 2025, 10, 4732–4744.</title>
<link href="http://hdl.handle.net/10967/263" rel="alternate"/>
<author>
<name/>
</author>
<id>http://hdl.handle.net/10967/263</id>
<updated>2025-02-20T19:27:42Z</updated>
<published>2024-10-09T12:03:43Z</published>
<summary type="text">The earthworm is a key indicator species in soil ecosystems. This makes the reproductive toxicity of chemical compounds to earthworms a desired property of determination and makes computational models necessary for descriptive and predictive purposes. Thus, the aim was to develop an advanced Quantitative Structure–Activity Relationship modeling approach for this complex property with imbalanced data. The approach integrated gradient-boosted decision trees as classifiers with a genetic algorithm for feature selection and Bayesian optimization for hyperparameter tuning. An additional goal was to analyze and interpret, using SHAP values, the structural features encoded by the molecular descriptors that contribute to pesticide toxicity and nontoxicity, the most notable of which are solvation entropy and a number of hydrolyzable bonds. The final model was constructed as a stacked ensemble of models and combined the strengths of the individual models. Evaluation of this model with an external test set of 147 compounds demonstrated a well-defined applicability domain and sufficient predictive capabilities with a Balanced Accuracy of 77%. The model representation follows FAIR principles and is available on QsarDB.org.
</summary>
<dc:date>2024-10-09T12:03:43Z</dc:date>
</entry>
<entry>
<title>Piir, G.; Sild, S.; Maran, U. Interpretable machine learning for the identification of estrogen receptor agonists, antagonists, and binders. Chemosphere 2024, 347, 140671.</title>
<link href="http://hdl.handle.net/10967/259" rel="alternate"/>
<author>
<name/>
</author>
<id>http://hdl.handle.net/10967/259</id>
<updated>2025-05-30T10:19:47Z</updated>
<published>2023-09-14T13:44:47Z</published>
<summary type="text">An abnormal hormonal activity or exposure to endocrine-disrupting chemicals (EDCs) can cause endocrine system malfunction. Among the many interactions EDCs can affect is the disruption of estrogen signalling, which can lead to adverse health effects such as cancer, osteoporosis, neurodegenerative diseases, cardiovascular disease, insulin resistance, and obesity. Knowing which chemical can act as an EDC is a significant advantage and a practical necessity. New Approach Methodologies (NAM) computational models offer a quick and cost-effective solution for preliminary hazard assessment of chemicals without animal testing. Therefore, a machine learning approach was used to investigate the relationships between estrogen receptor (ER) activity and chemical structure to identify chemicals that can interact with ER. For this purpose, the consolidated in vitro assay data from ToxCast/Tox21 projects was used for developing Random Forest classification models for ER binding, agonists, and antagonists. The overall classification prediction accuracy reaches up to 82%, depending on whether the model predicted agonists, antagonists, or compounds that bind to the active site. Given the imbalance in endocrine disruption data, the derived models are good candidates for deprioritising chemicals and reducing animal testing. The interpretation of theoretical molecular descriptors of the models was consistent with the molecular interactions known in the ligand binding pocket. The estimated class probabilities enabled the analysis of the applicability domain of the developed models and the assessment of the predictions’ reliability, followed by the guidelines for interpreting prediction results. The models are openly accessible and usable at QsarDB.org according to the FAIR (Findable, Accessible, Interoperable, Reusable) principles.
</summary>
<dc:date>2023-09-14T13:44:47Z</dc:date>
</entry>
<entry>
<title>Kotli, M.; Piir, G.; Maran, U. Pesticide effect on earthworm lethality via interpretable machine learning. J. Hazard Mater. 2023, 461, 132577</title>
<link href="http://hdl.handle.net/10967/258" rel="alternate"/>
<author>
<name/>
</author>
<id>http://hdl.handle.net/10967/258</id>
<updated>2025-01-29T15:05:10Z</updated>
<published>2023-08-18T08:26:20Z</published>
<summary type="text">Earthworms are among the most important animals (invertebrates) for soil health. Many   chemical   substances   released   into   nature   for   agricultural   development, such as pesticides, may have unwanted effects on those organisms. However, it is essential to understand the extent of the impact of chemicals on soil health first and then make the proper decisions for regulatory or commercial purposes. We   hypothesize   that   there   is   an   expressible   quantitative   structure-activity relationship (QSAR) between the structure of pesticide compounds and the acute toxicity   effect   of   earthworm   species  Eisenia   fetida. The   description   of   this relationship allows for a better assessment of the impact of chemicals on the said earthworm.   To   describe   this   relationship,   a   dataset   of   chemicals   was   collected from open-access sources to develop a mathematical model. A novel approach, combining   genetic   algorithm   and   Bayesian   optimization,   was   used   to   select structural features into the model and to optimize model parameters. The final QSAR   classification   model   was   created   with   the   Random   Forest   algorithm   and exhibited good prediction Accuracy of 0.78 on training set and 0.80 on test set. The model representation follows FAIR principles and is available on QsarDB.org.
</summary>
<dc:date>2023-08-18T08:26:20Z</dc:date>
</entry>
</feed>
