<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
<title>University of Tartu, Institute of Chemistry, Molecular Technology (Estonia)</title>
<link href="http://hdl.handle.net/10967/1" rel="alternate"/>
<subtitle>MolTech, UTARTU</subtitle>
<id>http://hdl.handle.net/10967/1</id>
<updated>2026-04-16T16:04:46Z</updated>
<dc:date>2026-04-16T16:04:46Z</dc:date>
<entry>
<title>Akinola, L. K.; Uzairu, A.; Shallangwa, G. A.; Abechi, S. E. Development of binary classification models for grouping hydroxylated polychlorinated biphenyls into active and inactive thyroid hormone receptor agonists. SAR and QSAR in Environmental Research 2023, 34, 267–284.</title>
<link href="http://hdl.handle.net/10967/269" rel="alternate"/>
<author>
<name/>
</author>
<id>http://hdl.handle.net/10967/269</id>
<updated>2025-05-30T15:24:49Z</updated>
<published>2025-05-07T13:27:28Z</published>
<summary type="text">Some adverse effects of hydroxylated polychlorinated biphenyls (OH-PCBs) in humans are presumed to be initiated via thyroid hormone receptor (TR) binding. Due to the trial-and-error approach adopted for OH-PCB selection in previous studies, experiments designed to test the TR binding hypothesis mostly utilized inactive OH-PCBs, leading to considerable waste of time, effort and other material resources. In this paper, linear discriminant analysis (LDA) and binary logistic regression (LR) were used to develop classification models to group OH-PCBs into active and inactive TR agonists using radial distribution function (RDF) descriptors as predictor variables. The classifications made by both LDA and LR models on the training set compounds resulted in an accuracy of 84.3%, sensitivity of 72.2% and specificity of 90.9%. The areas under the ROC curves, constructed with the training set data, were found to be 0.872 and 0.880 for LDA and LR models, respectively. External validation of the models revealed that 76.5% of the test set compounds were correctly classified by both LDA and LR models. These findings suggest that the two models reported in this paper are good and reliable for classifying OH-PCB congeners into active and inactive TR agonists.
</summary>
<dc:date>2025-05-07T13:27:28Z</dc:date>
</entry>
<entry>
<title>Toots, K. M.; Sild, S.; Leis, J.; Acree, W. E.; Maran, U. A multicomponent QSPR approach to describe and predict gas-ionic liquid distribution of organic solutes using machine learning. J. Mol. Liq. 2025, 436, 128184.</title>
<link href="http://hdl.handle.net/10967/266" rel="alternate"/>
<author>
<name/>
</author>
<id>http://hdl.handle.net/10967/266</id>
<updated>2025-07-29T15:03:36Z</updated>
<published>2025-03-17T11:45:47Z</published>
<summary type="text">Ionic liquids are known as green solvents, which makes accurate prediction of gas–ionic liquid partition coefficients (log K) important from the perspective of various industrial applications. A gas–ionic liquid is a multicomponent system, but is usually modelled by the structural properties of one component, the solute. The integration of structural descriptors of all three components, solute, cation, and anion, into a single computational model has not been achieved. To do this, a machine learning approach was applied to a large collected dataset consisting of 6,531 experimental log K values, including data series for 170 solutes and 138 ionic liquids. The Multiple Linear Regression (MLR) and Random Forest Regression (RF) approaches were compared, both of which applied stepwise forward descriptor selection. The best MLR model achieved a cross-validated coefficient of determination (Rcv2) of 0.795 and an external validation coefficient of determination (R2) of 0.801, while the RF model demonstrated significant increase in performance with cross-validated Rcv2 of 0.965 and external validation R2 of 0.957. The descriptors included in the models showed that the description and prediction of log K is significantly improved when structural properties of all three components of the system (solute, cation, and anion) are taken into account. When comparing the  linear and non-linear RF models, the presence of molecular descriptors of different components was significantly increased in the latter. The molecular descriptors in the models highlighted the roles of dispersion forces, dipolar interactions, and hydrogen bonding in solute–ionic liquid partitioning. The study provides thoroughly-analyzed predictive models for estimating gas–ionic liquid partition coefficients and provides structure-level insights into solute–ionic-liquid interactions, facilitating the rational design of ionic liquids and expanding the range of solutes for various applications.
</summary>
<dc:date>2025-03-17T11:45:47Z</dc:date>
</entry>
<entry>
<title>Käärik, M.; Krjukova, N.; Maran, U.; Oja, M.; Piir, G.; Leis, J. Nanomaterial texture-based machine learning of ciprofloxacin adsorption on nanoporous carbon. Int. J. Mol. Sci. 2024, 25, 11696.</title>
<link href="http://hdl.handle.net/10967/265" rel="alternate"/>
<author>
<name/>
</author>
<id>http://hdl.handle.net/10967/265</id>
<updated>2025-05-30T15:24:03Z</updated>
<published>2024-10-25T07:41:49Z</published>
<summary type="text">Drug substances in water bodies and groundwater have become a significant threat to the surrounding environment. This study focuses on the ability of the nanoporous carbon materials to remove ciprofloxacin from aqueous solutions under specific experimental conditions and on the development of the mathematical model that would allow describing the molecular interactions of the adsorption process and calculating the adsorption capacity of the material. Thus, based on the adsorption measurements of the 87 carbon materials, it was found that, depending on the porosity and pore size distribution, adsorption capacity values varied between 55 and 495 mg g-1. For a more detailed analysis of the effects of different carbon textures and pores characteristics, a Quantitative nano-Structure-Property Relationship (QnSPR) was developed to describe and predict the ability of a nanoporous carbon material to remove ciprofloxacin from aqueous solutions. The adsorption capacity of potential nanoporous carbon-based adsorbents for the removal of ciprofloxacin was shown to be sufficiently accurately described by a three-parameter multi-linear QnSPR equation (R² = 0.70). This description was achieved only with parameters describing the texture of the carbon material such as specific surface area (Sdft) and pore size fractions of 1.1-1.2 nm (&#119881;&#119873;2[1.1−1.2]) and 3.3-3.4 nm (&#119881;&#119873;2[3.3−3.4]) for pores
</summary>
<dc:date>2024-10-25T07:41:49Z</dc:date>
</entry>
<entry>
<title>Kotli, M.; Piir, G.; Maran, U. Predictive Modeling of Pesticides Reproductive Toxicity in Earthworms Using Interpretable Machine-Learning Techniques on Imbalanced Data. ACS Omega 2025, 10, 4732–4744.</title>
<link href="http://hdl.handle.net/10967/263" rel="alternate"/>
<author>
<name/>
</author>
<id>http://hdl.handle.net/10967/263</id>
<updated>2025-02-20T19:27:42Z</updated>
<published>2024-10-09T12:03:43Z</published>
<summary type="text">The earthworm is a key indicator species in soil ecosystems. This makes the reproductive toxicity of chemical compounds to earthworms a desired property of determination and makes computational models necessary for descriptive and predictive purposes. Thus, the aim was to develop an advanced Quantitative Structure–Activity Relationship modeling approach for this complex property with imbalanced data. The approach integrated gradient-boosted decision trees as classifiers with a genetic algorithm for feature selection and Bayesian optimization for hyperparameter tuning. An additional goal was to analyze and interpret, using SHAP values, the structural features encoded by the molecular descriptors that contribute to pesticide toxicity and nontoxicity, the most notable of which are solvation entropy and a number of hydrolyzable bonds. The final model was constructed as a stacked ensemble of models and combined the strengths of the individual models. Evaluation of this model with an external test set of 147 compounds demonstrated a well-defined applicability domain and sufficient predictive capabilities with a Balanced Accuracy of 77%. The model representation follows FAIR principles and is available on QsarDB.org.
</summary>
<dc:date>2024-10-09T12:03:43Z</dc:date>
</entry>
</feed>
