Toots, K. M.; Sild, S.; Leis, J.; Acree, W. E.; Maran, U. A multicomponent QSPR approach to describe and predict gas-ionic liquid distribution of organic solutes using machine learning. J. Mol. Liq. 2025, 436, 128184.

QsarDB Repository

Toots, K. M.; Sild, S.; Leis, J.; Acree, W. E.; Maran, U. A multicomponent QSPR approach to describe and predict gas-ionic liquid distribution of organic solutes using machine learning. J. Mol. Liq. 2025, 436, 128184.

QDB archive DOI: 10.15152/QDB.266   DOWNLOAD

QsarDB content

Property logK: Gas-ionic liquid partition coefficient

MLR: Multiple Linear Regression QSAR model for the gas-ionic liquid partition coefficient of organic solutes

Regression model (regression)

Open in:QDB ExplorerQDB Predictor

NameTypen

R2

σ

MLR training predictionstraining52240.7960.468
MLR holdout predictionsexternal validation13070.8010.470
RF: Random Forest Regression QSAR model for gas-ionic liquid partition coefficient of organic solutes

Random forest (regression)

Open in:QDB ExplorerQDB Predictor

NameTypen

R2

σ

RF training predictionstraining52240.9960.070
RF holdout predictionsexternal validation13070.9570.219

Citing

When using this QDB archive, please cite (see details) it together with the original article:

  • Toots, K. M.; Sild, S.; Leis, J.; Acree, W. E.; Maran, U. Data for: A multicomponent QSPR approach to describe and predict gas-ionic liquid distribution of organic solutes using machine learning. QsarDB repository, QDB.266. 2025. https://doi.org/10.15152/QDB.266

  • Toots, K. M.; Sild, S.; Leis, J.; Acree, W. E.; Maran, U. A multicomponent QSPR approach to describe and predict gas-ionic liquid distribution of organic solutes using machine learning. J. Mol. Liq. 2025, 436, 128184. https://doi.org/10.1016/j.molliq.2025.128184

Metadata

Show full item record

Title: Toots, K. M.; Sild, S.; Leis, J.; Acree, W. E.; Maran, U. A multicomponent QSPR approach to describe and predict gas-ionic liquid distribution of organic solutes using machine learning. J. Mol. Liq. 2025, 436, 128184.
Abstract:Ionic liquids are known as green solvents, which makes accurate prediction of gas–ionic liquid partition coefficients (log K) important from the perspective of various industrial applications. A gas–ionic liquid is a multicomponent system, but is usually modelled by the structural properties of one component, the solute. The integration of structural descriptors of all three components, solute, cation, and anion, into a single computational model has not been achieved. To do this, a machine learning approach was applied to a large collected dataset consisting of 6,531 experimental log K values, including data series for 170 solutes and 138 ionic liquids. The Multiple Linear Regression (MLR) and Random Forest Regression (RF) approaches were compared, both of which applied stepwise forward descriptor selection. The best MLR model achieved a cross-validated coefficient of determination (Rcv2) of 0.795 and an external validation coefficient of determination (R2) of 0.801, while the RF model demonstrated significant increase in performance with cross-validated Rcv2 of 0.965 and external validation R2 of 0.957. The descriptors included in the models showed that the description and prediction of log K is significantly improved when structural properties of all three components of the system (solute, cation, and anion) are taken into account. When comparing the linear and non-linear RF models, the presence of molecular descriptors of different components was significantly increased in the latter. The molecular descriptors in the models highlighted the roles of dispersion forces, dipolar interactions, and hydrogen bonding in solute–ionic liquid partitioning. The study provides thoroughly-analyzed predictive models for estimating gas–ionic liquid partition coefficients and provides structure-level insights into solute–ionic-liquid interactions, facilitating the rational design of ionic liquids and expanding the range of solutes for various applications.
URI:http://hdl.handle.net/10967/266
http://dx.doi.org/10.15152/QDB.266
Date:2025-03-17
Funding:The authors are grateful for support from the Ministry of Education and Research, Republic of Estonia, through the Estonian Research Council (grant number PRG1509).


Files in this item

NameDescriptionFormatSizeView
article4.ziparticle4application/zip7.624MbView/Open
Files associated with this item are distributed
under Creative Commons license.

This item appears in the following Collection(s)

Show full item record