Regression model (regression)
Open in:QDB ExplorerQDB Predictor
Name | Type | n |
R2 |
σ |
---|---|---|---|---|
Training set | training | 81 | 0.670 | 0.823 |
Validation set | external validation | 42 | 0.785 | 0.842 |
Tight test set | external validation | 100 | 0.508 | 0.936 |
Loose test set | external validation | 32 | 0.749 | 1.115 |
Test sets together | external validation | 132 | 0.659 | 0.980 |
Regression model (regression)
Open in:QDB ExplorerQDB Predictor
Name | Type | n |
R2 |
σ |
---|---|---|---|---|
Training set | training | 346 | 0.624 | 1.004 |
Validation set | external validation | 90 | 0.697 | 0.964 |
Tight test set | external validation | 100 | 0.521 | 0.933 |
Loose test set | external validation | 32 | 0.648 | 1.300 |
Test sets together | external validation | 132 | 0.602 | 1.031 |
Regression model (regression)
Open in:QDB ExplorerQDB Predictor
Name | Type | n |
R2 |
σ |
---|---|---|---|---|
Training set | training | 346 | 0.671 | 0.939 |
Validation set | external validation | 90 | 0.785 | 0.813 |
Tight test set | external validation | 100 | 0.520 | 0.962 |
Loose test set | external validation | 32 | 0.794 | 1.010 |
Test sets together | external validation | 132 | 0.652 | 0.971 |
Regression model ensemble (regression)
Open in:QDB ExplorerQDB Predictor
Name | Type | n |
R2 |
σ |
---|---|---|---|---|
Training set i | training | 345 | 0.693 | 0.914 |
Tight test set | external validation | 100 | 0.571 | 0.861 |
Loose test set | external validation | 32 | 0.787 | 1.021 |
Test sets together | external validation | 132 | 0.694 | 0.898 |
When using this QDB archive, please cite (see details) it together with the original article:
Oja, M.; Sild, S.; Piir, G.; Maran, U. Data for: Intrinsic aqueous solubility: mechanistically transparent data-driven modeling of drug substances. QsarDB repository, QDB.257. 2022. https://doi.org/10.15152/QDB.257
Oja, M.; Sild, S.; Piir, G.; Maran, U. Intrinsic aqueous solubility: mechanistically transparent data-driven modeling of drug substances. Pharmaceutics 2022, 14, 2248. https://doi.org/10.3390/pharmaceutics14102248
dc.contributor.other | This work was funded by the Ministry of Education and Research, Republic of Estonia through Estonian Research Council [grant number PRG1509] and the European Union European Regional Development Fund through Foundation Archimedes [grant number TK143, Centre of Excellence in Molecular Cell Engineering]. | |
dc.date.accessioned | 2022-10-12T13:54:19Z | |
dc.date.available | 2022-10-12T13:54:19Z | |
dc.date.issued | 2022-10-12 | |
dc.identifier.uri | http://hdl.handle.net/10967/257 | |
dc.identifier.uri | http://dx.doi.org/10.15152/QDB.257 | |
dc.description.abstract | Intrinsic aqueous solubility is a foundation property for understanding chemical, technological, pharmaceutical, and environmental behavior of drug substances. Despite years of solubility research, molecular structure-based prediction of the intrinsic aqueous solubility of drug substances is still under active investigation. This paper describes the authors’ systematic data-driven modelling in which two fit-for-purpose training data sets for intrinsic aqueous solubility were collected and curated, and three quantitative structure-property relationships were derived to make predictions for the most recent solubility challenge. All three models are performing well individually, while being mechanistically transparent and easy to understand. Molecular descriptors involved in the models are related to the following key steps in the solubility process: dissociation of the molecule from the crystal, formation of a cavity in the solvent, and insertion of the molecule into the solvent. A consensus modeling approach with these models remarkably improved prediction capability and reduced the number of strong outliers by more than two times. The performance and outliers of the second solubility challenge predictions were analyzed retrospectively. All developed models have been published in the QsarDB repository according to FAIR principles and can be used without restrictions for exploring, downloading, and predictions. | en_US |
dc.publisher | Mare Oja | |
dc.publisher | Sulev Sild | |
dc.publisher | Geven Piir | |
dc.publisher | Uko Maran | |
dc.rights | Attribution 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
dc.title | Oja, M.; Sild, S.; Piir, G.; Maran, U. Intrinsic aqueous solubility: mechanistically transparent data-driven modeling of drug substances. Pharmaceutics 2022, 14, 2248. | |
qdb.property.endpoint | 1. Physical Chemical Properties 1.3. Water solubility | en_US |
qdb.descriptor.application | DRAGON 6.0.40 | en_US |
qdb.descriptor.application | RDKit 2016.03.05 | en_US |
qdb.descriptor.application | XLOGS 1.0 | en_US |
qdb.descriptor.application | PaDEL-Descriptor 2.21 | en_US |
qdb.prediction.application | CODESSA Pro 1.0 | en_US |
qdb.prediction.application | scikit-learn 0.18 | en_US |
qdb.prediction.application | R 3.5.3 | en_US |
bibtex.entry | article | en_US |
bibtex.entry.author | Oja, Mare | |
bibtex.entry.author | Sild, Sulev | |
bibtex.entry.author | Piir, Geven | |
bibtex.entry.author | Maran, Uko | |
bibtex.entry.doi | 10.3390/pharmaceutics14102248 | |
bibtex.entry.journal | Pharmaceutics | en_US |
bibtex.entry.number | 10 | |
bibtex.entry.pages | 2248 | |
bibtex.entry.title | Intrinsic aqueous solubility: mechanistically transparent data-driven modeling of drug substances | en_US |
bibtex.entry.volume | 14 | |
bibtex.entry.year | 2022 | |
qdb.model.type | Regression model (regression) | en_US |
qdb.model.type | Regression model ensemble (regression) | en_US |
qdb.descriptor.calculation | M1 | |
qdb.descriptor.calculation | M2 | |
qdb.descriptor.calculation | M3 | |
qdb.descriptor.calculation | M_cons |
Name | Description | Format | Size | View |
---|---|---|---|---|
2022P2248.qdb.zip | Models for intrinsic water solubility | application/zip | 194.3Kb | View/ |