Piir, G.; Sild, S.; Spilioti, E.; Nikolopoulou, D.; Katsanou, E.; Langezaal, I.; Maran, U. Classification of Thyroid Peroxidase (TPO) Inhibitors Using Transfer Learning with SMILES Embeddings. Chemical Research in Toxicology 2026.

QsarDB Repository

Piir, G.; Sild, S.; Spilioti, E.; Nikolopoulou, D.; Katsanou, E.; Langezaal, I.; Maran, U. Classification of Thyroid Peroxidase (TPO) Inhibitors Using Transfer Learning with SMILES Embeddings. Chemical Research in Toxicology 2026.

QDB archive DOI: 10.15152/QDB.272   DOWNLOAD

QsarDB content

Property TPO_inhibition: Thyroid peroxidase inhibition

Table5: The final model

Random forest (classification)

Open in:QDB ExplorerQDB Predictor

NameTypenAccuracy
Training settraining103681.000
Out of bag setinternal validation103680.799
Test setexternal validation240230.793
Validation setexternal validation10180.753

Citing

When using this QDB archive, please cite (see details) it together with the original article:

  • Piir, G. Data for: Classification of Thyroid Peroxidase (TPO) Inhibitors Using Transfer Learning with SMILES Embeddings. QsarDB repository, QDB.272. 2026. https://doi.org/10.15152/QDB.272

  • Piir, G.; Sild, S.; Spilioti, E.; Nikolopoulou, D.; Katsanou, E.; Langezaal, I.; Maran, U. Classification of Thyroid Peroxidase (TPO) Inhibitors Using Transfer Learning with SMILES Embeddings. Chemical Research in Toxicology 2026.

Metadata

Show simple item record

dc.date.accessioned2026-05-27T17:25:57Z
dc.date.available2026-05-27T17:25:57Z
dc.date.issued2026-05-27
dc.identifier.urihttp://hdl.handle.net/10967/272
dc.identifier.urihttp://dx.doi.org/10.15152/QDB.272
dc.description.abstractThyroid hormones (THs) regulate many processes in mammals and, therefore, affect every organ in the body. Thyroid peroxidase (TPO) is an essential enzyme for the successful biosynthesis of THs. Although TPO inhibition is a well-documented molecular initiating event (MIE) in thyroid hormone system disruption adverse outcome pathways (AOPs), experimental methods and computational models to assess TPO activity are lacking. Efficient computational new approach methodologies (NAMs) are a viable solution for identifying TPO inhibitors from a large pool of agrochemicals. The aim of this study was to investigate the suitability of SMILES embeddings generated using a specialized language model (SLM) based on a pretrained deep neural network (DNN) for applying a transfer learning approach in the development of quantitative structure−activity relationships for classifying TPO inhibitors. Traditional theoretical molecular descriptors were used for comparison. Two different molecular descriptor sets resulted in Random Forest (RF) models that performed similarly on the training and test sets, while the sensitivity for the external validation set was substantially different between the two models (0.788 vs 0.490). Comparison of the predictions with the TPO inhibition data of the chemicals assessed by EFSA and EU-NETVAL laboratories showed good agreement. At the same time, analysis of experimental data from other sources showed some conflicting estimates. This suggests that further and more precise studies are needed for some compounds. This study advances in silico methodologies by implementing transfer learning for QSAR modeling from text representations (e.g., SMILES) using the pretrained Bidirectional Encoder Representations from Transformers (BERT) architecture. While traditional QSAR approach relies on molecular descriptors, this evaluation shows that model-generated SMILES embeddings can expand the applicability domain, indicating a more robust representation of structural information compared to traditional molecular descriptors.en_US
dc.publisherGeven Piir
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.titlePiir, G.; Sild, S.; Spilioti, E.; Nikolopoulou, D.; Katsanou, E.; Langezaal, I.; Maran, U. Classification of Thyroid Peroxidase (TPO) Inhibitors Using Transfer Learning with SMILES Embeddings. Chemical Research in Toxicology 2026.
qdb.property.endpoint4. Human health effects 4.18. Endocrine Activityen_US
qdb.descriptor.applicationMolBERTen_US
qdb.prediction.applicationR 4.5.1en_US
bibtex.entryarticleen_US
bibtex.entry.authorPiir, Geven
bibtex.entry.authorSild, Sulev
bibtex.entry.authorSpilioti, Eliana
bibtex.entry.authorNikolopoulou, Dimitra
bibtex.entry.authorKatsanou, Effrosyni
bibtex.entry.authorLangezaal, Ingrid
bibtex.entry.authorMaran, Uko
bibtex.entry.journalChemical Research in Toxicologyen_US
bibtex.entry.titleClassification of Thyroid Peroxidase (TPO) Inhibitors Using Transfer Learning with SMILES Embeddingsen_US
bibtex.entry.year2026
qdb.model.typeRandom forest (classification)en_US


Files in this item

NameDescriptionFormatSizeView
2026CRT.qdb.zipThyroid peroxidase inhibition of agrochemicalsapplication/zip274.1MbView/Open
TPO_modelling.7zData and scriptUnknown191.7MbView/Open
Files associated with this item are distributed
under Creative Commons license.

This item appears in the following Collection(s)

Show simple item record