105 Downloads (Pure)

Abstract

Drug-induced liver injury (DILI) presents a multifaceted challenge, influenced by interconnected biological mechanisms. Current DILI datasets are characterized by small sizes and high imbalance, posing difficulties in learning robust representations and accurate modeling. To address these challenges, we trained a multi-modal multi-task model integrating preclinical histopathologies, biochemistry (blood markers), and clinical DILI-related adverse drug reactions (ADRs). Leveraging pretrained BERT models, we extracted representations covering a broad chemical space, facilitating robust learning in both frozen and fine-tuned settings. To address imbalanced data, we explored weighted Binary Cross-Entropy (w-BCE) and weighted Focal Loss (w-FL) . Our results demonstrate that the frozen BERT model consistently enhances performance across all metrics and modalities with weighted loss functions compared to their non-weighted counterparts. However, the efficacy of fine-tuning BERT varies across modalities, yielding inconclusive results. In summary, the incorporation of BERT features with weighted loss functions demonstrates advantages, while the efficacy of fine-tuning remains uncertain.

Original languageEnglish
Title of host publicationAI in Drug Discovery - 1st International Workshop, AIDD 2024, Held in Conjunction with ICANN 2024, Proceedings
EditorsDjork-Arné Clevert, Michael Wand, Jürgen Schmidhuber, Kristína Malinovská, Igor V. Tetko
PublisherSpringer
Pages82-97
Number of pages16
ISBN (Electronic)978-3-031-72381-0
ISBN (Print)978-3-031-72380-3
DOIs
Publication statusPublished - 2025
MoE publication typeA4 Conference publication
EventInternational Workshop on AI in Drug Discovery - Lugano, Switzerland
Duration: 19 Sept 202419 Sept 2024
Conference number: 1

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer
Volume14894 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Workshop

WorkshopInternational Workshop on AI in Drug Discovery
Abbreviated titleAIDD
Country/TerritorySwitzerland
CityLugano
Period19/09/202419/09/2024

Funding

The authors acknowledge financial support from the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement No 956832, “Advanced Machine learning for Innovative Drug Discovery” (AIDD).

Keywords

  • BERT
  • DILI
  • Focal loss
  • Toxicity

Fingerprint

Dive into the research topics of 'Balancing Imbalanced Toxicity Models : Using MolBERT with Focal Loss'. Together they form a unique fingerprint.

Cite this