Abstract

Several generative models with elaborate training and sampling procedures have been proposed to accelerate structure-based drug design (SBDD); however, their empirical performance turns out to be suboptimal. We seek to better understand this phenomenon from both theoretical and empirical perspectives. Since most of these models apply graph neural networks (GNNs), one may suspect that they inherit the representational limitations of GNNs. We analyze this aspect, establishing the first such results for protein-ligand complexes. A plausible counterview may attribute the underperformance of these models to their excessive parameterizations, inducing expressivity at the expense of generalization. We investigate this possibility with a simple metric-aware approach that learns an economical surrogate for affinity to infer an unlabelled molecular graph and optimizes for labels conditioned on this graph and molecular properties. The resulting model achieves state-of-the-art results using 100x fewer trainable parameters and affords up to 1000x speedup. Collectively, our findings underscore the need to reassess and redirect the existing paradigm and efforts for SBDD. Code is available at https://github.com/rafalkarczewski/SimpleSBDD.

Original languageEnglish
Title of host publicationProceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS) 2025
PublisherJMLR
Pages3187-3195
Number of pages9
Publication statusPublished - 2025
MoE publication typeA4 Conference publication
EventInternational Conference on Artificial Intelligence and Statistics - Splash Beach Resort, Mai Khao, Thailand
Duration: 3 May 20255 May 2025
Conference number: 28
https://aistats.org/aistats2025/

Publication series

NameProceedings of Machine Learning Research
PublisherJMLR
Volume258
ISSN (Print)2640-3498

Conference

ConferenceInternational Conference on Artificial Intelligence and Statistics
Abbreviated titleAISTATS
Country/TerritoryThailand
CityMai Khao
Period03/05/202505/05/2025
Internet address

Funding

This work was supported by the Finnish Center for Artificial Intelligence (FCAI) under Flagship R5 (award 15011052). VG also acknowledges the support from Saab-WASP (grant 411025), Academy of Finland (grant 342077), and the Jane and Aatos Erkko Foundation (grant 7001703). RK thanks Paulina Karczewska for her help with preparing figures.

Fingerprint

Dive into the research topics of 'What Ails Generative Structure-based Drug Design: Expressivity is Too Little or Too Much?'. Together they form a unique fingerprint.
  • -: Finnish Center for Artificial Intelligence

    Garg, V. (Principal investigator)

    01/01/202331/12/2026

    Project: Academy of Finland: Other research funding

  • HEALED/Garg: Human-steered next-generation machine learning for reviving drug design

    Garg, V. (Principal investigator), Bickford Smith, F. (Project Member), Mirzaie, N. (Project Member), Verma, Y. (Project Member), Andrzejewski, M. (Project Member), Jain, E. (Project Member), Cunha, T. (Project Member), Laabid, N. (Project Member), Ghaznavi, M. (Project Member), Soltan Mohammadi, N. (Project Member) & Hemmann, L. (Project Member)

    01/09/202131/08/2025

    Project: RCF Academy Project

Cite this