Projects per year
Abstract
Structural annotation of small molecules in biological samples remains a key bottleneck in untargeted metabolomics, despite rapid progress in predictive methods and tools during the past decade. Liquid chromatography–tandem mass spectrometry, one of the most widely used analysis platforms, can detect thousands of molecules in a sample, the vast majority of which remain unidentified even with best-of-class methods. Here we present LC-MS2Struct, a machine learning framework for structural annotation of small-molecule data arising from liquid chromatography–tandem mass spectrometry (LC-MS2) measurements. LC-MS2Struct jointly predicts the annotations for a set of mass spectrometry features in a sample, using a novel structured prediction model trained to optimally combine the output of state-of-the-art MS2 scorers and observed retention orders. We evaluate our method on a dataset covering all publicly available reversed-phase LC-MS2 data in the MassBank reference database, including 4,327 molecules measured using 18 different LC conditions from 16 contributors, greatly expanding the chemical analytical space covered in previous multi-MS2 scorer evaluations. LC-MS2Struct obtains significantly higher annotation accuracy than earlier methods and improves the annotation accuracy of state-of-the-art MS2 scorers by up to 106%. The use of stereochemistry-aware molecular fingerprints improves prediction performance, which highlights limitations in existing approaches and has strong implications for future computational LC-MS2 developments.
Original language | English |
---|---|
Pages (from-to) | 1224–1237 |
Number of pages | 27 |
Journal | Nature Machine Intelligence |
Volume | 4 |
Issue number | 12 |
DOIs | |
Publication status | Published - Dec 2022 |
MoE publication type | A1 Journal article-refereed |
Fingerprint
Dive into the research topics of 'Joint structural annotation of small molecules using liquid chromatography retention order and tandem mass spectrometry data'. Together they form a unique fingerprint.Projects
- 2 Finished
-
MAGITICS: Machine learning for digItal diagnostics of antimicrobial resistance
Rousu, J. (Principal investigator), Bach, E. (Project Member), Huusari, R. (Project Member), Szedmak, S. (Project Member) & Xiang, W. (Project Member)
01/01/2020 → 31/12/2023
Project: Academy of Finland: Other research funding
-
Machine Learning for Computational Metabolomics
Rousu, J. (Principal investigator), Brouard, C. (Project Member), Huusari, R. (Project Member), Bach, E. (Project Member) & Sabzevari, M. (Project Member)
01/09/2017 → 31/08/2021
Project: Academy of Finland: Other research funding