Liquid-chromatography retention order prediction for metabolite identification

Research output: Contribution to journalArticle

Details

Original languageEnglish
Pages (from-to)875-883
Number of pages9
JournalBioinformatics
Volume34
Issue number17
StatePublished - 1 Sep 2018
MoE publication typeA1 Journal article-refereed
EventEuropean Conference on Computational Biology - Athens, Greece
Duration: 8 Sep 201812 Sep 2018
Conference number: 17

Researchers

Research units

  • Friedrich Schiller University Jena

Abstract

Motivation: Liquid Chromatography (LC) followed by tandem Mass Spectrometry (MS/MS) is one of the predominant methods for metabolite identification. In recent years, machine learning has started to transform the analysis of tandem mass spectra and the identification of small molecules. In contrast, LC data is rarely used to improve metabolite identification, despite numerous published methods for retention time prediction using machine learning.

Results: We present a machine learning method for predicting the retention order of molecules; that is, the order in which molecules elute from the LC column. Our method has important advantages over previous approaches: We show that retention order is much better conserved between instruments than retention time. To this end, our method can be trained using retention time measurements from different LC systems and configurations without tedious pre-processing, significantly increasing the amount of available training data. Our experiments demonstrate that retention order prediction is an effective way to learn retention behaviour of molecules from heterogeneous retention time data. Finally, we demonstrate how retention order prediction and MS/MS-based scores can be combined for more accurate metabolite identifications when analyzing a complete LC-MS/MS run.

    Research areas

  • TANDEM MASS-SPECTRA, TIME PREDICTION, FRAGMENTATION, METABOLOMICS, PERFORMANCE, REGRESSION, DATABASES

ID: 28288980