Computing Atom Mappings for Biochemical Reactions without Subgraph Isomorphism

Markus Heinonen*, Sampsa Lappalainen, Taneli Mielikäinen, Juho Rousu

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

Abstract

The ability to trace the fate of individual atoms through the metabolic pathways is needed in many applications of systems biology and drug discovery. However, this information is not immediately available from the most common metabolome studies and needs to be separately acquired. Automatic discovery of correspondence of atoms in biochemical reactions is called the "atom mapping problem.'' We suggest an efficient approach for solving the atom mapping problem exactly-finding mappings of minimum edge edit distance. The algorithm is based on A* search equipped with sophisticated heuristics for pruning the search space. This approach has clear advantages over the commonly used heuristic approach of iterative maximum common subgraph (MCS) algorithm: we explicitly minimize an objective function, and we produce solutions that typically require less manual curation. The two methods are similar in computational resource demands. We compare the performance of the proposed algorithm against several alternatives on data obtained from the KEGG LIGAND and RPAIR databases: greedy search, bi-partite graph matching, and the MCS approach. Our experiments show that alternative approaches often fail in finding mappings with minimum edit distance.

Original languageEnglish
Pages (from-to)43-58
Number of pages16
JournalJOURNAL OF COMPUTATIONAL BIOLOGY
Volume18
Issue number1
DOIs
Publication statusPublished - Jan 2011
MoE publication typeA1 Journal article-refereed

Keywords

  • algorithms
  • biochemical networks
  • CHEMICAL STRUCTURES
  • METABOLIC FLUXES
  • SYSTEMS BIOLOGY
  • IN-SILICO
  • ALGORITHM
  • INFORMATION
  • GRAPHS
  • ENERGIES
  • PATHWAYS
  • DATABASE

Cite this