3D Genome Reconstruction from Partially Phased Hi-C Data

Diego Cifuentes, Jan Draisma, Oskar Henriksson, Annachiara Korchmaros, Kaie Kubjas*

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

38 Downloads (Pure)

Abstract

The 3-dimensional (3D) structure of the genome is of significant importance for many cellular processes. In this paper, we study the problem of reconstructing the 3D structure of chromosomes from Hi-C data of diploid organisms, which poses additional challenges compared to the better-studied haploid setting. With the help of techniques from algebraic geometry, we prove that a small amount of phased data is sufficient to ensure finite identifiability, both for noiseless and noisy data. In the light of these results, we propose a new 3D reconstruction method based on semidefinite programming, paired with numerical algebraic geometry and local optimization. The performance of this method is tested on several simulated datasets under different noise levels and with different amounts of phased data. We also apply it to a real dataset from mouse X chromosomes, and we are then able to recover previously known structural features.

Original languageEnglish
Article number33
Pages (from-to)1-30
Number of pages30
JournalBulletin of Mathematical Biology
Volume86
Issue number4
DOIs
Publication statusPublished - Apr 2024
MoE publication typeA1 Journal article-refereed

Keywords

  • 13P25
  • 14P05
  • 3D genome organization
  • 65H14
  • 90C90
  • 92-08
  • 92E10
  • Applied algebraic geometry
  • Diploid organisms
  • Hi-C
  • Numerical algebraic geometry

Fingerprint

Dive into the research topics of '3D Genome Reconstruction from Partially Phased Hi-C Data'. Together they form a unique fingerprint.

Cite this