Login / Signup

Phasing Diploid Genome Assembly Graphs with Single-Cell Strand Sequencing.

Mir HenglinMaryam GhareghaniWilliam T HarveyDavid PorubskySergey KorenEvan E EichlerPeter EbertTobias Marschall
Published in: bioRxiv : the preprint server for biology (2024)
Haplotype information is crucial for biomedical and population genetics research. However, current strategies to produce de-novo haplotype-resolved assemblies often require either difficult-to-acquire parental data or an intermediate haplotype-collapsed assembly. Here, we present Graphasing, a workflow which synthesizes the global phase signal of Strand-seq with assembly graph topology to produce chromosome-scale de-novo haplotypes for diploid genomes. Graphasing readily integrates with any assembly workflow that both outputs an assembly graph and has a haplotype assembly mode. Graphasing performs comparably to trio-phasing in contiguity, phasing accuracy, and assembly quality, outperforms Hi-C in phasing accuracy, and generates human assemblies with over 18 chromosome-spanning haplotypes.
Keyphrases
  • single cell
  • electronic health record
  • rna seq
  • endothelial cells
  • healthcare
  • machine learning
  • big data
  • health information
  • induced pluripotent stem cells
  • dna methylation