Login / Signup

Chromosome-level scaffolding of haplotype-resolved assemblies using Hi-C data without reference genomes.

Xiaofei ZengZili YiXing-Tan ZhangYuhui DuYu LiZhiqing ZhouSijie ChenHuijie ZhaoSai YangYibin WangGuoan Chen
Published in: Nature plants (2024)
Scaffolding is crucial for constructing most chromosome-level genomes. The high-throughput chromatin conformation capture (Hi-C) technology has become the primary scaffolding strategy due to its convenience and cost-effectiveness. As sequencing technologies and assembly algorithms advance, constructing haplotype-resolved genomes is increasingly preferred because haplotypes can provide additional genetic information on allelic and non-allelic variations. ALLHiC is a widely used allele-aware scaffolding tool designed for this purpose. However, its dependence on chromosome-level reference genomes and a higher chromosome misassignment rate still impede the unravelling of haplotype-resolved genomes. Here we present HapHiC, a reference-independent allele-aware scaffolding tool with superior performance on chromosome assignment as well as contig ordering and orientation. In addition, we provide new insights into the challenges in allele-aware scaffolding by conducting comprehensive analyses on various adverse factors. Finally, with the help of HapHiC, we constructed the haplotype-resolved allotriploid genome for Miscanthus × giganteus, an important lignocellulosic bioenergy crop.
Keyphrases