Login / Signup

SubPhaser: a robust allopolyploid subgenome phasing method based on subgenome-specific k-mers.

Kai-Hua JiaZhao-Xuan WangLongxin WangGuang-Yuan LiWei ZhangXiao-Ling WangFang-Ji XuSi-Qian JiaoShan-Shan ZhouHui LiuYong-Peng MaGuiqi BiWei ZhaoYousry A El-KassabyIlga PorthGuowei LiRen-Gang ZhangJian-Feng Mao
Published in: The New phytologist (2022)
With advanced sequencing technology, dozens of complex polyploid plant genomes have been characterized. However, for many polyploid species, their diploid ancestors are unknown or extinct, making it impossible to unravel the subgenomes and genome evolution directly. We developed a novel subgenome-phasing algorithm, SubPhaser, specifically designed for a neoallopolyploid or a homoploid hybrid. SubPhaser first searches for the subgenome-specific sequence (k-mer), then assigns homoeologous chromosomes into subgenomes, and further provides tools to annotate and investigate specific sequences. SubPhaser works well on neoallopolyploids and homoploid hybrids containing subgenome-specific sequences like wheat, but fails on autopolyploids lacking subgenome-specific sequences like alfalfa, indicating that SubPhaser can phase neoallopolyploid/homoploid hybrids with high accuracy, sensitivity and performance. This highly accurate, highly sensitive, ancestral data free chromosome phasing algorithm, SubPhaser, offers significant application value for subgenome phasing in neoallopolyploids and homoploid hybrids, and for the subsequent exploration of genome evolution and related genetic/epigenetic mechanisms.
Keyphrases
  • machine learning
  • gene expression
  • copy number
  • electronic health record
  • coronavirus disease
  • amino acid
  • solid phase extraction