Login / Signup

High-quality genome assembly enables prediction of allele-specific gene expression in hybrid poplar.

Tian-Le ShiKai-Hua JiaYu-Tao BaoShuai NieXue-Chan TianXue-Mei YanZhao-Yang ChenZhi-Chao LiShi-Wei ZhaoHai-Yao MaYe ZhaoXiang LiRen-Gang ZhangJing GuoWei ZhaoEl-Kassaby YousryNiels A MüllerYves Van de PeerXiao-Ru WangNathaniel Robert StreetIlga M PorthXinmin AnJian-Feng Mao
Published in: Plant physiology (2024)
Poplar (Populus) is a well-established model system for tree genomics and molecular breeding, and hybrid poplar is widely used in forest plantations. However, distinguishing its diploid homologous chromosomes is difficult, complicating advanced functional studies on specific alleles. In this study, we applied a trio-binning design and PacBio High-Fidelity long-read sequencing to obtain haplotype-phased telomere-to-telomere genome assemblies for the two parents of the well-studied F1 hybrid "84K" (Populus alba × P. tremula var. glandulosa). Almost all chromosomes, including the telomeres and centromeres, were completely assembled for each haplotype subgenome apart from two small gaps on one chromosome. By incorporating information from these haplotype assemblies and extensive RNA-seq data, we analyzed gene expression patterns between the two subgenomes and alleles. Transcription bias at the subgenome level was not uncovered, but extensive expression differences were detected between alleles. We developed machine-learning (ML) models to predict allele-specific expression (ASE) with high accuracy and identified underlying genome features most highly influencing ASE. One of our models with 15 predictor variables achieved 77% accuracy on the training set and 74% accuracy on the testing set. ML models identified gene body CHG methylation, sequence divergence, and transposon occupancy both upstream and downstream of alleles as important factors for ASE. Our haplotype-phased genome assemblies and ML strategy highlight an avenue for functional studies in Populus and provide additional tools for studying ASE and heterosis in hybrids.
Keyphrases