Login / Signup

Chromosome-level genome assemblies from two sandalwood species provide insights into the evolution of the Santalales.

Zhou HongDan PengLuke R TembrockXuezhu LiaoDaping XuXiaojing LiuZhi-Qiang Wu
Published in: Communications biology (2023)
Sandalwood is one of the most expensive woods in the world and is well known for its long-lasting and distinctive aroma. In our study, chromosome-level genome assemblies for two sandalwood species (Santalum album and Santalum yasi) were constructed by integrating NGS short reads, RNA-seq, and Hi-C libraries with PacBio HiFi long reads. The S. album and S. yasi genomes were both assembled into 10 pseudochromosomes with a length of 229.59 Mb and 232.64 Mb, containing 21,673 and 22,816 predicted genes and a repeat content of 28.93% and 29.54% of the total genomes, respectively. Further analyses resolved a Santalum-specific whole-genome triplication event after divergence from ancestors of the Santalales lineage Malania, yet due to dramatic differences in transposon content, the Santalum genomes were only one-sixth the size of the Malania oleifera genome. Examination of RNA-seq data revealed a suite of genes that are differentially expressed in haustoria and might be involved in host hemiparasite interactions. The two genomes presented here not only provide an important comparative dataset for studying genome evolution in early diverging eudicots and hemiparasitic plants but will also hasten the application of conservation genomics for a lineage of trees recovering from decades of overexploitation.
Keyphrases
  • single cell
  • rna seq
  • genome wide
  • copy number
  • dna methylation
  • machine learning
  • big data
  • genome wide identification
  • data analysis