Login / Signup

A High-Quality Reference Genome Assembly of Prinsepia uniflora (Rosaceae).

Lei ZhangChaopan ZhangYajing AnQiang ZhuMingcheng Wang
Published in: Genes (2023)
This study introduces a meticulously constructed genome assembly at the chromosome level for the Rosaceae family species Prinsepia uniflora , a traditional Chinese medicinal herb. The final assembly encompasses 1272.71 megabases (Mb) distributed across 16 pseudochromosomes, boasting contig and super-scaffold N50 values of 2.77 and 79.32 Mb, respectively. Annotated within this genome is a substantial 875.99 Mb of repetitive sequences, with transposable elements occupying 777.28 Mb, constituting 61.07% of the entire genome. Our predictive efforts identified 49,261 protein-coding genes within the repeat-masked assembly, with 45,256 (91.87%) having functional annotations, 5127 (10.41%) demonstrating tandem duplication, and 2373 (4.82%) classified as transcription factor genes. Additionally, our investigation unveiled 3080 non-coding RNAs spanning 0.51 Mb of the genome sequences. According to our evolutionary study, P. uniflora underwent recent whole-genome duplication following its separation from Prunus salicina . The presented reference-level genome assembly and annotation for P. uniflora will significantly facilitate the in-depth exploration of genomic information pertaining to this species, offering substantial utility in comparative genomics and evolutionary analyses involving Rosaceae species.
Keyphrases
  • genome wide
  • transcription factor
  • dna methylation
  • copy number
  • gene expression
  • optical coherence tomography
  • small molecule
  • mass spectrometry
  • liquid chromatography
  • bioinformatics analysis