Login / Signup

High-quality wild barley genome assemblies and annotation with Nanopore long reads and Hi-C sequencing data.

Rui PanHaifei HuYuhui XiaoLe XuYanhao XuKai OuyangChengdao LiTianhua HeWenying Zhang
Published in: Scientific data (2023)
Wild barley, from "Evolution Canyon (EC)" in Mount Carmel, Israel, are ideal models for cereal chromosome evolution studies. Here, the wild barley EC_S1 is from the south slope with higher daily temperatures and drought, while EC_N1 is from the north slope with a cooler climate and higher relative humidity, which results in a differentiated selection due to contrasting environments. We assembled a 5.03 Gb genome with contig N50 of 3.53 Mb for wild barley EC_S1 and a 5.05 Gb genome with contig N50 of 3.45 Mb for EC_N1 using 145 Gb and 160.0 Gb Illumina sequencing data, 295.6 Gb and 285.35 Gb Nanopore sequencing data and 555.1 Gb and 514.5 Gb Hi-C sequencing data, respectively. BUSCOs and CEGMA evaluation suggested highly complete assemblies. Using full-length transcriptome data, we predicted 39,179 and 38,373 high-confidence genes in EC_S1 and EC_N1, in which 93.6% and 95.2% were functionally annotated, respectively. We annotated repetitive elements and non-coding RNAs. These two wild barley genome assemblies will provide a rich gene pool for domesticated barley.
Keyphrases
  • genome wide
  • electronic health record
  • single cell
  • big data
  • rna seq
  • climate change
  • copy number
  • gene expression
  • genetic diversity
  • single molecule
  • data analysis
  • high frequency
  • transcription factor
  • deep learning