Chromosome-level assembly of the water buffalo genome surpasses human and goat genomes in sequence contiguity.
Wai Yee LowRick TearleDerek M BickhartBenjamin D RosenSarah B KinganThomas SwaleFrançoise Thibaud-NissenTerence D MurphyRachel YoungLucas LefevreDavid A HumeAndrew CollinsPaolo Ajmone MarsanTimothy P L SmithJohn Lewis WilliamsPublished in: Nature communications (2019)
Rapid innovation in sequencing technologies and improvement in assembly algorithms have enabled the creation of highly contiguous mammalian genomes. Here we report a chromosome-level assembly of the water buffalo (Bubalus bubalis) genome using single-molecule sequencing and chromatin conformation capture data. PacBio Sequel reads, with a mean length of 11.5 kb, helped to resolve repetitive elements and generate sequence contiguity. All five B. bubalis sub-metacentric chromosomes were correctly scaffolded with centromeres spanned. Although the index animal was partly inbred, 58% of the genome was haplotype-phased by FALCON-Unzip. This new reference genome improves the contig N50 of the previous short-read based buffalo assembly more than a thousand-fold and contains only 383 gaps. It surpasses the human and goat references in sequence contiguity and facilitates the annotation of hard to assemble gene clusters such as the major histocompatibility complex (MHC).
Keyphrases
- single molecule
- genome wide
- endothelial cells
- copy number
- single cell
- induced pluripotent stem cells
- dna methylation
- atomic force microscopy
- gene expression
- pluripotent stem cells
- dna damage
- transcription factor
- amino acid
- high frequency
- deep learning
- electronic health record
- rna seq
- big data
- oxidative stress
- quantum dots
- high speed