Login / Signup

Improved reference genome for the domestic horse increases assembly contiguity and composition.

Theodore S KalbfleischEdward S RiceMichael S DePriestBrian P WalenzMatthew S HestandJoris R VermeeschBrendan L O ConnellIan T FiddesAlisa O VershininaNedda F SaremiJessica L PetersenCarrie J FinnoRebecca R BelloneMolly E McCueSamantha A BrooksErnest BaileyLudovic OrlandoRichard E GreenDonald C MillerDouglas F AntczakJames N MacLeod
Published in: Communications biology (2018)
Recent advances in genomic sequencing technology and computational assembly methods have allowed scientists to improve reference genome assemblies in terms of contiguity and composition. EquCab2, a reference genome for the domestic horse, was released in 2007. Although of equal or better quality compared to other first-generation Sanger assemblies, it had many of the shortcomings common to them. In 2014, the equine genomics research community began a project to improve the reference sequence for the horse, building upon the solid foundation of EquCab2 and incorporating new short-read data, long-read data, and proximity ligation data. Here, we present EquCab3. The count of non-N bases in the incorporated chromosomes is improved from 2.33 Gb in EquCab2 to 2.41 Gb in EquCab3. Contiguity has also been improved nearly 40-fold with a contig N50 of 4.5 Mb and scaffold contiguity enhanced to where all but one of the 32 chromosomes is comprised of a single scaffold.
Keyphrases
  • electronic health record
  • big data
  • genome wide
  • single cell
  • quality improvement
  • healthcare
  • single molecule
  • mental health
  • dna methylation
  • data analysis
  • machine learning
  • amino acid