Login / Signup

An 8.22 Mb Assembly and Annotation of the Alpaca (Vicugna pacos) Y Chromosome.

Matthew J JevitBrian W DavisCaitlin CastanedaAndrew E HillhouseRytis JurasVladimir A TrifonovAhmed TibaryJorge C PereiraMalcolm Andrew Ferguson-SmithTerje Raudsepp
Published in: Genes (2021)
The unique evolutionary dynamics and complex structure make the Y chromosome the most diverse and least understood region in the mammalian genome, despite its undisputable role in sex determination, development, and male fertility. Here we present the first contig-level annotated draft assembly for the alpaca (Vicugna pacos) Y chromosome based on hybrid assembly of short- and long-read sequence data of flow-sorted Y. The latter was also used for cDNA selection providing Y-enriched testis transcriptome for annotation. The final assembly of 8.22 Mb comprised 4.5 Mb of male specific Y (MSY) and 3.7 Mb of the pseudoautosomal region. In MSY, we annotated 15 X-degenerate genes and two novel transcripts, but no transposed sequences. Two MSY genes, HSFY and RBMY, are multicopy. The pseudoautosomal boundary is located between SHROOM2 and HSFY. Comparative analysis shows that the small and cytogenetically distinct alpaca Y shares most of MSY sequences with the larger dromedary and Bactrian camel Y chromosomes. Most of alpaca X-degenerate genes are also shared with other mammalian MSYs, though WWC3Y is Y-specific only in alpaca/camels and the horse. The partial alpaca Y assembly is a starting point for further expansion and will have applications in the study of camelid populations and male biology.
Keyphrases
  • genome wide
  • copy number
  • rna seq
  • dna methylation
  • genome wide identification
  • bioinformatics analysis
  • gene expression
  • machine learning
  • single cell
  • big data
  • artificial intelligence
  • data analysis