Login / Signup

Genome assembly and annotation of the tambaqui ( Colossoma macropomum ): an emblematic fish of the Amazon River Basin.

Alexandre Wagner Silva HilsdorfMarcela Uliano-SilvaLuiz Lehmann CoutinhoHoracio MontenegroVera Maria Fonseca Almeida-ValDanillo Pinhal
Published in: GigaByte (Hong Kong, China) (2021)
Colossoma macropomum , known as "tambaqui", is the largest Characiformes fish in the Amazon River Basin and a leading species in Brazilian aquaculture and fisheries. Good quality meat and excellent adaptability to culture systems are some of its remarkable farming features. To support studies into the genetics and genomics of the tambaqui, we have produced the first high-quality genome for the species. We combined Illumina and PacBio sequencing technologies to generate a reference genome, assembled with 39× coverage of long reads and polished to a consensus quality value (QV) of 36 with 130× coverage of short reads. The genome was assembled into 1269 scaffolds (a total of 1,221,847,006 bases), with a scaffold N50 size of 40 Mb, where 93% of all assembled bases were placed in the largest 54 scaffolds corresponding to the diploid karyotype of the tambaqui. Furthermore, the NCBI Annotation Pipeline annotated genes, pseudogenes, and non-coding transcripts using the RefSeq database as evidence, guaranteeing a high-quality annotation. A Genome Data Viewer for the tambaqui was produced, which will benefit groups interested in exploring the unique genomic features of the species. The availability of a highly accurate genome assembly for tambaqui provides the foundation for the discovery of novel ecological and evolutionary insights, and is a helpful resource for aquaculture.
Keyphrases
  • genome wide
  • single cell
  • healthcare
  • gene expression
  • emergency department
  • high resolution
  • mass spectrometry
  • quality improvement
  • copy number
  • electronic health record
  • high throughput
  • artificial intelligence