Login / Signup

A phased genome of the highly heterozygous 'Texas' almond uncovers patterns of allele-specific expression linked to heterozygous structural variants.

Raúl CastaneraCarlos de TomásValentino RuggieriCarlos M VicientIban EduardoMaría José AranzanaPere ArúsJosep M Casacuberta
Published in: Horticulture research (2024)
The vast majority of traditional almond varieties are self-incompatible, and the level of variability of the species is very high, resulting in a high-heterozygosity genome. Therefore, information on the different haplotypes is particularly relevant to understand the genetic basis of trait variability in this species. However, although reference genomes for several almond varieties exist, none of them is phased and has genome information at the haplotype level. Here, we present a phased assembly of genome of the almond cv. Texas. This new assembly has 13% more assembled sequence than the previous version of the Texas genome and has an increased contiguity, in particular in repetitive regions such as the centromeres. Our analysis shows that the 'Texas' genome has a high degree of heterozygosity, both at SNPs, short indels, and structural variants level. Many of the SVs are the result of heterozygous transposable element insertions, and in many cases, they also contain genic sequences. In addition to the direct consequences of this genic variability on the presence/absence of genes, our results show that variants located close to genes are often associated with allele-specific gene expression, which highlights the importance of heterozygous SVs in almond.
Keyphrases
  • genome wide
  • copy number
  • dna methylation
  • gene expression
  • early onset
  • poor prognosis
  • high frequency
  • healthcare
  • binding protein
  • transcription factor
  • amino acid