Login / Signup

Comparative analysis and classification of highly divergent mouse rDNA units based on their intergenic spacer (IGS) variability.

Jung-Hyun KimRamaiah NagarajaAlexey Y OgurtsovVladimir N NoskovMikhail LiskovykhHee-Sheung LeeYutaro HoriTakehiko KobayashiKent HunterDavid SchlessingerNatalay KouprinaSvetlana A ShabalinaVladimir Larionov
Published in: NAR genomics and bioinformatics (2024)
Ribosomal DNA (rDNA) repeat units are organized into tandem clusters in eukaryotic cells. In mice, these clusters are located on at least eight chromosomes and show extensive variation in the number of repeats between mouse genomes. To analyze intra- and inter-genomic variation of mouse rDNA repeats, we selectively isolated 25 individual rDNA units using Transformation-Associated Recombination (TAR) cloning. Long-read sequencing and subsequent comparative sequence analysis revealed that each full-length unit comprises an intergenic spacer (IGS) and a ∼13.4 kb long transcribed region encoding the three rRNAs, but with substantial variability in rDNA unit size, ranging from ∼35 to ∼46 kb. Within the transcribed regions of rDNA units, we found 209 variants, 70 of which are in external transcribed spacers (ETSs); but the rDNA size differences are driven primarily by IGS size heterogeneity, due to indels containing repetitive elements and some functional signals such as enhancers. Further evolutionary analysis categorized rDNA units into distinct clusters with characteristic IGS lengths; numbers of enhancers; and presence/absence of two common SNPs in promoter regions, one of which is located within promoter (p)RNA and may influence pRNA folding stability. These characteristic features of IGSs also correlated significantly with 5'ETS variant patterns described previously and associated with differential expression of rDNA units. Our results suggest that variant rDNA units are differentially regulated and open a route to investigate the role of rDNA variation on nucleolar formation and possible associations with pathology.
Keyphrases
  • transcription factor
  • single cell
  • single molecule
  • machine learning
  • deep learning
  • type diabetes
  • high frequency
  • induced apoptosis
  • metabolic syndrome
  • adipose tissue
  • cell proliferation
  • nucleic acid
  • wild type