Paralog transcriptional differentiation in the D. melanogaster-specific gene family Sdic across populations and spermatogenesis stages.
Bryan D CliftonImtiyaz HariyaniAshlyn KimuraFangning LuoAlvin NguyenJosé M RanzPublished in: Communications biology (2023)
How recently originated gene copies become stable genomic components remains uncertain as high sequence similarity of young duplicates precludes their functional characterization. The tandem multigene family Sdic is specific to Drosophila melanogaster and has been annotated across multiple reference-quality genome assemblies. Here we show the existence of a positive correlation between Sdic copy number and total expression, plus vast intrastrain differences in mRNA abundance among paralogs, using RNA-sequencing from testis of four strains with variable paralog composition. Single cell and nucleus RNA-sequencing data expose paralog expression differentiation in meiotic cell types within testis from third instar larva and adults. Additional RNA-sequencing across synthetic strains only differing in their Y chromosomes reveal a tissue-dependent trans-regulatory effect on Sdic: upregulation in testis and downregulation in male accessory gland. By leveraging paralog-specific expression information from tissue- and cell-specific data, our results elucidate the intraspecific functional diversification of a recently expanded tandem gene family.
Keyphrases
- single cell
- copy number
- rna seq
- poor prognosis
- genome wide
- mitochondrial dna
- high throughput
- drosophila melanogaster
- binding protein
- escherichia coli
- cell proliferation
- electronic health record
- signaling pathway
- gene expression
- healthcare
- dna methylation
- big data
- bone marrow
- cell therapy
- microbial community
- mesenchymal stem cells
- deep learning
- health information
- germ cell
- oxidative stress
- heat shock protein
- antibiotic resistance genes