Login / Signup

An assessment of the genomic structural variation landscape in Sub-Saharan African populations.

Stephen TollmanEmma WienerLaura CottinoGerrit BothaOscar Asanya NyangiriHarry NoyesAnnette MacLeodDavid JakuboskyClement AdebamowoPhillip AwadallaGuida LandouréMogomotsi MatshabaEnock MatovuMichelle RamsayGustave SimoMartin Chitolongo SimuunzaCaroline T TiemessenAmbroise WonkamVenesa SahibdeenAmanda KrauseScott Hazelhurst
Published in: Research square (2024)
Structural variants are responsible for a large part of genomic variation between individuals and play a role in both common and rare diseases. Databases cataloguing structural variants notably do not represent the full spectrum of global diversity, particularly missing information from most African populations. To address this representation gap, we analysed 1,091 high-coverage African genomes, 545 of which are public data sets, and 546 which have been analysed for structural variants for the first time. Variants were called using five different tools and datasets merged and jointly called using SURVIVOR. We identified 67,795 structural variants throughout the genome, with 10,421 genes having at least one variant. Using a conservative overlap in merged data, 6,414 of the structural variants (9.5%) are novel compared to the Database of Genomic Variants. This study contributes to knowledge of the landscape of structural variant diversity in Africa and presents a reliable dataset for potential applications in population genetics and health-related research.
Keyphrases
  • copy number
  • genome wide
  • healthcare
  • big data
  • single cell
  • gene expression
  • electronic health record
  • social media
  • rna seq
  • climate change
  • transcription factor