Login / Signup

A blended genome and exome sequencing method captures genetic variation in an unbiased, high-quality, and cost-effective manner.

Toni A BoltzBenjamin B ChuCalwing LiaoJulia M SealockRobert YeLerato MajaraJack M FuSusan ServiceLingyu ZhanSarah E MedlandSinéad B ChapmanSimone RubinacciMatthew DeFeliceJonna L GrimsbyTamrat AbebeMelkam AlemayehuFred K AshabaElizabeth G AtkinsonTim B BigdeliAmanda B BradwayHarrison BrandLori B ChibnikAbebaw FekaduMichael GatzenBizu GelayeStella GichuruMarissa L GildeaToni C HillHailiang HuangKalyn M HubbardWilfred E InjeraRoxanne JamesMoses JolobaChristopher KachulisPhillip R KalmbachKamulegeya RogersGabriel KigenSoyeon KimNastassja KoenEdith Kamaru KwobahJoseph KyebuzibwaSeungmo LeeNiall J LennonPenelope A LindEsteban A Lopera MayaJohnstone MakaleSerghei MangulJustin McMahonPierre MowlemHenry MusinguziRehema M MwemaNoeline NakasujjaCarter P NewmanLethukuthula L NkambuleConor R O'NeilAna Maria OlivaresCatherine M OlsenLinnet OngeriSophie J ParsaAdele PretoriusRaj RamesarFaye L ReaganChiara SabattiJacquelyn A SchneiderWelelta ShiferawAnne StevensonErik StrickerRocky E StroudJessie TangDavid C WhitemanMary T YohannesMingrui YuKai Yuannull nullDickens AkenaLukoye AtwoliSymon M KariukiKarestan C KoenenCharles R J C NewtonDan J SteinSolomon TeferraZukiswa ZingelaCarlos N PatoMichele T PatoCarlos López-JaramilloNelson FreimerRoel A OphoffLoes M Olde LoohuisMichael E TalkowskiBenjamin M NealeDaniel P HowriganAlicia R Martin
Published in: bioRxiv : the preprint server for biology (2024)
We deployed the Blended Genome Exome (BGE), a DNA library blending approach that generates low pass whole genome (1-4x mean depth) and deep whole exome (30-40x mean depth) data in a single sequencing run. This technology is cost-effective, empowers most genomic discoveries possible with deep whole genome sequencing, and provides an unbiased method to capture the diversity of common SNP variation across the globe. To evaluate this new technology at scale, we applied BGE to sequence >53,000 samples from the Populations Underrepresented in Mental Illness Associations Studies (PUMAS) Project, which included participants across African, African American, and Latin American populations. We evaluated the accuracy of BGE imputed genotypes against raw genotype calls from the Illumina Global Screening Array. All PUMAS cohorts had R 2 concordance ≥95% among SNPs with MAF≥1%, and never fell below ≥90% R 2 for SNPs with MAF<1%. Furthermore, concordance rates among local ancestries within two recently admixed cohorts were consistent among SNPs with MAF≥1%, with only minor deviations in SNPs with MAF<1%. We also benchmarked the discovery capacity of BGE to access protein-coding copy number variants (CNVs) against deep whole genome data, finding that deletions and duplications spanning at least 3 exons had a positive predicted value of ∼90%. Our results demonstrate BGE scalability and efficacy in capturing SNPs, indels, and CNVs in the human genome at 28% of the cost of deep whole-genome sequencing. BGE is poised to enhance access to genomic testing and empower genomic discoveries, particularly in underrepresented populations.
Keyphrases