Mosaic deletion patterns of the human antibody heavy chain gene locus shown by Bayesian haplotyping.
Moriah GidoniOmri SnirAyelet PeresPazit PolakIda LindemanIvana MikocziovaVikas Kumar SarnaKnut E A LundinChristopher ClouserFrancois VigneaultAndrew M CollinsLudvig M SollidGur YaariPublished in: Nature communications (2019)
Analysis of antibody repertoires by high-throughput sequencing is of major importance in understanding adaptive immune responses. Our knowledge of variations in the genomic loci encoding immunoglobulin genes is incomplete, resulting in conflicting VDJ gene assignments and biased genotype and haplotype inference. Haplotypes can be inferred using IGHJ6 heterozygosity, observed in one third of the people. Here, we propose a robust novel method for determining VDJ haplotypes by adapting a Bayesian framework. Our method extends haplotype inference to IGHD- and IGHV-based analysis, enabling inference of deletions and copy number variations in the entire population. To test this method, we generated a multi-individual data set of naive B-cell repertoires, and found allele usage bias, as well as a mosaic, tiled pattern of deleted IGHD and IGHV genes. The inferred haplotypes may have clinical implications for genetic disease predispositions. Our findings expand the knowledge that can be extracted from antibody repertoire sequencing data.
Keyphrases
- copy number
- genome wide
- mitochondrial dna
- dna methylation
- single cell
- high throughput sequencing
- healthcare
- immune response
- genome wide identification
- electronic health record
- big data
- gene expression
- machine learning
- toll like receptor
- genome wide analysis
- transcription factor
- hiv infected
- inflammatory response
- artificial intelligence
- antiretroviral therapy
- pluripotent stem cells
- deep learning
- bioinformatics analysis