Evidence of rare misassemblies in the bovine reference genome revealed by population genetic metrics.
Saber QanbariRobert D SchnabelDörte WittenburgPublished in: Animal genetics (2022)
Creation of the bovine reference assembly paved the way to develop the high-throughput genotyping arrays of the single nucleotide polymorphisms (SNPs) based on the available map coordinates that facilitated major advances in gene mapping and selection programs. The assembly flaws, however, may cause false results in the downstream gene mapping studies. The most recent bovine reference genome (ARS-UCD1.2) is built on the long-read sequences that provides improved quality and continuity. By applying population genetic metrics in this study, we aimed to evaluate the map coordinates to which SNP markers were assigned. We employed a three-step approach by combining the recombination and linkage disequilibrium analyses to test if the markers fit into the assigned physical map coordinates. We applied the method to the bovine 50k array in a large pedigree of Holstein cattle and revealed a panel of 65 candidate markers, most of which were re-located either on a different chromosome or re-mapped as far as several million base pairs away on the same chromosome. This list of candidates accounts for 0.1% of the SNPs in the widely used 50k genotyping array and we foresee a reasonably larger set of markers being misplaced in the BovineHD 700K BeadChip. We suggest pre-removal of the candidate misplaced markers to reduce false signals in association mapping studies.
Keyphrases
- genome wide
- high density
- copy number
- high throughput
- dna methylation
- high resolution
- public health
- single cell
- physical activity
- mass spectrometry
- gene expression
- dna damage
- case control
- men who have sex with men
- transcription factor
- quality improvement
- dna repair
- genetic diversity
- human immunodeficiency virus
- antiretroviral therapy