Use of overlapping DNA pools to discern genetic differences despite pooling error.
John W KeeleTara G McDaneldLarry Alexander KuehnPublished in: Journal of animal science (2023)
Genotyping pools of commercial cattle and individual seedstock animals may reveal hidden relationships between sectors enabling use of commercial data for genetic evaluation. However, commercial data capture may be compromised by inexact pool formation. We aimed to estimate the concordance between distances or genomic covariance among pooling allele frequencies (PAF) of DNA pools comprised of 100 animals with 0 or 50 % overlap of animals in common between pools. Cattle lung samples were collected from a commercial beef processing plant on a single day. Six pools of 100 animals each were constructed so that overlap between pools was 0 or 50 %. Two pools of all 200 animals were constructed to estimate PAF for all 200 animals. Frozen lung tissue (0.01 g) from each animal was weighed into a tube containing a pool; there were 2 pools of 200 animals each and 6 pools of 100 animals each. Every contribution of an individual animal was an independent measurement to insure independence of pooling errors. Lung samples were kept on dried ice during the pooling process to keep them from thawing. The 8 pools were then assayed for approximately 100,000 single nucleotide polymorphisms (SNP). Pooling allele frequency for each SNP and pool was based on the relative intensity of the 2 dyes used to detect the alleles rather than genotype calls which are not tractable from pooling data. Euclidean distances and genomic relationships among the PAF for the 8 pools were estimated and distances were tested for concordance with pool overlap using permutation-based analysis of distance. Distances among pools were concordant with the planned overlap of animals shared between pools (P = 0.0024); pool overlap accounted for 70 % of the variation and pooling error accounted for 30 %. Pools containing 100 animals with no overlap were the most distant from one another and pools with 50% overlap were the least distant. This work shows that we can discern differences in distance between pairs of overlapping DNA pools sharing 0 and 50 % of the animals. Genomic correlations among non-overlapping pools indicated that non-overlapping pool pairs did not share many related animals because genomic correlations were near zero for these pairs. On the other hand, one pair of non-overlapping pools likely contained related animals between pools because the correlation was 0.21. Pools sharing 50 % overlap ranged in genomic relationship between 0.21 to 0.39 (n=12).