The Updated Mouse Universal Genotyping Array Bioinformatic Pipeline Improves Genetic QC in Laboratory Mice.
Matthew W BlanchardJohn Sebastian SigmonJennifer BrennanChidima AhulamibeMichelle E AllenRalph S BaricTimothy A BellJoeseph FarringtonDominic CiavattaMarta Cruz CisnerosMadison DrushalMartin T FerrisRebecca FryChristiann GainesBin GuMark T HeiseRichard Austin HodgesTal KafriRachel LynchTerry MagnusonDarla MillerCaroline E Y MurphyDavid Truong NguyenKelsey E NollMegan ProulxChris SassettiGinger D ShawJeremy M SimonClare M SmithMyrek StybloLisa TarantinoJoyce WooFernando Pardo Manuel de VillenaPublished in: bioRxiv : the preprint server for biology (2024)
The MiniMUGA genotyping array is a popular tool for genetic QC of laboratory mice and genotyping of samples from most types of experimental crosses involving laboratory strains, particularly for reduced complexity crosses. The content of the production version of the MiniMUGA array is fixed; however, there is the opportunity to improve array's performance and the associated report's usefulness by leveraging thousands of samples genotyped since the initial description of MiniMUGA in 2020. Here we report our efforts to update and improve marker annotation, increase the number and the reliability of the consensus genotypes for inbred strains and increase the number of constructs that can reliably be detected with MiniMUGA. In addition, we have implemented key changes in the informatics pipeline to identify and quantify the contribution of specific genetic backgrounds to the makeup of a given sample, remove arbitrary thresholds, include the Y Chromosome and mitochondrial genome in the ideogram, and improve robust detection of the presence of commercially available substrains based on diagnostic alleles. Finally, we have made changes to the layout of the report, to simplify the interpretation and completeness of the analysis and added a table summarizing the ideogram. We believe that these changes will be of general interest to the mouse research community and will be instrumental in our goal of improving the rigor and reproducibility of mouse-based biomedical research.
Keyphrases
- genome wide
- high throughput
- copy number
- dna methylation
- high resolution
- escherichia coli
- high fat diet induced
- high density
- healthcare
- single cell
- mental health
- oxidative stress
- adipose tissue
- deep learning
- machine learning
- rna seq
- insulin resistance
- quality improvement
- gene expression
- metabolic syndrome
- mass spectrometry
- skeletal muscle
- artificial intelligence
- real time pcr
- wild type
- sensitive detection