Analysis pipelines for cancer genome sequencing in mice.
Sebastian LangeThomas EngleitnerSebastian MuellerRoman MareschMaximilian ZwiebelLaura González-SilvaGünter SchneiderRuby BanerjeeFengtang YangGeorge S VassiliouMathias J FriedrichDieter SaurIgnacio VarelaRoland RadPublished in: Nature protocols (2020)
Mouse models of human cancer have transformed our ability to link genetics, molecular mechanisms and phenotypes. Both reverse and forward genetics in mice are currently gaining momentum through advances in next-generation sequencing (NGS). Methodologies to analyze sequencing data were, however, developed for humans and hence do not account for species-specific differences in genome structures and experimental setups. Here, we describe standardized computational pipelines specifically tailored to the analysis of mouse genomic data. We present novel tools and workflows for the detection of different alteration types, including single-nucleotide variants (SNVs), small insertions and deletions (indels), copy-number variations (CNVs), loss of heterozygosity (LOH) and complex rearrangements, such as in chromothripsis. Workflows have been extensively validated and cross-compared using multiple methodologies. We also give step-by-step guidance on the execution of individual analysis types, provide advice on data interpretation and make the complete code available online. The protocol takes 2-7 d, depending on the desired analyses.
Keyphrases
- copy number
- mitochondrial dna
- genome wide
- electronic health record
- papillary thyroid
- dna methylation
- big data
- mouse model
- squamous cell
- randomized controlled trial
- single cell
- endothelial cells
- squamous cell carcinoma
- high fat diet induced
- lymph node metastasis
- health information
- type diabetes
- machine learning
- data analysis
- insulin resistance
- skeletal muscle
- metabolic syndrome
- childhood cancer
- young adults