Login / Signup

Forensic efficiency evaluation of a mtDNA whole genome sequencing system constructed with long fragment amplification strategy on DNA nanoball sequencing platform.

Man ChenChong ChenNing LiYuerong SuWei CuiYan HuangMeiming CaiBofeng Zhu
Published in: Forensic science international. Genetics (2024)
Mitochondrial DNA (mtDNA) is an important genetic marker for degraded biological sample identification, maternal pedigree tracing, and population genetic structure study owing to its characteristics of high copy number, anti-degradable ring structure, and maternal inheritance. Whole mtDNA genome sequencing is an optimal method for the analysis of mtDNA polymorphism and heterogeneity because it allows for the comprehensive use of maternal genetic information. However, because of lacking quantitative evaluations for sequencing data, the scientific interpretation standards for mtDNA sequencing results of the previously used sequencing systems are often different, and false positive or false negative results are prone to occur when faced with the interference of nuclear genomic DNA, or the heterogeneities of mtDNA sequence and structure. In this study, we evaluated a novel mtDNA whole genome sequencing system using long fragment amplification strategy on the DNA nanoball (DNB) sequencing platform. This system demonstrated high sequencing quality and specific mtDNA sequencing efficiencies on positive control DNA and FTA bloodstain samples, as the average Q20 and Q30 values of the corresponding samples were 97.17 % and 91.93 %; 97.37 % and 92.48 %, respectively. The mean mapping percentages for the reference sequences of whole genome DNA (wgDNA), mtDNA, and nuclear genomic DNA (ngDNA) in the corresponding samples were 99.98 %, 99.97 %, 0.03 %, and 99.91 %, 99.40 %, 0.60 %; respectively. The average error calling rates for the bases A, C, G, and T of the whole mtDNA genome were 0.2519 %, 0.2550 %, 0.2906 %; and 0.2392 %, respectively. The efficacy of heteroplasmy identification was assessed using a set of theoretical sites with predetermined rates. These sites were created by combining the samples with known mtDNA haplotypes in certain proportions. The absolute errors between observed and theoretical heteroplasmy values were 89.59 %, 74.68 %, 50.20 %, 12.65 %, 8.31 %, and 4.85 %, while the theoretical heteroplasmy values were 5 %, 10 %, 20 %, 80 %, 90 %, and 95 %, respectively. The absolute error exhibited relative stability when the mtDNA sequencing depth exceeded 500×. Furthermore, the system sequencing efficiency was also confirmed among different kinds of samples, and these samples included natural samples (e.g., peripheral blood samples preserved on FTA cards for 2 and 11 years, and on filter paper for 6 and 9 years), degraded samples, sensitivity samples, samples derived from various bodily fluids, and maternal pedigree samples. In summary, the whole mtDNA genome sequencing system used for forensic identification demonstrated high performance in analyzing mtDNA sequence information, and showed significant prospects for forensic application and maternal genetic research.
Keyphrases