Assessing methylation detection for primary human tissue using Nanopore sequencing.
Rylee GennerStuart AkesonMelissa MeredithPilar Alvarez JerezLaksh MalikBreeana BakerAbigail Miano-Burkhardtnull nullBenedict PatenKimberley J BillingsleyCornelis BlauwendraatMiten JainPublished in: bioRxiv : the preprint server for biology (2024)
DNA methylation most commonly occurs as 5-methylcytosine (5-mC) in the human genome and has been associated with human diseases. Recent developments in single-molecule sequencing technologies (Oxford Nanopore Technologies (ONT) and Pacific Biosciences) have enabled readouts of long, native DNA molecules, including cytosine methylation. ONT recently upgraded their Nanopore sequencing chemistry and kits from R9 to the R10 version, which yielded increased accuracy and sequencing throughput. However the effects on methylation detection have not yet been documented. Here we performed a series of computational analyses to characterize differences in Nanopore-based 5mC detection between the ONT R9 and R10 chemistries. We compared 5mC calls in R9 and R10 for three human genome datasets: a cell line, a frontal cortex brain sample, and a blood sample. We performed an in-depth analysis on CpG islands and homopolymer regions, and documented high concordance for methylation detection among sequencing technologies. The strongest correlation was observed between Nanopore R10 and Illumina bisulfite technologies for cell line-derived datasets. Subtle differences in methylation datasets between technologies can impact analysis tools such as differential methylation calling software. Our findings show that comparisons can be drawn between methylation data from different Nanopore chemistries using guided hypotheses. This work will facilitate comparison among Nanopore data cohorts derived using different chemistries from large scale sequencing efforts, such as the NIH CARD Long Read Initiative.
Keyphrases
- single molecule
- dna methylation
- genome wide
- endothelial cells
- single cell
- atomic force microscopy
- living cells
- induced pluripotent stem cells
- gene expression
- pluripotent stem cells
- loop mediated isothermal amplification
- rna seq
- label free
- solid state
- functional connectivity
- machine learning
- real time pcr
- electronic health record
- big data
- artificial intelligence
- drug discovery