Nanopore Cas9-targeted sequencing enables accurate and simultaneous identification of transgene integration sites, their structure and epigenetic status in recombinant Chinese hamster ovary cells.
Klaus LeitnerKrishna MotheramgariNicole BorthNicolas MarxPublished in: Biotechnology and bioengineering (2023)
The integration of a transgene expression construct into the host genome is the initial step for the generation of recombinant cell lines used for biopharmaceutical production. The stability and level of recombinant gene expression in Chinese hamster ovary (CHO) can be correlated to the copy number, its integration site as well as the epigenetic context of the transgene vector. Also, undesired integration events, such as concatemers, truncated and inverted vector repeats, are impacting the stability of recombinant cell lines. Thus, to characterize cell clones and to isolate the most promising candidates it is crucial to obtain information on the site of integration, the structure of integrated sequence and the epigenetic status. Current sequencing techniques allow to gather this information separately but do not offer a comprehensive and simultaneous resolution. In this study, we present a fast and robust nanopore Cas9-targeted sequencing (nCats) pipeline to identify integration sites, the composition of the integrated sequence as well as its DNA methylation status in CHO cells that can be obtained simultaneously from the same sequencing run. A Cas9-enrichment step during library preparation enables targeted and directional nanopore sequencing with up to 724x median on-target coverage and up to 153 kb long reads. The data generated by nCats provides sensitive, detailed and correct information on the transgene integration sites and the expression vector structure, which could only be partly produced by traditional Targeted Locus Amplification-seq data. Moreover, with nCats the DNA methylation status can be analyzed from the same raw data without prior DNA amplification. This article is protected by copyright. All rights reserved.
Keyphrases
- dna methylation
- gene expression
- single cell
- genome wide
- copy number
- single molecule
- induced apoptosis
- crispr cas
- cancer therapy
- cell free
- rna seq
- poor prognosis
- mitochondrial dna
- genome editing
- cell cycle arrest
- high resolution
- signaling pathway
- stem cells
- machine learning
- drug delivery
- nucleic acid
- big data
- artificial intelligence
- solid state
- oxidative stress
- cell proliferation
- mass spectrometry
- endoplasmic reticulum stress
- deep learning
- circulating tumor cells