Inferring gene expression from cell-free DNA fragmentation profiles.
Mohammad Shahrokh EsfahaniEmily G HamiltonMahya MehrmohamadiBarzin Y NabetStefan K AligDaniel A KingChloé Beate SteenCharles W MacaulayAndre SchultzMonica C NesselbushJoanne SooJoseph G Schroers-MartinBinbin ChenMichael S BinkleyHenning StehrJacob J ChabonBrian J SworderAngela B-Y HuiMatthew J FrankEverett J ModingChih Long LiuAaron M NewmanJames M IsbellCharles M RudinBob T LiDavid M KurtzMaximillian DiehnAsh A AlizadehPublished in: Nature biotechnology (2022)
Profiling of circulating tumor DNA (ctDNA) in the bloodstream shows promise for noninvasive cancer detection. Chromatin fragmentation features have previously been explored to infer gene expression profiles from cell-free DNA (cfDNA), but current fragmentomic methods require high concentrations of tumor-derived DNA and provide limited resolution. Here we describe promoter fragmentation entropy as an epigenomic cfDNA feature that predicts RNA expression levels at individual genes. We developed 'epigenetic expression inference from cell-free DNA-sequencing' (EPIC-seq), a method that uses targeted sequencing of promoters of genes of interest. Profiling 329 blood samples from 201 patients with cancer and 87 healthy adults, we demonstrate classification of subtypes of lung carcinoma and diffuse large B cell lymphoma. Applying EPIC-seq to serial blood samples from patients treated with PD-(L)1 immune-checkpoint inhibitors, we show that gene expression profiles inferred by EPIC-seq are correlated with clinical response. Our results indicate that EPIC-seq could enable noninvasive, high-throughput tissue-of-origin characterization with diagnostic, prognostic and therapeutic potential.
Keyphrases
- circulating tumor
- genome wide
- single cell
- dna methylation
- gene expression
- diffuse large b cell lymphoma
- high throughput
- rna seq
- genome wide identification
- cell free
- poor prognosis
- circulating tumor cells
- copy number
- transcription factor
- machine learning
- deep learning
- epstein barr virus
- papillary thyroid
- single molecule
- binding protein
- genome wide analysis
- big data
- gram negative
- squamous cell carcinoma
- loop mediated isothermal amplification
- multidrug resistant
- bioinformatics analysis