Quantification of mutant-allele expression at isoform level in cancer from RNA-seq data.
Wenjiang DengTian MouYudi PawitanTrung Nghia VuPublished in: NAR genomics and bioinformatics (2022)
Even though the role of DNA mutations in cancer is well recognized, current quantification of the RNA expression, performed either at gene or isoform level, typically ignores the mutation status. Standard methods for estimating allele-specific expression (ASE) consider gene-level expression, but the functional impact of a mutation is best assessed at isoform level. Hence our goal is to quantify the mutant-allele expression at isoform level. We have developed and implemented a method, named MAX, for quantifying mutant-allele expression given a list of mutations. For a gene of interest, a mutant reference is constructed by incorporating all possible mutant versions of the wild-type isoforms in the transcriptome annotation. The mutant reference is then used for the RNA-seq reads mapping, which in principle works similarly for any quantification tool. We apply an alternating EM algorithm to the read-count data from the mapping step. In a simulation study, MAX performs well against standard isoform-quantification methods. Also, MAX achieves higher accuracy than conventional gene-based ASE methods such as ASEP. An analysis of a real dataset of acute myeloid leukemia reveals a subgroup of NPM1-mutated patients responding well to a kinase inhibitor. Our findings indicate that quantification of mutant-allele expression at isoform level is feasible and has potential added values for assessing the functional impact of DNA mutations in cancers.
Keyphrases
- wild type
- poor prognosis
- rna seq
- acute myeloid leukemia
- binding protein
- gene expression
- clinical trial
- machine learning
- high resolution
- dna methylation
- newly diagnosed
- squamous cell carcinoma
- allogeneic hematopoietic stem cell transplantation
- big data
- prognostic factors
- transcription factor
- high density
- peripheral blood
- childhood cancer