Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing.
Wenming XiaoLuyao RenZhong ChenLi Tai FangYongmei ZhaoJustin LackMeijian GuanBin ZhuErich JaegerLiz KerriganThomas M BlomquistTiffany HungMarc SultanKenneth IdlerCharles LuAndreas SchererRebecca KuskoMalcolm MoosChunlin XiaoStephen T SherryOgan D AbaanWanqiu ChenXin ChenJessica NordlundUlrika LiljedahlRoberta MaestroMaurizio PolanoJiri DrabekPetr VojtaSulev KõksEne ReimannBindu Swapna MadalaTimothy R MercerChris MillerHoward JacobTiffany TruongAli MoshrefiAparna NatarajanAna GranatGary P SchrothRasika KalameghamEric PetersVirginie PetitjeanAshley WaltonTsai-Wei ShenKeyur TalsaniaCristobal Juan VeraKurt LangenbachMaryellen de MarsJennifer A HippJames C WilleyJing WangJyoti ShettyYuliya KrigaArati RaziuddinBao TranYuanting ZhengYing YuMargaret CamParthav JailwalaCu NguyenDaoud MeerzamanQingrong ChenChunhua YanBen ErnestUrvashi MehraRoderick V JensenWendell D JonesJian-Liang LiBrian N PapasMehdi PiroozniaYun-Ching ChenFayaz T SeifuddinZhipan LiXuelu LiuWolfgang ReschJingya WangLeihong WuGokhan YavasCorey MilesBaitang NingWeida TongChristopher E MasonEric DonaldsonSamir LababidiLouis M StaudtZivana TezakHuixiao HongCharles WangLe-Ming ShiPublished in: Nature biotechnology (2021)
Clinical applications of precision oncology require accurate tests that can distinguish true cancer-specific mutations from errors introduced at each step of next-generation sequencing (NGS). To date, no bulk sequencing study has addressed the effects of cross-site reproducibility, nor the biological, technical and computational factors that influence variant identification. Here we report a systematic interrogation of somatic mutations in paired tumor-normal cell lines to identify factors affecting detection reproducibility and accuracy at six different centers. Using whole-genome sequencing (WGS) and whole-exome sequencing (WES), we evaluated the reproducibility of different sample types with varying input amount and tumor purity, and multiple library construction protocols, followed by processing with nine bioinformatics pipelines. We found that read coverage and callers affected both WGS and WES reproducibility, but WES performance was influenced by insert fragment size, genomic copy content and the global imbalance score (GIV; G > T/C > A). Finally, taking into account library preparation protocol, tumor content, read coverage and bioinformatics processes concomitantly, we recommend actionable practices to improve the reproducibility and accuracy of NGS experiments for cancer mutation detection.
Keyphrases
- papillary thyroid
- squamous cell
- healthcare
- primary care
- copy number
- randomized controlled trial
- loop mediated isothermal amplification
- palliative care
- real time pcr
- lymph node metastasis
- single molecule
- squamous cell carcinoma
- gene expression
- childhood cancer
- mass spectrometry
- dna methylation
- young adults
- circulating tumor cells