Multi-Dimensional Fragmentomics Enables Early and Accurate Detection of Colorectal Cancer.
Yuepeng CaoNannan WangXuxiaochen WuWanxiangfu TangHua BaoChengshuai SiPeng ShaoDongzheng LiXin ZhouDongqin ZhuShanshan YangFufeng WangGuoqing SuKe WangQifan WangYao ZhangQiangcheng WangDongsheng YuQian JiangJun BaoLiu YangPublished in: Cancer research (2024)
Colorectal cancer (CRC) is frequently diagnosed in advanced stages, highlighting the need for developing approaches for early detection. Liquid biopsy using cell-free DNA (cfDNA) fragmentomics is a promising approach, but the clinical application is hindered by complexity and cost. This study aimed to develop an integrated model using cfDNA fragmentomics for accurate, cost-effective early-stage CRC detection. Plasma cfDNA was extracted and sequenced from a training cohort of 360 participants, including 176 CRC patients and 184 healthy controls. An ensemble stacked model comprising five machine learning models was employed to distinguish CRC patients from healthy controls using five cfDNA fragmentomic features. The model was validated in an independent cohort of 236 participants (117 CRC patients and 119 controls) and a prospective cohort of 242 participants (129 CRC patients and 113 controls). The ensemble stacked model showed remarkable discriminatory power between CRC patients and controls, outperforming all base models and achieving a high area under the ROC curve (AUC) of 0.986 in the validation cohort. It reached 94.88% sensitivity and 98% specificity for detecting CRC in the validation cohort, with sensitivity increasing as cancer progressed. The model also demonstrated consistently high accuracy in within-run and between-run tests and across various conditions in healthy individuals. In the prospective cohort, it achieved 91.47% sensitivity and 95.58% specificity. This integrated model capitalizes on the multiplex nature of cfDNA fragmentomics to achieve high sensitivity and robustness, offering significant promise for early CRC detection and broad patient benefit.