Synthetic lethal connectivity and graph transformer improve synthetic lethality prediction.
Kunjie FanBirkan GökbağShan TangShangjia LiYirui HuangLingling WangLijun ChengLang LiPublished in: Briefings in bioinformatics (2024)
Synthetic lethality (SL) has shown great promise for the discovery of novel targets in cancer. CRISPR double-knockout (CDKO) technologies can only screen several hundred genes and their combinations, but not genome-wide. Therefore, good SL prediction models are highly needed for genes and gene pairs selection in CDKO experiments. However, lack of scalable SL properties prevents generalizability of SL interactions to out-of-sample data, thereby hindering modeling efforts. In this paper, we recognize that SL connectivity is a scalable and generalizable SL property. We develop a novel two-step multilayer encoder for individual sample-specific SL prediction model (MLEC-iSL), which predicts SL connectivity first and SL interactions subsequently. MLEC-iSL has three encoders, namely, gene, graph, and transformer encoders. MLEC-iSL achieves high SL prediction performance in K562 (AUPR, 0.73; AUC, 0.72) and Jurkat (AUPR, 0.73; AUC, 0.71) cells, while no existing methods exceed 0.62 AUPR and AUC. The prediction performance of MLEC-iSL is validated in a CDKO experiment in 22Rv1 cells, yielding a 46.8% SL rate among 987 selected gene pairs. The screen also reveals SL dependency between apoptosis and mitosis cell death pathways.
Keyphrases
- genome wide
- cell death
- cell cycle arrest
- dna methylation
- induced apoptosis
- copy number
- high throughput
- genome wide identification
- squamous cell carcinoma
- oxidative stress
- mycobacterium tuberculosis
- white matter
- gene expression
- crispr cas
- multiple sclerosis
- resting state
- convolutional neural network
- signaling pathway
- machine learning
- deep learning
- artificial intelligence
- single cell
- genome editing