A Database of Lung Cancer-Related Genes for the Identification of Subtype-Specific Prognostic Biomarkers.
Yining LiuMin ZhaoHong QuPublished in: Biology (2023)
The molecular subtype is critical for accurate treatment and follow-up in patients with lung cancer; however, information regarding subtype-associated genes is dispersed among thousands of published studies. Systematic curation and cross-validation of the scientific literature would provide a solid foundation for comparative genetic studies of the major molecular subtypes of lung cancer. Here, we constructed a literature-based lung cancer gene database (LCGene). In the current release, we collected and curated 2507 unique human genes, including 2267 protein-coding and 240 non-coding genes from comprehensive manual examination of 10,960 PubMed article abstracts. Extensive annotations were added to aid identification of differentially expressed genes, potential gene editing sites, and non-coding gene regulation. For instance, we prepared 607 curated genes with CRISPR knockout information in 43 lung cancer cell lines. Further comparison of these implicated genes among different subtypes identified several subtype-specific genes with high mutational frequencies. Common tumor suppressors and oncogenes shared by lung adenocarcinoma and lung squamous cell carcinoma, for example, exhibited different mutational frequencies and prognostic features, suggesting the presence of subtype-specific biomarkers. Our retrospective analysis revealed 43 small cell lung cancer-specific genes. Moreover, 52 tumor suppressors and oncogenes shared by lung adenocarcinoma and squamous cell carcinoma confirmed the different molecular mechanisms of these two cancer subtypes. The subtype-based genetic differences, when combined, may provide insight into subtype-specific biomarkers for genetic testing.
Keyphrases
- genome wide
- bioinformatics analysis
- genome wide identification
- squamous cell carcinoma
- small cell lung cancer
- dna methylation
- genome wide analysis
- systematic review
- copy number
- single cell
- endothelial cells
- gene expression
- health information
- crispr cas
- small molecule
- electronic health record
- emergency department
- risk assessment
- wastewater treatment
- adverse drug
- protein protein
- amino acid
- climate change
- induced pluripotent stem cells