Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures.
Alyssa Zi-Xin LeongPey Yee LeeM Aiman MohtarSaiful Effendi SyafruddinYuh-Fen PungTeck Yew LowPublished in: Journal of biomedical science (2022)
A short open reading frame (sORFs) constitutes ≤ 300 bases, encoding a microprotein or sORF-encoded protein (SEP) which comprises ≤ 100 amino acids. Traditionally dismissed by genome annotation pipelines as meaningless noise, sORFs were found to possess coding potential with ribosome profiling (RIBO-Seq), which unveiled sORF-based transcripts at various genome locations. Nonetheless, the existence of corresponding microproteins that are stable and functional was little substantiated by experimental evidence initially. With recent advancements in multi-omics, the identification, validation, and functional characterisation of sORFs and microproteins have become feasible. In this review, we discuss the history and development of an emerging research field of sORFs and microproteins. In particular, we focus on an array of bioinformatics and OMICS approaches used for predicting, sequencing, validating, and characterizing these recently discovered entities. These strategies include RIBO-Seq which detects sORF transcripts via ribosome footprints, and mass spectrometry (MS)-based proteomics for sequencing the resultant microproteins. Subsequently, our discussion extends to the functional characterisation of microproteins by incorporating CRISPR/Cas9 screen and protein-protein interaction (PPI) studies. Our review discusses not only detection methodologies, but we also highlight on the challenges and potential solutions in identifying and validating sORFs and their microproteins. The novelty of this review lies within its validation for the functional role of microproteins, which could contribute towards the future landscape of microproteomics.
Keyphrases
- single cell
- protein protein
- mass spectrometry
- rna seq
- high throughput
- crispr cas
- genome wide
- small molecule
- amino acid
- working memory
- minimally invasive
- liquid chromatography
- high resolution
- multiple sclerosis
- genome editing
- ms ms
- air pollution
- label free
- dna methylation
- risk assessment
- high performance liquid chromatography
- bioinformatics analysis
- real time pcr