MS/MS in silico subtraction-based proteomic profiling as an approach to facilitate disease gene discovery: application to lens development and cataract.
Sandeep AryalDeepti AnandFrancisco G HernandezBailey A T WeatherbeeHongzhan HuangAshok P ReddyPhillip A WilmarthLarry L DavidSalil A LachkePublished in: Human genetics (2019)
While the bioinformatics resource-tool iSyTE (integrated Systems Tool for Eye gene discovery) effectively identifies human cataract-associated genes, it is currently based on just transcriptome data, and thus, it is necessary to include protein-level information to gain greater confidence in gene prioritization. Here, we expand iSyTE through development of a novel proteome-based resource on the lens and demonstrate its utility in cataract gene discovery. We applied high-throughput tandem mass spectrometry (MS/MS) to generate a global protein expression profile of mouse lens at embryonic day (E)14.5, which identified 2371 lens-expressed proteins. A major challenge of high-throughput expression profiling is identification of high-priority candidates among the thousands of expressed proteins. To address this problem, we generated new MS/MS proteome data on mouse whole embryonic body (WB). WB proteome was then used as a reference dataset for performing "in silico WB-subtraction" comparative analysis with the lens proteome, which effectively identified 422 proteins with lens-enriched expression at ≥ 2.5 average spectral counts, ≥ 2.0 fold enrichment (FDR < 0.01) cut-off. These top 20% candidates represent a rich pool of high-priority proteins in the lens including known human cataract-linked genes and many new potential regulators of lens development and homeostasis. This rich information is made publicly accessible through iSyTE (https://research.bioinformatics.udel.edu/iSyTE/), which enables user-friendly visualization of promising candidates, thus making iSyTE a comprehensive tool for cataract gene discovery.
Keyphrases
- genome wide
- cataract surgery
- high throughput
- genome wide identification
- ms ms
- copy number
- small molecule
- tandem mass spectrometry
- dna methylation
- single cell
- endothelial cells
- high performance liquid chromatography
- ultra high performance liquid chromatography
- transcription factor
- magnetic resonance imaging
- liquid chromatography tandem mass spectrometry
- genome wide analysis
- binding protein
- healthcare
- poor prognosis
- molecular docking
- liquid chromatography
- simultaneous determination
- health information
- high resolution
- risk assessment
- induced pluripotent stem cells
- gene expression
- climate change
- rna seq
- long non coding rna
- peripheral blood
- gas chromatography
- mass spectrometry
- dual energy