Login / Signup

Semantic Signature: Comparative Interpretation of Gene Expression on a Semantic Space.

Jihun KimKeewon KimJu Han Kim
Published in: Computational and mathematical methods in medicine (2016)
Background. Interpretation of microarray data remains challenging because biological meaning should be extracted from enormous numeric matrices and be presented explicitly. Moreover, huge public repositories of microarray dataset are ready to be exploited for comparative analysis. This study aimed to provide a platform where essential implication of a microarray experiment could be visually expressed and various microarray datasets could be intuitively compared. Results. On the semantic space, gene sets from Molecular Signature Database (MSigDB) were plotted as landmarks and their relative distances were calculated by Lin's semantic similarity measure. By formal concept analysis, a microarray dataset was transformed into a concept lattice with gene clusters as objects and Gene Ontology terms as attributes. Concepts of a lattice were located on the semantic space reflecting semantic distance from landmarks and edges between concepts were drawn; consequently, a specific geographic pattern could be observed from a microarray dataset. We termed a distinctive geography shared by microarray datasets of the same category as "semantic signature." Conclusions. "Semantic space," a map of biological entities, could serve as a universal platform for comparative microarray analysis. When microarray data were displayed on the semantic space as concept lattices, "semantic signature," characteristic geography for a microarray experiment, could be discovered.
Keyphrases
  • gene expression
  • bioinformatics analysis
  • healthcare
  • genome wide
  • emergency department
  • dna methylation
  • high throughput
  • electronic health record
  • machine learning
  • palliative care
  • big data
  • artificial intelligence