Login / Signup

A data-driven dimensionality-reduction algorithm for the exploration of patterns in biomedical data.

Md Tauhidul IslamLei Xing
Published in: Nature biomedical engineering (2020)
Dimensionality reduction is widely used in the visualization, compression, exploration and classification of data. Yet a generally applicable solution remains unavailable. Here, we report an accurate and broadly applicable data-driven algorithm for dimensionality reduction. The algorithm, which we named 'feature-augmented embedding machine' (FEM), first learns the structure of the data and the inherent characteristics of the data components (such as central tendency and dispersion), denoises the data, increases the separation of the components, and then projects the data onto a lower number of dimensions. We show that the technique is effective at revealing the underlying dominant trends in datasets of protein expression and single-cell RNA sequencing, computed tomography, electroencephalography and wearable physiological sensors.
Keyphrases
  • electronic health record
  • machine learning
  • deep learning
  • big data
  • single cell
  • computed tomography
  • magnetic resonance imaging
  • rna seq
  • blood pressure
  • high resolution
  • quality improvement