Login / Signup

Jupyter and Galaxy: Easing entry barriers into complex data analyses for biomedical researchers.

Bjoern Andreas GrueningHelena RascheBoris Rebolledo-JaramilloCarl EberhardTorsten HouwaartJohn M ChiltonNate CoraorRolf BackofenJames TaylorAnton Nekrutenko
Published in: PLoS computational biology (2017)
What does it take to convert a heap of sequencing data into a publishable result? First, common tools are employed to reduce primary data (sequencing reads) to a form suitable for further analyses (i.e., the list of variable sites). The subsequent exploratory stage is much more ad hoc and requires the development of custom scripts and pipelines, making it problematic for biomedical researchers. Here, we describe a hybrid platform combining common analysis pathways with the ability to explore data interactively. It aims to fully encompass and simplify the "raw data-to-publication" pathway and make it reproducible.
Keyphrases
  • electronic health record
  • big data
  • high throughput
  • machine learning