PyGMQL: scalable data extraction and analysis for heterogeneous genomic datasets.
Luca NanniPietro PinoliArif CanakogluStefano CeriPublished in: BMC bioinformatics (2019)
PyGMQL is an effective and innovative tool for supporting tertiary data extraction and analysis pipelines. We demonstrate the expressiveness and performance of PyGMQL through a sequence of biological data analysis scenarios of increasing complexity, which highlight reproducibility, expressive power and scalability.