Login / Signup

FastqPuri: high-performance preprocessing of RNA-seq data.

Paula Pérez-RubioClaudio LottazJulia C Engelmann
Published in: BMC bioinformatics (2019)
FastqPuri is a new tool which covers all aspects of short read sequence data preprocessing. It was designed for RNA-seq data to meet the needs for fast preprocessing of fastq data to allow transcript and gene counting, but it is suitable to process any short read sequencing data of which high sequence quality is needed, such as for genome assembly or SNV (single nucleotide variant) detection. FastqPuri is most flexible in filtering undesired biological sequences by offering two approaches to optimize speed and memory usage dependent on the total size of the potential contaminating sequences. FastqPuri is available at https://github.com/jengelmann/FastqPuri . It is implemented in C and R and licensed under GPL v3.
Keyphrases
  • rna seq
  • single cell
  • electronic health record
  • big data
  • single molecule
  • genome wide
  • dna methylation
  • copy number
  • genetic diversity
  • genome wide identification