Complete pipeline for Oxford Nanopore Technology amplicon sequencing (ONT-AmpSeq): from pre-processing to creating an operational taxonomic unit table.
Patrick Skov SchacksenStine Karstenskov ØstergaardMathias Helmer EskildsenJeppe Lund NielsenPublished in: FEBS open bio (2024)
Amplicon sequencing has long served as a robust method for characterising microbial communities, and despite inherent resolution limitations, it remains a preferred technique, offering cost- and time-effective insights into bacterial compositions. Here, we introduce ONT-AmpSeq, a user-friendly pipeline designed for processing amplicon sequencing data generated from Oxford Nanopore Technology (ONT) devices. Our pipeline enables efficient creation of taxonomically annotated operational taxonomic unit (OTU) tables from ONT sequencing data, with the flexibility to multiplex amplicons on the same barcode. The pipeline encompasses six main steps-statistics, quality filtering, alignment, clustering, polishing, and taxonomic classification-integrating various state-of-the-art software tools. We provide a detailed description of each step, along with performance tests and robustness evaluations using both test data and a ZymoBIOMICS ® Microbial Community Standard mock community dataset. Our results demonstrate the ability of ONT-AmpSeq to effectively process ONT amplicon data, offering valuable insights into microbial community composition. Additionally, we discuss the influence of polishing tools on taxonomic insight and the impact of taxonomic annotation methods on the derived microbial composition. Overall, ONT-AmpSeq represents a comprehensive solution for analysing ONT amplicon sequencing data, facilitating streamlined and reliable microbial community analysis. The pipeline, along with test data, is freely available for public use.