Login / Signup

Species detection and delineation in the marine planktonic diatoms Chaetoceros and Bacteriastrum through metabarcoding: making biological sense of haplotype diversity.

Chetan C GaonkarRoberta PireddaDiana SarnoAdriana ZingoneMarina MontresorWiebe H C F Kooistra
Published in: Environmental microbiology (2020)
High-throughput sequencing (HTS) metabarcoding is commonly applied to assess phytoplankton diversity. Usually, haplotypes are grouped into operational taxonomic units (OTUs) through clustering, whereby the resulting number of OTUs depends on chosen similarity thresholds. We applied, instead, a phylogenetic approach to infer taxa among 18S rDNA V4-metabarcode haplotypes gathered from 48 time-series samples using the marine planktonic diatoms Chaetoceros and Bacteriastrum as test case. The 73 recovered taxa comprised both solitary haplotypes and polytomies, the latter composed each of a highly abundant, dominant haplotype and one to several minor, peripheral haplotypes. The solitary and dominant haplotypes usually matched reference sequences, enabling species assignation of taxa. We hypothesise that the super-abundance of reads in dominant haplotypes results from the homogenization effect of concerted evolution. Reads of populous peripheral haplotypes and dominant haplotypes show comparable distribution patterns over the sample dates, suggesting that they are part of the same population. Many taxa revealed marked seasonality, with closely related ones generally showing distinct periodicity, whereas others occur year-round. Phylogenies inferred from metabarcode haplotypes enable delineation of biologically meaningful taxa, whereas OTUs resulting from clustering algorithms often deviate markedly from such taxa.
Keyphrases
  • machine learning
  • single cell
  • rna seq