DROP: Molecular voucher database for identification of Drosophila parasitoids.
Chia-Hua LueMatthew L BuffingtonSonja SchefferMatthew LewisTyler A ElliottAmelia R I LindseyAmy C DriskellAnna JandovaMasahito T KimuraYves CartonRobert R KulaTodd A SchlenkeMariana MateosShubha GovindJulien VaraldiEmilio GuerrieriMassimo GiorginiXingeng WangKim HoelmerKent M DaanePaul K AbramNicholas A PardikesJoel J BrownMelanie ThierryMarylène PoiriéPaul GoldsteinScott E MillerW Daniel TraceyJeremy S DavisFrancis M JigginsBregje WertheimOwen T LewisJeff LeipsPhillip P A StaniczenkoJan HrčekPublished in: Molecular ecology resources (2021)
Molecular identification is increasingly used to speed up biodiversity surveys and laboratory experiments. However, many groups of organisms cannot be reliably identified using standard databases such as GenBank or BOLD due to lack of sequenced voucher specimens identified by experts. Sometimes a large number of sequences are available, but with too many errors to allow identification. Here, we address this problem for parasitoids of Drosophila by introducing a curated open-access molecular reference database, DROP (Drosophila parasitoids). Identifying Drosophila parasitoids is challenging and poses a major impediment to realize the full potential of this model system in studies ranging from molecular mechanisms to food webs, and in biological control of Drosophila suzukii. In DROP, genetic data are linked to voucher specimens and, where possible, the voucher specimens are identified by taxonomists and vetted through direct comparison with primary type material. To initiate DROP, we curated 154 laboratory strains, 856 vouchers, 554 DNA sequences, 16 genomes, 14 transcriptomes, and six proteomes drawn from a total of 183 operational taxonomic units (OTUs): 114 described Drosophila parasitoid species and 69 provisional species. We found species richness of Drosophila parasitoids to be heavily underestimated and provide an updated taxonomic catalogue for the community. DROP offers accurate molecular identification and improves cross-referencing between individual studies that we hope will catalyse research on this diverse and fascinating model system. Our effort should also serve as an example for researchers facing similar molecular identification problems in other groups of organisms.