Sequencing, assembly and annotation of the whole-insect genome of Lymantria dispar dispar, the European gypsy moth.
Michael E SparksFrancois Olivier HebertJ Spencer JohnstonRichard C HamelinMichel CussonRoger C LevesqueDawn E Gundersen-RindalPublished in: G3 (Bethesda, Md.) (2022)
The European gypsy moth, Lymantria dispar dispar (LDD), is an invasive insect and a threat to urban trees, forests and forest-related industries in North America. For use as a comparator with a previously published genome based on the LD652 pupal ovary-derived cell line, as well as whole-insect genome sequences obtained from the Asian gypsy moth subspecies L. dispar asiatica and L. dispar japonica, the whole-insect LDD genome was sequenced, assembled and annotated. The resulting assembly was 998 Mb in size, with a contig N50 of 662 Kb and a GC content of 38.8%. Long interspersed nuclear elements constitute 25.4% of the whole-insect genome, and a total of 11,901 genes predicted by automated gene finding encoded proteins exhibiting homology with reference sequences in the NCBI NR and/or UniProtKB databases at the most stringent similarity cutoff level (i.e., the gold tier). These results will be especially useful in developing a better understanding of the biology and population genetics of L. dispar and the genetic features underlying Lepidoptera in general.