Evaluation of the impact of Illumina error correction tools on de novo genome assembly.
Mahdi HeydariGiles MiclottePiet DemeesterYves Van de PeerJan FostierPublished in: BMC bioinformatics (2017)
We confirm that most EC tools reduce the number of errors in sequencing data without introducing many new errors. However, we found that many EC tools suffer from poor performance in certain sequence contexts such as regions with low coverage or regions that contain short repeated or low-complexity sequences. Reads overlapping such regions are often ill-corrected in an inconsistent manner, leading to breakpoints in the resulting assemblies that are not present in assemblies obtained from uncorrected data. Resolving this systematic flaw in future EC tools could greatly improve the applicability of such tools.