Login / Signup

Exploring the accuracy and limits of algorithms for localizing recombination breakpoints.

Shi CenDavid A Rasmussen
Published in: Molecular biology and evolution (2024)
Phylogenetic methods are widely used to reconstruct the evolutionary relationships among species and in dividuals. However, recombination can obscure ancestral relationships as individuals may inherit different regions of their genome from different ancestors. It is therefore often necessary to detect recombination events, locate recombination breakpoints and select recombination-free alignments prior to reconstructing phylogenetic trees. While many earlier studies examined the power of different methods to detect recombination, very few have examined the ability of these methods to accurately locate recombination breakpoints. In this study, we simulated genome sequences based on ancestral recombination graphs and explored the accuracy of three popular recombination detection methods: MaxChi, 3SEQ and GARD. The accuracy of inferred breakpoint locations was evaluated along with the key factors contributing to variation in accuracy across data sets. While many different genomic features contribute to the variation in performance across methods, the number of informative sites consistent with the pattern of inheritance between parent and recombinant child sequences always has the greatest contribution to accuracy. While partitioning sequence alignments based on identified recombination breakpoints can greatly decrease phylogenetic error, the quality of phylogenetic reconstructions depends very little on how breakpoints are chosen to partition the alignment. Our work sheds light on how different features of recombinant genomes affect the performance of recombination detection methods and suggests best practices for reconstructing phylogenies based on recombination-free alignments.
Keyphrases
  • dna repair
  • dna damage
  • genome wide
  • primary care
  • oxidative stress
  • mental health
  • machine learning
  • gene expression
  • big data
  • quality improvement
  • cell free
  • artificial intelligence
  • rna seq
  • quantum dots
  • image quality