Login / Signup

IDXL: Species Tree Inference Using Internode Distance and Excess Gene Leaf Count.

Sourya BhattacharyyaJayanta Mukherjee
Published in: Journal of molecular evolution (2017)
We propose an extension of the distance matrix methods NJst and ASTRID to infer species trees from incongruent gene trees having Incomplete Lineage Sorting. Both approaches consider the average internode distance (ID) between individual taxa pairs as the distance measure. The measure ID does not use the root of a tree, and thus may not always infer the relative position of a taxon with respect to the root. We define a novel distance measure excess gene leaf count (XL) between individual couplets. The XL measure is computed using the root of a tree. It is proved to be additive, and is shown to infer the relative order of divergence among individual couplets better. We propose a novel method IDXL which uses both the XL and ID measures for species tree construction. IDXL is shown to perform better than NJst and other distance matrix approaches for most of the biological and simulated datasets. Having the same computational complexity as NJst, IDXL can be applied for species tree inference on large-scale biological datasets.
Keyphrases
  • copy number
  • single cell
  • genome wide
  • rna seq
  • genetic diversity
  • peripheral blood
  • gene expression
  • transcription factor
  • magnetic resonance
  • diffusion weighted imaging