On the computational complexity of the maximum parsimony reconciliation problem in the duplication-loss-coalescence model

Algorithms Mol Biol. 2017 Mar 14:12:6. doi: 10.1186/s13015-017-0098-8. eCollection 2017.

Abstract

Background: Phylogenetic tree reconciliation is a widely-used method for inferring the evolutionary histories of genes and species. In the duplication-loss-coalescence (DLC) model, we seek a reconciliation that explains the incongruence between a gene and species tree using gene duplication, loss, and deep coalescence events. In the maximum parsimony framework, costs are associated with these event types and a reconciliation is sought that minimizes the total cost of the events required to map the gene tree onto the species tree.

Results: We show that this problem is NP-hard even for the special case of minimizing the number of duplications. We then show that the problem is APX-hard when both duplications and losses are considered, implying that no polynomial-time approximation scheme can exist for the problem unless P = NP.

Conclusions: These intractability results are likely to guide future research on algorithmic aspects of the DLC-reconciliation problem.

Keywords: APX-hardness; Duplication-loss-coalescence model; NP-hardness; Phylogenetic reconciliation.