Influence of Tree Topology Restrictions on the Complexity of Haplotyping with Missing Data
Haplotyping, also known as haplotype phase prediction, is the problem of predicting likely haplotypes from genotype data. One fast haplotyping method is based on an evolutionary model where a perfect phylogenetic tree is sought that explains the observed data. Unfortunately, when data entries are missing as is often the case in laboratory data, the resulting incomplete perfect phylogeny haplotyping problem ipph is NP-complete and no theoretical results are known concerning its approximability, fixed-parameter tractability, or exact algorithms for it. Even radically simplified versions, such as the restriction to phylogenetic trees consisting of just two directed paths from a given root, are still NP-complete; but here a fixed-parameter algorithm is known. We show that such drastic and ad hoc simplifications are not necessary to make ipph fixed-parameter tractable: We present the first theoretical analysis of an algorithm, which we develop in the course of the paper, that works for arbitrary instances of ipph. On the negative side we show that restricting the topology of perfect phylogenies does not always reduce the computational complexity: while the incomplete directed perfect phylogeny problem is well-known to be solvable in polynomial time, we show that the same problem restricted to path topologies is NP-complete.
KeywordsTree Topology Node Label Light Component Mutation Tree Tree Record
Unable to display preview. Download preview PDF.
- 4.Clark, A.G.: Inference of haplotypes from PCR-amplified samples of diploid populations. J. of Mol. Biol. and Evol. 7(2), 111–122 (1990)Google Scholar
- 6.Elberfeld, M., Schnoor, I., Tantau, T.: Influence of tree topology restrictions on the complexity of haplotyping with missing data. Tech. Rep. SIIM-TR-A-08-05, Universität zu Lübeck (2008)Google Scholar
- 9.Excoffier, L., Slatkin, M.: Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. Mol. Biol. and Evol. 12(5), 921–927 (1995)Google Scholar
- 10.Gramm, J., Hartman, T., Nierhoff, T., Sharan, R., Tantau, T.: On the complexity of SNP block partitioning under the perfect phylogeny model. Discrete Math. (2008) (to appear), doi:010.1016/j.disc.2008.04.002Google Scholar
- 14.Halperin, E., Karp, R.M.: Perfect phylogeny and haplotype assignment. In: Proc. RECOMB 2002, pp. 10–19. ACM Press, New York (2004)Google Scholar