Abstract
A current high-priority phase of human genomics involves the development of a full Haplotype Map of the human genome [23]. It will be used in large-scale screens of populations to associate specific haplotypes with specific complex genetic-influenced diseases. A key, perhaps bottleneck, problem is to computationally infer haplotype pairs from genotype data. This paper follows the talk given at the DIMACS Conference on SNPs and Haplotypes held in November of 2002. It reviews several combinatorial approaches to the haplotype inference problem that we have investigated over the last several years. In addition, it updates some of the work presented earlier, and discusses the current state of our work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bafna, V., Gusfield, D., Lancia, G., Yooseph, S.: Haplotyping as perfect phylogeny: A direct approach. Technical report, UC Davis, Department of Computer Science (2002)
Bafna, V., Gusfield, D., Lancia, G., Yooseph, S.: Haplotyping as perfect phylogeny: A direct approach. J. Computational Biology 10, 323–340 (2003)
Bixby, R.E., Wagner, D.K.: An almost linear-time algorithm for graph realization. Mathematics of Operations Research 13, 99–123 (1988)
Chung, R.H., Gusfield, D.: Empirical exploration of perfect phylogeny haplotyping and haplotypers. In: Warnow, T.J., Zhu, B. (eds.) COCOON 2003. LNCS, vol. 2697, pp. 5–19. Springer, Heidelberg (2003)
Chung, R.H., Gusfield, D.: Perfect phylogeny haplotyper: Haplotype inferral using a tree model. Bioinformatics 19(6), 780–781 (2003)
Clark, A.: Inference of haplotypes from PCR-amplified samples of diploid populations. Mol. Biol. Evol. 7, 111–122 (1990)
Clark, A., Weiss, K., Nickerson, D., et al.: Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase. Am. J. Human Genetics 63, 595–612 (1998)
Daly, M., Rioux, J., Schaffner, S., Hudson, T., Lander, E.: High-resolution haplotype structure in the human genome. Nature Genetics 29, 229–232 (2001)
Donnelly, P.: Comments made in a lecture given at the DIMACS conference on Computational Methods for SNPs and Haplotype Inference (November 2002)
Eskin, E., Halperin, E., Karp, R.: Large scale reconstruction of haplotypes from genotype data. In: Proceedings of RECOMB 2003 (April 2003)
Eskin, E., Halperin, E., Karp, R.: Efficient reconstruction of haplotype structure via perfect phylogeny. Technical report, UC Berkeley, Computer Science Division, EECS (2002)
Fullerton, M., Clark, A., Sing, C., et al.: Apolipoprotein E variation at the sequence haplotype level: implications for the origin and maintenance of a major human polymorphism. Am. J. of Human Genetics, 881–900 (2000)
Gavril, F., Tamari, R.: An algorithm for constructing edge-trees from hypergraphs. Networks 13, 377–388 (1983)
Gusfield, D.: Efficient algorithms for inferring evolutionary history. Networks 21, 19–28 (1991)
Gusfield, D.: Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge University Press, Cambridge (1997)
Gusfield, D.: A practical algorithm for deducing haplotypes in diploid populations. In: Proceedings of 8’th International Confernece on Intelligent Systems in Molecular Biology, pp. 183–189. AAAI Press (2000)
Gusfield, D.: Inference of haplotypes from samples of diploid populations: complexity and algorithms. Journal of computational biology 8(3) (2001)
Gusfield, D.: Haplotyping as Perfect Phylogeny: Conceptual Framework and Efficient Solutions (Extended Abstract). In: Proceedings of RECOMB 2002: The Sixth Annual International Conference on Computational Biology, pp. 166–175 (2002)
Gusfield, D.: Haplotype inference by pure parsimony. In: Baeza-Yates, R., Chávez, E., Chrochemore, M. (eds.) CPM 2003. LNCS, vol. 2676, pp. 144–155. Springer, Heidelberg (2003)
Gusfield, D., Eddhu, S., Langley, C.: Optimal, efficient reconstruction of phylogenetic networks with constrained recombination. J. Bioinformatics and Computational Biology (to appear)
Gusfield, D., Eddhu, S., Langley, C.: Efficient reconstruction of phylogenetic networks (of SNPs) with constrained recombination. In: Proceedings of 2’nd CSB Bioinformatics Conference. IEEE Press, Los Alamitos (2003)
Gusfield, D., Eddhu, S., Langley, C.: The fine structure of galls in phylogenetic networks with recombination. Technical report, UC Davis, Department of Computer Science (2003)
Helmuth, L.: Genome research: Map of the human genome 3.0. Science 293(5530), 583–585 (2001)
Hubbel, E.: Personal Communication (August 2000)
Hudson, R.: Gene genealogies and the coalescent process. Oxford Survey of Evolutionary Biology 7, 1–44 (1990)
Lancia, G., Pinotti, C., Rizzi, R.: Haplotyping populations: Complexity and approximations, technical report dit-02-082. Technical report, University of Trento (2002)
Lawler, E.L.: Combinatorial Optimization: Networks and Matroids. Holt, Rinehart and Winston (1976)
Lin, S., Cutler, D., Zwick, M., Cahkravarti, A.: Haplotype inference in random population samples. Am. J. of Hum. Genet. 71, 1129–1137 (2003)
Niu, T., Qin, Z., Xu, X., Liu, J.S.: Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms. Am. J. Hum. Genet. 70, 157–169 (2002)
Orzack, S., Gusfield, D., Olson, J., Nesbitt, S., Stanton, V.: Analysis and exploration of the use of rule-based algorithms and consensus methods for the inferral of haplotypes. Genetics 165, 915–928 (2003)
Orzack, S., Gusfield, D., Stanton, V.: The absolute and relative accuracy of haplotype inferral methods and a consensus approach to haplotype inferral. Abstract Nr 115 in Am. Society of Human Genetics (Supplement 2001)
Papadimitriou, C., Steiglitz, K.: Combinatorial Optimization: Algorithms and Complexity. Prentice-Hall, Englewood Cliffs (1982)
Ravi, R.: Personal Communication
Stephens, M., Smith, N., Donnelly, P.: A new statistical method for haplotype reconstruction from population data. Am. J. Human Genetics 68, 978–989 (2001)
Tavare, S.: Calibrating the clock: Using stochastic processes to measure the rate of evolution. In: Lander, E., Waterman, M. (eds.) Calculating the Secretes of Life. National Academy Press, Washington (1995)
Wang, L., Xu, L.: Haplotype inference by maximum parsimony. Bioinformatics 19, 1773–1780 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gusfield, D. (2004). An Overview of Combinatorial Methods for Haplotype Inference. In: Istrail, S., Waterman, M., Clark, A. (eds) Computational Methods for SNPs and Haplotype Inference. RSNPsH 2002. Lecture Notes in Computer Science(), vol 2983. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24719-7_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-24719-7_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21249-2
Online ISBN: 978-3-540-24719-7
eBook Packages: Springer Book Archive