Abstract
Pedigrees, or family trees, are graphs of family relationships that are used to study inheritance. A fundamental problem in computational biology is to find, for a pedigree with n individuals genotyped at every site, a set of Mendelian-consistent haplotypes that have the minimum number of recombinations. This is an \(\mathsf {NP}\)-hard problem and some pedigrees can have thousands of individuals and hundreds of thousands of sites.
This paper formulates this problem as a optimization on a graph and introduces a tailored algorithm with a running time of \(O(n^{(k+2)}m^{6k})\) for n individuals, m sites, and k recombinations. Since there are generally only 1-2 recombinations per chromosome in each meiosis, k is small enough to make this algorithm practically relevant.
Full Manuscript. Pre-print publication of the full manuscript is available at arXiv [10].
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abecasis, G., Cherny, S., Cookson, W., Cardon, L.: Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nat. Genet. 30, 97–101 (2002)
Browning, S., Browning, B.: On reducing the statespace of hidden Markov models for the identity by descent process. Theoret. Popul. Biol. 62(1), 1–8 (2002)
Coop, G., Wen, X., Ober, C., Pritchard, J.K., Przeworski, M.: High-resolution mapping of crossovers reveals extensive variation in fine-scale recombination patterns among humans. Science 319(5868), 1395–1398 (2008)
Doan, D.D., Evans, P.A.: Fixed-parameter algorithm for haplotype inferences on general pedigrees with small number of sites. In: Moulton, V., Singh, M. (eds.) WABI 2010. LNCS, vol. 6293, pp. 124–135. Springer, Heidelberg (2010)
Doan, D., Evans, P.: An FPT haplotyping algorithm on pedigrees with a small number of sites. Algorithms Mol. Biol. 6, 1–8 (2011)
Fishelson, M., Dovgolevsky, N., Geiger, D.: Maximum likelihood haplotyping for general pedigrees. Hum. Hered. 59, 41–60 (2005)
Geiger, D., Meek, C., Wexler, Y.: Speeding up HMM algorithms for genetic linkage analysis via chain reductions of the state space. Bioinformatics 25(12), i196 (2009)
Geiger, D., Meek, C., Wexler, Y.: Speeding up HMM algorithms for genetic linkage analysis via chain reductions of the state space. Bioinformatics 25(12), i196–i203 (2009)
Iverson, K.E.: A Programming Language. Wiley, New York (1962)
Kirkpatrick, B.: Haplotype inference for pedigrees with few recombinations. arXiv 1602.04270 (2016). http://arxiv.org/abs/1602.04270
Lauritzen, S.L., Sheehan, N.A.: Graphical models for genetic analysis. Stat. Sci. 18(4), 489–514 (2003)
Li, J., Jiang, T.: Computing the minimum recombinant haplotype configuration from incomplete genotype data on a pedigree by integer linear programming. J. Comput. Biol. 12(6), 719–739 (2005)
O’Connell, J., Gurdasani, D., et al.: A general approach for haplotype phasing across the full spectrum of relatedness. PLoS Genet 10(4), e1004234 (2014)
Pirola, Y., Bonizzoni, P., Jiang, T.: An efficient algorithm for haplotype inference on pedigrees with recombinations and mutations. IEEE/ACM Trans. Comput. Biol. Bioinform. 9(1), 12–25 (2012)
Risch, N., Merikangas, K.: The future of genetic studies of complex human diseases. Science 273(5281), 1516–1517 (1996)
Sobel, E., Lange, K.: Descent graphs in pedigree analysis: applications to haplotyping, location scores, and marker-sharing statistics. Am. J. Hum. Genet. 58(6), 1323–1337 (1996)
Steel, M., Hein, J.: Reconstructing pedigrees: a combinatorial perspective. J. Theoret. Biol. 240(3), 360–367 (2006)
Thornton, T., McPeek, M.: Case-control association testing with related individuals: a more powerful quasi-likelihood score test. Am. J. Hum. Genet. 81, 321–337 (2007)
Wang, W.B., Jiang, T.: Inferring haplotypes from genotypes on a pedigree with mutations, genotyping errors and missing alleles. J. Bioinform. Comput. Biol. 9, 339–365 (2011)
Xiao, J., Lou, T., Jiang, T.: An efficient algorithm for haplotype inference on pedigrees with a small number of recombinants. Algorithmica 62(3), 951–981 (2012)
Acknowledgments
BK thanks M. Mnich at the Cluster of Excellence, Saarland University, Saarbrücken, Germany for critical reading of the manuscript. BK thanks arXiv for pre-print publication of the full manuscript [10].
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Kirkpatrick, B. (2016). Haplotype Inference for Pedigrees with Few Recombinations. In: Bourgeois, A., Skums, P., Wan, X., Zelikovsky, A. (eds) Bioinformatics Research and Applications. ISBRA 2016. Lecture Notes in Computer Science(), vol 9683. Springer, Cham. https://doi.org/10.1007/978-3-319-38782-6_23
Download citation
DOI: https://doi.org/10.1007/978-3-319-38782-6_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-38781-9
Online ISBN: 978-3-319-38782-6
eBook Packages: Computer ScienceComputer Science (R0)