Abstract
We study the problem of finding the sequence of an unknown DNA fragment given the set of its k-long subsequences and a homologous sequence, namely a sequence that is similar to the target sequence. Such a sequence is available in some applications, e.g., when detecting single nucleotide polymorphisms. Pe’er and Shamir studied this problem and presented a heuristic algorithm for it. In this paper, we give an algorithm with provable performance: We show that under some assumptions, the algorithm can reconstruct a random sequence of length O(4k) with high probability. We also show that no algorithm can reconstruct sequences of length Ω(logk·4k).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adleman, L.M.: Location sensitive sequencing of DNA. Technical report, University of Southern California (1998)
Arratia, R., Martin, D., Reinert, G., Waterman, M.S.: Poisson process approximation for sequence repeats, and sequencing by hybridization. J. of Computational Biology 3(3), 425–463 (1996)
Bains, W., Smith, G.C.: A novel method for nucleic acid sequence determination. J. Theor. Biology 135, 303–307 (1988)
Ben-Dor, A., Pe’er, I., Shamir, R., Sharan, R.: On the complexity of positional sequencing by hybridization. J. Theor. Biology 8(4), 88–100 (2001)
Błażewicz, J., Formanowicz, P., Glover, F., Kasprzak, M., Wȩglarz, J.: An improved tabu search algorithm for DNA sequencing with errors. In: Proc. 3rd Metaheuristics International Conference, pp. 69–75 (1999)
Błażewicz, J., Formanowicz, P., Guinand, F., Kasprzak, M.: A heuristic managing errors for DNA sequencing. Bioinformatics 18(5), 652–660 (2002)
Błażewicz, J., Formanowicz, P., Kasprzak, M., Markiewicz, W.T., Wȩglarz, J.: DNA sequencing with positive and negative errors. J. of Computational Biology 6(1), 113–123 (1999)
Błażewicz, J., Formanowicz, P., Kasprzak, M., Markiewicz, W.T., Wȩglarz, J.: Tabu search for dna sequencing with false negatives and false positives. European Journal of Operational Research 125, 257–265 (2000)
Błażewicz, J., Kaczmarek, J., Kasprzak, M., Markiewicz, W.T., Wȩglarz, J.: Sequential and parallel algorithms for DNA sequencing. CABIOS 13, 151–158 (1997)
Błażewicz, J., Kasprzak, M., Kuroczycki, W.: Hybrid genetic algorithm for DNA sequencing with errors. J. of Heuristics 8, 495–502 (2002)
Broude, S.D., Sano, T., Smith, C.S., Cantor, C.R.: Enhanced DNA sequencing by hybridization. Proc. Nat. Acad. Sci. USA 91, 3072–3076 (1994)
Drmanac, R., Labat, I., Brukner, I., Crkvenjakov, R.: Sequencing of megabase plus DNA by hybridization: theory of the method. Genomics 4, 114–128 (1989)
Dyer, M.E., Frieze, A.M., Suen, S.: The probability of unique solutions of sequencing by hybridization. J. of Computational Biology 1, 105–110 (1994)
Frieze, A., Preparata, F., Upfal, E.: Optimal reconstruction of a sequence from its probes. J. of Computational Biology 6, 361–368 (1999)
Frieze, M., Halldórsson, B.V.: Optimal sequencing by hybridization in rounds. J. of Computational Biology 9(2), 355–369 (2002)
Halperin, E., Halperin, S., Hartman, T., Shamir, R.: Handling long targets and errors in sequencing by hybridization. In: Proc. 6th Annual International Conference on Computational Molecular Biology (RECOMB 2002), pp. 176–185 (2002)
Hannenhalli, S., Pevzner, P.A., Lewis, H., Skiena, S.: Positional sequencing by hybridization. Computer Applications in the Biosciences 12, 19–24 (1996)
Heath, S.A., Preparata, F.P.: Enhanced sequence reconstruction with DNA microarray application. In: Wang, J. (ed.) COCOON 2001. LNCS, vol. 2108, pp. 64–74. Springer, Heidelberg (2001)
Heath, S.A., Preparata, F.P., Young, J.: Sequencing by hybridization using direct and reverse cooperating spectra. In: Proc. 6th Annual International Conference on Computational Molecular Biology (RECOMB 2002), pp. 186–193 (2002)
Leong, H.W., Preparata, F.P., Sung, W.K., Willy, H.: On the control of hybridization noise in DNA sequencing-by-hybridization. In: Guigó, R., Gusfield, D. (eds.) WABI 2002. LNCS, vol. 2452, pp. 392–403. Springer, Heidelberg (2002)
Lipshutz, R.J.: Likelihood DNA sequencing by hybridization. J. Biomolecular Structure and Dynamics 11, 637–653 (1993)
Lysov, Y., Floretiev, V., Khorlyn, A., Khrapko, K., Shick, V., Mirzabekov, A.: DNA sequencing by hybridization with oligonucleotides. Dokl. Acad. Sci. USSR 303, 1508–1511 (1988)
Margaritis, D., Skiena, S.: Reconstructing strings from substrings in rounds. In: Proc. 36th Symposium on Foundation of Computer Science (FOCS 1995), pp. 613–620 (1995)
Pe’er, I., Arbili, N., Shamir, R.: A computational method for resequencing long dna targets by universal oligonucleotide arrays. Proc. National Academy of Science USA 99, 15497–15500 (2002)
Pe’er, I., Shamir, R.: Spectrum alignment: Efficient resequencing by hybridization. In: Proc. 8th International Conference on Intelligent Systems in Molecular Biology (ISMB 2000), pp. 260–268 (2000)
Pevzner, P.A.: l-tuple DNA sequencing: Computer analysis. J. Biomolecular Structure and Dynamics 7, 63–73 (1989)
Pevzner, P.A., Lysov, Y.P., Khrapko, K.R., Belyavsky, A.V., Florentiev, V.L., Mirzabekov, A.D.: Improved chips for sequencing by hybridization. J. Biomolecular Structure and Dynamics 9, 399–410 (1991)
Preparata, F., Upfal, E.: Sequencing by hybridization at the information theory bound: an optimal algorithm. In: Proc. 4th Annual International Conference on Computational Molecular Biology (RECOMB 2000), pp. 88–100 (2000)
Shamir, R., Tsur, D.: Large scale sequencing by hybridization. J. of Computational Biology 9(2), 413–428 (2002)
Skiena, S., Snir, S.: Restricting SBH ambiguity via restriction enzymes. In: Guigó, R., Gusfield, D. (eds.) WABI 2002. LNCS, vol. 2452, pp. 404–417. Springer, Heidelberg (2002)
Skiena, S., Sundaram, G.: Reconstructing strings from substrings. J. of Computational Biology 2, 333–353 (1995)
Snir, S., Yeger-Lotem, E., Chor, B., Yakhini, Z.: Using restriction enzymes to improve sequencing by hybridization. Technical Report CS-2002-14, Technion, Haifa, Israel (2002)
Tsur, D.: Sequencing by hybridization with errors: Handling longer sequences. Manuscript (2003)
Tsur, D.: Sequencing by hybridization in few rounds. In: Benson, G., Page, R.D.M. (eds.) WABI 2003. LNCS (LNBI), vol. 2812, pp. 498–511. Springer, Heidelberg (2003) (to appear)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tsur, D. (2003). Bounds for Resequencing by Hybridization. In: Benson, G., Page, R.D.M. (eds) Algorithms in Bioinformatics. WABI 2003. Lecture Notes in Computer Science(), vol 2812. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39763-2_35
Download citation
DOI: https://doi.org/10.1007/978-3-540-39763-2_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20076-5
Online ISBN: 978-3-540-39763-2
eBook Packages: Springer Book Archive