Abstract
In this paper we approach the fragment assembly problem by using the notion of minimal forbidden factors introduced in previous paper. Denoting by M(w) the set of minimal forbidden factors of a word w, we first focus on the evaluation of the size of elements in M(w) and on designing of an algorithm to recover the word w from M(w). Actually we prove that for a word w randomly generated by a memoryless source with identical symbol probabilities, the maximal length m(w) of words in M(w) is logarithmic and that the reconstruction algorithm runs in linear time. These results have an interesting application to the fragment assembly problem, i.e. reconstruct a word w from a given set I of substrings (fragments). Indeed under a suitable hypothesis on the set of fragments I, one can detect the elements of M(w) by looking at the minimal forbidden factors of elements in I and then apply the reconstruction algorithm.
Partially supported by MURST projects: Modelli di calcolo innovativi: metodi sintattici e combinatori and Bioinformatica e Ricerca Genomica
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aho, A. V., Hopcroft, J. E., Ullman, J. D.: Data Structures and Algorithms. Addison Wesley, Reading, Mass, 1983
Béal, M.-P., Mignosi, F., Restivo, A., Sciortino, M.: ForbiddenWords in Symbolic Dynamics. Advances in Appl. Math. 25 (2000) 163–193
Blumer, A., Blumer, J., Ehrenfeucht, A., Haussler, D., Chen, M.T., Seiferas, J.: The smallest automaton recognizing the subwords of a text. Theoret. Comput. Sci. 40 (1985) 31–55
Carpi, A., de Luca, A., Varricchio, S.:Words, univalent factors, and boxes. Report, Università di Roma “La Sapienza”, Dipartimento di Matematica “Guido Castelnuovo”, 2000.
Crochemore, M., Hancart, C.: Automata for matching patterns. Handbook of Formal Languages, volume 2, chapter 9, 399–462, G. Rozenberg, A. Salomaa (Eds.), Springer Verlag, 1997
Crochemore, M., Mignosi, F., Restivo, A.: Automata and forbidden words. Inf. Proc. Lett. 67 (1998) 111–117
Crochemore, M., Mignosi, F., Restivo, A., Salemi, S.: Data Compression using antidictionaries. Proceedings of the IEEE, Special Issue on Lossless Data Compression (J. A. Storer Ed.) 88:11 (2000) 1756–1768
Mignosi, F., Restivo, A., Sciortino, M.: Words and Forbidden Factors. Theoret. Comput. Sci. 273(1–2) (2001) 99–117
Mignosi, F., Restivo, A., Sciortino, M., Storer, J.: On Sequence Assembly. Technical report Brandeis cs-00-210
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mignosi, F., Restivo, A., Sciortino, M. (2002). Forbidden Factors and Fragment Assembly. In: Kuich, W., Rozenberg, G., Salomaa, A. (eds) Developments in Language Theory. DLT 2001. Lecture Notes in Computer Science, vol 2295. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46011-X_31
Download citation
DOI: https://doi.org/10.1007/3-540-46011-X_31
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43453-5
Online ISBN: 978-3-540-46011-4
eBook Packages: Springer Book Archive