Skip to main content

Forbidden Factors and Fragment Assembly

  • Conference paper
  • First Online:
Developments in Language Theory (DLT 2001)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2295))

Included in the following conference series:

Abstract

In this paper we approach the fragment assembly problem by using the notion of minimal forbidden factors introduced in previous paper. Denoting by M(w) the set of minimal forbidden factors of a word w, we first focus on the evaluation of the size of elements in M(w) and on designing of an algorithm to recover the word w from M(w). Actually we prove that for a word w randomly generated by a memoryless source with identical symbol probabilities, the maximal length m(w) of words in M(w) is logarithmic and that the reconstruction algorithm runs in linear time. These results have an interesting application to the fragment assembly problem, i.e. reconstruct a word w from a given set I of substrings (fragments). Indeed under a suitable hypothesis on the set of fragments I, one can detect the elements of M(w) by looking at the minimal forbidden factors of elements in I and then apply the reconstruction algorithm.

Partially supported by MURST projects: Modelli di calcolo innovativi: metodi sintattici e combinatori and Bioinformatica e Ricerca Genomica

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aho, A. V., Hopcroft, J. E., Ullman, J. D.: Data Structures and Algorithms. Addison Wesley, Reading, Mass, 1983

    MATH  Google Scholar 

  2. Béal, M.-P., Mignosi, F., Restivo, A., Sciortino, M.: ForbiddenWords in Symbolic Dynamics. Advances in Appl. Math. 25 (2000) 163–193

    MATH  Google Scholar 

  3. Blumer, A., Blumer, J., Ehrenfeucht, A., Haussler, D., Chen, M.T., Seiferas, J.: The smallest automaton recognizing the subwords of a text. Theoret. Comput. Sci. 40 (1985) 31–55

    Article  MATH  MathSciNet  Google Scholar 

  4. Carpi, A., de Luca, A., Varricchio, S.:Words, univalent factors, and boxes. Report, Università di Roma “La Sapienza”, Dipartimento di Matematica “Guido Castelnuovo”, 2000.

    Google Scholar 

  5. Crochemore, M., Hancart, C.: Automata for matching patterns. Handbook of Formal Languages, volume 2, chapter 9, 399–462, G. Rozenberg, A. Salomaa (Eds.), Springer Verlag, 1997

    Google Scholar 

  6. Crochemore, M., Mignosi, F., Restivo, A.: Automata and forbidden words. Inf. Proc. Lett. 67 (1998) 111–117

    Google Scholar 

  7. Crochemore, M., Mignosi, F., Restivo, A., Salemi, S.: Data Compression using antidictionaries. Proceedings of the IEEE, Special Issue on Lossless Data Compression (J. A. Storer Ed.) 88:11 (2000) 1756–1768

    Google Scholar 

  8. Mignosi, F., Restivo, A., Sciortino, M.: Words and Forbidden Factors. Theoret. Comput. Sci. 273(1–2) (2001) 99–117

    Article  MathSciNet  Google Scholar 

  9. Mignosi, F., Restivo, A., Sciortino, M., Storer, J.: On Sequence Assembly. Technical report Brandeis cs-00-210

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mignosi, F., Restivo, A., Sciortino, M. (2002). Forbidden Factors and Fragment Assembly. In: Kuich, W., Rozenberg, G., Salomaa, A. (eds) Developments in Language Theory. DLT 2001. Lecture Notes in Computer Science, vol 2295. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46011-X_31

Download citation

  • DOI: https://doi.org/10.1007/3-540-46011-X_31

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43453-5

  • Online ISBN: 978-3-540-46011-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics