Skip to main content

Computing Minimum Length Representations of Sets of Words of Uniform Length

  • Conference paper
  • First Online:
  • 665 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8986))

Abstract

Motivated by text compression, the problem of representing sets of words of uniform length by partial words, i.e., sequences that may have some wildcard characters or holes, was recently considered and shown to be in \(\mathcal {P}\). Polynomial-time algorithms that construct representations were described using graph theoretical approaches. As more holes are allowed, representations shrink, and if representation is given, the set can be reconstructed. We further study this problem by determining, for a binary alphabet, the largest possible value of the size of a set of partial words that is important in deciding the representability of a given set S of words of uniform length. This largest value, surprisingly, is \(\varSigma _{i=0}^{|S|-1} 2^{\chi (i)}\) where \(\chi (i)\) is the number of ones in the binary representation of i, a well-studied digital sum, and it is achieved when the cardinality of S is a power of two. We show that circular representability is in \(\mathcal {P}\) and that unlike non-circular representability, it is easy to decide. We also consider the problem of computing minimum length representation (circular) total words, those without holes, and reduce it to a cost/flow network problem.

This material is based upon work supported by the National Science Foundation under Grant No. DMS–1060775.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Ahuja, R.K., Magnanti, T.L., Orlin, J.B.: Network Flows: Theory, Algorithms, and Applications. Prentice-Hall, New Jersey (1993)

    MATH  Google Scholar 

  2. Blanchet-Sadri, F., Munteanu, S.: Deciding representability of sets of words of equal length in polynomial time. In: Lecroq, T., Mouchard, L. (eds.) IWOCA 2013. LNCS, vol. 8288, pp. 28–40. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  3. Blanchet-Sadri, F., Simmons, S.: Deciding representability of sets of words of equal length. Theoret. Comput. Sci. 475, 34–46 (2013)

    Article  MATH  MathSciNet  Google Scholar 

  4. Flajolet, P., Grabner, P., Kirschenhofer, P., Prodinger, H., Tichy, F.: Mellin transforms and asymptotics: digital sums. Theoret. Comput. Sci. 123, 291–314 (1994)

    Article  MATH  MathSciNet  Google Scholar 

  5. Fredericksen, H.: A survey of full length nonlinear shift register cycle algorithms. SIAM Rev. 24, 195–221 (1982)

    Article  MathSciNet  Google Scholar 

  6. Gallant, J., Maier, D., Storer, J.A.: On finding minimal length superstrings. J. Comput. Syst. Sci. 20, 50–58 (1980)

    Article  MATH  MathSciNet  Google Scholar 

  7. Goldberg, A.V., Tarjan, R.E.: Finding minimum-cost circulations by successive approximation. Math. Oper. Res. 15, 430–466 (1990)

    Article  MATH  MathSciNet  Google Scholar 

  8. Harborth, H.: Number of odd binomial coefficients. Proc. Amer. Math. Soc. 62, 19–22 (1977)

    Article  MATH  MathSciNet  Google Scholar 

  9. Katz, J., Lindell, Y.: Introduction to Modern Cryptography: Principles and Protocols. Cryptography and Network Security. Chapman & Hall/CRC, Boca Raton (2008)

    Google Scholar 

  10. van Lint, J.H., MacWilliams, F.J., Sloane, N.J.A.: On pseudo-random arrays. SIAM J. Appl. Math. 36, 62–72 (1979)

    Article  MATH  MathSciNet  Google Scholar 

  11. Tan, S., Shallit, J.: Sets represented as the length-n factors of a word. In: Karhumäki, J., Lepistö, A., Zamboni, L. (eds.) WORDS 2013. LNCS, vol. 8079, pp. 250–261. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  12. Tarjan, R.: Depth-first search and linear graph algorithms. SIAM J. Comput. 1, 146–160 (1972)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Francine Blanchet-Sadri .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Blanchet-Sadri, F., Lohr, A. (2015). Computing Minimum Length Representations of Sets of Words of Uniform Length. In: Jan, K., Miller, M., Froncek, D. (eds) Combinatorial Algorithms. IWOCA 2014. Lecture Notes in Computer Science(), vol 8986. Springer, Cham. https://doi.org/10.1007/978-3-319-19315-1_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-19315-1_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-19314-4

  • Online ISBN: 978-3-319-19315-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics