An approximation algorithm for alphabet indexing problem

Shimozono, Shinichi

doi:10.1007/BFb0015403

Shinichi Shimozono¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1004))

Included in the following conference series:

International Symposium on Algorithms and Computation

163 Accesses

Abstract

A K-indexing is a mapping from an alphabet Σ to a set Γ of K symbols forming the homomorphism to transform strings. Given Σ, two disjoint sets of strings P, Q over Σ and Γ={1,..., K}, Alphabet Indexing is the problem to find a K-indexing that transforms no two different strings taken from each P and Q into the same one. Although this problem is NP-complete, applying K-indexing to input data brings remarkable advantages in actual applications. In this paper, we introduce Max K-Indexing, a maximization version of Alphabet Indexing, that intends to maximize the number of pairs in P×Q whose strings are not transformed into the same ones. We show that the problem is MAX SNP-hard, that is, the problem seems to have no polynomial-time algorithm achieving an arbitrary small error ratio. Then we propose a simple polynomial-time greedy algorithm and show that the algorithm attains the constant error ratio 1/K for K-indexing. Also we define M-Tuple K-Indexing problem by extending pairs of strings in Max K-Indexing to tuples of strings over more than two sets, and show that a natural extension of the algorithm also achieves a constant error bound.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

S. Arikawa, S. Miyano, A. Shinohara, S. Kuhara, Y. Mukouchi, and T. Shinohara, A machine discovery from amino acid sequences by decision trees over regular patterns, New Gener. Comput., 11 (1993), pp. 361–375.
Google Scholar
S. Arora, C. Lund, R. Motwani, M. Sudan, and M. Szegedy, Proof verification and hardness of approximation problems, in Proc. 33rd Annual Symposium on Foundations of Computer Science, 1992, pp. 14–23.
Google Scholar
A. Blum, T. Jiang, M. Li, J. Tromp, and M. Yannakakis, Linear approximation of shortest superstrings, in Proc. 23rd Annual ACM Symposium on Theory of Computing, 1991, pp. 328–336.
Google Scholar
S. Fukamachi, S. Shimozono, H. Arimura, and T. Shinohara, Lossy text compression for string pattern matching, Technical Report of IEICE NLC95-6, 1995.
Google Scholar
M. R. Garey and D. S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness, W. H. Freeman and Company, New York, 1978.
Google Scholar
M. X. Goemans and D. P. Williamson, 878-approximation algorithms for MAX CUT and MAX 2SAT, in Proc. 26th Annual ACM Symposium on Theory of Computing, 1994, pp. 422–431.
Google Scholar
D. S. Johnson, Approximation algorithms for combinatorial problems, J. Comput. Sys. Sci., 9 (1974), pp. 256–278.
Google Scholar
C. H. Papadimitriou, Computational Complexity, Addison-Wesley Publishing Company, Reading, Massachusetts, 1994.
Google Scholar
C. H. Papadimitriou and M. Yannakakis, Optimization, approximation, and complexity classes, J. Comput. Sys. Sci., 43 (1991), pp. 425–440.
Google Scholar
S. Shimozono and S. Miyano, Complexity of finding alphabet indexing, IEICE Trans. Inf. Sys., E78-D (1995), pp. 13–18.
Google Scholar
S. Shimozono, A. Shinohara, T. Shinohara, S. Miyano, S. Kuhara, and S. Arikawa, Knowledge acquisition from amino acid sequences by machine learning system BONSAI, Trans. Inf. Proc. Soc. Japan, 35 (1994), pp. 2009–2018.
Google Scholar
M. Yannakakis, On the approximation of maximum satisfiability, J. Algorithms, 17 (1994), pp. 475–502.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Control Engineering and Science, Kyushu Institute of Technology, Kawazu 680-4, 820, Iizuka, Japan
Shinichi Shimozono

Authors

Shinichi Shimozono
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

John Staples Peter Eades Naoki Katoh Alistair Moffat

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shimozono, S. (1995). An approximation algorithm for alphabet indexing problem. In: Staples, J., Eades, P., Katoh, N., Moffat, A. (eds) Algorithms and Computations. ISAAC 1995. Lecture Notes in Computer Science, vol 1004. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0015403

Download citation

DOI: https://doi.org/10.1007/BFb0015403
Published: 09 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60573-7
Online ISBN: 978-3-540-47766-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics