Abstract
The shortest common superstring and the shortest common supersequence are two well studied problems having a wide range of applications. In this paper we consider both problems with resource constraints, denoted as the Restricted Common Superstring (shortly RCSstr) problem and the Restricted Common Supersequence (shortly RCSseq). In the RCSstr (RCSseq) problem we are given a set S of n strings, s 1, s 2, …, s n , and a multiset t = {t 1, t 2, …, t m }, and the goal is to find a permutation π: {1, …, m} → {1, …, m} to maximize the number of strings in S that are substrings (subsequences) of π(t) = t π(1) t π(2) ⋯ t π(m) (we call this ordering of the multiset, π(t), a permutation of t). We first show that in its most general setting the RCSstr problem is NP-complete and hard to approximate within a factor of n 1 − ε, for any ε > 0, unless P = NP. Afterwards, we present two separate reductions to show that the RCSstr problem remains NP-Hard even in the case where the elements of t are drawn from a binary alphabet or for the case where all input strings are of length two. We then present some approximation results for several variants of the RCSstr problem. In the second part of this paper, we turn to the RCSseq problem, where we present some hardness results, tight lower bounds and approximation algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barone, P., Bonizzoni, P., Vedova, G.D., Mauri, G.: An approximation algorithm for the shortest common supersequence problem: an experimental analysis. In: Symposium on Applied Computing, pp. 56–60 (2001)
Berger, B., Peter, W.S.: Approximation algorithms for the maximum acyclic subgraph problem. In: SODA, pp. 236–243 (1990)
Bläser, M.: A 3/4-approximation algorithm for maximum atsp with weights zero and one. In: APPROX-RANDOM, pp. 61–71 (2004)
Blum, A., Jiang, T., Li, M., Tromp, J., Yannakakis, M.: Linear approximation of shortest superstrings. J. ACM 41(4), 630–647 (1994)
Charikar, M., Guruswami, V., Manokaran, R.: Every permutation csp of arity 3 is approximation resistant. In: IEEE Conference on Computational Complexity, pp. 62–73 (2009)
Cotta, C.: Memetic algorithms with partial lamarckism for the shortest common supersequence problem. In: IWINAC, vol. (2), pp. 84–91 (2005)
Engebretsen, L., Karpinski, M.: Approximation hardness of tsp with bounded metrics. In: ICALP, pp. 201–212 (2001)
Garey, M.R., Johnson, D.S.: Computers and intractability. A guide to the theory of NP-completeness. W. H. Freeman, New York (1979)
Gotthilf, Z., Lewenstein, M.: Improved approximation results on the shortest common supersequence problem. In: SPIRE, pp. 277–284 (2009)
Gotthilf, Z., Lewenstein, M., Popa, A.: On shortest common superstring and swap permutations. In: SPIRE, pp. 270–278 (2010)
Guruswami, V., Manokaran, R., Raghavendra, P.: Beating the random ordering is hard: Inapproximability of maximum acyclic subgraph. In: FOCS, pp. 573–582 (2008)
Jiang, T., Li, M.: On the approximation of shortest common supersequences and longest common subsequences. SIAM J. Comput. 24(5), 1122–1139 (1995)
Kaplan, H., Lewenstein, M., Shafrir, N., Sviridenko, M.: Approximation algorithms for asymmetric tsp by decomposing directed regular multigraphs. J. ACM 52(4), 602–626 (2005)
Maier, D.: The complexity of some problems on subsequences and supersequences. J. ACM 25(2), 322–336 (1978)
Middendorf, M.: The shortest common nonsubsequence problem is NP-complete. Theor. Comput. Sci. 108(2), 365–369 (1993)
Ott, S.: Lower bounds for approximating shortest superstrings over an alphabet of size 2. In: WG, pp. 55–64 (1999)
Papadimitriou, C.H., Yannakakis, M.: Optimization, approximation, and complexity classes. J. Comput. Syst. Sci. 43(3), 425–440 (1991)
Pevzner, P.A.: Multiple alignment, communication cost, and graph matching. SIAM Journal of Applied Mathematics 52(6), 1763–1779 (1992)
Räihä, K.-J., Ukkonen, E.: The shortest common supersequence problem over binary alphabet is NP-complete. Theor. Comput. Sci. 16, 187–198 (1981)
Rubinov, A.R., Timkovsky, V.G.: String noninclusion optimization problems. SIAM J. Discrete Math. 11(3), 456–467 (1998)
Sacerdoti, E.D.: A structure for plans and behavior. Elsevier, Amsterdam (1977)
Sankoff, D., Kruskal, J.: Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison. CSLI Publications, Stanford (1983)
Sellis, T.K.: Multiple-query optimization. ACM Trans. Database Syst. 13(1), 23–52 (1988)
Storer, J.A.: Data Compression: Methods and Theory. Computer Science Press, Rockville (1988)
Sweedyk, Z.: A 2\(\frac{1}{2}\)-approximation algorithm for shortest superstring. SIAM J. Comput. 29(3), 954–986 (1999)
Tate, A.: Generating project networks. In: IJCAI, pp. 888–893 (1977)
Timkovsky, V.G.: Some approximations for shortest common nonsubsequences and supersequences. In: SPIRE, pp. 257–268 (2008)
Vazirani, V.V.: Approximation Algorithms. Springer, Heidelberg (2004)
Wilensky, R.: Planning and understanding. Addison-Wesley, Reading (1983)
Wilkins, D.E.: Practical planning: Extending the classical AI planning paradigm. Morgan Kaufmann, San Francisco (1988)
Zuckerman, D.: Linear degree extractors and the inapproximability of max clique and chromatic number. Theory of Computing 3(1), 103–128 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Clifford, R., Gotthilf, Z., Lewenstein, M., Popa, A. (2011). Restricted Common Superstring and Restricted Common Supersequence. In: Giancarlo, R., Manzini, G. (eds) Combinatorial Pattern Matching. CPM 2011. Lecture Notes in Computer Science, vol 6661. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21458-5_39
Download citation
DOI: https://doi.org/10.1007/978-3-642-21458-5_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21457-8
Online ISBN: 978-3-642-21458-5
eBook Packages: Computer ScienceComputer Science (R0)