Approximate Runs - Revisited

Landau, Gad M.

doi:10.1007/978-3-540-89097-3_2

Gad M. Landau^4,5

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5280))

Included in the following conference series:

International Symposium on String Processing and Information Retrieval

879 Accesses

Abstract

The problem of finding repeats within a string is an important computational problem with applications in data compression and in the field of molecular biology. Both exact and inexact repeats occur frequently in the genome, and certain repeats are known to be related to human diseases.

A multiple tandem repeat in a sequence S is a (periodic) substring r of S of the form r = u^au^′, where u (the period) is a prefix of r, u^′ is a prefix of u and a ≥ 2. A run is a maximal (non-extendable) multiple tandem repeat. An approximate run is a run with errors (i.e. the repeated subsequences are similar but not identical).

Many measures have been proposed that capture the similarity among all periods. We may measure the number of errors between consecutive periods, between all periods, or between each period and a consensus string. Another possible measure is the number of positions in the periods that may differ.

In this talk I will survey a range of our results in this area. Various parts of this work are joint work with Maxime Crochemore, Gene Myers, Jeanette Schmidt and Dina Sokol.

Download to read the full chapter text

Chapter PDF

Sequence Repeats

Alphabet-Independent Algorithms for Finding Context-Sensitive Repeats in Linear Time

Identification of All Exact and Approximate Inverted Repeats in Regular and Weighted Sequences

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Author information

Authors and Affiliations

Department of Computer Science, University of Haifa, Haifa, Israel
Gad M. Landau
Department of Computer and Information Science, Polytechnic University, New York, USA
Gad M. Landau

Authors

Gad M. Landau
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Bar-Ilan University, 52900, Ramat-Gan, Israel
Amihood Amir
School of Computer Science and Information Technology, RMIT University, Melbourne, Australia
Andrew Turpin
NICTA Victoria Laboratory, Department of Computer Science and Software Engineering, The University of Melbourne, Victoria, Australia
Alistair Moffat

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Landau, G.M. (2008). Approximate Runs - Revisited. In: Amir, A., Turpin, A., Moffat, A. (eds) String Processing and Information Retrieval. SPIRE 2008. Lecture Notes in Computer Science, vol 5280. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89097-3_2

Download citation

DOI: https://doi.org/10.1007/978-3-540-89097-3_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89096-6
Online ISBN: 978-3-540-89097-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Approximate Runs - Revisited

Abstract

Chapter PDF

Similar content being viewed by others

Sequence Repeats

Alphabet-Independent Algorithms for Finding Context-Sensitive Repeats in Linear Time

Identification of All Exact and Approximate Inverted Repeats in Regular and Weighted Sequences

Keywords

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Approximate Runs - Revisited

Abstract

Chapter PDF

Similar content being viewed by others

Sequence Repeats

Alphabet-Independent Algorithms for Finding Context-Sensitive Repeats in Linear Time

Identification of All Exact and Approximate Inverted Repeats in Regular and Weighted Sequences

Keywords

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation