Skip to main content

Formal Languages and Algorithms for Similarity Based Retrieval from Sequence Databases

  • Conference paper
  • First Online:
FST TCS 2002: Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2556))

  • 491 Accesses

Abstract

Similarity based retrieval is of major importance for querying sequence databases. We consider formalisms based on automata, temporal logics and regular expressions for querying such databases. We define two different types of similarity measures–syntax based and semantics based. These measures are divided into a spectrum of measures based on the vector distance function that is employed. We consider norm vector distance functions and give efficient query processing algorithms when these measures are employed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agarwal R., Faloutsis C, Swaini A: Efficient Similarity Search in Sequence Databases, In FODO Conference, Evanston, Illinois, Oct. 1993.

    Google Scholar 

  2. J. Chomicki, History-less Checking of Dynamic Integrity Constraints, IEEE International Conference on Data Engineering, Phoenix, Arizona, February 1992.

    Google Scholar 

  3. Dayal U., Active Database Management Systems, Proc. of 3rd Intnl. Conf. on Data and Knowledge bases–Improving usability and Responsiveness, Jerusalem, June 1988.

    Google Scholar 

  4. Durbin R., et al.: Biological Sequence Analysis, Cambridge University Press, 1998.

    Google Scholar 

  5. E. A. Emerson, A. P. Sistla: Triple Exponential Decision Procedure for the Logic CTL*, Workshop on Logics of Programs, Carnegie-Mellon University, Pittsburgh, Pennsylvania, June 1983.

    Google Scholar 

  6. Faloutsos F., Ranganathan M., Manolopoulos: Fast Subsequence Matching in Time-Series Databases, Proc. of the 1994 ACM SIGMOD Intnl. Conf. on Management of Data, Minneapolis, MN, May 1994.

    Google Scholar 

  7. Gehani N., Jagadish H., Shmueli O.: Composite Event Specification in Active Databases: Models and Implementation, Proc. 18th Intnl. Conference on Very Large Databases, Aug. 1992.

    Google Scholar 

  8. Garofalakis, M. N., Rastogi R., Shim K.: SPIRIT: Sequential Pattern Mining with Regular Expression Constraints, Proc, of the 25th Intnl Conf. on Very Large Databases, Edinburgh, Scotlanad,UK, 1999.

    Google Scholar 

  9. Tao Hu, A. Prasad Sistla: Similarity based Retrieval from Sequence Databases using Automata as Specifications, Technical report, Dept of Electrical Engg and Computer Sciejce, 2000.

    Google Scholar 

  10. H. Lewis and C. Papadimitriou: Elements of the Theory of Computation, Prentice-Hall, 1998.

    Google Scholar 

  11. Z. Manna and A. Pnueli, The Temporal Logic of Reactive and Concurrent SystemsχSpecification, Springer-Verlag 1992.

    Google Scholar 

  12. A. P. Sistla, T. Hu, V. Chowdhry Similarity based Retrieval from Sequence Databases Using Automata as Queries 11th ACM Intnl Conference on Information and Knowledge Management, Virginia, Nov 2002.

    Google Scholar 

  13. A. P. Sistla and O. Wolfson, Temporal Triggers in Active Databases, IEEE Transactions on Knowledge and Data Engineering, Vol 7, No 3, June 1995, pp 471–486.

    Article  Google Scholar 

  14. Sistla A. P., Yu C, Venkatasubramanian R.: Similarity based Retrieval of Videos, 13th International Conference on Data Engineering, April, 1997, Birmingham, U.K.

    Google Scholar 

  15. M. Vardi, P. Wolper, A. P. Sistla, Reasoning about Infinite Computations, IEEE FOCS 1983.

    Google Scholar 

  16. Wu S., Manber U.,: Fast Text Searching: Allowing Errors, CACM Oct. 1992, pp 83–91.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Sistla, A.P. (2002). Formal Languages and Algorithms for Similarity Based Retrieval from Sequence Databases. In: Agrawal, M., Seth, A. (eds) FST TCS 2002: Foundations of Software Technology and Theoretical Computer Science. FSTTCS 2002. Lecture Notes in Computer Science, vol 2556. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36206-1_29

Download citation

  • DOI: https://doi.org/10.1007/3-540-36206-1_29

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00225-3

  • Online ISBN: 978-3-540-36206-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics