Advertisement

Applying Database Semantics to the WWW

  • Roland Hausser
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3307)

Abstract

Today’s search engines build their indices on the basis of document mark-up in XML and significant letter sequences (words) occurring in the document texts. There are some drawbacks, however: the XML mark-up requires skill as well as tedious work from the user posting the document, and the indexing based on significant word distributions, though automatic and highly effective, is not as precise as required by many applications.

As a complement to current methods, this paper presents an automatic content analysis of texts which is based on traditional linguistic methods in conjunction with a comparatively new data structure ([6]) and algorithm ([3]). Having already presented the formal definitions elsewhere, we aim here at illustrating the system in action, based on an ongoing implementation in JAVA.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Abney, S.: Parsing by Chunks. In: Berwick, R., Abney, S., Tenny, C. (eds.) Principle-Based Parsing. Kluwer Academic Publishers, Dordrecht (1991)Google Scholar
  2. 2.
    Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American 284(5) (2001)Google Scholar
  3. 3.
    Hausser, R.: Complexity in Left-Associative Grammar. Theoretical Computer Science 106(2), 283–308 (1992)zbMATHCrossRefMathSciNetGoogle Scholar
  4. 4.
    Hausser, R. (ed.): Linguistische Verifikation. Dokumentation zur Ersten Morpholympics. Max Niemeyer Verlag, Tübingen (1996)Google Scholar
  5. 5.
    Hausser, R.: Foundations of Computational Linguistics. In: Human-Computer Communication in Natural Language, 2nd edn. Springer, Berlin (1999/2001)Google Scholar
  6. 6.
    Hausser, R.: Database Semantics for Natural Language. Artificial Intelligence 130(1), 27–74 (2001)zbMATHCrossRefGoogle Scholar
  7. 7.
    Hausser, R.: Turn Taking in Database Semantics. In: Kangassalo, H., et al. (eds.) Information Modeling and Knowledge Bases XVI. IOS Press, Amsterdam (2005) (to appear)Google Scholar
  8. 8.
    Kycia, A.: Implementierung der Datenbanksemantik in JAVA. MA-thesis. Universität Erlangen-Nürnberg (2004)Google Scholar
  9. 9.
    Vergne, J.: Une méthode pour l’analyse descendante et calculatoire de corpus multilingues: application au calcul des relations sujet-verbe. Actes de TALN, 63–74 (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Roland Hausser
    • 1
  1. 1.Abteilung Computerlinguistik (CLUE)Universität Erlangen-Nürnberg 

Personalised recommendations