Named Entity Recognition from Greek Texts: The GIE Project

Karkaletsis, Vangelis; Spyropoulos, Constantine D.; Petasis, George

doi:10.1007/978-94-011-4840-5_12

Vangelis Karkaletsis³,
Constantine D. Spyropoulos³ &
George Petasis³

Part of the book series: International Series on Microprocessor-Based and Intelligent Systems Engineering ((ISCA,volume 21))

183 Accesses
5 Citations

Abstract

Todays’ overload of information, particularly through the World Wide Web, makes difficult the users’ access to the right information. The situation becomes even more difficult due to the fact that a lot of this information is in different languages. Therefore, it is important to apply an information process that will extract from all that volume of information only the facts that match users’ interests, and allow the user to access facts written in a different language. Information Extraction (IE) technology can meet these requirements, since unlike what happens with information retrieval and filtering technology, in IE the user interests are on specific facts extracted from the documents and not on the documents themselves. Some documents may contain the requested keywords but be irrelevant to the users’ interests. Working with specific facts instead of documents provides users information more relevant to their domain of interest. The IE systems developed so far, extract, in most cases, fixed information from documents in a fixed language. However, in order for the IE technology to be truly applicable in real life applications, meeting the above requirements, IE systems need to be easily adaptable (customisable) to new domains and users interests, as well as to multiple languages. During the last decade, substantial progress has been made in developing reliable Information Extraction (IE) technology. IE technology is currently exploited in real applications, such as the extraction of information for companies acquisitions [1],[2],[3], stock exchanges [4], companies profits and losses [5], joint ventures and management succession events [6],[7],[8], as well as for the understanding of military messages [9] and police reports [10],[11],[12].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cowie J., Wakao T., Jin W., Pustejovsky J. and Waterman S., The diderot information extraction system. In Proceedings of the First Conference of the Pacific Association for Computational Linguistics (PACLING 93). Vancouver, Canada, 1993.
Google Scholar
Jacobs P.S. and Rau L.F., Scisor: Extracting information from on-line news. Communications of the ACM, 33(11):88–97, 1990.
Article Google Scholar
Wilks Y. Diderot: a text extraction system. In DARPA Speech and Natural Language Workshop. Morgan Kaufmann, San Mateo, CA, 1991.
Google Scholar
Vichot F., Wolinski F., Tomeh J., Guennou S., Dillet B., Aydjian S., High Precision Hypertext Navigation Based on NLP Automatic Extractions, Hypertext, Information Retrieval, Multimedia (HTM′97), Dortmund, Germany, (30): 161–174. October, 1997.
Google Scholar
Andersen P.M., Hayes P.J., Huettner A.K., Nirenburg LB., Schmandt L.M. and Weinstein S.P. Automatic extraction of facts from press releases to generate news stories. In Proceedings of the Third Conference on Applied Natural Language Processing, pages 170–177. ACL, 1992.
Google Scholar
ECRAN: Extraction of Content: Research at Near Market, http://www2.echo.lu/langeneg/en/le1/ecran/ecran.html
MUC5, 1993. Proceedings of the Fifth Message Understanding Conference, San Francisco, Calif.: Morgan Kaufmann.
Google Scholar
MUC6, 1995. Proceedings of the Sixth Message Understanding Conference. San Francisco, Calif.: Morgan Kaufmann.
Google Scholar
DARPA Speech and Natural Language Workshop, Harriman, NY, 1992.
Google Scholar
AVENT1NUS: Advanced Information System for Multinational Drug Enforcement. http://www2.echo.lu/langeneg/en/lel/aventinus/aventinus.html
Evans R.and Hartley A.F., The traffic information collator. Expert Systems: The International Journal of Knowledge Engineering, 7(4):209–214, 1990.
Article Google Scholar
Gaizauskas R., Evans R., Cahill L.J., Richardson I. and Walker J., Poetic: A system for gathering and disseminating traffic information. In S.G. Ritchie and G.T. Hendrickson, editors, Conference Preprints of the International Conference on Artificial Intelligence Applications in Transportation Engineering, pages 79–98, San Buenaventura, California, 1992.
Google Scholar
Gaizauskas, R., Wilks, Y. «Information Extraction beyond Document Retrieval», University of Sheffield, Dept. of Computer Science, CS-97-10, 1997.
Google Scholar
Cunningham, H., Wilks, Y., Gaizauskas, R., GATE — a General Architecture for Text Engineering, 16th Conference on Computational Linguistics (COLING′96), 274–279, 1996.
Google Scholar
Gazdar G. and Mellish C, 1989. Natural Language Processing in Prolog. Addison-Wesley, 1989.
Google Scholar
Paliouras G., Karkaletsis V. and Spyropoulos C.D., “Machine Learning for Domain-Adaptive Word Sense Disambiguation”. Proceedings of the LREC Workshop on “Adapting Lexical and Corpus Resources to Sublanguages and Applications”, Granada, Spain, May 26, 1998.
Google Scholar

Download references

Author information

Authors and Affiliations

Software and Knowledge Engineering Laboratory, Institute of Informatics and Telecommunications, N.C.S.R. «Demokritos», Greece
Vangelis Karkaletsis, Constantine D. Spyropoulos & George Petasis

Authors

Vangelis Karkaletsis
View author publications
You can also search for this author in PubMed Google Scholar
Constantine D. Spyropoulos
View author publications
You can also search for this author in PubMed Google Scholar
George Petasis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical Engineering and Computer Engineering, National Technical University of Athens, Greece
Spyros G. Tzafestas

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Karkaletsis, V., Spyropoulos, C.D., Petasis, G. (1999). Named Entity Recognition from Greek Texts: The GIE Project. In: Tzafestas, S.G. (eds) Advances in Intelligent Systems. International Series on Microprocessor-Based and Intelligent Systems Engineering, vol 21. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-4840-5_12

Download citation

DOI: https://doi.org/10.1007/978-94-011-4840-5_12
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-0393-6
Online ISBN: 978-94-011-4840-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics