Towards an Enhanced Vector Model to Encode Textual Relations: Experiments Retrieving Information

Carrillo, Maya; López-López, A.

doi:10.1007/978-0-387-09695-7_37

Maya Carrillo² &
A. López-López³

Part of the book series: IFIP – The International Federation for Information Processing ((IFIPAICT,volume 276))

Included in the following conference series:

IFIP International Conference on Artificial Intelligence in Theory and Practice

1069 Accesses

Abstract

The constant growth of digital information, facilitated by storage technologies, imposes new challenges for information processing tasks, and maintains the need of effective search mechanisms, oriented towards improving in precision but simultaneously capable of producing useful information in a short time. Hence, this paper presents a document representation to encode textual relations. This representation does not consider each term as one entry in a vector but rather as a pattern, i.e. a set of contiguous entries. To deal with variations inherent in natural language, we plan to express textual relations (such as noun phrases, named entities, subject-verb, verb-object, adjective-noun, and adverb-verb) as composed patterns. An operator is applied to form bindings between terms encoding relations as new “terms”, thereby providing additional descriptive elements for indexing a document collection. The results of our first experiments, using the document representation to conduct information retrieval and incorporating two-word noun phrases, showed that the representation is feasible, retrieves, and improves the ranking of relevant documents, and consequently the values of mean average precision.

Download to read the full chapter text

Chapter PDF

Indexing of Textual Databases Based on Lexical Resources: A Case Study for Serbian

Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources

Expansion-by-Analogy: A Vector Symbolic Approach to Semantic Search

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Becker J., Kuropa D.: Topic-based Vector Space Model. In: Procs. of the 6th International Conference on Business Information Systems, pp. 7-13, July 2003 Colorado, USA.
Google Scholar
Evans D., Zhai C.: Noun-phrase Analysis in Unrestricted Text for Information Retrieval. In: Procs. of the 34th Annual Meeting on Association for Computational Linguistics, pp. 17-24, June 1996.
Google Scholar
Gonçalves A., Zhu J., Song D., Uren V., Pacheco R.: LRD: Latent Relation Discovery for Vector Space Expansion and Information Retrieval. In: Procs. of the Seventh International Conference on Web-Age Information Management, pp. 122-133, June 2006, Hong Kong, China.
Google Scholar
Grinberg D., Lafferty J. and Sleator D.: A Robust Parsing Algorithm for Link Grammars. Carnegie Mellon University, Computer Science, Technical Report CMU-CS-95-125, 17p.,1995.
Google Scholar
Lewis D., Sparck K.: Natural Language Processing for Information Retrieval. In: Communications ACM 39, pp. 92-101, January 1996.
Google Scholar
Mitra M., Buckley C., Singhal A., Cardie C.: An Analysis of Statistical and Syntactic Phrases. In: Procs. of RIAO-97, 5th International Conference, pp. 200-214.
Google Scholar
Plate T.A.: Analogy Retrieval and Processing with Distributed Vector Representation, Victoria University of Wellington, Computer Science, Technical Report CS-TR-98-4, 16 p.
Google Scholar
Shi S., Wen J., Yu Q., Ruihua R., Ying Ma W.: Gravitation-Based Model for Information Retrieval. In: Procs. of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2005, pp. 488-495, Salvador, Brazil August 15-19, 2005.
Chapter Google Scholar
Vilares J., Gómez-Rodríguez C. and Alonso M.A.: Managing Syntactic Variation in Text Retrieval. In: Peter R. King, Procs. of the 2005 ACM Symposium on Document Engineering. Bristol, United Kingdom, pp. 162-164, November 2-4, 2005, ACM Press, New York, USA.
Google Scholar

Download references

Author information

Authors and Affiliations

Instituto Nacional de Astrofísica Óptica y Electrónica, Facultad de Ciencias de la Computación, BUAP, México
Maya Carrillo
Instituto Nacional de Astrofísica Óptica y Electrónica, Luis Enrique Erro #1 Santa María Tonantzintla, 72840, Puebla, México
A. López-López

Authors

Maya Carrillo
View author publications
You can also search for this author in PubMed Google Scholar
A. López-López
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Portsmouth, UK
Max Bramer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Carrillo, M., López-López, A. (2008). Towards an Enhanced Vector Model to Encode Textual Relations: Experiments Retrieving Information. In: Bramer, M. (eds) Artificial Intelligence in Theory and Practice II. IFIP AI 2008. IFIP – The International Federation for Information Processing, vol 276. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-09695-7_37

Download citation

DOI: https://doi.org/10.1007/978-0-387-09695-7_37
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-09694-0
Online ISBN: 978-0-387-09695-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Towards an Enhanced Vector Model to Encode Textual Relations: Experiments Retrieving Information

Abstract

Chapter PDF

Similar content being viewed by others

Indexing of Textual Databases Based on Lexical Resources: A Case Study for Serbian

Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources

Expansion-by-Analogy: A Vector Symbolic Approach to Semantic Search

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Towards an Enhanced Vector Model to Encode Textual Relations: Experiments Retrieving Information

Abstract

Chapter PDF

Similar content being viewed by others

Indexing of Textual Databases Based on Lexical Resources: A Case Study for Serbian

Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources

Expansion-by-Analogy: A Vector Symbolic Approach to Semantic Search

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation