Abstract
This work aims to present a new methodology to retrieve the documents relating to the traditional Thai medicine recipe that is translated from the ancient palm leaf manuscripts. This methodology is developed based on three main concepts: sematic data, latent search indexing (LSI), and cross language information retrieval (CLIR). Our methodology consists of four main processing steps. They are document indexing, document representation based on LSI, user’s query transformation, and document retrieval and ranking. After testing by the common performance measures for information retrieval system such as recall, precision, and F-measure, it would demonstrate that our methodology can achieve substantial improvements.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Iijima, A.: A Historical Approach to the Palm-Leaf Manuscripts Preserved in WatMahathat, Yasothon (Thailand), http://www.laomanuscripts.net/downloads/literaryheritageoflaos26_iijima_en.pdf
Manmart, L., Chamnongsri, N., Wuwongse, V.: Metadata Development for Palm Leaf Manuscripts in Thailand. In: Proc. International Conference on Dublin Core and Metadata Applications (2012)
Shi, Z., Setlur, S., Govindaraju, V.: Digital Enhancement of Palm Leaf Manuscript Images using Normalization Techniques. In: The 5th International Conference on Knowledge-based Computer Systems (2004)
Rosario, B.: Latent Semantic Indexing: An overview, http://people.ischool.berkeley.edu/~rosario/projects/LSI.pdf
Braschler, M., Peters, C., Schäuble, P.: Cross-Language Information Retrieval (CLIR) Track: Overview, http://trec.nist.gov/pubs/trec8/papers/trec8ov.pdf
van der Vlist, E.: XML Schema. O’Reilly (2002)
Haav, H.M., Lubi, T.L.: A Survey of Concept-based Information Retrieval Tools on the Web. The Fifth East-European Conference on Advances in Databases and Information Systems (ADBIS) (2001)
Egozi, O., Markovvitch, S., Gabrilovich, E.: Concept-based Information Retrieval Using Explicit Semantic Analysis. Journal of ACM Transactions on Information Systems (2011)
Guarino, N.: Formal Ontology and Information Systems. In: Guarino, N. (ed.) Proceedings of the 1st International Conference of Formal Ontology in Information Systems (1998)
Sowa, J.F.: Conceptual Structures: Information Processing in Minds and Machines. Addison-Wesley, Reading (1984)
Riloff, E., Lehnert, W.: Automated dictionary construction for information extraction from text. In: Proceedings of 9th IEEE Conference on Artificial Intelligence for Applications (1993); Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison Wesley (1999)
Tan, P.N., Steinbach, M., Kumar, V.: Association Analysis: Basic Concepts and Algorithms. In: Introduction to Data Mining. Addison-Wesley (2005)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison Wesley (1999)
Cormack, G.V., Lynam, T.R.: Statistical precision of information retrieval evaluation. In: Proceedings of the 29th Annual International ACM SIGIR (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Polpinij, J. (2014). Concept-Based Cross Language Retrieval for Thai Medicine Recipes. In: Tuamsuk, K., Jatowt, A., Rasmussen, E. (eds) The Emergence of Digital Libraries – Research and Practices. ICADL 2014. Lecture Notes in Computer Science, vol 8839. Springer, Cham. https://doi.org/10.1007/978-3-319-12823-8_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-12823-8_33
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12822-1
Online ISBN: 978-3-319-12823-8
eBook Packages: Computer ScienceComputer Science (R0)