Abstract
Information Retrieval (IR) is an ever evolving concept of computer and communication applications. The emergence of mobile technology and its widespread service domains have given new dimensions to this as a research topic. Number of key players of search industry, are coming up with problems and their solutions with respect to SMS based Information systems and retrieval facilities. An extensive literature survey reveals that SMS based IR models for Indic languages are not available at large scale. In addition to this an universal transliteration encoding formats for Indic languages named ITRANS, has attracted us to resolve the problem of a suitable knowledgebase. With all these exploration we have formulated our research problem by combining both these features Viz. ITRANS encoded document corpus and SMS based Information Retrieval on this corpus. With reference of a few initiations of this emerging topic, we present an extended IR model for Marathi documents in ITRANS format. A collection of approximately hundred documents including Marathi songs, poems, abhangas and powada is incorporated as the document corpus for our experimental work. Several queries are formulated in ITRANS format and are used for the testing. The system developed by us ranks the documents based on the similarity scores applying Cosine Similarity measures. The documents in their ranked orders are checked by an expert for their relevance with respective queries. The feedback submitted by the expert is deployed to compute the relevance measure in terms of precision and recall. Finally the results are analyzed to correlate the similarity degree to the relevance of documents.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Joshi M, Pathak V (2011) A functional taxonomy of SMSbIR systems. In: 3rd International conference on electronics computer technology, Kanyakumari, 8–10 April, 2011, IEEE Xpolre, pp V6:166–V6:170 ISBN: 978-1-4244-8677-9
Kopparadu SK, Shrvastava A, Pande A (2007) SMS based natural language interface to yellow pages directory
Mitra P, Samajpati A, Sarkar T, Das PK (2006) An SMS based rural application for agricultural consultancy and commodity booking service
So S (2009) The development of a SMS-based teaching and learning system. Hong Kong Institute of Education, Hong Kong
Kushchu I, Kushchu H (2003) From E-government to M-government: facing the inevitable. In: The proceedings of european conference on E-government (ECEG 2003), Trinity College, Dublin, pp 1–13
LEE DL (1997) Document ranking and the vector space model. Hong Kong University of Science and Technology, HUII CHAUNG, Information Dimensions, KENT SEAMONS, Transarc, MARCH/APRIL 1997 0740–7459/97/$10.00© 1997, IEEE
Ran A, Lencevicius R (2007) Natural language query system for RDF repositories. In: Proceedings of the 7th international symposium on natural language processing, SNLP, pp 1–6
Goh TT, Li C (2009) SMS-based library catalogue system: a preliminary investigation of user acceptance. Victoria University of Wellington, Wellington, New Zealand, Liew school of information management
Rocchio JJ (1971) Relevance feedback in information retrieval. In: Salton G (ed) The SMART retrieval system: experiments in automatic document processing, Prentice Hall Inc, NJ, pp 313–323
Pathak V, Joshi M (2012) ITRANSed marathi literature retrieval using SMS based natural language query. Adv Comput Res 4(1):125–129
Soubbotin MM, Soubbotin SM (2002) Use of patterns for detection of answer strings: A systematic approach. In: Proceedings of TREC, vol 11. Citeseer, pp 134–143
Robertson SE (1977) The probability ranking principle in IR. J Doc 33(4):294–304
Drucker H, Shahrary B, Gibbon DC (2002) AT & T Research and Monmouth University, West Long Branch, NJ 07764 USA, Support vector machines : relevance feedback and information retrieval. Inf Process Manage 38:305–323
Salton G, Buckley C (1990) Improving retrieval performance by relevance feedback. J Am Soc Inform Sci 41:288–297
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media New York
About this paper
Cite this paper
Pathak, V.M., Joshi, M.R. (2013). ITRANS Encoded Marathi Literature Document Relevance Ranking for Natural Language Flexible Queries. In: Chaki, N., Meghanathan, N., Nagamalai, D. (eds) Computer Networks & Communications (NetCom). Lecture Notes in Electrical Engineering, vol 131. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6154-8_41
Download citation
DOI: https://doi.org/10.1007/978-1-4614-6154-8_41
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-6153-1
Online ISBN: 978-1-4614-6154-8
eBook Packages: EngineeringEngineering (R0)