Abstract
This paper presents the novel concept of system-mediated information access, i.e. system support for the user in clarifying and refining a vague information need and in generating a good formulation for it. The concept is based on two main assumptions: firstly, on document clustering’s ability to reveal the topical, semantic structure of a domain of interest, represented by a specialized collection, and secondly, on the capacity of language models to convey content. Experimental results show that these assumptions are correct and that there is potential to significantly improve the retrieval performance by generating a better query through mediation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
M. Beaulieu, T. Do, A. Payne, and S. Jones. Enquire okapi project. British Library Research and Innovation Report 17, Centre for Interactive Systems Research, City University, London, January 1997.
I. Campbell. Applying ostensive functionalism in the place of descriptive proceduralism: “the query is dead”. In Workshop on Information Retrieval and Human Computer Interaction. University of Glasgow, September 1996.
D. Harman. Relevance feedback revisited. In Proceedings of SIGIR’92, pages 1–15, Copenhagen, Denmark, 1992. ACM.
D. J. Harper, M. Mechkour, and G. Muresan. Document clustering for mediated information access. In Proceedings of the 21st Annual BCS-IRSG Colloquium, Glasgow, April 1999.
N. Jardine and C. J. v. Rijsbergen. The use of hierarchic clustering in information retrieval. Information Storage and Retrieval, 7:217–240, 1971.
A. Leuski and J. Allan. Improving interactive retrieval by combining ranked lists and clustering. In Proceedings of RIAO2000, pages 665–681, Paris, April 2000.
M. Magennis and C. J. v. Rijsbergen. The potential and actual effectiveness of interactive query expansion. In Proceedings of SIGIR’ 97, pages 324–332, Philadelphia, July 1997. ACM.
C. D. Manning and H. Schutze. Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, Massachusetts, 1999.
G. Muresan, D. J. Harper, and M. Mechkour. Webcluster, a tool for mediated information access. In M. Hearst, F. Gey, and R. Tong, editors, Proceedings of SIGIR’99, page 337, Berkeley, August 1999. ACM.
G. Muresan, D. J. H. Harper, A. Goker, and P. Lowit. Clusterbook, a tool for dual information access. In N. J. Belkin, P. Ingwersen, and M.-K. Leong, editors, Proceedings of SIGIR 2000, page 391, Athens, July 2000. ACM.
R. Nordlie. Unmediated and mediated information searching in the public library. In Proceedings of ASIS 1996, 1996.
E. M. Voorhees. The Effectiveness and Efficiency of Agglomerative Hierarchic Clustering in Document Retrieval. PhD thesis, Department of Computer Science, Cornell University, Ithaca, NY 14853, October 1985.
P. Willett. Similarity coefficients and weighting functions for automatic document classification: an empirical comparison. International Classification, 10(3):138–142, 1983.
M. Zizi and M. Beaudoin-Lafon. Hypermedia exploration with interactive dynamic maps. International Journal on Human Computer Interaction, 43, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Muresan, G., Harper, D.J. (2001). Document Clustering and Language Models for System-Mediated Information Access. In: Constantopoulos, P., Sølvberg, I.T. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2001. Lecture Notes in Computer Science, vol 2163. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44796-2_37
Download citation
DOI: https://doi.org/10.1007/3-540-44796-2_37
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42537-3
Online ISBN: 978-3-540-44796-2
eBook Packages: Springer Book Archive