Abstract
In distributed work environments, where users are sharing and searching resources, ensuring an appropriate ranking at remote peers is a key problem. While this issue has been investigated for federated libraries, where the exchange of collection specific information suffices to enable homogeneous TFxIDF rankings across the participating collections, no solutions are known for PageRank-based ranking schemes, important for personalized retrieval on the desktop.
Connected users share fulltext resources and metadata expressing information about them and connecting them. Based on which information is shared or private, we propose several algorithms for computing personalized PageRank-based rankings for these connected peers. We discuss which information is needed for the ranking computation and how PageRank values can be estimated in case of incomplete information. We analyze the performance of our algorithms through a set of experiments, and conclude with suggestions for choosing among these algorithms.
Chapter PDF
References
Beagle++. (2006), http://beagle.kbs.uni-hannover.de/
NEPOMUK - The Social Semantic Desktop (2006), http://nepomuk.semanticdesktop.org
Bianchini, M., Gori, M., Scarselli, F.: Inside pagerank. ACM Trans. Inter. Tech. 5(1), 92–128 (2005)
Callan, J.P., Lu, Z., Croft, W.B.: Searching distributed collections with inference networks. In: Proc. of the Intl. Conf. on Research and Development in Information Retrieval (SIGIR) (1995)
Chien, S., Dwork, C., Kumar, S., Sivakumar, D.: Towards exploiting link evolution. In: Unpublished manuscript (2001)
Chirita, P.A., Costache, S., Nejdl, W., Paiu, R.: Beagle++: Semantically enhanced searching and ranking on the desktop. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, Springer, Heidelberg (2006)
Chirita, P.A., Ghita, S., Nejdl, W., Paiu, R.: Semantically enhanced searching and ranking on the desktop. In: Proc. of the Semantic Desktop Workshop held at the Intl. Semantic Web Conf (2005)
Damian, A., Nejdl, W., Paiu, R.: Peer-sensitive objectrank: Valuing contextual information in social networks. In: Proc. of the Intl. Conf. on Web Information Systems Engineering (2005)
Dong, X., Halevy, A.Y.: A platform for personal information management and integration. In: Proc. of Conf. on Innovative Data Systems Research (CIDR) (2005)
Dong, X., Halevy, A.Y., Nemes, E., Sigundsson, S.B., Domingos, P.: Semex: Toward on-the-fly personal information integration. In: Proc. of the Workshop on Information Integration on the Web (2004)
Franklin, M., Halevy, A.Y., Maier, D.: From databases to dataspaces: a new abstraction for information management. SIGMOD Rec. 34(4), 27–33 (2005)
Green, N., Ipeirotis, P.G., Gravano, L.: SDLIP + STARTS = SDARTS a protocol and toolkit for metasearching. In: ACM/IEEE Joint Conference on Digital Libraries, pp. 207–214 (2001)
Haveliwala, T.: Topic-sensitive pagerank. In: Proc. of the Intl. WWW Conf (2002)
Parreira, J.X., Donato, D., Michel, S., Weikum, G.: Efficient and decentralized pagerank approximation in a peer-to-peer web search network. In: Proc. of the Intl. Conf. on Very Large Data Bases (VLDB) (2006)
Wang, Y., DeWitt, D.: Computing pagerank in a distributed internet search system. In: Proc. of the Intl. Conf. on Very Large Databases (VLDB) (2004)
Wu, J., Aberer, K.: Using siterank for decentralized computation of web document ranking. In: Proc. of Intl. Conf. on Adaptive Hypermedia and Adaptive WebBased Systems (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Costache, S., Nejdl, W., Paiu, R. (2007). Personalizing PageRank-Based Ranking over Distributed Collections. In: Krogstie, J., Opdahl, A., Sindre, G. (eds) Advanced Information Systems Engineering. CAiSE 2007. Lecture Notes in Computer Science, vol 4495. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72988-4_9
Download citation
DOI: https://doi.org/10.1007/978-3-540-72988-4_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72987-7
Online ISBN: 978-3-540-72988-4
eBook Packages: Computer ScienceComputer Science (R0)