Abstract
Person search is one of the most popular search types on the Web. Most of the conventional technologies for person search focused on mapping the person name to a specific person (i.e. referents). In contrast, in this paper, we propose a novel ranking measure called famousness for person search. We use the notion of famousness for ranking people according to how well-known they are. Intuitively, famousness score is computed by analyzing the metadata of search results returned by a search engine. The metadata used in our method include URL, snippet, and the number of search results. To compute the famousness score of a person, first, we cluster the search results by using their metadata. Second, we compute the deviations in the size and number of such clusters. If the related Web pages of a person can be grouped into many large clusters of similar size, it looks like that person has been mentioned in many Web pages from various domains and that s/he is well known. In addition, we compare the clusters of search results with those of other people. Persons having more and larger clusters are given higher famousness scores. We also show experimental results to validate the ranking method based on the famousness score.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bekkerman, R., McCallum, A.: Disambiguating web appearances of people in a social network. In: Proc. of WWW 2005, pp. 463–470 (2005)
Cheng, T., Chang, K.C.: Entity Search Engine: Towards Agile Best-Effort Information Integration over the Web. In: Proc. of CIDR 2007, pp. 108–113 (2007)
Guha, R., Garg, A.: Disambiguating people in search. Stanford University (2004)
Al-Kamha, R., Embley, D.W.: Grouping search-engine returned citations for person-name queries. In: Proc. of WIDM 2004, pp. 96–103 (2004)
Kaseneci, G., Suchanek, F.M., Ifrim, G., Ramanath, M., Weikum, G.: NAGA: Searching and Ranking Knowledge. In: Proc. of ICDE 2008 (2008)
Mann, G.S., Yarowsky, D.: Unsupervised personal name disambiguation. In: Proc. of CoNLL 2003 (2003)
Ohshima, H., Nakamura, S., Tanaka, K.: SlothLib: A Programming Library for Research on Web Search (in Japanese). DBSJ Letters 6(1), 113–116 (2007)
Pollack, S.M.: Measures for the comparison of information retrieval systems. American Documentation 19(4), 397–397 (1968)
Suchanek, F., Kasneci, G., Weikum, G.: Yago: A Core of Semantic Knowledge. In: Proc. of WWW 2007 (2007)
Wan, X., Gao, J., Li, M., Ding, B.: Person Resolution in Person Search Results: WebHawk. In: Proc. of CIKM 2005, pp. 163–170 (2005)
Yahoo! Developer Network (2008), http://developer.yahoo.com/
Yahoo! Japan Developer Network (2008), http://developer.yahoo.co.jp/search/
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ma, Q., Yoshikawa, M. (2008). Ranking People Based on Metadata Analysis of Search Results. In: Hartmann, S., Zhou, X., Kirchberg, M. (eds) Web Information Systems Engineering – WISE 2008 Workshops. WISE 2008. Lecture Notes in Computer Science, vol 5176. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85200-1_7
Download citation
DOI: https://doi.org/10.1007/978-3-540-85200-1_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85199-8
Online ISBN: 978-3-540-85200-1
eBook Packages: Computer ScienceComputer Science (R0)