Abstract
In many situations, related to some types of systems or organizations’ tasks, it is necessary to identify people with similar profiles. In the case of a collaborative recommender system, items to be recommended are those associated to similar users. Another example, in the academic environment, is to identify new members to be part of a research group (people with similar profiles). This task of identifying people with similar profiles can be time-consuming. In this sense, this work considers that scientific papers written by people can be used to identify users with similar profiles. Considering this assumption, we have done some experiments to identify which parts of papers, which type of indexes (terms or concepts) and which type of similarity functions (Jaccard or a Fuzzy function) are more suitable to identify similar people. The paper presents the results of some experiments and some application scenarios considering academic environments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Davenport, T.H., Pruzac, L.: Working Knowledge – How Organizations Manage what they know. Harvard Business School Press, Cambridge (1998)
Wang, J., de Vries, A.P., Reinders, M.J.T.: Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval SIGIR 2006, Washington, USA, pp. 501–508 (2006)
Adomavicius, G., Tuzhilin, A.: Toward the Next Generation of Recommender systems: A Survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering 17(6), 734–749 (2005)
Drachsler, H., Hummel, H.G.K., Koper, R.: Personal recommender systems for learners in lifelong learning networks: the requirements, techniques and model. Int. J. Learn. Technol. 3(4), 404–423 (2008)
Stoilova, L., Holloway, T., Markines, B., Maguitman, A.G., Menczer, F.: Givealink: Mining a semantic network of bookmarks for web search and recommendation. In: Proceedings of the 3rd International Workshop on Link discovery LinkKDD, pp. 66–73 (2005)
Spertus, E., Sahami, M., Buyukkokten, O.: Evaluating similarity measures: a large-scale study in the orkut social network. In: Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery and data mining KDD 2005, pp. 678–684 (2005)
Rashid, A.M., Albert, I., Cosley, D., Lam, S.K., McNee, S.M., Konstan, J.A., Riedl, J.: Getting to know you: learning new user preferences in recommender systems. In: IUI 2002: Proceedings of the 7th international conference on Intelligent user interfaces, pp. 127–134. ACM, New York (2002)
McNee, S., Albert, I., Cosley, D., Gopalkrishnan, P., Lam, S.K., Rashid, A.M., Konstan, J.A., Riedl, J.: On the recommending of citations for research paperss. In: Proceedings of the 2002 ACM Conference on Computer Supported Cooperative Work, pp. 116–125 (2002)
Middleton, S.E., Shadbolt, N.R., Roure, D.C.D.: Capturing interest through inference and visualization: ontological user profiling in recommender systems. In: International Conference on Knowledge Capture KCAP 2003, pp. 62–69. ACM Press, New York (2003)
Dumais, S.T., Nielsen, J.: Automating the assignment of submitted manuscripts to reviewers. In: 15th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 233–244. Copenhagen, Denmark (1992)
Yarowsky, D., Florian, R.: Taking the load off the conference chairs: towards a digital paper-routing assistant. In: Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, Washington, USA, pp. 220–230 (1999)
Basu, C., Hirsh, H., Cohen, W.: A study in combining multiple information sources. Journal of the Artificial Intelligence Research (JAIR) 14, 231–252 (2001)
Loh, S., Wives, L.K., Oliveira, J.P.M.: Concept-based knowledge discovery in texts extracted from the web. ACM SIGKDD Explorations 2(1), 29–39 (1998)
Chen, H.: The vocabulary problem in collaboration. IEEE Computer 27(5), 2–10 (1994)
Lin, C.h., Chen, H.: An automatic indexing and neural network approach to concept retrieval and classification of multilingual (chinese-english) documents. IEEE Transactions on Systems, Man and Cybernetics 26(1), 1–14 (1996)
Sowa, J.F.: Knowledge representation: logical, philosophical, and computational foundations. Brooks/Cole Publishing Co., Pacific Grove (2000)
Guarino, N.: Formal ontology and information systems. In: International Conference on Formal Ontologies in Information Systems - FOIS 1998, Trento, Italy, pp. 3–15 (1998)
Willet, P.: Recent trends in hierarchic document clustering: a critical review. Information Processing & Management 24(5), 577–597 (1998)
Loh, S.: Concept-based approach for knowledge discovery in texts (in Portuguese). PhD thesis, Federal University of Rio Grande do Sul (2001)
Pedrycz, W.: Fuzzy neural networks and neurocomputations. Fuzzy Sets and Systems 56(1), 1–28 (1993)
Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw- Hill, New York (1983)
Brutlag, J., Meek, C.: Challenges of the email domain for text classification. In: 7th International Conference on Machine Learning (ICML 2000), pp. 103–110. Stanford University, USA (2000)
Kraft, R., Chang, C.C., Maghoul, F., Kumar, R.: Searching with context. In: WWW 2006: Proceedings of the 15th international conference on World Wide Web, pp. 477–486. ACM, New York (2006)
Koller, D., Sahami, M.: Hierarchically Classifying Documents using very few Words. In: ICML 1997: Proceedings of the Fourteenth International Conference on Machine Learning, pp. 170–178. Morgan Kaufmann Publishers Inc., San Francisco (1997)
Chang, H.C., Hsu, C.C.: Using topic keyword clusters for automatic document clustering. IEICE - Trans. Inf. Syst. E88-D(8), 1852–1860 (2005)
Ding, Y., Li, X.: Time weight collaborative filtering. In: CIKM 2005: Proceedings of the 14th ACM International Conference on Information and Knowledge Management, pp. 485–492. ACM, New York (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Loh, S., Lorenzi, F., Granada, R., Lichtnow, D., Wives, L.K., Palazzo Moreira de Oliveira, J. (2010). Using Scientific Publications to Identify People with Similar Interests. In: Cordeiro, J., Filipe, J. (eds) Web Information Systems and Technologies. WEBIST 2009. Lecture Notes in Business Information Processing, vol 45. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12436-5_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-12436-5_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12435-8
Online ISBN: 978-3-642-12436-5
eBook Packages: Computer ScienceComputer Science (R0)