Skip to main content

Using Scientific Publications to Identify People with Similar Interests

  • Conference paper
Web Information Systems and Technologies (WEBIST 2009)

Abstract

In many situations, related to some types of systems or organizations’ tasks, it is necessary to identify people with similar profiles. In the case of a collaborative recommender system, items to be recommended are those associated to similar users. Another example, in the academic environment, is to identify new members to be part of a research group (people with similar profiles). This task of identifying people with similar profiles can be time-consuming. In this sense, this work considers that scientific papers written by people can be used to identify users with similar profiles. Considering this assumption, we have done some experiments to identify which parts of papers, which type of indexes (terms or concepts) and which type of similarity functions (Jaccard or a Fuzzy function) are more suitable to identify similar people. The paper presents the results of some experiments and some application scenarios considering academic environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Davenport, T.H., Pruzac, L.: Working Knowledge – How Organizations Manage what they know. Harvard Business School Press, Cambridge (1998)

    Google Scholar 

  2. Wang, J., de Vries, A.P., Reinders, M.J.T.: Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval SIGIR 2006, Washington, USA, pp. 501–508 (2006)

    Google Scholar 

  3. Adomavicius, G., Tuzhilin, A.: Toward the Next Generation of Recommender systems: A Survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering 17(6), 734–749 (2005)

    Article  Google Scholar 

  4. Drachsler, H., Hummel, H.G.K., Koper, R.: Personal recommender systems for learners in lifelong learning networks: the requirements, techniques and model. Int. J. Learn. Technol. 3(4), 404–423 (2008)

    Article  Google Scholar 

  5. Stoilova, L., Holloway, T., Markines, B., Maguitman, A.G., Menczer, F.: Givealink: Mining a semantic network of bookmarks for web search and recommendation. In: Proceedings of the 3rd International Workshop on Link discovery LinkKDD, pp. 66–73 (2005)

    Google Scholar 

  6. Spertus, E., Sahami, M., Buyukkokten, O.: Evaluating similarity measures: a large-scale study in the orkut social network. In: Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery and data mining KDD 2005, pp. 678–684 (2005)

    Google Scholar 

  7. Rashid, A.M., Albert, I., Cosley, D., Lam, S.K., McNee, S.M., Konstan, J.A., Riedl, J.: Getting to know you: learning new user preferences in recommender systems. In: IUI 2002: Proceedings of the 7th international conference on Intelligent user interfaces, pp. 127–134. ACM, New York (2002)

    Chapter  Google Scholar 

  8. McNee, S., Albert, I., Cosley, D., Gopalkrishnan, P., Lam, S.K., Rashid, A.M., Konstan, J.A., Riedl, J.: On the recommending of citations for research paperss. In: Proceedings of the 2002 ACM Conference on Computer Supported Cooperative Work, pp. 116–125 (2002)

    Google Scholar 

  9. Middleton, S.E., Shadbolt, N.R., Roure, D.C.D.: Capturing interest through inference and visualization: ontological user profiling in recommender systems. In: International Conference on Knowledge Capture KCAP 2003, pp. 62–69. ACM Press, New York (2003)

    Chapter  Google Scholar 

  10. Dumais, S.T., Nielsen, J.: Automating the assignment of submitted manuscripts to reviewers. In: 15th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 233–244. Copenhagen, Denmark (1992)

    Chapter  Google Scholar 

  11. Yarowsky, D., Florian, R.: Taking the load off the conference chairs: towards a digital paper-routing assistant. In: Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, Washington, USA, pp. 220–230 (1999)

    Google Scholar 

  12. Basu, C., Hirsh, H., Cohen, W.: A study in combining multiple information sources. Journal of the Artificial Intelligence Research (JAIR) 14, 231–252 (2001)

    MATH  Google Scholar 

  13. Loh, S., Wives, L.K., Oliveira, J.P.M.: Concept-based knowledge discovery in texts extracted from the web. ACM SIGKDD Explorations 2(1), 29–39 (1998)

    Article  Google Scholar 

  14. Chen, H.: The vocabulary problem in collaboration. IEEE Computer 27(5), 2–10 (1994)

    Google Scholar 

  15. Lin, C.h., Chen, H.: An automatic indexing and neural network approach to concept retrieval and classification of multilingual (chinese-english) documents. IEEE Transactions on Systems, Man and Cybernetics 26(1), 1–14 (1996)

    Google Scholar 

  16. Sowa, J.F.: Knowledge representation: logical, philosophical, and computational foundations. Brooks/Cole Publishing Co., Pacific Grove (2000)

    Google Scholar 

  17. Guarino, N.: Formal ontology and information systems. In: International Conference on Formal Ontologies in Information Systems - FOIS 1998, Trento, Italy, pp. 3–15 (1998)

    Google Scholar 

  18. Willet, P.: Recent trends in hierarchic document clustering: a critical review. Information Processing & Management 24(5), 577–597 (1998)

    Article  Google Scholar 

  19. Loh, S.: Concept-based approach for knowledge discovery in texts (in Portuguese). PhD thesis, Federal University of Rio Grande do Sul (2001)

    Google Scholar 

  20. Pedrycz, W.: Fuzzy neural networks and neurocomputations. Fuzzy Sets and Systems 56(1), 1–28 (1993)

    Article  Google Scholar 

  21. Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw- Hill, New York (1983)

    MATH  Google Scholar 

  22. Brutlag, J., Meek, C.: Challenges of the email domain for text classification. In: 7th International Conference on Machine Learning (ICML 2000), pp. 103–110. Stanford University, USA (2000)

    Google Scholar 

  23. Kraft, R., Chang, C.C., Maghoul, F., Kumar, R.: Searching with context. In: WWW 2006: Proceedings of the 15th international conference on World Wide Web, pp. 477–486. ACM, New York (2006)

    Chapter  Google Scholar 

  24. Koller, D., Sahami, M.: Hierarchically Classifying Documents using very few Words. In: ICML 1997: Proceedings of the Fourteenth International Conference on Machine Learning, pp. 170–178. Morgan Kaufmann Publishers Inc., San Francisco (1997)

    Google Scholar 

  25. Chang, H.C., Hsu, C.C.: Using topic keyword clusters for automatic document clustering. IEICE - Trans. Inf. Syst. E88-D(8), 1852–1860 (2005)

    Article  Google Scholar 

  26. Ding, Y., Li, X.: Time weight collaborative filtering. In: CIKM 2005: Proceedings of the 14th ACM International Conference on Information and Knowledge Management, pp. 485–492. ACM, New York (2005)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Loh, S., Lorenzi, F., Granada, R., Lichtnow, D., Wives, L.K., Palazzo Moreira de Oliveira, J. (2010). Using Scientific Publications to Identify People with Similar Interests. In: Cordeiro, J., Filipe, J. (eds) Web Information Systems and Technologies. WEBIST 2009. Lecture Notes in Business Information Processing, vol 45. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12436-5_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12436-5_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12435-8

  • Online ISBN: 978-3-642-12436-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics