Academic Access Data Analysis for Literature Recommendation

  • Yixing FanEmail author
  • Jiafeng Guo
  • Yanyan Lan
  • Jun Xu
  • Xueqi Cheng
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10390)


Academic reading plays an important role in researchers’ daily life. To alleviate the burden of seeking relevant literature from rapidly growing academic repository, different kinds of recommender systems have been introduced in recent years. However, most existing work focused on adopting traditional recommendation techniques, like content-based filtering or collaborative filtering, in the literature recommendation scenario. Little work has yet been done on analyzing the academic reading behaviors to understand the reading patterns and information needs of real-world academic users, which would be a foundation for improving existing recommender systems or designing new ones. In this paper, we aim to tackle this problem by carrying out empirical analysis over large scale academic access data, which can be viewed as a proxy of academic reading behaviors. We conduct global, group-based and sequence-based analysis to address the following questions: (1) Are there any regularities in users’ academic reading behaviors? (2) Will users with different levels of activeness exhibit different information needs? (3) How to correlate one’s future demands with his/her historical behaviors? By answering these questions, we not only unveil useful patterns and strategies for literature recommendation, but also identify some challenging problems for future development.


Academic access data Literature recommendation User study 



The work was funded by 973 Program of China under Grant No. 2014CB340401, the National Key R&D Program of China under Grant No. 2016QY02D0405, the National Natural Science Foundation of China (NSFC) under Grants No. 61232010, 61472401, 61433014, 61425016, and 61203298, the Key Research Program of the CAS under Grant No. KGZD-EW-T03-2, and the Youth Innovation Promotion Association CAS under Grants No. 20144310 and 2016102.


  1. 1.
    Le Anh, V., Hoang Hai, V., Tran, H.N., Jung, J.J.: SciRecSys: a recommendation system for scientific publication by discovering keyword relationships. In: Hwang, D., Jung, J.J., Nguyen, N.-T. (eds.) ICCCI 2014. LNCS (LNAI), vol. 8733, pp. 72–82. Springer, Cham (2014). doi: 10.1007/978-3-319-11289-3_8 Google Scholar
  2. 2.
    Asunka, S., Chae, H.S., Hughes, B., Natriello, G.: Understanding academic information seeking habits through analysis of web server log files. J. Acad. Librarianship 35(1), 33–45 (2009)CrossRefGoogle Scholar
  3. 3.
    Bogers, T., van den Bosch, A.: Recommending scientific articles using citeulikeGoogle Scholar
  4. 4.
    Bollen, J., de Sompel, H.V.: An architecture for the aggregation and analysis of scholarly usage data. In: Proceedings of ACM/IEEE Joint Conference on Digital Libraries. Chapel Hill, pp. 298–307, 11–15 June 2006Google Scholar
  5. 5.
    Bollen, J., Van de Sompel, H., Rodriguez, M.A.: Towards usage-based impact metrics: first results from the mesur project. JCDL 2008, pp. 231–240. ACM, New York (2008)Google Scholar
  6. 6.
    Clauset, A., Shalizi, C.R., Newman, M.E.J.: Power-law distributions in empirical data. SIAM Review 51(4), 661–703 (2009)CrossRefzbMATHMathSciNetGoogle Scholar
  7. 7.
    Dogan, R.I., Murray, G.C., Névéol, A., Lu, Z.: Understanding pubmed\(^{\textregistered }\) user search behavior through log analysis. Database, 2009 (2009)Google Scholar
  8. 8.
    Gorraiz, J., Gumpenberger, C., Schlögl, C.: Usage versus citation behaviours in four subject areas. Scientometrics 101(2), 1077–1095 (2014)CrossRefGoogle Scholar
  9. 9.
    Han, H., Jeong, W., Wolfram, D.: Log analysis of academic digital library: user query patterns. iConference 2014 Proceedings (2014)Google Scholar
  10. 10.
    Hemminger, B.M., Lu, D., Vaughan, K.T.L., Adams, S.J.: Information seeking behavior of academic scientists. JASIST 58(14), 2205–2225 (2007)CrossRefGoogle Scholar
  11. 11.
    Huang, W., Wu, Z., Mitra, P., Giles, C.L.: Refseer: a citation recommendation system. In: JCDL. U.K, pp. 371–374, 8–12 Sep 2014Google Scholar
  12. 12.
    Kurtz, M.J., Bollen, J.: Usage bibliometrics. CoRR, abs/1102.2891 (2011)Google Scholar
  13. 13.
    Mönnich, M., Spiering, M.: Adding value to the library catalog by implementing a recommendation system. D-Lib Magazine 14(5), 4 (2008)Google Scholar
  14. 14.
    Niu, X., Hemminger, B.M.: A study of factors that affect the information-seeking behavior of academic scientists. JASIST 63(2), 336–353 (2012)CrossRefGoogle Scholar
  15. 15.
    Pohl, S., Radlinski, F., Joachims, T.: Recommending related papers based on digital library access records. In: JCDL 2007, Proceedings. Vancouver, Canada, pp. 417–418, 18–23 June 2007Google Scholar
  16. 16.
    Tenopir, C., King, D.W., Bush, A.: Medical faculty’s use of print and electronic journals: changes over time and in comparison with scientists. J. Med. Libr. Assoc. 92(2), 233 (2004)Google Scholar
  17. 17.
    Tenopir, C., Volentine, R., King, D.W.: Article and book reading patterns of scholars: findings for publishers. Learn. Publish. 25(4), 279–291 (2012)CrossRefGoogle Scholar
  18. 18.
    Torres, R., McNee, S.M., Abel, M., Konstan, J.A., Riedl, J.: Enhancing digital libraries with techlens+. In: ACM/IEEE Joint Conference on Digital Libraries, JCDL 2004, Proceedings. Tucson, pp. 228–236, 7–11 June 2004Google Scholar
  19. 19.
    Vellino, A.: A comparison between usage-based and citation-based methods for recommending scholarly research articles. ASIST 47(1), 1–2 (2010)Google Scholar
  20. 20.
    Villn-Rueda, L., Senso, J.A., de Moya-Anegn, F.: The use of OPAC in a large academic library: a transactional log analysis study of subject searching. J. Acad. Librarianship 33(3), 327–337 (2007)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Yixing Fan
    • 1
    Email author
  • Jiafeng Guo
    • 1
  • Yanyan Lan
    • 1
  • Jun Xu
    • 1
  • Xueqi Cheng
    • 1
  1. 1.CAS Key Lab of Network Data Science and Technology, Institute of Computing Technology, Chinese Academy of SciencesBeijingChina

Personalised recommendations