Skip to main content

Finding Survey Papers via Link and Content Analysis

  • Conference paper
  • First Online:
Communication Technologies, Information Security and Sustainable Development (IMTIC 2013)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 414))

Included in the following conference series:

  • 741 Accesses

Abstract

Survey articles provide a comprehensive overview of a specific area of research. Automatic detection of survey articles from huge scientific literature is interesting and useful knowledge discovery task in academic social networks. There are different features which can be exploited to differentiate between survey articles and other research articles. Surveys articles are usually citing many important articles this important feature is used in the past for finding surveys using HITS algorithm in addition to base words, base cues, and article length features. The rank of authors writing the articles and text of articles is not considered. In this paper, two additional features based on Author Rank (author authority score of her papers) and textual feature Entropy (paper disorder score) are introduced. Entropy feature has its special significance as it can be used even when there is no link structure. Empirical results show that proposed enhancements are useful and better results are obtained. Especially for large number of top n papers our proposed methods performance is very stable as compared to existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://citeseer.ist.psu.edu/

  2. 2.

    http://www.cs.umass.edu/~mccallum/code-data.html

References

  1. Azzopardi, L., Girolami, M., van Risjbergen, K.: Investigating the relationship between language model perplexity and IR precision-recall measures. In: Proceedings of the 26th ACM SIGIR International Conference on Research and Development in Information Retrieval (2003)

    Google Scholar 

  2. Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. In: Proceedings of the 7th International Conference on World Wide Web, pp. 107–117 (1998)

    Google Scholar 

  3. Cohn, D., Chang, H.: Learning to probabilistically identify authoritative documents. In: Proceedings of the 17th International Conference on Machine Learning, pp. 167–174 (2000)

    Google Scholar 

  4. Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. In: Proceedings of the 9th Annual ACM–SIAM Symposium on Discrete Algorithms, pp. 668–677 (1998)

    Google Scholar 

  5. Lawrence, S., Giles, L., Bollacker, K.: Digital libraries and autonomous citation indexing. IEEE Comput. 32(6), 67–71 (1999)

    Article  Google Scholar 

  6. Nanba, H., Okumura, M.: Automatic detection of survey articles. In: Rauber, A., Christodoulakis, S., Tjoa, A.M. (eds.) ECDL 2005. LNCS, vol. 3652, pp. 391–401. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  7. n-gram, http://en.wikipedia.org/wiki/N-gram

  8. Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc information retrieval. In: Proceedings of the 24th ACM SIGIR International Conference on Research and Development in Information Retrieval, pp. 334–342 (2001)

    Google Scholar 

  9. Hotho, A., Jäschke, R., Schmitz, C., Stumme, G.: Information retrieval in folksonomies: search and ranking. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 411–426. Springer, Heidelberg (2006)

    Google Scholar 

  10. Ivn, G., Grolmusz, V.: When the web meets the cell: using personalized PageRank for analyzing protein interaction networks. Biofinformatics 27(3), 405–407 (2011)

    Article  Google Scholar 

  11. Daud, A.: Using time topic modeling for semantics-based dynamic research interest finding. Knowl.-Based Syst. (KBS) 26, 154–163 (2012)

    Article  Google Scholar 

  12. Daud, A., Li, J., Zhou, L., Muhammad, F.: Temporal expert finding through generalized time topic modeling. Knowl.-Based Syst. (KBS) 23(6), 615–625 (2010)

    Article  Google Scholar 

  13. Daud, A., Shaikh, M.A., Rajpar, A.H.: Scientific reference mining using semantic information through topic modeling. Mehran Univ. Res. J. Eng. Technol. 28(2), 253–262 (2009)

    Google Scholar 

  14. Daud, A., Abbasi, R., Muhammad, F.: Finding rising stars in social networks. In: Meng, W., Feng, L., Bressan, S., Winiwarter, W., Song, W. (eds.) DASFAA 2013, Part I. LNCS, vol. 7825, pp. 13–24. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  15. Daud, A., Hussain, S.: Publication venue based language modeling for expert finding. In: Proceedings of International Conference on Future Communication and Computer Technology (ICFCCT 2012), 19−20 May 2012

    Google Scholar 

  16. Manaskasemsak, B., Rungsawang, A., Yamana, H.: Time-weighted web authoritative ranking. Inf. Retrieval J. 14(2), 133–157 (2011)

    Article  Google Scholar 

  17. Shu, L., Long, B., Meng, W.: A latent topic model for complete entity resolution. In: Proceedings of the International Conference on Data Engineering (ICDE) (2009)

    Google Scholar 

  18. Xing, W., Ghorbani, A.: Weighted PageRank algorithm. In: Proceedings of the 2nd Annual Conference on Communication Networks and Services Research, pp. 305−314 (2004)

    Google Scholar 

Download references

Acknowledgments

The work is supported by Higher Education Commission (HEC), Islamabad, Pakistan.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ali Daud .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Daud, A., Sikandar, A., Mansha, S. (2014). Finding Survey Papers via Link and Content Analysis. In: Shaikh, F., Chowdhry, B., Zeadally, S., Hussain, D., Memon, A., Uqaili, M. (eds) Communication Technologies, Information Security and Sustainable Development. IMTIC 2013. Communications in Computer and Information Science, vol 414. Springer, Cham. https://doi.org/10.1007/978-3-319-10987-9_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-10987-9_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-10986-2

  • Online ISBN: 978-3-319-10987-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics