Abstract
Survey articles provide a comprehensive overview of a specific area of research. Automatic detection of survey articles from huge scientific literature is interesting and useful knowledge discovery task in academic social networks. There are different features which can be exploited to differentiate between survey articles and other research articles. Surveys articles are usually citing many important articles this important feature is used in the past for finding surveys using HITS algorithm in addition to base words, base cues, and article length features. The rank of authors writing the articles and text of articles is not considered. In this paper, two additional features based on Author Rank (author authority score of her papers) and textual feature Entropy (paper disorder score) are introduced. Entropy feature has its special significance as it can be used even when there is no link structure. Empirical results show that proposed enhancements are useful and better results are obtained. Especially for large number of top n papers our proposed methods performance is very stable as compared to existing methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Azzopardi, L., Girolami, M., van Risjbergen, K.: Investigating the relationship between language model perplexity and IR precision-recall measures. In: Proceedings of the 26th ACM SIGIR International Conference on Research and Development in Information Retrieval (2003)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. In: Proceedings of the 7th International Conference on World Wide Web, pp. 107–117 (1998)
Cohn, D., Chang, H.: Learning to probabilistically identify authoritative documents. In: Proceedings of the 17th International Conference on Machine Learning, pp. 167–174 (2000)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. In: Proceedings of the 9th Annual ACM–SIAM Symposium on Discrete Algorithms, pp. 668–677 (1998)
Lawrence, S., Giles, L., Bollacker, K.: Digital libraries and autonomous citation indexing. IEEE Comput. 32(6), 67–71 (1999)
Nanba, H., Okumura, M.: Automatic detection of survey articles. In: Rauber, A., Christodoulakis, S., Tjoa, A.M. (eds.) ECDL 2005. LNCS, vol. 3652, pp. 391–401. Springer, Heidelberg (2005)
Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc information retrieval. In: Proceedings of the 24th ACM SIGIR International Conference on Research and Development in Information Retrieval, pp. 334–342 (2001)
Hotho, A., Jäschke, R., Schmitz, C., Stumme, G.: Information retrieval in folksonomies: search and ranking. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 411–426. Springer, Heidelberg (2006)
Ivn, G., Grolmusz, V.: When the web meets the cell: using personalized PageRank for analyzing protein interaction networks. Biofinformatics 27(3), 405–407 (2011)
Daud, A.: Using time topic modeling for semantics-based dynamic research interest finding. Knowl.-Based Syst. (KBS) 26, 154–163 (2012)
Daud, A., Li, J., Zhou, L., Muhammad, F.: Temporal expert finding through generalized time topic modeling. Knowl.-Based Syst. (KBS) 23(6), 615–625 (2010)
Daud, A., Shaikh, M.A., Rajpar, A.H.: Scientific reference mining using semantic information through topic modeling. Mehran Univ. Res. J. Eng. Technol. 28(2), 253–262 (2009)
Daud, A., Abbasi, R., Muhammad, F.: Finding rising stars in social networks. In: Meng, W., Feng, L., Bressan, S., Winiwarter, W., Song, W. (eds.) DASFAA 2013, Part I. LNCS, vol. 7825, pp. 13–24. Springer, Heidelberg (2013)
Daud, A., Hussain, S.: Publication venue based language modeling for expert finding. In: Proceedings of International Conference on Future Communication and Computer Technology (ICFCCT 2012), 19−20 May 2012
Manaskasemsak, B., Rungsawang, A., Yamana, H.: Time-weighted web authoritative ranking. Inf. Retrieval J. 14(2), 133–157 (2011)
Shu, L., Long, B., Meng, W.: A latent topic model for complete entity resolution. In: Proceedings of the International Conference on Data Engineering (ICDE) (2009)
Xing, W., Ghorbani, A.: Weighted PageRank algorithm. In: Proceedings of the 2nd Annual Conference on Communication Networks and Services Research, pp. 305−314 (2004)
Acknowledgments
The work is supported by Higher Education Commission (HEC), Islamabad, Pakistan.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Daud, A., Sikandar, A., Mansha, S. (2014). Finding Survey Papers via Link and Content Analysis. In: Shaikh, F., Chowdhry, B., Zeadally, S., Hussain, D., Memon, A., Uqaili, M. (eds) Communication Technologies, Information Security and Sustainable Development. IMTIC 2013. Communications in Computer and Information Science, vol 414. Springer, Cham. https://doi.org/10.1007/978-3-319-10987-9_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-10987-9_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10986-2
Online ISBN: 978-3-319-10987-9
eBook Packages: Computer ScienceComputer Science (R0)