Keywords Extraction via Multi-relational Network Construction

Lei, Kai; Tang, Hanhong; Zeng, YiFan

doi:10.1007/978-3-319-00951-3_4

Kai Lei⁴,
Hanhong Tang⁴ &
YiFan Zeng⁴

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 225))

1238 Accesses

Abstract

Keywords extraction can be regarded as a process of ranking the words in a given document (set) according to their importance to this document (set). Previous graph-based methods usually consider only one kind of relation between words, such as co-occurrence, ignoring the fact that words in a text interact with each other via multiple relations, which collaborate to decide the importance of words. Although some recently published methods use more than one relation type, they fail to consider the interactions between relations. Therefore, we propose a new approach for keywords extraction by constructing a multi-relational network from texts, which evaluates the various relations at the same time. Experiments shows that our approach is competitive compared with some typical methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Mihalcea, R., Tarau, P.: Textrank: Bringing order into texts. In: Proceedings of EMNLP, Barcelona, Spain, vol. 4, pp. 404–411 (2004)
Google Scholar
Wan, X., Xiao, J.: Collabrank: towards a collaborative approach to single-document keyphrase extraction. In: Proceedings of the 22nd International Conference on Computational Linguistics, vol. 1, pp. 969–976. Association for Computational Linguistics (2008)
Google Scholar
Wang, J., Liu, J., Wang, C.: Keyword extraction based on pageRank. In: Zhou, Z.-H., Li, H., Yang, Q. (eds.) PAKDD 2007. LNCS (LNAI), vol. 4426, pp. 857–864. Springer, Heidelberg (2007)
Chapter Google Scholar
Liu, Z., Huang, W., Zheng, Y., Sun, M.: Automatic keyphrase extraction via topic decomposition. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 366–376. Association for Computational Linguistics (2010)
Google Scholar
Pedersen, T., Patwardhan, S., Michelizzi, J.: Wordnet: Similarity: measuring the relatedness of concepts. In: Demonstration Papers at HLT-NAACL 2004, pp. 38–41. Association for Computational Linguistics (2004)
Google Scholar
Van Der Plas, L., Pallotta, V., Rajman, M., Ghorbel, H.: Automatic keyword extraction from spoken text. a comparison of two lexical resources: the edr and wordnet. arXiv preprint cs/0410062 (2004)
Google Scholar
Ng, M.K.P., Li, X., Ye, Y.: Multirank: co-ranking for objects and relations in multi-relational data. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1217–1225. ACM (2011)
Google Scholar
Hulth, A.: Improved automatic keyword extraction given more linguistic knowledge. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, pp. 216–223. Association for Computational Linguistics (2003)
Google Scholar
Toutanova, K., Klein, D., Manning, C.D., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 173–180. Association for Computational Linguistics (2003)
Google Scholar
Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 50–57. ACM (1999)
Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. The Journal of Machine Learning Research 3, 993–1022 (2003)
MATH Google Scholar
Landauer, T.K., Foltz, P.W., Laham, D.: An introduction to latent semantic analysis. Discourse Processes 25(2-3), 259–284 (1998)
Article Google Scholar
Phan, X.H., Nguyen, C.T.: Gibbslda++: A c/c++implementation of latent dirichlet allocation (lda) (2007)
Google Scholar
Zhao, X., Jiang, J., He, J., Song, Y., Achananuparp, P., Lim, E.P., Li, X.: Topical keyphrase extraction from twitter (2011)
Google Scholar
Miller, G.A.: Wordnet: a lexical database for english. Communications of the ACM 38(11), 39–41 (1995)
Article Google Scholar
Stark, M.M., Riesenfeld, R.F.: Wordnet: An electronic lexical database. In: Proceedings of 11th Eurographics Workshop on Rendering. Citeseer (1998)
Google Scholar
Matsuo, Y., Ishizuka, M.: Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools 13(01), 157–169 (2004)
Article Google Scholar
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems 30(1), 107–117 (1998)
Article Google Scholar
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing and Management 24(5), 513–523 (1988)
Article Google Scholar
Turney, P.D.: Learning algorithms for keyphrase extraction. Information Retrieval 2(4), 303–336 (2000)
Article Google Scholar
Wartena, C., Brussee, R., Slakhorst, W.: Keyword extraction using word co-occurrence. In: DEXA, vol. 10, pp. 54–58 (2010)
Google Scholar
Yin, W., Pei, Y., Huang, L.: Automatic multi-document summarization based on new sentence similarity measures. In: Anthony, P., Ishizuka, M., Lukose, D. (eds.) PRICAI 2012. LNCS, vol. 7458, pp. 832–837. Springer, Heidelberg (2012)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Shenzhen Key Lab for Cloud Computing Technology & Applications (SPCCTA), School of Electronics and Computer Engineering, Peking University, Shenzhen, 518055, P.R. China
Kai Lei, Hanhong Tang & YiFan Zeng

Authors

Kai Lei
View author publications
You can also search for this author in PubMed Google Scholar
Hanhong Tang
View author publications
You can also search for this author in PubMed Google Scholar
YiFan Zeng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kai Lei .

Editor information

Editors and Affiliations

, Dept of Computer Engineering, KTO Karatay University, 130 Karatay, Konya, 42020, Turkey
Dhinaharan Nagamalai
, School of Computing and Informatics, University of Louisiana at Lafayette, 214 Oliver Hall, Lafayette, 70504-4330, Louisiana, USA
Ashok Kumar
, Dept. of Electrical &, Prairie View A&M University, Prairie View, 77446-0519, Texas, USA
Annamalai Annamalai

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lei, K., Tang, H., Zeng, Y. (2013). Keywords Extraction via Multi-relational Network Construction. In: Nagamalai, D., Kumar, A., Annamalai, A. (eds) Advances in Computational Science, Engineering and Information Technology. Advances in Intelligent Systems and Computing, vol 225. Springer, Heidelberg. https://doi.org/10.1007/978-3-319-00951-3_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-00951-3_4
Publisher Name: Springer, Heidelberg
Print ISBN: 978-3-319-00950-6
Online ISBN: 978-3-319-00951-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics