Skip to main content

Research of New Words Identification in Social Network for Monitoring Public Opinion

  • Conference paper
Trustworthy Computing and Services (ISCTCS 2012)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 320))

Included in the following conference series:

  • 3233 Accesses

Abstract

With the rapid development of Internet, a large number of new words have emerged and widely been used in social network. Traditional segmentation algorithm can’t identify these new words efficiently, which will greatly affect the accuracy in extracting out these hot words and keywords. Moreover, it will affect the performance of the network public opinion monitoring system. In this paper, we use tweets collected from Twitter as the experimental data-set. By calculating frequency statistics of k-gram strings, we can find out new words as candidates, and then identify new words by their practical application frequency using Twitter’s search function. The experiment shows: this segmentation algorithm can effectively identify the new keywords and is more suitable for public opinion monitoring system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Xu, X.-R.: Study on the Way to Solve the Paroxysmal Public Feelings on Internet. Journal of North China Electric Power University (Social Sciences) (1), 89–93 (2007) (in Chinese)

    Google Scholar 

  2. Wei, W., Xin, X.: Online Public Opinion Hotspot Detection and Analysis Based on Document Clustering. New Technology of Library and Information Service (3), 74–79 (2009) (in Chinese)

    Google Scholar 

  3. Du, J., Xiong, H.-L.: Algorithm to recognize unknown Chinese words based on BBS corpus. Computer Engineering and Design 31(3), 630–633 (2010) (in Chinese)

    Google Scholar 

  4. Li, D., Cao, Y.-D., Wan, Y.-L.: Internet-Oriented New Words Identification. Journal of Beijing University of Posts and Telecommunications 31(1), 26–29 (2008) (in Chinese)

    Google Scholar 

  5. Zou, G., Liu, Y., Liu, Q., et al.: Internet oriented Chinese New Words Detection. Journal of Chinese Information Processing 18(6), 1–9 (2004) (in Chinese)

    Google Scholar 

  6. Cui, S.-Q., Liu, Q., et al.: New Word Detection Based on Large- Scale Corpus. Journal of Computer Research and Development 43(5), 927–932 (2006) (in Chinese)

    Article  Google Scholar 

  7. He, M., Gong, C.C., Zhang, H.-P., et al.: Method of new word identification based on larger-scale corpus. Computer Engineering and Applications 43(21), 157–159 (2007) (in Chinese)

    Google Scholar 

  8. Tang, J.-T., Li, F., Guo, C.S.: Research of New Word Pattern Recognization in Network Monitoring Public Opinion. Computer Technology and Development 22(1), 119–125 (2012) (in Chinese)

    Google Scholar 

  9. Trendistic, http://trendistic.com/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xiaoyan, W., kai, X., Ying, S., Jian-long, T., Li, G. (2013). Research of New Words Identification in Social Network for Monitoring Public Opinion. In: Yuan, Y., Wu, X., Lu, Y. (eds) Trustworthy Computing and Services. ISCTCS 2012. Communications in Computer and Information Science, vol 320. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35795-4_75

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35795-4_75

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35794-7

  • Online ISBN: 978-3-642-35795-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics