Skip to main content

A SVM Applied Text Categorization of Academia-Industry Collaborative Research and Development Documents on the Web

  • Conference paper
  • First Online:
Analysis and Modeling of Complex Data in Behavioral and Social Sciences
  • 2253 Accesses

Abstract

A method of automatically extracting Japanese documents describing University-Industry (U-I) relations from the Web is proposed. The proposed method consists of Japanese text processing and support vector machine (SVM) classification. The SVM feature selections were customized for U-I relations documents. The strongest experimental result was 79.95 of accuracy and 81.17 of f-measure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://mecab.googlecode.com/svn/trunk/mecab/doc/index.html.

  2. 2.

    http://svmlight.joachims.org/.

References

  • Aizawa, A. (2003). An information-theoretic perspective of tf-idf measures. Information Processing and Management, 39(1), 45–65. doi:10.1016/S0306-4573(02)00021-3.

    Article  MATH  MathSciNet  Google Scholar 

  • Bouckaert, R. (2003). Choosing between two learning algorithms based on calibrated tests. In Proceedings of the 20th International Conference on Machine Learning (ICML-2003) (pp. 51–58), Washington, DC.

    Google Scholar 

  • Leydesdorff, L., & Meyer, M. (2003). The triple helix of university-industry-government relations. Scientometrics, 58(2), 191–203.

    Article  Google Scholar 

  • Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513–523.

    Article  Google Scholar 

  • Vapnik, V. N. (1995). The nature of statistical learning theory. New York: Springer.

    Book  MATH  Google Scholar 

  • Yang, Y., & Pedersen, J. O. (1997). A comparative study on feature selection in text categorization. In D. H. Fisher (Ed.), Proceedings of ICML-97, 14th International Conference on Machine Learning (pp. 412–420). San Francisco: Morgan Kaufmann.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kei Kurakawa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Kurakawa, K., Sun, Y., Yamashita, N., Baba, Y. (2014). A SVM Applied Text Categorization of Academia-Industry Collaborative Research and Development Documents on the Web. In: Vicari, D., Okada, A., Ragozini, G., Weihs, C. (eds) Analysis and Modeling of Complex Data in Behavioral and Social Sciences. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-06692-9_19

Download citation

Publish with us

Policies and ethics