A SVM Applied Text Categorization of Academia-Industry Collaborative Research and Development Documents on the Web

Kurakawa, Kei; Sun, Yuan; Yamashita, Nagayoshi; Baba, Yasumasa

doi:10.1007/978-3-319-06692-9_19

Kei Kurakawa²²,
Yuan Sun²²,
Nagayoshi Yamashita²³ &
…
Yasumasa Baba²⁴

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

2253 Accesses

Abstract

A method of automatically extracting Japanese documents describing University-Industry (U-I) relations from the Web is proposed. The proposed method consists of Japanese text processing and support vector machine (SVM) classification. The SVM feature selections were customized for U-I relations documents. The strongest experimental result was 79.95 of accuracy and 81.17 of f-measure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Aizawa, A. (2003). An information-theoretic perspective of tf-idf measures. Information Processing and Management, 39(1), 45–65. doi:10.1016/S0306-4573(02)00021-3.
Article MATH MathSciNet Google Scholar
Bouckaert, R. (2003). Choosing between two learning algorithms based on calibrated tests. In Proceedings of the 20th International Conference on Machine Learning (ICML-2003) (pp. 51–58), Washington, DC.
Google Scholar
Leydesdorff, L., & Meyer, M. (2003). The triple helix of university-industry-government relations. Scientometrics, 58(2), 191–203.
Article Google Scholar
Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513–523.
Article Google Scholar
Vapnik, V. N. (1995). The nature of statistical learning theory. New York: Springer.
Book MATH Google Scholar
Yang, Y., & Pedersen, J. O. (1997). A comparative study on feature selection in text categorization. In D. H. Fisher (Ed.), Proceedings of ICML-97, 14th International Conference on Machine Learning (pp. 412–420). San Francisco: Morgan Kaufmann.
Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Informatics, Tokyo, 101-8430, Japan
Kei Kurakawa & Yuan Sun
GMO Research, Tokyo, 150-8512, Japan
Nagayoshi Yamashita
The Institute of Statistical Mathematics, Tokyo, 190-8562, Japan
Yasumasa Baba

Authors

Kei Kurakawa
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Sun
View author publications
You can also search for this author in PubMed Google Scholar
Nagayoshi Yamashita
View author publications
You can also search for this author in PubMed Google Scholar
Yasumasa Baba
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kei Kurakawa .

Editor information

Editors and Affiliations

Department of Statistical Science, University of Rome "La Sapienza", Rome, Italy
Donatella Vicari
and Information Sciences, Tama University Graduate School of Management, Tokyo, Japan
Akinori Okada
Department of Political Science, University of Naples "Federico II", Naples, Italy
Giancarlo Ragozini
Fakultät Statistik, Technische Universität Dortmund, Dortmund, Germany
Claus Weihs

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kurakawa, K., Sun, Y., Yamashita, N., Baba, Y. (2014). A SVM Applied Text Categorization of Academia-Industry Collaborative Research and Development Documents on the Web. In: Vicari, D., Okada, A., Ragozini, G., Weihs, C. (eds) Analysis and Modeling of Complex Data in Behavioral and Social Sciences. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-06692-9_19

Download citation

DOI: https://doi.org/10.1007/978-3-319-06692-9_19
Published: 17 June 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06691-2
Online ISBN: 978-3-319-06692-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics