An Automatic Library Data Classification System Using Layer Structure and Voting Strategy
This paper deals with issues of traditional one-layered book classification systems and employs the complementary attribute of various classifiers to propose a two layered book classification system using voting strategy. Moreover, the collection of dissertations from a university library and books from an electronic bookstore are used as the training and testing corpus. The classification codes of dissertations and books are employed as the gold standard as well. Each dissertation contains various components such as title, authors, table of contents, abstract or cited papers et al. To understand the classification effect of all the combinations of components, various combinations are studied as well and the best combination is recommended. The features extracted from abstracts and table of content are found to be most useful for document classification. On the other hand, to obtain the best classification performance, the combination of classifiers for a two-layered book classification system is studied and the best combination was also recommended as well.
KeywordsSupport Vector Machine Classification Performance Vote Strategy Automatic Classification Document Classification
Unable to display preview. Download preview PDF.
- 1.Chen, K.H., Lo, S.C., Lin, C.J.: The Investigation of the Consistency of Subject Cataloging for Academic Journal Articles of Library and Information Science. In: Proceedings of Information and Communication Conference, Taipei, pp. 125–142 (2002) (Chinese)Google Scholar
- 2.Chen, S.Y., Yeh, J.Y., Hwang, M.J., Lin, X.J., Ke, H.R., Yang, W.P.: Automatic Book Classification Method combined with Support Vector Machine and Metadata. International Journal of Advanced Information Technologies (IJAIT) 3(1), 2–21 (2009) (Chinese)Google Scholar
- 3.AL-Nabi, D.L.A., Ahmed, S.: Survey on Classification Algorithms for Data Mining(Comparison and Evaluation). Computer Engineering and Intelligent Systems 4(8), 18–24 (2013)Google Scholar
- 5.Huang, C.M.: A Neural Network Approach to Automatic Classification of Thesis Documents.,Report of National Science Council (NSC 89-2416-H-224-053), Taiwan (2002) (Chinese)Google Scholar
- 6.Huang, J.H.: A Study of Book Title Feature Extraction Based on the Automatic Classification -An Example of BibliographyAutomatically Classified System.Unpublished Master Thesis, Department of Library and Information Science of Fu-Jen Catholic University, Taipei (2008) (Chinese)Google Scholar
- 8.Magdy, W., Darwish, K.: Book Search: Indexing the Valuable Parts. In: Proceedings of the 2008 Workshop on Research Advances in Large Digital Book Repositories, pp. 53–56 (2008)Google Scholar
- 9.Tokkola, K.: Discriminative Features for Document Classification. In: Proceedings of the 16th International Conference on Pattern Recognition (ICPR 2002), vol. 1, pp. 472–475 (2002)Google Scholar
- 12.Yi, K.: Challenges in Automatic Classification using Library Classification Schemes. In: Proceedings of World Library and Information Congress: 72nd IFLA General Conference and Council, pp. 1–14 (2006)Google Scholar