Parallel Training of An Improved Neural Network for Text Categorization

Li, Cheng Hua; Yang, Laurence T.; Lin, Man

doi:10.1007/s10766-013-0245-x

Parallel Training of An Improved Neural Network for Text Categorization

Published: 07 April 2013

Volume 42, pages 505–523, (2014)
Cite this article

International Journal of Parallel Programming Aims and scope Submit manuscript

Cheng Hua Li^1,2,
Laurence T. Yang² &
Man Lin²

421 Accesses
6 Citations
Explore all metrics

Abstract

This paper studies parallel training of an improved neural network for text categorization. With the explosive growth on the amount of digital information available on the Internet, text categorization problem has become more and more important, especially when millions of mobile devices are now connecting to the Internet. Improved back-propagation neural network (IBPNN) is an efficient approach for classification problems which overcomes the limitations of traditional BPNN. In this paper, we utilize parallel computing to speedup the neural network training process of IBPNN. The parallel IBNPP algorithm for text categorization is implemented on a Sun Cluster with 34 nodes (processors). The communication time and speedup for the parallel IBPNN versus various number of nodes are studied. Experiments are conducted on various data sets and the results show that the parallel IBPNN together with SVD technique achieves fast computational speed and high text categorization correctness.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Development and Application of Artificial Neural Network

Article 30 December 2017

Automated machine learning: past, present and future

Article Open access 18 April 2024

A survey of the recent architectures of deep convolutional neural networks

Article 21 April 2020

References

ai.mit. 20-news-18828 version: http://www.ai.mit.edu/jrennie/20Newsgroups (2010)
Chen, G.A., Yu, X.H., Cheng, S.X.: Acceleration of backpropagation learning using optimized learning rate and momentum. Electron. Lett. 29(14), 1288–1289 (1993)
Article Google Scholar
Costa, M.A., Braga, A., de Menezes, B.R.: Improving neural networks generalization with new constructive and pruning methods. J. Intell. Fuzzy Syst. 13, 75–83 (2003)
MATH Google Scholar
Dahl, G., McAvinney, A., Newhall, T.: Parallelizing neural network training for cluster systems. In: Proceedings of the IASTED International Conference on Parallel and Distributed Computing and Networks (2008)
daviddlewis: Reuters21578 data set. http://www.daviddlewis.com/resources/testcollections/reuters21578 (2010)
Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nedellec, C., Rouveirol, C. (Eds.) Proceedings of the 10th European Conference on Machine Learning (ECML’98), pp. 137–142, Springer, Berlin (1998)
Kontar, S.: Parallel training of neural network for speech recognition. In: Proceedings of the 12th International Conference on, Soft Computing (2006)
Kramer, A.H., Sangiovanni-Vincentelli, A.: Efficient parallel learning algorithms for neural networks. In: Touretzky, S. (Ed.) Advances in Neural Information Processing Systems, pp. 40–48 (1989)
Lai, K.K., Yu, L., Wang, S.: Neural network metalearning for parallel textual information retrieval. Int. Jo. Artif. Intell. 1(A08), 173–184 (2008)
Google Scholar
Lewis, D.D.: Naive (Bayes) at forty. The independence assumption in information retrieval. In: Proceedings of the 10th European Conference on, Machine Learning (ECML’98), pp. 4–15 (1998)
Lewis, D.D., Gale, W.A.: A sequential algorithm for training text classifiers. In: SIGIR ’94 Proceedings of the 17th Annual International ACM SIGIR Conference, pp. 3–12 (1994)
Li, C.H., Park, S.C.: Combination of modified bpnn algorithms and an efficient feature selection method for text categorization. Inf. Process. Manag. 45, 329–340 (2009)
Article Google Scholar
Li, C.H., Park, S.C.: An efficient document classification model using an improved back propagation neural network and singular value decomposition. Expert Syst. Appl. 36(2), 3208–3215 (2009)
Article MathSciNet Google Scholar
Lin, M., Ding, C.: Parallel genetic algorithms for dvs scheduling of distributed embedded systems. High Perform. Comput. Commun. LNCS 4782, 180–191 (2007)
Article Google Scholar
McCallum, A., Nigam, K.: A comparison of event models for naive bayes text classification. In: AAAI’98 Workshop on Learning for Text Categorization, pp. 41–48 (1998)
Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Article Google Scholar
Skucas, I., Remeikis, N., Melninkaite, V.: A combined neural network and decision tree approach for text categorization. Inf. Syst. Dev. XXVII, 173–184 (2005)
Google Scholar
Srinivasan, P., Ruiz, M.E.: Automatic text categorization using neural network. In: Proceedings of the 8th ASIS SIG/CR Workshop on Classification Research, pp. 59–72 (1998)
Tamura, H., Ishii, M., Wang, X.G., Tang, Z., Sun, W.D.: An improved backpropagation algorithm to avoid the local minima problem. Neurocomputing 56, 455–460 (2004)
Article Google Scholar
Tan, S.B.: An effective refinement strategy for KNN text classifier. Expert Syst. Appl. 30(2), 290–298 (2006)
Article Google Scholar
Windheuser, U., Zick, F.K., Krahl, D.: Data Mining–Einsatz Inder Praxis. Addison Wesley/Longman, Bonn (1998)
Google Scholar
Yam, J.Y.F., Chow, T.W.S.: A weight initialization method for improving training speed in feed forward neural network. IEEE Trans. Neural Netw. 2(30), 219–232 (2000)
Google Scholar
Yang, L.T., Xu, L., Lin, M.: Integer factorization by a parallel gnfs algorithm for public key cryptosystems. Embed. Softw. Syst. LNCS 3820, 683–695 (2005)
Article Google Scholar
Zelikovitz, S., Hirsh, H.: Using lsi for text classification in the presence of background text. In: Proceedings of the Tenth International Conference on Information and Knowledge Management, pp. 113–118, ACM Press (2001)
Zeng, H.J., Lu, Y.C., Shi, C.Y., Sun, J.T., Chen, Z., Ma, W.Y.: Supervised latent semantic indexing for document categorization. In: ICDM, pp. 535–538. IEEE Press (2004)

Download references

Acknowledgments

This work was supported by NSERC (Natural Sciences and Engineering Research Council, Canada) and CFI (Canadian Foundation of Innovation).

Author information

Authors and Affiliations

CUIIUC, ChangZhou University, Changzhou, People’s Republic of China
Cheng Hua Li
Department of Mathematics, Statistics and Computer Science, St. Francis Xavier University, Antigonish, Canada
Cheng Hua Li, Laurence T. Yang & Man Lin

Authors

Cheng Hua Li
View author publications
You can also search for this author in PubMed Google Scholar
Laurence T. Yang
View author publications
You can also search for this author in PubMed Google Scholar
Man Lin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Man Lin.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, C.H., Yang, L.T. & Lin, M. Parallel Training of An Improved Neural Network for Text Categorization. Int J Parallel Prog 42, 505–523 (2014). https://doi.org/10.1007/s10766-013-0245-x

Download citation

Received: 02 December 2012
Accepted: 19 March 2013
Published: 07 April 2013
Issue Date: June 2014
DOI: https://doi.org/10.1007/s10766-013-0245-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Parallel Training of An Improved Neural Network for Text Categorization

Abstract

Access this article

Similar content being viewed by others

Development and Application of Artificial Neural Network

Automated machine learning: past, present and future

A survey of the recent architectures of deep convolutional neural networks

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Parallel Training of An Improved Neural Network for Text Categorization

Abstract

Access this article

Similar content being viewed by others

Development and Application of Artificial Neural Network

Automated machine learning: past, present and future

A survey of the recent architectures of deep convolutional neural networks

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation