Abstract
This paper proposes a framework for soft real-time text classification system, which use control theory as a scientific underpinning, rather than ad hoc solutions. In order to provide real-time guarantee, two control loops are adopted. The feed forward control loop estimates the suitable number of classifiers according to the current workload, while the feedback control loop provides fine-grained control to the number of classifiers that perform imprecise computation. The soft real-time classification system can accommodate to the change of workload and transitional overload. The theory analysis and experiments result further prove its effectiveness: the variation range of the average response time is kept within ± 3% of the desired value; the computational resource is dynamically reallocated and reclaimed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1), 1–47 (2002)
Yang, Y., Zhang, J., Kisiel, B.: A scalability analysis of classifiers in text categorization. In: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 96–103. ACM Press, New York (2003)
Grossman, D.A., Frieder, O.: Information Retrieval - Algorithms and Heuristics. Kluwer Academic Publishers, Massachusetts (1998)
Li, R.-L., Hu, Y.-F.: Noise reduction to text categorization based on density for KNN. In: Proceedings of the International Conference on Machine Learning and Cybernetics, vol. 5, pp. 3119–3124. IEEE Computer Society, California (2003)
Zhou, S., Ling, T.-W., Guan, J., et al.: Fast text classification: a training-corpus pruning based approach. In: Proceedings of 8th International Conference on Database Systems for Advanced Applications, pp. 127–136. IEEE Computer Society, California (2003)
Deng, Z.-H., Tang, S.-W., Yang, D.-Q., et al.: SRFW: a simple, fast and effective text classification algorithm. In: Proceedings of International Conference on Machine Learning and Cybernetics, vol. 3, pp. 1267–1271. IEEE Computer Society, California (2002)
Liu, C.L., Layland, J.W.: Scheduling algorithms for multiprogramming in a hard real time environment. Journal of the ACM 20 (1973)
Krishna, C.M., Shin, K.G.: Real-Time Systems. McGraw-Hill Companies, Columbus (1997)
Buttazzo, G.C.: Rate monotonic vs. EDF: judgment day. Real-Time Systems 28(1), 5–26 (2005)
Buttazzo, G.C.: Hard Real-Time Computing System: Predictable Scheduling Algorithms and Applications. Kluwer Academic Publishers, Massachusetts (2000)
Sha, L., Abdelzaher, T.F., Arzen, K.-E., et al.: Real time scheduling theory: a historical perspective. Real-Time Systems 28(2/3), 101–155 (2004)
Lu, F.S.: Queuing Theory and Its Applications. Hunan Scientific Publication Press, Hunan (1984)
Lin, Y.L.: Applied Stochastic Process. Tsinghua University Press, Beijing (2002)
Cheng, S., Guo, B., Li, X., et al.: Signal and System. Press of Xian Electronic Technology University, Xian (2001)
Shun, Z.: System Analysis and Control. Tsinghua University Press, Beijing (1994)
Franklin, G.F., Powell, J.D., Workman, M.L.: Digital Control of Dynamic Systems, 3rd edn. Addison-Wesley, Boston (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, H., Chen, Y., Dai, Y. (2005). A Soft Real-Time Web News Classification System with Double Control Loops. In: Fan, W., Wu, Z., Yang, J. (eds) Advances in Web-Age Information Management. WAIM 2005. Lecture Notes in Computer Science, vol 3739. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11563952_8
Download citation
DOI: https://doi.org/10.1007/11563952_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29227-2
Online ISBN: 978-3-540-32087-6
eBook Packages: Computer ScienceComputer Science (R0)