Distributing Text Classification in Grid Environments
The previous chapters looked at several ways to improve the performance of support vector machines (SVMs) and relevance vector machines (RVMs) in text classification applications.
Most data mining problems are nowadays faced with two great challenges. First, the volume of digital data available is growing massively in almost all application areas. Second, state-of-the-art learning machines are becoming increasingly demanding in terms of computing power. This chapter establishes a high-performance distributed computing environment model where the learning techniques proposed in the previous chapters are efficiently deployed and tested in large scale corpora.
KeywordsDirect Acyclic Graph Schedule Scheme Testing Document Computing Node Grid Environment
Unable to display preview. Download preview PDF.