Abstract
Sparse Grids (SG), due to Zenger, are the basis for efficient high dimensional approximation and have recently been applied successfully to predictive modelling. They are spanned by a collection of simpler function spaces represented by regular grids. The combination technique prescribes how approximations on simple grids can be combined to approximate the high dimensional functions. It can be improved by iterative refinement.
Fitting sparse grids admits the exploitation of parallelism at various stages. The fit can be done entirely by fitting partial models on regular grids. This allows parallelism over the partial grids. In addition, each of the partial grid fits can be parallelised as well, both in the assembly phase where parallelism is done over the data and in the solution stage using traditional parallel solvers for the resulting PDEs. A simple timing model confirms that the most effective methods are obtained when both types of parallelism are used.
Part of the work was supported by the German Bundesministerium für Bildung und Forschung (BMB+F) within the project 03GRM6BN.
Chapter PDF
Similar content being viewed by others
References
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Statistics/Probability Series. Wadsworth Publishing Company, Belmont, California, U.S.A. (1984)
Friedman, J.H.: Multivariate adaptive regression splines. Ann. Statist. 19 (1991) 1–141 With discussion and a rejoinder by the author.
Zenger, C.: Sparse grids. In Hackbusch, W., ed.: Parallel Algorithms for Partial Differential Equations, Proceedings of the Sixth GAMM-Seminar, Kiel, 1990. Volume 31 of Notes on Num. Fluid Mech., Vieweg (1991) 241–251
Griebel, M., Schneider, M., Zenger, C.: A combination technique for the solution of sparse grid problems. In de Groen, P., Beauwens, R., eds.: Iterative Methods in Linear Algebra, IMACS, Elsevier, North Holland (1992) 263–281
Garcke, J., Griebel, M.: Classification with sparse grids using simplicial basis functions. Intelligent Data Analysis 6 (2002) 483–502 (shortened version appeared in KDD 2001, Proc. Seventh ACM SIGKDD, F. Provost and R. Srikant (eds.), pages 87–96, ACM, 2001).
Garcke, J., Griebel, M., Thess, M.: Data mining with sparse grids. Computing 67 (2001) 225–253
Hastie, T.J., Tibshirani, R.J.: Generalized additive models. Volume 43 of Monographs on Statistics and Applied Probability. Chapman and Hall Ltd., London (1990)
Hastie, T., Tibshirani, R.: Generalized additive models. Statist. Sci. 1 (1986) 297–318 With discussion.
Hegland, M.: Additive sparse grid fitting. In: Proceedings of the Fifth International Conference on Curves and Surfaces, Saint-Malo, France 2002. (2002) submitted.
Hegland, M., Nielsen, O.M., Shen, Z.: High dimensional smoothing based on multilevel analysis. Submitted (2000) Available at http://datamining.anu.edu.au/publications/2000/hisurf2000.ps.gz.
Blackford, L.S., Choi, J., Cleary, A., D’Azevedo, E., Demmel, J., Dhillon, I., Dongarra, J., Hammarling, S., Henry, G., Petitet, A., Stanley, K., Walker, D., Whaley, R.C.: ScaLAPACK Users’ Guide. Society for Industrial and Applied Mathematics, Philadelphia, PA (1997)
Griebel, M.: The combination technique for the sparse grid solution of PDEs on multiprocessor machines. Parallel Processing Letters 2 (1992) 61–70
Garcke, J., Griebel, M.: On the parallelization of the sparse grid approach for data mining. In Margenov, S., Wasniewski, J., Yalamov, P., eds.: Large-Scale Scientific Computations, Third International Conference, Sozopol, Bulgaria. Volume 2179 of Lecture Notes in Computer Science, (2001) 22–32
Griebel, M., Huber, W., Störtkuhl, T., Zenger, C.: On the parallel solution of 3D PDEs on a network of workstations and on vector computers. In Bode, A., Cin, M.D., eds.: Lecture Notes in Computer Science 732, Parallel Computer Architectures: Theory, Hardware, Software, Applications, Springer Verlag (1993) 276–291
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Garcke, J., Hegland, M., Nielsen, O. (2003). Parallelisation of Sparse Grids for Large Scale Data Analysis. In: Sloot, P.M.A., Abramson, D., Bogdanov, A.V., Gorbachev, Y.E., Dongarra, J.J., Zomaya, A.Y. (eds) Computational Science — ICCS 2003. ICCS 2003. Lecture Notes in Computer Science, vol 2659. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44863-2_67
Download citation
DOI: https://doi.org/10.1007/3-540-44863-2_67
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40196-4
Online ISBN: 978-3-540-44863-1
eBook Packages: Springer Book Archive