Parallelisation of the PC Algorithm

Madsen, Anders L.; Jensen, Frank; Salmerón, Antonio; Langseth, Helge; Nielsen, Thomas D.

doi:10.1007/978-3-319-24598-0_2

Anders L. Madsen^20,21,
Frank Jensen²⁰,
Antonio Salmerón²²,
Helge Langseth²³ &
…
Thomas D. Nielsen²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9422))

Included in the following conference series:

Conference of the Spanish Association for Artificial Intelligence

1044 Accesses
2 Citations

Abstract

This paper describes a parallel version of the PC algorithm for learning the structure of a Bayesian network from data. The PC algorithm is a constraint-based algorithm consisting of five steps where the first step is to perform a set of (conditional) independence tests while the remaining four steps relate to identifying the structure of the Bayesian network using the results of the (conditional) independence tests. In this paper, we describe a new approach to parallelisation of the (conditional) independence testing as experiments illustrate that this is by far the most time consuming step. The proposed parallel PC algorithm is evaluated on data sets generated at random from five different real-world Bayesian networks. The results demonstrate that significant time performance improvements are possible using the proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://www.sagemath.org.

References

Andreassen, S., Jensen, F.V., Andersen, S.K., Falck, B., Kjærulff, U., Woldbye, M., Sørensen, A.R., Rosenfalck, A., Jensen, F.: MUNIN – an expert EMG assistant. In: Computer-Aided Electromyography and Expert Systems, Chapter 21. Elsevier Science (1989)
Google Scholar
Andreassen, S., Hovorka, R., Benn, J., Olesen, K.G., Carson, E.R.: A model-based approach to insulin adjustment. In: Stefanelli, S., Hasman, A., Fieschi, M., Talmon, J. (eds.) Proceedings of the Third Conference on Artificial Intelligence in Medicine. Lecture Notes in Medical Informatics, pp. 239–248. Springer, Heidelberg (1991)
Google Scholar
Basak, A., Brinster, I., Ma, X., Mengshoel, O.J.: Accelerating Bayesian network parameter learning using hadoop and MapReduce. In: Proceedings of the 1st International Workshop on Big Data, Streams a nd Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications, pp. 101–108 (2012)
Google Scholar
Chen, W., Zong, L., Huang, W., Ou, G., Wang, Y., Yang, D.: An empirical study of massively parallel Bayesian networks learning for sentiment extraction from unstructured text. In: Du, X., Fan, W., Wang, J., Peng, Z., Sharaf, M.A. (eds.) APWeb 2011. LNCS, vol. 6612, pp. 424–435. Springer, Heidelberg (2011)
Chapter Google Scholar
Chu, C.-T., Kim, S.K., Lin, Y.-A., Yu, Y., Bradski, G., Ng, A.Y., Olukotun, K.: Map-reduce for machine learning on multicore. In: NIPS, pp. 281–288 (2006)
Google Scholar
de Jongh, M.: Algorithms for constraint-based learning of Bayesian network structures with large numbers of variables. Ph.D. thesis, Uni\(\dot{\rm o}\)f Pittsburgh (2014)
Google Scholar
Fang, Q., Yue, K., Fu, X., Wu, H., Liu, W.: A MapReduce-based method for learning Bayesian network from massive data. In: Ishikawa, Y., Li, J., Wang, W., Zhang, R., Zhang, W. (eds.) APWeb 2013. LNCS, vol. 7808, pp. 697–708. Springer, Heidelberg (2013)
Chapter Google Scholar
Jensen, F.V., Skaanning, C., Kjærulff, U.: The SACSO system for troubleshooting of printing systems. In: Proceedings of the Seventh Scandinavian Conference on Artificial Intelligence (2001)
Google Scholar
Jensen, F.V., Nielsen, T.D.: Bayesian Networks and Decision Graphs, 2nd edn. Springer, New York (2007)
Book MATH Google Scholar
Kalisch, M., Buhlmann, P.: Estimating high-dimensional directed acyclic graphs with the PC-algorithm. J. Mach. Learn. Res. 8, 613–636 (2008)
Google Scholar
Kjærulff, U.B., Madsen, A.L.: Bayesian Networks and Influence Diagrams: A Guide to Construction and Analysis, 2nd edn. Springer, New York (2013)
Book Google Scholar
Knuth, D.E.: The Art of Computer Programming, Volume 4, Fascicle 3. Addison-Wesley, Reading (2005)
Google Scholar
Madsen, A.L., Jensen, F., Salmeron, A., Karlsen, M., Langseth, H., Nielsen, T.D.: A new method for vertical parallelisation of tan learning based on balanced incomplete block designs. In: Proceedings of PGM, pp. 302–317 (2014)
Google Scholar
Nikolova, O., Aluru, S.: Parallel discovery of direct causal relations and Markov boundaries with applications to gene networks. In: 2011 International Conference IEEE Parallel Processing (ICPP), pp. 512–521 (2011)
Google Scholar
Papanikolaou, A.: Presents Modern Risk-based Methods and Applications to Ship Design, Operation, and Regulations. Springer, Heidelberg (2009)
Google Scholar
Scutari, M.: Learning Bayesian Networks with the bnlearn R Package. J. Stat. Softw. 35(3), 1–22 (2010)
Article MathSciNet Google Scholar
Spirtes, P., Glymour, C., Scheines, R.: Causation, Prediction, and Search. Adaptive Computation and Machine Learning, 2nd edn. MIT Press, Cambridge (2000)
Google Scholar
Stinson, D.: Combinatorial Designs - Constructions and Analysis. Springer, New York (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

HUGIN EXPERT A/S, Aalborg, Denmark
Anders L. Madsen & Frank Jensen
Department of Computer Science, Aalborg University, Aalborg, Denmark
Anders L. Madsen & Thomas D. Nielsen
Department of Mathematics, University of Almería, Almería, Spain
Antonio Salmerón
Department of Computer and Information Science, Norwegian University of Science and Technology, Trondheim, Norway
Helge Langseth

Authors

Anders L. Madsen
View author publications
You can also search for this author in PubMed Google Scholar
Frank Jensen
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Salmerón
View author publications
You can also search for this author in PubMed Google Scholar
Helge Langseth
View author publications
You can also search for this author in PubMed Google Scholar
Thomas D. Nielsen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anders L. Madsen .

Editor information

Editors and Affiliations

University of Castilla-La Mancha, Albacete, Spain
José M. Puerta
University of Castilla-La Mancha, Albacete, Spain
José A. Gámez
University of Cadiz, Cadiz, Spain
Bernabe Dorronsoro
Public University of Navarre, Pamplona, Spain
Edurne Barrenechea
Pablo de Olavide University, Sevilla, Spain
Alicia Troncoso
Department of Civil Engineering, University of Burgos, Burgos, Spain
Bruno Baruque
Public University of Navarre, Pamplona, Spain
Mikel Galar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Madsen, A.L., Jensen, F., Salmerón, A., Langseth, H., Nielsen, T.D. (2015). Parallelisation of the PC Algorithm. In: Puerta, J., et al. Advances in Artificial Intelligence. CAEPIA 2015. Lecture Notes in Computer Science(), vol 9422. Springer, Cham. https://doi.org/10.1007/978-3-319-24598-0_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-24598-0_2
Published: 14 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24597-3
Online ISBN: 978-3-319-24598-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics