A Community-Driven Graph Partitioning Method for Constraint-Based Causal Discovery

Chaudhary, Mandar S.; Ranshous, Stephen; Samatova, Nagiza F.

doi:10.1007/978-3-319-72150-7_21

Mandar S. Chaudhary⁶,
Stephen Ranshous⁶ &
Nagiza F. Samatova⁶

Part of the book series: Studies in Computational Intelligence ((SCI,volume 689))

Included in the following conference series:

International Conference on Complex Networks and their Applications

4791 Accesses
1 Citations

Abstract

Constraint-based (CB) methods are widely used for discovering causal relationships in observational data. The PC-stable algorithm is a prominent example of CB methods. A critical component of the PC-stable algorithm is to find d-separators and perform conditional independence (CI) tests to eliminate spurious causal relationships. While the pairwise CI tests are necessary for identifying causal relationships, the error rate, where true causal relationships are erroneously removed, increases with the number of tests performed. Efficiently searching for the true d-separator set is thus a critical component to increase the accuracy of the causal graph. To this end, we propose a novel recursive algorithm for constructing causal graphs, based on a two-phase divide and conquer strategy. In phase one, we recursively partition the undirected graph using community detection, and subsequently construct partial skeletons from each partition. Phase two uses a bottom-up approach to merge the subgraph skeletons, ultimately yielding the full causal graph. Simulations on several real-world data sets show that our approach effectively finds the d-separators, leading to a significant improvement in the quality of causal graphs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
A moral graph is the same as a skeleton except it has edges between pairs of variables having a common child.
2.
In this work, we will refer to nodes and variables interchangeably.
3.
http://www.bnlearn.com/bnrepository/.

References

Abellán, J., Gómez-Olmedo, M., Moral, S., et al.: Some variations on the pc algorithm. In: Probabilistic Graphical Models, pp. 1–8 (2006)
Google Scholar
Aliferis, C.F., Statnikov, A., Tsamardinos, I., Mani, S., Koutsoukos, X.D.: Local causal and markov blanket induction for causal discovery and feature selection for classification part i: algorithms and empirical evaluation. J. Mach. Learn. Res. 11, 171–234 (2010)
Google Scholar
Blondel, V.D., Guillaume, J.L., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech. Theor. Exp. 2008(10), P10008 (2008)
Google Scholar
Brandes, U., Delling, D., Gaertler, M., Görke, R., Hoefer, M., Nikoloski, Z., Wagner, D.: Maximizing modularity is hard. arXiv preprint physics/0608255 (2006)
Google Scholar
Cai, R., Zhang, Z., Hao, Z.: Sada: A general framework to support robust causation discovery. In: International Conference on Machine Learning, pp. 208–216 (2013)
Google Scholar
Chaudhary, M.S., Gonzalez, D.L., Bello, G.A., Angus, M.P., Desai, D., Harenberg, S., Doraiswamy, P.M., Semazzi, F.H., Kumar, V., Samatova, N.F.: Causality-guided feature selection. In: Advanced Data Mining and Applications, pp. 391–405. Springer (2016)
Google Scholar
Colombo, D., Maathuis, M.H.: Order-independent constraint-based causal structure learning. J. Mach. Learn. Res. 15(1), 3741–3782 (2014)
MathSciNet MATH Google Scholar
Geng, Z., Wang, C., Zhao, Q.: Decomposition of search for v-structures in dags. J. Multivariate Anal. 96(2), 282–294 (2005)
Article MathSciNet MATH Google Scholar
Harenberg, S., Bello, G., Gjeltema, L., Ranshous, S., Harlalka, J., Seay, R., Padmanabhan, K., Samatova, N.: Community detection in large-scale networks: a survey and empirical evaluation. Wiley Interdisciplinary Rev. Comput. Statistics 6(6), 426–439 (2014)
Article Google Scholar
Kalisch, M., Bühlmann, P.: Estimating high-dimensional directed acyclic graphs with the pc-algorithm. J. Mach. Learn. Res. 8, 613–636 (2007)
MATH Google Scholar
Le, T., Hoang, T., Li, J., Liu, L., Liu, H., Hu, S.: A fast pc algorithm for high dimensional causal discovery with multi-core pcs. IEEE/ACM Trans. Comput. Biol. Bioinf (2016)
Google Scholar
Liu, H., Zhou, S., Lam, W., Guan, J.: A new hybrid method for learning bayesian networks: separation and reunion. Knowl. Based Syst. 121, 185–197 (2017)
Article Google Scholar
Meek, C.: Causal inference and causal explanation with background knowledge. In: Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, pp. 403–410. Morgan Kaufmann Publishers Inc. (1995)
Google Scholar
Spirtes, P., Glymour, C.N., Scheines, R.: Causation, Prediction, and Search. MIT press (2000)
Google Scholar
Tsamardinos, I., Brown, L.E., Aliferis, C.F.: The max-min hill-climbing Bayesian network structure learning algorithm. Mach. Learn. 65(1), 31–78 (2006)
Article Google Scholar
Wille, A., Bühlmann, P.: Low-order conditional independence graphs for inferring genetic networks. Stat. Appl. Genet. Molec. Biol. 5(1) (2006)
Google Scholar
Xie, X., Geng, Z.: A recursive method for structural learning of directed acyclic graphs. J. Mach. Learn. Res. 9, 459–483 (2008)
Google Scholar
Xie, X., Geng, Z., Zhao, Q.: Decomposition of structural learning about directed acyclic graphs. Artif. Intell. 170(4–5), 422–439 (2006)
Article MathSciNet MATH Google Scholar
Zhang, J., Mayer, W., et al.: Weakening faithfulness: some heuristic causal discovery algorithms. Int. J. Data Sci. Anal. 3(2), 93–104 (2017)
Article Google Scholar

Download references

Acknowledgements

This material is based upon work supported by the NSF grant 1029711. In addition, this material is based on work supported in part by the DOE SDAVI Institute and the U.S. National Science Foundation (Expeditions in Computing program).

Author information

Authors and Affiliations

North Carolina State University, Raleigh, NC, USA
Mandar S. Chaudhary, Stephen Ranshous & Nagiza F. Samatova

Authors

Mandar S. Chaudhary
View author publications
You can also search for this author in PubMed Google Scholar
Stephen Ranshous
View author publications
You can also search for this author in PubMed Google Scholar
Nagiza F. Samatova
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mandar S. Chaudhary .

Editor information

Editors and Affiliations

University of Lyon 2, Lyon, France
Chantal Cherifi
University of Burgundy, Dijon, France
Hocine Cherifi
École Normale Supérieure de Lyon, Lyon, France
Márton Karsai
University College London, London, United Kingdom
Mirco Musolesi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chaudhary, M.S., Ranshous, S., Samatova, N.F. (2018). A Community-Driven Graph Partitioning Method for Constraint-Based Causal Discovery. In: Cherifi, C., Cherifi, H., Karsai, M., Musolesi, M. (eds) Complex Networks & Their Applications VI. COMPLEX NETWORKS 2017. Studies in Computational Intelligence, vol 689. Springer, Cham. https://doi.org/10.1007/978-3-319-72150-7_21

Download citation

DOI: https://doi.org/10.1007/978-3-319-72150-7_21
Published: 27 November 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-72149-1
Online ISBN: 978-3-319-72150-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics