A Brief Survey of Machine Learning Methods for Classification in Networked Data and an Application to Suspicion Scoring

Macskassy, Sofus Attila; Provost, Foster

doi:10.1007/978-3-540-73133-7_13

A Brief Survey of Machine Learning Methods for Classification in Networked Data and an Application to Suspicion Scoring

Sofus Attila Macskassy¹ &
Foster Provost²

Conference paper

1480 Accesses
8 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCCN,volume 4503))

Abstract

This paper surveys work from the field of machine learning on the problem of within-network learning and inference. To give motivation and context to the rest of the survey, we start by presenting some (published) applications of within-network inference. After a brief formulation of this problem and a discussion of probabilistic inference in arbitrary networks, we survey machine learning work applied to networked data, along with some important predecessors—mostly from the statistics and pattern recognition literature. We then describe an application of within-network inference in the domain of suspicion scoring in social networks. We close the paper with pointers to toolkits and benchmark data sets used in machine learning research on classification in network data. We hope that such a survey will be a useful resource to workshop participants, and perhaps will be complemented by others.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Macskassy, S.A., Provost, F.: Classification in Networked Data: A toolkit and a univariate case study. Technical Report CeDER Working Paper 04-08, Stern School of Business, New York University (2004) [June 2006 revision]
Google Scholar
Macskassy, S.A., Provost, F.: Suspicion scoring based on guilt-by-association, collective inference, and focused data access. In: International Conference on Intelligence Analysis (2005)
Google Scholar
Macskassy, S.A., Provost, F.: Suspicion scoring of entities based on guilt-by-association, collective inference, and focused data access. In: Annual Conference of the North American Association for Computational Social and Organizational Science, NAACSOS (2005)
Google Scholar
Craven, M., Freitag, D., McCallum, A., Mitchell, T., Nigam, K., Quek, C.Y.: Learning to Extract Symbolic Knowledge from the World Wide Web. In: 15th Conference of the American Association for Artificial Intelligence (1998)
Google Scholar
Lu, Q., Getoor, L.: Link-Based Classification. In: Proceedings of the 20th International Conference on Machine Learning, ICML (2003)
Google Scholar
Jensen, D., Neville, J.: Linkage and Autocorrelation Cause Feature Selection Bias in Relational Learning. In: Proceedings of the 19th International Conference on Machine Learning, ICML (2002)
Google Scholar
Perlich, C., Provost, F.: Distribution-based aggregation for relational learning with identifier attributes. Machine Learning 62(1-2), 65–105 (2006)
Article Google Scholar
Besag, J.: Spatial interaction and the statistical analysis of lattice systems. Journal of the Royal Statistical Society 36(2), 192–236 (1974)
MATH MathSciNet Google Scholar
Jensen, D., Neville, J., Gallagher, B.: Why Collective Inference Improves Relational Classification. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Fetch Technologies, 2041 Rosecrans Ave, Suite 245, El Segundo, CA 90245,
Sofus Attila Macskassy
New York University, Stern School of Business, 44 W. 4th Street, New York, NY 10012,
Foster Provost

Authors

Sofus Attila Macskassy
View author publications
You can also search for this author in PubMed Google Scholar
Foster Provost
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Edoardo Airoldi David M. Blei Stephen E. Fienberg Anna Goldenberg Eric P. Xing Alice X. Zheng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Macskassy, S.A., Provost, F. (2007). A Brief Survey of Machine Learning Methods for Classification in Networked Data and an Application to Suspicion Scoring. In: Airoldi, E., Blei, D.M., Fienberg, S.E., Goldenberg, A., Xing, E.P., Zheng, A.X. (eds) Statistical Network Analysis: Models, Issues, and New Directions. ICML 2006. Lecture Notes in Computer Science, vol 4503. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73133-7_13

Download citation

DOI: https://doi.org/10.1007/978-3-540-73133-7_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73132-0
Online ISBN: 978-3-540-73133-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics