Abstract
Most real-word data can be modeled as heterogeneous information networks (HINs), which are composed of multiple types of nodes and links. Classification for objects in HINs is a fundamental problem with broad applications. However, traditional methods cannot involve in heterogeneous information networks. These approaches could not involve the relatedness between objects and various path semantics. In this paper, we proposed a novel framework called CHIN for classification. It utilizes the relevance measurement on objects to iteratively label objects in HINs. As different meta-path performs different accuracy for classification, the proposed framework incorporates the weights of meta-paths. As our experiments show, CHIN generates more accurate classes than the other classification algorithm, but also provides meaningful weights for meta-paths for classification task.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Sun, Y., Han, J., Yan, X., Yu, P.S., Wu, T.: PathSim: meta path-based top-k similarity search in heterogeneous information networks. Proc. VLDB Endow. 4(11), 992–1003 (2011)
Sun, Y., Yu, Y., Han, J.: Ranking-based clustering of heterogeneous information networks with star network schema. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 797–806. ACM (2009)
Ellison, N.B.: Social network sites: definition, history, and scholarship. J. Comput. Mediat. Commun. 13(1), 210–230 (2007)
Völkel, M., Krötzsch, M., Vrandecic, D., Haller, H., Studer, R.: Semantic wikipedia. In: Proceedings of the 15th International Conference on World Wide Web, pp. 585–594. ACM (2006)
Gupta, M., Kumar, P., Bhasker, B.: A new relevance measure for heterogeneous networks. In: Madria, S., Hara, T. (eds.) DaWaK 2015. LNCS, vol. 9263, pp. 165–177. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-22729-0_13
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst. 30(1–7), 107–117 (1998)
Li, J., Ge, B., Yang, K., Chen, Y., Tan, Y.: Meta-path based heterogeneous combat network link prediction. Phys. A Stat. Mech. Appl. 482, 507–523 (2017)
Santiago, A., Benito, R.M.: Robustness of heterogeneous complex networks. Phys. A Stat. Mech. Appl. 388(11), 2234–2242 (2009)
Gupta, M., Kumar, P., Bhasker, B.: DPRel: a meta-path based relevance measure for mining heterogeneous networks. Inf. Syst. Front., 1–17 (2017)
Macskassy, S.A., Provost, F.: Classification in networked data: a toolkit and a univariate case study. J. Mach. Learn. Res. 8(May), 935–983 (2007)
Wan, C., Li, X., Kao, B., Yu, X., Gu, Q., Cheung, D., Han, J.: Classification with active learning and meta-paths in heterogeneous information networks. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 443–452. ACM (2015)
Ji, M., Han, J., Danilevsky, M.: Ranking-based classification of heterogeneous information networks. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011, pp. 1298–1306. ACM, New York (2011)
Pio, G., Serafino, F., Malerba, D., Ceci, M.: Multi-type clustering and classification from heterogeneous networks. Inf. Sci. 425, 107–126 (2018)
Zhou, D., Bousquet, O., Lal, T.N., Weston, J., Schölkopf, B.: Learning with local and global consistency. In: Advances in Neural Information Processing Systems, pp. 321–328 (2004)
Ji, M., Sun, Y., Danilevsky, M., Han, J., Gao, J.: Graph regularized transductive classification on heterogeneous information networks. In: Balcázar, J.L., Bonchi, F., Gionis, A., Sebag, M. (eds.) ECML PKDD 2010. LNCS (LNAI), vol. 6321, pp. 570–586. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15880-3_42
Macskassy, S.A., Provost, F.: A simple relational classifier. Technical report, New York Univ NY STERN School of Business (2003)
Sun, Y., Han, J.: Mining heterogeneous information networks: a structural analysis approach. ACM SIGKDD Explor. Newsl. 14(2), 20–28 (2013)
Shi, C., Li, Y., Zhang, J., Sun, Y., Philip, S.Y.: A survey of heterogeneous information network analysis. IEEE Trans. Knowl. Data Eng. 29(1), 17–37 (2017)
Acknowledgments
This work is supported by National Key R&D Program of China (No. 2017YFC08033007), the National Natural Science of Foundation of China (No. 91546111, 91646201) and Basic Research Funding of Beijing University of Technology (No. 040000546318516).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, J., Jiang, Z., Li, T. (2018). CHIN: Classification with META-PATH in Heterogeneous Information Networks. In: Florez, H., Diaz, C., Chavarriaga, J. (eds) Applied Informatics. ICAI 2018. Communications in Computer and Information Science, vol 942. Springer, Cham. https://doi.org/10.1007/978-3-030-01535-0_5
Download citation
DOI: https://doi.org/10.1007/978-3-030-01535-0_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01534-3
Online ISBN: 978-3-030-01535-0
eBook Packages: Computer ScienceComputer Science (R0)