Abstract
With the rapid development of social networks, the current scale of graph data continues to increase, and the performance of anonymous social network methods is limited. Node reachability query is essential in directed graphs, which can reflect the relationship between nodes and the direction of information dissemination. Aiming at the problem of the reachability of nodes between directed social network privacy technologies, this paper proposes a reachability preserving distribution perturbation (RPDP) algorithm, which is based on the distributed graph processing system GraphX. This algorithm first generates a Random Neighborhood Table (RNT) composed of four tuples for the nodes and then uses the message transmission of GraphX and “probe” mechanism. The proposed algorithm improves the disposal efficiency of the large-scale social network while maintaining the reachability of the nodes. Experiments based on the real social network data show that the proposed algorithm can keep the node reachability and deal with large-scale social network efficiently while protecting the character of the graph structure.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Backstrom, L, Dwork, C.: Kleinberg, J.: Wherefore art thou r3579x? anonymized social networks, hidden patterns, and structural steganography. In: 16th International Conference on World Wide Web on Proceedings, pp. 181–190. ACM, Canada (2007)
Hay, M., Miklau, G., Jensen, D.: Anonymizing social networks. Computer science department faculty publication series, pp. 173–187 (2007)
Ying, X.W., Wu, X.T.: Randomizing social networks: a spectrum preserving approach. In: 8th SIAM International Conference on Data Mining on Proceedings, pp. 739–750. Industrial and Applied Mathematics, United States (2008)
Ying, X., Wu, X.: On link privacy in randomizing social networks. In: Theeramunkong, T., Kijsirikul, B., Cercone, N., Ho, T.-B. (eds.) PAKDD 2009. LNCS (LNAI), vol. 5476, pp. 28–39. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-01307-2_6
Fu, L.Z., Meng, X.F.: Reachability indexing for large-scale graphs: studies and forecasts. J. Comput. Res. Dev. 2(1), 116–129 (2015)
Seufert, S., Anand, A., Bedathur, S.: FERRARI: flexible and efficient reachability range assignment for graph indexing. In IEEE 29th International Conference on Data Engineering on Proceedings, pp. 1009–1020. IEEE, Brisbane (2013)
Cheng, J., Shang, Z., Cheng, H.: Efficient processing of k-hop reachability queries. VLDB 23(2), 227–252 (2014)
Zhou, J.F., Chen, W., Fei, C, P.: BiRch: a bidirectional search algorithm for k-step reachability queries. Communications 36(8), 50–60 (2015)
Jurczyk, P., Xiong, L.: Distributed anonymization: achieving privacy for both data subjects and data providers. In: Gudes, E., Vaidya, J. (eds.) DBSec 2009. LNCS, vol. 5645, pp. 191–207. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-03007-9_13
Zhang, X., Dou, W., Pei, J.: Proximity-aware local-recoding anonymization with MapReduce for scalable big data privacy preservation in cloud. IEEE Trans. Comput. 64(8), 2293–2307 (2015)
Zhang, X,. Yang, L, T., Liu, C.: A scalable two-phase top-down specialization approach for data anonymization using MapReduce on cloud. IEEE Trans. Parallel Distrib. Syst. 25(2), 363–373 (2014)
Fard, A.M., Wang, K., Yu, P.S.: Limiting link disclosure in social network analysis through subgraph-wise perturbation. In: 15th International Conference on Extending Database Technology on Proceedings, pp. 109–119, ACM, New York (2012)
Fard, A.M., Wang, K.: Neighborhood randomization for link privacy in social network analysis. World Wide Web 18(1), 9–32 (2015)
Bhagat, S., Cormode, G., Krishnamurthy, B.: Class-based graph anonymization for social network data. VLDB Endow. 2(1), 766–777 (2009)
Wang, L., Li, X.: A clustering-based bipartite graph privacy preserving approach for sharing high-dimensional data. Int. J. Softw. Eng. Knowl. Eng. 24(07), 1091–1111 (2017)
Wang, Y., Zheng, B.: Preserving privacy in social networks against connection fingerprint attacks. In: IEEE 31st International Conference on Data Engineering on Proceedings, pp. 54–65. IEEE, Seoul (2015)
Masoumzadeh, A., Joshi, J.: Preserving structural properties in edge-perturbing anonymization techniques for social networks. IEEE Trans. Dependable Secure Comput. 9(6), 877–889 (2012)
Masoumzadeh, A., Joshi, J.: Preserving structural properties in anonymization of social networks. In: 6th International Conference on Collaborative Computing: Networking Applications and Worksharing on Proceedings, pp. 1–10. IEEE, Chicago (2011)
Zhang, L., Zhang, W.: Edge anonymity in social network graphs. In: Proceedings International Conference on Computational Science & Engineering, pp. 1–8. IEEE, Vancouver (2009)
Casas-Roma, J.: Privacy-preserving on graphs using randomization and edge-relevance. In: Torra, V., Narukawa, Y., Endo, Y. (eds.) MDAI 2014. LNCS (LNAI), vol. 8825, pp. 204–216. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-12054-6_18
Mittal, P., Papamanthou, C., Song, D.: Preserving Link Privacy in Social Network Based Systems. arXiv preprint arXiv:1208.6189 (2012)
Liu, C., Mittal, P.: LinkMirage: enabling privacy-preserving analytics on social relationships. In: Network & Distributed System Security Symposium on Proceedings (2016)
Liu, C., Mittal, P.: LinkMirage: How to Anonymize Links in Dynamic Social Systems. Eprint arXiv:1501.01361 (2015)
Gonzalez, J.E., Xin, R.S., Dave, A.: Graphx: graph processing in a distributed dataflow framework. In: Usenix Conference on Operating Systems Design & Implementation on Proceedings, pp. 599–613 (2014)
Xin, R.S., Crankshaw, D., Dave, A.: GraphX: unifying data-parallel and graph-parallel analytics. arXiv preprint arXiv:1402.2394 (2014)
Xin, R.S., Gonzalez, J.E., Franklin, M.J.: GraphX: a resilient distributed graph system on Spark. In: First International Workshop on Graph Data Management Experiences and Systems on Proceedings. ACM, 2 (2013)
Daly, E., Haahr, M.: Social network analysis for information flow in disconnected delay-tolerant MANETs. IEEE Trans. Mob. Comput. 8(5), 621 (2009)
Acknowledgments
This work is partially supported by Natural Science Foundation of China (No. 61562065). The authors also gratefully acknowledge the helpful comments and suggestions of the reviewers, which have improved the presentation.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhang, X., Li, J., He, X., Liu, J. (2019). Distributed Graph Perturbation Algorithm on Social Networks with Reachability Preservation. In: Jin, H., Lin, X., Cheng, X., Shi, X., Xiao, N., Huang, Y. (eds) Big Data. BigData 2019. Communications in Computer and Information Science, vol 1120. Springer, Singapore. https://doi.org/10.1007/978-981-15-1899-7_14
Download citation
DOI: https://doi.org/10.1007/978-981-15-1899-7_14
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1898-0
Online ISBN: 978-981-15-1899-7
eBook Packages: Computer ScienceComputer Science (R0)