Abstract
There is an ever growing number of users with accounts on multiple social media and networking sites. Consequently, there is increasing interest in matching user accounts and profiles across different social networks in order to create aggregate profiles of users. In this paper, we present models for Digital Stylometry, which is a method for matching users through stylometry inspired techniques. We experimented with linguistic, temporal, and combined temporal-linguistic models for matching user accounts, using standard and novel techniques. Using publicly available data, our best model, a combined temporal-linguistic one, was able to correctly match the accounts of 31% of 5,612 distinct users across Twitter and Facebook.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Balakrishnan, N., Nevzorov, V.B.: A primer on statistical distributions. John Wiley & Sons (2004)
Balduzzi, M., Platzer, C., Holz, T., Kirda, E., Balzarotti, D., Kruegel, C.: Abusing social networks for automated user profiling. In: Jha, S., Sommer, R., Kreibich, C. (eds.) RAID 2010. LNCS, vol. 6307, pp. 422–441. Springer, Heidelberg (2010)
Bird, S., Klein, E., Loper, E.: Natural language processing with Python. O’Reilly Media, Inc. (2009)
Charniak, E.: Statistical language learning. MIT press (1996)
Cover, T.M., Thomas, J.A.: Elements of information theory. John Wiley & Sons (2012)
Goga, O., Loiseau, P., Sommer, R., Teixeira, R., Gummadi, K.P.: On the reliability of profile matching across large online social networks. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1799–1808. ACM (2015)
Iofciu, T., Fankhauser, P., Abel, F., Bischoff, K.: Identifying users across social tagging systems. In: ICWSM (2011)
Jurafsky, D., Martin, J.H.: Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition, 2nd edn. Prentice Hall (2008)
Labitzke, S., Taranu, I., Hartenstein, H.: What your friends tell others about you: low cost linkability of social network profiles. In: Proc. 5th International ACM Workshop on Social Network Mining and Analysis, San Diego, CA, USA (2011)
Lutosawski, W.: Principes de stylomtrie (1890)
Malhotra, A., Totti, L., Meira Jr., W., Kumaraguru, P., Almeida, V.: Studying user footprints in different online social networks. In: Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012), pp. 1065–1070. IEEE Computer Society (2012)
Manning, C.D., Schütze, H.: Foundations of statistical natural language processing. MIT press (1999)
Peekyou: http://www.peekyou.com/
Peled, O., Fire, M., Rokach, L., Elovici, Y.: Entity matching in online social networks. In: 2013 International Conference on Social Computing (SocialCom), pp. 339–344. IEEE (2013)
Qazvinian, V., Rosengren, E., Radev, D.R., Mei, Q.: Rumor has it: identifying misinformation in microblogs. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1589–1599. Association for Computational Linguistics (2011)
Raad, E., Chbeir, R., Dipanda, A.: User profile matching in social networks. In: 2010 13th International Conference on Network-Based Information Systems (NBiS), pp. 297–304. IEEE (2010)
Rajaraman, A., Ullman, J.D.: Mining of massive datasets. Cambridge University Press (2011)
Roy, B.C., Vosoughi, S., Roy, D.: Grounding language models in spatiotemporal context. In: Fifteenth Annual Conference of the International Speech Communication Association (2014)
Social Intelligence Corp: http://www.socialintel.com/
Spokeo: http://www.spokeo.com/
Takahashi, T., Igata, N.: Rumor detection on twitter. In: 2012 Joint 6th International Conference on Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), pp. 452–457. IEEE (2012)
Vosecky, J., Hong, D., Shen, V.Y.: User identification across multiple social networks. In: First International Conference on Networked Digital Technologies, NDT 2009, pp. 360–365. IEEE (2009)
Vosoughi, S.: Automatic detection and verification of rumors on Twitter. Ph.D. thesis, Massachusetts Institute of Technology (2015)
Vosoughi, S., Roy, D.: A human-machine collaborative system for identifying rumors on twitter. In: ICDM Workshop on Event Analytics using Social Media Data (2015)
Vosoughi, S., Zhou, H., Roy, D.: Enhanced twitter sentiment classification using contextual information. In: Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 16–24. Association for Computational Linguistics, Lisboa, September 2015. http://aclweb.org/anthology/W15-2904
You, G.W., Hwang, S.W., Nie, Z., Wen, J.R.: Socialsearch: enhancing entity search with social network matching. In: Proceedings of the 14th International Conference on Extending Database Technology, pp. 515–519. ACM (2011)
Zafarani, R., Liu, H.: Connecting corresponding identities across communities. In: ICWSM (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Vosoughi, S., Zhou, H., Roy, D. (2015). Digital Stylometry: Linking Profiles Across Social Networks. In: Liu, TY., Scollon, C., Zhu, W. (eds) Social Informatics. SocInfo 2015. Lecture Notes in Computer Science(), vol 9471. Springer, Cham. https://doi.org/10.1007/978-3-319-27433-1_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-27433-1_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27432-4
Online ISBN: 978-3-319-27433-1
eBook Packages: Computer ScienceComputer Science (R0)