Abstract
Emoji are a contemporary and extremely popular way to enhance electronic communication. Without rigid semantics attached to them, emoji symbols take on different meanings based on the context of a message. Thus, like the word sense disambiguation task in natural language processing, machines also need to disambiguate the meaning or ‘sense’ of an emoji. In a first step toward achieving this goal, this paper presents EmojiNet, the first machine readable sense inventory for emoji. EmojiNet is a resource enabling systems to link emoji with their context-specific meaning. It is automatically constructed by integrating multiple emoji resources with BabelNet, which is the most comprehensive multilingual sense inventory available to date. The paper discusses its construction, evaluates the automatic resource creation process, and presents a use case where EmojiNet disambiguates emoji usage in tweets. EmojiNet is available online for use at http://emojinet.knoesis.org.
The rights of this work are transferred to the extent transferable according to title 17 U.S.C. 105.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
References
Emogi research team - 2015 emoji report (2015)
Balasuriya, L., Wijeratne, S., Doran, D., Sheth, A.: Finding street gang members on Twitter. In: The 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2016), vol. 8, San Francisco, CA, USA, pp. 685–692 (08 2016)
Basile, P., Caputo, A., Semeraro, G.: An enhanced lesk word sense disambiguation algorithm through a distributional semantic model. In: COLING, pp. 1591–1600 (2014)
Camacho-Collados, J., Pilehvar, M.T., Navigli, R.: Nasari: a novel approach to a semantically-aware representation of items. In: Proceedings of NAACL, pp. 567–577 (2015)
Davis, M., Edberg, P.: Unicode emoji - unicode technical report #51. Technical report 51(3) (2016)
Dimson, T.: Emojineering part 1: machine learning for emoji trends. Instagram Engineering Blog (2015)
Kelly, R., Watts, L.: Characterising the inventive appropriation of emoji as relationally meaningful in mediated close personal relationships. Experiences of Technology Appropriation: Unanticipated Users, Usage, Circumstances, and Design (2015)
Miller, H., Thebault-Spieker, J., Chang, S., Johnson, I., Terveen, L., Hecht, B.: Blissfully happy or ready to fight: varying interpretations of emoji. In: ICWSM 2016 (2016)
Moro, A., Navigli, R., Tucci, F.M., Passonneau, R.J.: Annotating the MASC corpus with BabelNet. In: LREC, pp. 4214–4219 (2014)
Navigli, R., Ponzetto, S.P.: BabelNet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artif. Intell. 193, 217–250 (2012)
Novak, P.K., Smailović, J., Sluban, B., Mozetič, I.: Sentiment of emojis. PLOS One 10(12), e0144296 (2015)
Pavalanathan, U., Eisenstein, J.: Emoticons vs. emojis on Twitter: a causal inference approach. arXiv preprint arXiv:1510.08480 (2015)
Rezabek, L., Cochenour, J.: Visual cues in computer-mediated communication: supplementing text with emoticons. J. Vis. Lit. 18(2), 201–215 (1998)
Santos, R.: Java image processing cookbook (2010). http://www.lac.inpe.br/JIPCookbook
SwiftKey, P.: Most-used emoji revealed: Americans love skulls, Brazilians love cats, the French love hearts [blog] (2015). http://bit.ly/2c5biPU
Vasilescu, F., Langlais, P., Lapalme, G.: Evaluating variants of the lesk approach for disambiguating words. In: LREC (2004)
Wang, W., Chen, L., Thirunarayan, K., Sheth, A.P.: Harnessing Twitter big data for automatic emotion identification. In: 2012 International Conference on Privacy, Security, Risk and Trust (PASSAT), 2012 International Conference on Social Computing (SocialCom), pp. 587–592. IEEE (2012)
Wijeratne, S., Balasuriya, L., Doran, D., Sheth, A.: Word embeddings to enhance Twitter gang member profile identification. In: IJCAI Workshop on Semantic Machine Learning (SML 2016), vol. 07, pp. 18–24. CEUR-WS, New York City (2016)
Acknowledgments
We are grateful to Sujan Perera for thought-provoking discussions on the topic. We acknowledge partial support from the National Institute on Drug Abuse (NIDA) Grant No. 5R01DA039454-02: “Trending: Social Media Analysis to Monitor Cannabis and Synthetic Cannabinoid Use”, National Institutes of Health (NIH) award: MH105384-01A1: “Modeling Social Behavior for Healthcare Utilization in Depression”, and Grant No. 2014-PS-PSN-00006 awarded by the Bureau of Justice Assistance. The Bureau of Justice Assistance is a component of the U.S. Department of Justice’s Office of Justice Programs, which also includes the Bureau of Justice Statistics, the National Institute of Justice, the Office of Juvenile Justice and Delinquency Prevention, the Office for Victims of Crime, and the SMART Office. Points of view or opinions in this document are those of the authors and do not necessarily represent the official position or policies of the U.S. Department of Justice, NIH or NIDA.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Wijeratne, S., Balasuriya, L., Sheth, A., Doran, D. (2016). EmojiNet: Building a Machine Readable Sense Inventory for Emoji. In: Spiro, E., Ahn, YY. (eds) Social Informatics. SocInfo 2016. Lecture Notes in Computer Science(), vol 10046. Springer, Cham. https://doi.org/10.1007/978-3-319-47880-7_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-47880-7_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47879-1
Online ISBN: 978-3-319-47880-7
eBook Packages: Computer ScienceComputer Science (R0)