Advertisement

On Predicting Geolocation of Tweets Using Convolutional Neural Networks

  • Binxuan Huang
  • Kathleen M. CarleyEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10354)

Abstract

In many Twitter studies, it is important to know where a tweet came from in order to use the tweet content to study regional user behavior. However, researchers using Twitter to understand user behavior often lack sufficient geo-tagged data. Given the huge volume of Twitter data there is a need for accurate automated geolocating solutions. Herein, we present a new method to predict a Twitter user’s location based on the information in a single tweet. We integrate text and user profile meta-data into a single model using a convolutional neural network. Our experiments demonstrate that our neural model substantially outperforms baseline methods, achieving 52.8% accuracy and 92.1% accuracy on city-level and country-level prediction respectively.

Keywords

Convolutional Neural Network Location Prediction Output Probability Twitter User Text Field 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Notes

Acknowledgments

This work was supported in part by the Office of Naval Research (ONR) N000140811186, and the National Science Foundation (NSF) 00361150115291. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the ONR or the NSF. We want to thank tutors in the Global Communication Center at Carnegie Mellon for their valuable advice.

References

  1. 1.
    Achrekar, H., Gandhe, A., Lazarus, R., Yu, S.H., Liu, B.: Predicting flu trends using twitter data. In: 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), pp. 702–707. IEEE (2011)Google Scholar
  2. 2.
    Berggren, M., Karlgren, J., Östling, R., Parkvall, M.: Inferring the location of authors from words in their texts. arXiv preprint arXiv:1612.06671 (2016)
  3. 3.
    Bo, H., Cook, P., Baldwin, T.: Geolocation prediction in social media data by finding location indicative words. In: Proceedings of COLING, pp. 1045–1062 (2012)Google Scholar
  4. 4.
    Cheng, Z., Caverlee, J., Lee, K.: You are where you tweet: a content-based approach to geo-locating twitter users. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 759–768. ACM (2010)Google Scholar
  5. 5.
    Culotta, A., Kumar, N.R., Cutler, J.: Predicting the demographics of twitter users from website traffic data. In: AAAI, pp. 72–78 (2015)Google Scholar
  6. 6.
    Earle, P.S., Bowden, D.C., Guy, M.: Twitter earthquake detection: earthquake monitoring in a social world. Ann. Geophys. 54(6) 211 (2012)Google Scholar
  7. 7.
    Hale, S., Gaffney, D., Graham, M.: Where in the world are you? Geolocation and language identification in twitter. In: Proceedings of ICWSM 2012, pp. 518–521 (2012)Google Scholar
  8. 8.
    Han, B., Cook, P., Baldwin, T.: A stacking-based approach to twitter user geolocation prediction. In: ACL (Conference System Demonstrations), pp. 7–12 (2013)Google Scholar
  9. 9.
    Han, B., Cook, P., Baldwin, T.: Text-based twitter user geolocation prediction. J. Artif. Intell. Res. 49, 451–500 (2014)Google Scholar
  10. 10.
    Hong, L., Ahmed, A., Gurumurthy, S., Smola, A.J., Tsioutsiouliklis, K.: Discovering geographical topics in the twitter stream. In: Proceedings of the 21st International Conference on World Wide Web, pp. 769–778. ACM (2012)Google Scholar
  11. 11.
    Jurgens, D.: That’s what friends are for: inferring location in online social media platforms based on social relationships. In: ICWSM 2013, pp. 273–282 (2013)Google Scholar
  12. 12.
    Kim, Y.: Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014)
  13. 13.
    Kingma, D., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
  14. 14.
    Landwehr, P.M., Carley, K.M.: Social media in disaster relief. In: Chu, W.W. (ed.) Data Mining and Knowledge Discovery for Big Data, pp. 225–257. Springer, Heidelberg (2014)CrossRefGoogle Scholar
  15. 15.
    Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)Google Scholar
  16. 16.
    Mislove, A., Lehmann, S., Ahn, Y.Y., Onnela, J.P., Rosenquist, J.N.: Understanding the demographics of twitter users. In: 5th ICWSM 2011 (2011)Google Scholar
  17. 17.
    Qian, Y., Tang, J., Yang, Z., Huang, B., Wei, W., Carley, K.M.: A probabilistic framework for location inference from social media. arXiv preprint arXiv:1702.07281 (2017)
  18. 18.
    Quercia, D., Kosinski, M., Stillwell, D., Crowcroft, J.: Our twitter profiles, our selves: predicting personality with twitter. In: 2011 IEEE Third International Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust (PASSAT), pp. 180–185. IEEE (2011)Google Scholar
  19. 19.
    Quinlan, J.R.: C4.5: Programs for Machine Learning. Elsevier, San Francisco (2014)Google Scholar
  20. 20.
    Roller, S., Speriosu, M., Rallapalli, S., Wing, B., Baldridge, J.: Supervised text-based geolocation using language models on an adaptive grid. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 1500–1510. Association for Computational Linguistics (2012)Google Scholar
  21. 21.
    Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. Cogn. Model. 5(3), 1 (1988)Google Scholar
  22. 22.
    Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes twitter users: real-time event detection by social sensors. In: Proceedings of the 19th International Conference on World Wide Web, pp. 851–860. ACM (2010)Google Scholar
  23. 23.
    Tsou, M.H., Yang, J.A., Lusher, D., Han, S., Spitzberg, B., Gawron, J.M., Gupta, D., An, L.: Mapping social activities and concepts with social media (twitter) and web search engines (yahoo and bing): a case study in 2012 US presidential election. Cartography Geogr. Inf. Sci. 40(4), 337–348 (2013)CrossRefGoogle Scholar
  24. 24.
    Wing, B.P., Baldridge, J.: Simple supervised document geolocation with geodesic grids. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 955–964. Association for Computational Linguistics (2011)Google Scholar
  25. 25.
    Wolpert, D.H.: Stacked generalization. Neural Netw. 5(2), 241–259 (1992)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  1. 1.School of Computer ScienceCarnegie Mellon UniversityPittsburghUSA

Personalised recommendations