A multilayer recognition model for twitter user geolocation
- 26 Downloads
Geolocation is important for many emerging applications such as disaster management and recommendation system. In this paper, we propose a multilayer recognition model (MRM) to predict the city-level location for social network users, solely based on the user’s tweet content. Through a series of optimizations such as entity selection, spatial clustering and outlier filtering, suitable features are extracted to model the geographic coordinates of tweet users. Then, the Multinomial Naive Bayes is applied to classify the datasets into different groups. The model is evaluated by comparing with an existing algorithm on twitter datasets. The experimental results reveal that our method achieves a better prediction accuracy of 54.82% on the test set, and the average error is reduced to 400.97 miles at best.
KeywordsTwitter Geolocation Spatial clustering Text classification
This work was partially supported by the China National Science and Technology Major Project (2017ZX03001015, 2018ZX03001015, and 2018ZX03001021). Furthermore, this work is done also with the support of the Chinese Academy of Sciences project under Grant No. CXJJ-16M119.
- 2.Rahimi, A., Cohn, T., & Baldwin, T. (2015). Twitter user geolocation using a unified text and network prediction model. Computer Science, 66(4), 568–578.Google Scholar
- 4.Wang, W., & Street, W. N. (2016). Finding hierarchical communities in complex networks using influence-guided label propagation. In IEEE international conference on data mining workshop (pp. 547–556).Google Scholar
- 8.Serdyukov, P., Murdock, V., & Zwol, R. V. (2009). Placing flickr photos on a map. In International ACM SIGIR conference on research and development in information retrieval (pp. 484–491).Google Scholar
- 9.Iso, H., Wakamiya, S., & Aramaki, E. (2017). Density estimation for geolocation via convolutional mixture density network. arXiv:1705.02750.
- 11.Lourentzou, I., Morales, A., & Zhai, C. X. (2018). Text-based geolocation prediction of social media users with neural networks. In IEEE international conference on big data (pp. 696–705).Google Scholar
- 12.Li, C., Wang, H., Zhang, Z., et al. (2016). Topic modeling for short texts with auxiliary word embeddings. In International ACM SIGIR conference on research & development in information retrieval (pp. 165–174).Google Scholar
- 13.Chandra, S., Khan, L., & Muhaya, F. B. (2012). Estimating twitter user location using social interactions—A content based approach. In IEEE third international conference on privacy, security, risk and trust (pp. 838–843).Google Scholar
- 14.Jurgens, D. (2013). That’s what friends are for: Inferring location in online social media platforms based on social relationships. In Proceedings of the international conference on web and social media (ICWSM’13) (Vol. 13, no 13, pp. 273–282).Google Scholar
- 15.Xing, Y., Meng, F., Zhou, Y., et al. (2014). A node influence based label propagation algorithm for community detection in networks. The Scientific World Journal, 2014(5), 627581.Google Scholar
- 16.Paradesi, S. M. (2011). Geotagging tweets using their content. In Twenty-fourth international Florida artificial intelligence research society conference, Palm Beach, Florida, USA. DBLP.Google Scholar
- 17.Cheng, Z., Caverlee, J., & Lee, K. (2010). You are where you tweet: a content-based approach to geo-locating twitter users. CIKM’10, 19(4), 759–768.Google Scholar
- 18.Chang, H. W., Lee, D., Eltaher, M., et al. (2012). @Phillies tweeting from philly? Predicting twitter user locations with spatial word usage. In IEEE/ACM international conference on advances in social networks analysis and mining (pp. 111–118).Google Scholar
- 19.Rahimi, A., Vu, D., Cohn, T., & Baldwin, T. (2015). Exploiting text and network context for geolocation of social media users. In NAACL-HLT 2015.Google Scholar
- 20.Uncu, O., Gruver, W. A., Kotak, D. B., et al. (2007). GRIDBSCAN: GRId density-based spatial clustering of applications with noise. In IEEE international conference on systems, man and cybernetics (pp. 2976–2981). IEEE.Google Scholar
- 21.Finkel, J. R., Grenager, T., & Manning, C. (2005). Incorporating non-local information into information extraction systems by Gibbs sampling. In Meeting on association for computational linguistics (pp. 363–370).Google Scholar