Transportation in Social Media: An Automatic Classifier for Travel-Related Tweets
- 3 Citations
- 2k Downloads
Abstract
In the last years researchers in the field of intelligent transportation systems have made several efforts to extract valuable information from social media streams. However, collecting domain-specific data from any social media is a challenging task demanding appropriate and robust classification methods. In this work we focus on exploring geo-located tweets in order to create a travel-related tweet classifier using a combination of bag-of-words and word embeddings. The resulting classification makes possible the identification of interesting spatio-temporal relations in São Paulo and Rio de Janeiro.
Keywords
Geo-located Twitter Transportation Text classificationNotes
Acknowledgements
This work was partially supported by Project “TEC4Growth - Pervasive Intelligence, Enhancers and Proofs of Concept with Industrial Impact/NORTE-01-0145-FEDER-000020”.
References
- 1.Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: NIPS (2013)Google Scholar
- 2.Saleiro, P., Amir, S., Silva, M., Soares, C.: Popmine: tracking political opinion on the web. In: 2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing (CIT/IUCC/DASC/PICOM), pp. 1521–1526. IEEE (2015)Google Scholar
- 3.Saleiro, P., Gomes, L., Soares, C.: Sentiment aggregate functions for political opinion polling using microblog streams. In: Proceedings of the Ninth International C* Conference on Computer Science & Software Engineering, pp. 44–50. ACM (2016)Google Scholar
- 4.Saleiro, P., Rodrigues, E.M., Soares, C., Oliveira, E.: FEUP at SemEval-2017 task 5: predicting sentiment polarity and intensity with financial word embeddings. In: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 895–899. Association for Computational Linguistics, Vancouver, August 2017Google Scholar
- 5.Saleiro, P., Soares, C.: Learning from the news: predicting entity popularity on Twitter. In: Boström, H., Knobbe, A., Soares, C., Papapetrou, P. (eds.) IDA 2016. LNCS, vol. 9897, pp. 171–182. Springer, Cham (2016). doi: 10.1007/978-3-319-46349-0_15CrossRefGoogle Scholar
- 6.Kokkinogenis, Z., Filguieras, J., Carvalho, S., Sarmento, L., Rossetti, R.J.F.: Mobility network evaluation in the user perspective: real-time sensing of traffic information in Twitter messages. In: Advances in Artificial Transportation Systems and Simulation, pp. 219–234 (2015)CrossRefGoogle Scholar
- 7.Rashidi, T.H., Abbasi, A., Maghrebi, M., Hasan, S., Waller, T.S.: Exploring the capacity of social media data for modelling travel behaviour. Transp. Res. Part C: Emerg. Technol. 75, 197–211 (2017)CrossRefGoogle Scholar
- 8.Ulloa, D., Saleiro, P., Rossetti, R.J., Silva, E.R.: Mining social media for open innovation in transportation systems. In: 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), pp. 169–174. IEEE (2016)Google Scholar
- 9.Kurkcu, A., Ozbay, K., Morgul, E.F.: Evaluating the usability of geo-located twitter as a tool for human activity and mobility patterns: a case study for New York city. In: Transportation Research Board 95th Annual Meeting, no. 16–3901 (2016)Google Scholar
- 10.Maghrebi, M., Abbasi, A., Waller, S.T.: Transportation application of social media: travel mode extraction. In: 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), pp. 1648–1653. IEEE (2016)Google Scholar
- 11.Mai, E., Hranac, R.: Twitter interactions as a data source for transportation incidents. In: Proceedings of Transportation Research Board 92nd Annual Meeting, no. 13–1636 (2013)Google Scholar
- 12.Rebelo, F., Soares, C., Rossetti, R.J.F.: TwitterJam: identification of mobility patterns in urban centers based on tweets. In: 2015 IEEE First International Smart Cities Conference (ISC2), pp. 1–6. IEEE (2015)Google Scholar
- 13.Carvalho, S., Sarmento, L., Rossetti, R.J.F.: Real-time sensing of traffic information in Twitter messages. In: 2010 13th International IEEE Conference on Intelligent Transportation Systems (ITSC 2010) 4th Workshop on Artificial Transportation Systems and Simulation (ATSS), Funchal, Portugal, 19–22 September 2010, pp. 1–4 (2010)Google Scholar
- 14.Kuflik, T., Minkov, E., Nocera, S., Grant-Muller, S., Gal-Tzur, A., Shoor, I.: Automating a framework to extract and analyse transport related social media content: the potential and the challenges. Transp. Res. Part C: Emerg. Technol. 77, 275–291 (2017)CrossRefGoogle Scholar
- 15.Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31st International Conference on Machine Learning (ICML-14), pp. 1188–1196 (2014)Google Scholar
- 16.Mikolov, T., Yih, W.-T., Zweig, G.: Linguistic regularities in continuous space word representations. In: HLT-NAACL, vol. 13 (2013)Google Scholar