Transportation in Social Media: An Automatic Classifier for Travel-Related Tweets

  • João PereiraEmail author
  • Arian Pasquali
  • Pedro Saleiro
  • Rosaldo Rossetti
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10423)


In the last years researchers in the field of intelligent transportation systems have made several efforts to extract valuable information from social media streams. However, collecting domain-specific data from any social media is a challenging task demanding appropriate and robust classification methods. In this work we focus on exploring geo-located tweets in order to create a travel-related tweet classifier using a combination of bag-of-words and word embeddings. The resulting classification makes possible the identification of interesting spatio-temporal relations in São Paulo and Rio de Janeiro.


Geo-located Twitter Transportation Text classification 



This work was partially supported by Project “TEC4Growth - Pervasive Intelligence, Enhancers and Proofs of Concept with Industrial Impact/NORTE-01-0145-FEDER-000020”.


  1. 1.
    Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: NIPS (2013)Google Scholar
  2. 2.
    Saleiro, P., Amir, S., Silva, M., Soares, C.: Popmine: tracking political opinion on the web. In: 2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing (CIT/IUCC/DASC/PICOM), pp. 1521–1526. IEEE (2015)Google Scholar
  3. 3.
    Saleiro, P., Gomes, L., Soares, C.: Sentiment aggregate functions for political opinion polling using microblog streams. In: Proceedings of the Ninth International C* Conference on Computer Science & Software Engineering, pp. 44–50. ACM (2016)Google Scholar
  4. 4.
    Saleiro, P., Rodrigues, E.M., Soares, C., Oliveira, E.: FEUP at SemEval-2017 task 5: predicting sentiment polarity and intensity with financial word embeddings. In: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 895–899. Association for Computational Linguistics, Vancouver, August 2017Google Scholar
  5. 5.
    Saleiro, P., Soares, C.: Learning from the news: predicting entity popularity on Twitter. In: Boström, H., Knobbe, A., Soares, C., Papapetrou, P. (eds.) IDA 2016. LNCS, vol. 9897, pp. 171–182. Springer, Cham (2016). doi: 10.1007/978-3-319-46349-0_15CrossRefGoogle Scholar
  6. 6.
    Kokkinogenis, Z., Filguieras, J., Carvalho, S., Sarmento, L., Rossetti, R.J.F.: Mobility network evaluation in the user perspective: real-time sensing of traffic information in Twitter messages. In: Advances in Artificial Transportation Systems and Simulation, pp. 219–234 (2015)CrossRefGoogle Scholar
  7. 7.
    Rashidi, T.H., Abbasi, A., Maghrebi, M., Hasan, S., Waller, T.S.: Exploring the capacity of social media data for modelling travel behaviour. Transp. Res. Part C: Emerg. Technol. 75, 197–211 (2017)CrossRefGoogle Scholar
  8. 8.
    Ulloa, D., Saleiro, P., Rossetti, R.J., Silva, E.R.: Mining social media for open innovation in transportation systems. In: 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), pp. 169–174. IEEE (2016)Google Scholar
  9. 9.
    Kurkcu, A., Ozbay, K., Morgul, E.F.: Evaluating the usability of geo-located twitter as a tool for human activity and mobility patterns: a case study for New York city. In: Transportation Research Board 95th Annual Meeting, no. 16–3901 (2016)Google Scholar
  10. 10.
    Maghrebi, M., Abbasi, A., Waller, S.T.: Transportation application of social media: travel mode extraction. In: 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), pp. 1648–1653. IEEE (2016)Google Scholar
  11. 11.
    Mai, E., Hranac, R.: Twitter interactions as a data source for transportation incidents. In: Proceedings of Transportation Research Board 92nd Annual Meeting, no. 13–1636 (2013)Google Scholar
  12. 12.
    Rebelo, F., Soares, C., Rossetti, R.J.F.: TwitterJam: identification of mobility patterns in urban centers based on tweets. In: 2015 IEEE First International Smart Cities Conference (ISC2), pp. 1–6. IEEE (2015)Google Scholar
  13. 13.
    Carvalho, S., Sarmento, L., Rossetti, R.J.F.: Real-time sensing of traffic information in Twitter messages. In: 2010 13th International IEEE Conference on Intelligent Transportation Systems (ITSC 2010) 4th Workshop on Artificial Transportation Systems and Simulation (ATSS), Funchal, Portugal, 19–22 September 2010, pp. 1–4 (2010)Google Scholar
  14. 14.
    Kuflik, T., Minkov, E., Nocera, S., Grant-Muller, S., Gal-Tzur, A., Shoor, I.: Automating a framework to extract and analyse transport related social media content: the potential and the challenges. Transp. Res. Part C: Emerg. Technol. 77, 275–291 (2017)CrossRefGoogle Scholar
  15. 15.
    Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31st International Conference on Machine Learning (ICML-14), pp. 1188–1196 (2014)Google Scholar
  16. 16.
    Mikolov, T., Yih, W.-T., Zweig, G.: Linguistic regularities in continuous space word representations. In: HLT-NAACL, vol. 13 (2013)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • João Pereira
    • 1
    • 2
    Email author
  • Arian Pasquali
    • 3
  • Pedro Saleiro
    • 1
    • 2
  • Rosaldo Rossetti
    • 1
    • 2
  1. 1.FEUPUniversidade do PortoPortoPortugal
  2. 2.LIACCUniversidade do PortoPortoPortugal
  3. 3.INESC TECUniversidade do PortoPortoPortugal

Personalised recommendations