Skip to main content

Analysis of Diverse Tourist Information Distributed Across the Internet

  • Conference paper
  • First Online:
Future Data and Security Engineering (FDSE 2018)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11251))

Included in the following conference series:

  • 939 Accesses

Abstract

Herein, we propose and discuss a new method for analyzing various types of tourist information about the Suwa area of Nagano Prefecture, Japan, available on the Internet. This information includes not only long sentences that can be found on web pages and in blogs, but also short sentences comprising a few words posted on social media. In this paper, we propose a novel method based on a neural network, called paragraph vector, for expressing relationships between words included in sentences. Our method achieves high retrieval accuracy even across social media posts comprising just a few words. Based on our evaluation results, the proposed method outperforms the conventional information retrieval technique wherein sufficient accuracy cannot be achieved as it is based on the occurrence probability of words in sentences. This improvement is achieved by using the word order as an input feature to the neural network model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 1107–1135 (2003)

    MATH  Google Scholar 

  2. https://www.tripadvisor.jp/

  3. Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of International Conference on Machine Learning, Beijing, China, vol. 32 (2014)

    Google Scholar 

  4. Topic modelling for humans. https://radimrehurek.com/gensim/

  5. Hara, Niizuma, Ota: Research on Effective Parameter Search Method for Paragraph Vector. DEIM Forum (2017)

    Google Scholar 

Download references

Acknowledgements

This work was partly supported by MEXT KAKENHI Grant Number 17K01149

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Takeshi Tsuchiya .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tsuchiya, T., Hirose, H., Miyosawa, T., Yamada, T., Sawano, H., Koyanagi, K. (2018). Analysis of Diverse Tourist Information Distributed Across the Internet. In: Dang, T., Küng, J., Wagner, R., Thoai, N., Takizawa, M. (eds) Future Data and Security Engineering. FDSE 2018. Lecture Notes in Computer Science(), vol 11251. Springer, Cham. https://doi.org/10.1007/978-3-030-03192-3_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-03192-3_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-03191-6

  • Online ISBN: 978-3-030-03192-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics