Abstract
Herein, we propose and discuss a new method for analyzing various types of tourist information about the Suwa area of Nagano Prefecture, Japan, available on the Internet. This information includes not only long sentences that can be found on web pages and in blogs, but also short sentences comprising a few words posted on social media. In this paper, we propose a novel method based on a neural network, called paragraph vector, for expressing relationships between words included in sentences. Our method achieves high retrieval accuracy even across social media posts comprising just a few words. Based on our evaluation results, the proposed method outperforms the conventional information retrieval technique wherein sufficient accuracy cannot be achieved as it is based on the occurrence probability of words in sentences. This improvement is achieved by using the word order as an input feature to the neural network model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 1107–1135 (2003)
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of International Conference on Machine Learning, Beijing, China, vol. 32 (2014)
Topic modelling for humans. https://radimrehurek.com/gensim/
Hara, Niizuma, Ota: Research on Effective Parameter Search Method for Paragraph Vector. DEIM Forum (2017)
Acknowledgements
This work was partly supported by MEXT KAKENHI Grant Number 17K01149
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Tsuchiya, T., Hirose, H., Miyosawa, T., Yamada, T., Sawano, H., Koyanagi, K. (2018). Analysis of Diverse Tourist Information Distributed Across the Internet. In: Dang, T., Küng, J., Wagner, R., Thoai, N., Takizawa, M. (eds) Future Data and Security Engineering. FDSE 2018. Lecture Notes in Computer Science(), vol 11251. Springer, Cham. https://doi.org/10.1007/978-3-030-03192-3_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-03192-3_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03191-6
Online ISBN: 978-3-030-03192-3
eBook Packages: Computer ScienceComputer Science (R0)