Mining Tourist Preferences with Twice-Learning

  • Chen Zhang
  • Jie Zhang
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7104)


Data mining techniques have been recognized as powerful tools for predictive modeling tourist decision-making process. However, two practical yet important problems have not been resolved by the data miners in empirical tourism research. Firstly, comprehensibility-the role of the data mining should not only generate accurate predictions, but also provide insights why certain prediction is made. But most widely used data mining methods that can generalize well are black-box in nature and can provide little information on the tourist decision-making facts. Secondly, the lack of training samples-it is usually rather difficult to collect enough training samples through surveying the tourist on site, especially for surveying the tourist’s decision-making facts. Many data mining methods may not achieve satisfactory performance if learned on small data set. In this paper, we show that these two problems can be addressed simultaneously using a twice-learning framework on the travel preference data. The results indicate that by addressing these two problems properly, we can predict tourist preferences accurately as well as extracting meaningful insights which would be useful for tourism marketing.


Target Concept Gain Ratio Data Mining Method Data Mining Approach Neural Network Ensemble 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Au, N., Law, R.: Categorical classification of tourism dining. Annals of Tourism Research 29, 819–833 (2002)CrossRefGoogle Scholar
  2. 2.
    Gibson, H., Yiannakis, A.: Tourist roles: Needs and the lifecourse. Annals of Tourism Research 29, 358–383 (2002)CrossRefGoogle Scholar
  3. 3.
    Hansen, L.K., Salamon, P.: Neural network ensembles. IEEE Trans. Pattern Analysis and Machine Intelligence 12, 993–1001 (1990)CrossRefGoogle Scholar
  4. 4.
    Jiang, Y., Li, M., Zhou, Z.-H.: Generation of Comprehensible Hypotheses from Gene Expression Data. In: Li, J., Yang, Q., Tan, A.-H. (eds.) BioDM 2006. LNCS (LNBI), vol. 3916, pp. 116–123. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  5. 5.
    Kim, H., Gu, Z.: Financial features of dividend-paying firms in the hospitality industry: A logistic regression analysis. International Journal of Hospitality Management 28, 359–366 (2009)CrossRefGoogle Scholar
  6. 6.
    Kon, S.C., Turner, W.L.: Neural network forecasting of tourism demand. Tourism Economics 11, 301–328 (2005)CrossRefGoogle Scholar
  7. 7.
    Law, R., Au, N.: A neural network model to forecast Japanese demand for travel to Hong Kong. Tourism Management 20, 89–97 (1999)CrossRefGoogle Scholar
  8. 8.
    Pai, P.F., Hong, W.C., Chang, P.T., Chen, C.T.: The application of support vector machines to forecast tourist arrivals in Barbados: An empirical study. International Journal of Management 23, 375–385 (2006)Google Scholar
  9. 9.
    Palmer, A., Jose Montano, J.J., Sese, A.: Designing an artificial neural network for forecasting tourism time-series. Tourism Management 27, 781–790 (2006)CrossRefGoogle Scholar
  10. 10.
    Quinlan, J.R.: C4.5: Programs for machine learning. Morgan Kaufmann, San Mateo (1993)Google Scholar
  11. 11.
    Song, H., Li, G.: Tourism demand modeling and forecasting A review of recent research. Tourism Management 29, 203–220 (2008)CrossRefGoogle Scholar
  12. 12.
    Cao, L.: In-depth Behavior Understanding and Use: the Behavior Informatics Approach. Information Science 180(17), 3067–3085 (2010)CrossRefGoogle Scholar
  13. 13.
    Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (2000)Google Scholar
  14. 14.
    Wong, J.-Y., Yeh, C.: Tourism hesitation in destination decision making. Annals of Tourism Research 36, 6–23 (2009)CrossRefGoogle Scholar
  15. 15.
    Wong, K.K.F., Song, H., Chon, K.S.: Bayesian models for tourism demand forecasting. Tourism Management 27, 773–780 (2006)CrossRefGoogle Scholar
  16. 16.
    World Tourism Organization. Chinese outbound tourism, Madrid (2003)Google Scholar
  17. 17.
    Zhou, Z.-H., Jiang, Y.: Medical diagnosis with C4.5 rule preceded by artificial neural network ensemble. IEEE Transactions on Information Technology in Biomedicine 7, 37–42 (2003)CrossRefGoogle Scholar
  18. 18.
    Zhou, Z.-H., Jiang, Y.: NeC4.5: neural ensemble based C4.5. IEEE Transactionson Knowledge and Data Engineering 16, 770–773 (2004)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Chen Zhang
    • 1
  • Jie Zhang
    • 1
  1. 1.Department of Land Resources and Tourism SciencesNanjing UniversityNanjingChina

Personalised recommendations