Solving data preprocessing problems in existing location-aware systems

  • Toly Chen
  • Katsuhiro Honda
Original Research


Location-aware services, or location-based services, are widely available and guide users to suitable service locations by considering distance and other contextual information. Despite the success stories reported by previous research, the formulae developed for evaluating the utility of a service location in existing location-aware service systems have discrepancies. Examining several representative cases revealed that most of these discrepancies were caused by improper data preprocessing, including huge data, incomplete data normalization, subjective data linearization or nonlinearization, biased weight adjustment, and information-loss discretization. This study reviews these discrepancies and provides corrections for overcoming them.


Location-based service Utility Normalization Linearization Nonlinearization Big data 



This study was supported by the Ministry of Science and Technology, Taiwan.


  1. Chandel MSS, Bhanarkar MP (2015) Identifying feasible central meeting location from multiple geo-points. Int J Emerg Trends Sci Technol 2(3):1969–1972Google Scholar
  2. Charansiriphaisan K, Chiewchanwattana S, Sunat K (2011) The expelling gaussian-base kernel fuzzy c-means algorithm. In: Proceedings of the third international conference on knowledge and smart technologies, pp 44–50Google Scholar
  3. Chen T (2015a) Creating a just-in-time location-aware service using fuzzy logic. Appl Spatial Anal Policy (in press)Google Scholar
  4. Chen T (2015b) A fuzzy integer-nonlinear programming approach for creating a flexible just-in-time location-aware service in a mobile environment. Appl Soft Comput (in press)Google Scholar
  5. Chen T, Wu HC (2013) Finding the just-in-time service location and path in a ubiquitous service network. Int J Internet Manuf Serv 3(2):137–147Google Scholar
  6. Chen T, Wang Y-C, Tsai H-R (2009) Lot cycle time prediction in a ramping-up semiconductor manufacturing factory with a SOM-FBPN-ensemble approach with multiple buckets and partial normalization. Int J Adv Manuf Technol 42(11–12):1206–1216CrossRefGoogle Scholar
  7. Chen T, Wang YC, Lin Z (2014a) Predictive distant operation and virtual control of computer numerical control machines. J Intell Manuf (in press)Google Scholar
  8. Chen Z, Xia JC, Irawan B, Caulfied C (2014b) Development of location-based services for recommending departure stations to park and ride users. Transp Res Part C 48:256–268CrossRefGoogle Scholar
  9. Chow CY, Mokbel MF, Liu X (2006) A peer-to-peer spatial cloaking algorithm for anonymous location-based service. In: Proceedings of the 14th Annual ACM International Symposium on Advances in Geographic Information Systems, pp 171–178Google Scholar
  10. Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, CambridgeCrossRefzbMATHGoogle Scholar
  11. Etzkorn B (2011) Data normalization and standardization.
  12. Hofmann T, Schölkopf B, Smola AJ (2008) Kernel methods in machine learning. Ann Stat 36(3):1171–1220MathSciNetCrossRefzbMATHGoogle Scholar
  13. Honda K, Notsu A, Ichihashi H (2010) Fuzzy pca-guided robust k-means clustering. IEEE Trans Fuzzy Syst 18(1):67–79CrossRefGoogle Scholar
  14. Kotsiantis S, Kanellopoulos D, Pintelas P (2006) Data preprocessing for supervised leaning. Int J Computer Sci 1(2):111–117Google Scholar
  15. Krevl A, Ciglarič M (2006) A framework for developing distributed location based applications. In: 20th International parallel and distributed processing symposium, Rhodes Island, Greece, pp 25–29Google Scholar
  16. Kuo MH, Chen LC, Liang CW (2009) Building and evaluating a location-based service recommendation system with a preference adjustment mechanism. Expert Syst Appl 36(2):3543–3554CrossRefGoogle Scholar
  17. Lin YC, Chen T (2013) A biobjective fuzzy integer-nonlinear programming approach for creating an intelligent location-aware service. J Appl Math 1–11 (article ID 423415)Google Scholar
  18. Mamdani EH (1977) Application of fuzzy logic to approximate reasoning using linguistic synthesis. IEEE Trans Computer C 26(12):1182–1191Google Scholar
  19. Mateo RMA, Lee M, Joo SC, Lee J (2006) Location-aware data mining for mobile users based on neuro-fuzzy system. Lecture Notes in Computer Science, vol 4223, pp 1269–1278Google Scholar
  20. Miller GA (1956) The magical number seven, plus or minus two: some limits on our capacity for processing information. Psychol Rev 63(2):81–97CrossRefGoogle Scholar
  21. Miyamoto S, Ichihashi H, Honda K (2008) Algorithms for fuzzy clustering. Springer, New YorkzbMATHGoogle Scholar
  22. Otebolaku AM, Andrade MT (2015) Context-aware media recommendations for smart devices. J Ambient Intell Humaniz Comput 6(1):13–36CrossRefGoogle Scholar
  23. Reinhardt A, Englert F, Christin D (2013) Enhancing user privacy by preprocessing distributed smart meter data. In: The third IFIP conference on sustainable internet and ICT for sustainability, pp 1–7Google Scholar
  24. Rockette HE, Gur D (2008) Selection of a rating scale in receiver operating characteristic studies: some remaining issues. Acad Radiol 15(2):245–248CrossRefGoogle Scholar
  25. Savage NS, Baranski M, Chavez NE, Höllerer T (2012) I’m feeling LoCo: a location based context aware recommendation system. Adv Location Based Serv 37–54Google Scholar
  26. Schölkopf B, Smola A, Müller K-R (1998) Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput 10(5):1299–1319CrossRefGoogle Scholar
  27. Sola J, Sevilla J (1997) Importance of input data normalization for the application of neural networks to complex industrial problems. IEEE Trans Nucl Sci 44(3):1464–1468CrossRefGoogle Scholar
  28. Sorte WF Jr (2014) Standardisation for incremental innovation: a case study in the Brazilian automobile industry. Int J Lean Enterprise Res 1(1):81–93CrossRefGoogle Scholar
  29. Takagi T, Sugeno M (1985) Fuzzy identification of systems and its applications to modeling and control. IEEE Trans Syst Man Cybern SMC 15(1):116–132CrossRefzbMATHGoogle Scholar
  30. Tsai HR, Chen T (2014) Enhancing the sustainability of a location-aware service through optimization. Sustainability 6:9441–9455CrossRefGoogle Scholar
  31. Witten IH, Frank E, Hall MA (2011) Data mining: practical machine learning tools and techniques. Morgan Kaufmann, BurlingtonGoogle Scholar
  32. Yang M-S, Tsai H-S (2008) A gaussian kernel-based fuzzy c-means algorithm with a spatial bias correction. Pattern Recogn Lett 29:1713–1725CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2015

Authors and Affiliations

  1. 1.Department of Industrial Engineering and Systems ManagementFeng Chia UniversityTaichungTaiwan
  2. 2.Graduate School of EngineeringOsaka Prefecture UniversitySakaiJapan

Personalised recommendations