Abstract
Spatial analysis is the core of geographic information system (GIS), of which, spatial overlay of vector data is a major job. Computational intensity of the spatial overlay has a direct effect on the overall performance of the GIS. High precision modeling for the computational intensity and analysis of the vector data overlay has been a challenging task. Thus, the paper proposes a novel approach, which utilizes self-learning and self-training features of optimized random forest algorithm to the vector data overlay analysis. Simulation experiments show that the proposed model is superior to non-optimized random forest algorithm and support vector machine model with higher prediction precision and is also capable of eliminate redundant computational intensity features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chang, K.T.: Geographic Information System. Wiley, Hoboken (2006)
Wang, S., Armstrong, M.P.: A theoretical approach to the use of cyberinfrastructure in geographical analysis. Int. J. Geogr. Inf. Sci. 23(2), 169–193 (2009)
Shook, E., Wang, S., Tang, W.: A communication-aware framework for parallel spatially explicit agent-based models. Int. J. Geogr. Inf. Sci. 27(11), 2160–2181 (2013)
Zhao, Y., Padmanabhan, A., Wang, S.: A parallel computing approach to viewshed analysis of large terrain data using graphics processing units. Int. J. Geogr. Inf. Sci. 27(2), 363–384 (2013)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Nguyen, T.T., Huang, J.Z., Nguyen, T.T.: Unbiased feature selection in learning random forests for high-dimensional data. Sci. World J. 2015 (2015)
Goodchild, M.F.: A spatial analytical perspective on geographical information systems. Int. J. Geogr. Inf. Syst. 1(4), 327–334 (1987)
XueLian, H., ChuanYong, Y., JingZu, L.: Research and application of vector data overlay analysis based on ArcGIS engine. Urban Geotech. Invest. Surv. 3, 014 (2010)
Armstrong, M.P., Densham, P.J.: Domain decomposition for parallel processing of spatial problems. Comput. Environ. Urban Syst. 16(6), 497–513 (1992)
Zhao, Y., Padmanabhan, A., Wang, S.: A parallel computing approach to viewshed analysis of large terrain data using graphics processing units. Int. J. Geogr. Information Sci. 27(2), 363–384 (2013)
Du, S., Wang, X., Feng, C.C., et al.: Classifying natural-language spatial relation terms with random forest algorithm. Int. J. Geogr. Inf. Sci. 31(3), 542–568 (2017)
Ding, S., Chen, L.: Intelligent optimization methods for high-dimensional data classification for support vector machines. Intell. Inf. Manag. 2(6), 354–364 (2010)
Delmelle, E., Dony, C., Casas, I., et al.: Visualizing the impact of space-time uncertainties on dengue fever patterns. Int. J. Geogr. Inf. Sci. 28(5), 1107–1127 (2014)
Wang, S.: A CyberGIS framework for the synthesis of cyberinfrastructure, GIS, and spatial analysis. Ann. Assoc. Am. Geogr. 100(3), 535–557 (2010)
Yang, C., Goodchild, M., Huang, Q., et al.: Spatial cloud computing: how can the geospatial sciences use and help shape cloud computing? Int. J. Digit. Earth 4(4), 305–329 (2011)
Fang, K.N., Wu, J.B., Zhu, J.P., et al.: A review of technologies on random forests. Stat. Inf. Forum 26(3), 32–38 (2011)
Boulesteix, A.L., Janitza, S., Kruppa, J., et al.: Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics. Wiley Interdisc. Rev.: Data Mining Knowl. Discov. 2(6), 493–507 (2002)
Cheng, G., Jing, N., Chen, L.: A theoretical approach to domain decomposition for parallelization of Digital Terrain Analysis. Ann. GIS 19(1), 45–52 (2013)
Nicodemus, K.K.: Letter to the Editor: On the stability and ranking of predictors from random forest variable importance measure. Brief. Bioinf. 12(4), 369–373 (2011)
Acknowledgements
Thanks to the support by the National Natural Science Foundation of China (No. 41271387)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wang, Q., Cao, H., Guo, YH. (2017). Computational Intensity Prediction Model of Vector Data Overlay with Random Forest Method. In: Zou, B., Li, M., Wang, H., Song, X., Xie, W., Lu, Z. (eds) Data Science. ICPCSEE 2017. Communications in Computer and Information Science, vol 727. Springer, Singapore. https://doi.org/10.1007/978-981-10-6385-5_49
Download citation
DOI: https://doi.org/10.1007/978-981-10-6385-5_49
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6384-8
Online ISBN: 978-981-10-6385-5
eBook Packages: Computer ScienceComputer Science (R0)