Skip to main content

Computational Intensity Prediction Model of Vector Data Overlay with Random Forest Method

  • Conference paper
  • First Online:
Data Science (ICPCSEE 2017)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 727))

  • 2528 Accesses

Abstract

Spatial analysis is the core of geographic information system (GIS), of which, spatial overlay of vector data is a major job. Computational intensity of the spatial overlay has a direct effect on the overall performance of the GIS. High precision modeling for the computational intensity and analysis of the vector data overlay has been a challenging task. Thus, the paper proposes a novel approach, which utilizes self-learning and self-training features of optimized random forest algorithm to the vector data overlay analysis. Simulation experiments show that the proposed model is superior to non-optimized random forest algorithm and support vector machine model with higher prediction precision and is also capable of eliminate redundant computational intensity features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Chang, K.T.: Geographic Information System. Wiley, Hoboken (2006)

    Google Scholar 

  2. Wang, S., Armstrong, M.P.: A theoretical approach to the use of cyberinfrastructure in geographical analysis. Int. J. Geogr. Inf. Sci. 23(2), 169–193 (2009)

    Article  Google Scholar 

  3. Shook, E., Wang, S., Tang, W.: A communication-aware framework for parallel spatially explicit agent-based models. Int. J. Geogr. Inf. Sci. 27(11), 2160–2181 (2013)

    Article  Google Scholar 

  4. Zhao, Y., Padmanabhan, A., Wang, S.: A parallel computing approach to viewshed analysis of large terrain data using graphics processing units. Int. J. Geogr. Inf. Sci. 27(2), 363–384 (2013)

    Article  Google Scholar 

  5. Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)

    Article  MATH  Google Scholar 

  6. Nguyen, T.T., Huang, J.Z., Nguyen, T.T.: Unbiased feature selection in learning random forests for high-dimensional data. Sci. World J. 2015 (2015)

    Google Scholar 

  7. Goodchild, M.F.: A spatial analytical perspective on geographical information systems. Int. J. Geogr. Inf. Syst. 1(4), 327–334 (1987)

    Article  Google Scholar 

  8. XueLian, H., ChuanYong, Y., JingZu, L.: Research and application of vector data overlay analysis based on ArcGIS engine. Urban Geotech. Invest. Surv. 3, 014 (2010)

    Google Scholar 

  9. Armstrong, M.P., Densham, P.J.: Domain decomposition for parallel processing of spatial problems. Comput. Environ. Urban Syst. 16(6), 497–513 (1992)

    Article  Google Scholar 

  10. Zhao, Y., Padmanabhan, A., Wang, S.: A parallel computing approach to viewshed analysis of large terrain data using graphics processing units. Int. J. Geogr. Information Sci. 27(2), 363–384 (2013)

    Article  Google Scholar 

  11. Du, S., Wang, X., Feng, C.C., et al.: Classifying natural-language spatial relation terms with random forest algorithm. Int. J. Geogr. Inf. Sci. 31(3), 542–568 (2017)

    Article  Google Scholar 

  12. Ding, S., Chen, L.: Intelligent optimization methods for high-dimensional data classification for support vector machines. Intell. Inf. Manag. 2(6), 354–364 (2010)

    Google Scholar 

  13. Delmelle, E., Dony, C., Casas, I., et al.: Visualizing the impact of space-time uncertainties on dengue fever patterns. Int. J. Geogr. Inf. Sci. 28(5), 1107–1127 (2014)

    Article  Google Scholar 

  14. Wang, S.: A CyberGIS framework for the synthesis of cyberinfrastructure, GIS, and spatial analysis. Ann. Assoc. Am. Geogr. 100(3), 535–557 (2010)

    Article  Google Scholar 

  15. Yang, C., Goodchild, M., Huang, Q., et al.: Spatial cloud computing: how can the geospatial sciences use and help shape cloud computing? Int. J. Digit. Earth 4(4), 305–329 (2011)

    Article  Google Scholar 

  16. Fang, K.N., Wu, J.B., Zhu, J.P., et al.: A review of technologies on random forests. Stat. Inf. Forum 26(3), 32–38 (2011)

    Google Scholar 

  17. Boulesteix, A.L., Janitza, S., Kruppa, J., et al.: Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics. Wiley Interdisc. Rev.: Data Mining Knowl. Discov. 2(6), 493–507 (2002)

    Google Scholar 

  18. Cheng, G., Jing, N., Chen, L.: A theoretical approach to domain decomposition for parallelization of Digital Terrain Analysis. Ann. GIS 19(1), 45–52 (2013)

    Article  Google Scholar 

  19. Nicodemus, K.K.: Letter to the Editor: On the stability and ranking of predictors from random forest variable importance measure. Brief. Bioinf. 12(4), 369–373 (2011)

    Article  Google Scholar 

Download references

Acknowledgements

Thanks to the support by the National Natural Science Foundation of China (No. 41271387)

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Han Cao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Wang, Q., Cao, H., Guo, YH. (2017). Computational Intensity Prediction Model of Vector Data Overlay with Random Forest Method. In: Zou, B., Li, M., Wang, H., Song, X., Xie, W., Lu, Z. (eds) Data Science. ICPCSEE 2017. Communications in Computer and Information Science, vol 727. Springer, Singapore. https://doi.org/10.1007/978-981-10-6385-5_49

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-6385-5_49

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-6384-8

  • Online ISBN: 978-981-10-6385-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics