Abstract
Lidar, photogrammetry, and various other survey technologies enable the collection of massive point clouds. Faced with hundreds of billions or trillions of points the traditional solutions for handling point clouds usually under-perform even for classical loading and retrieving operations. To obtain insight in the features affecting performance the authors carried out single-user tests with different storage models on various systems, including Oracle Spatial and Graph, PostgreSQL-PostGIS, MonetDB and LAStools (during the second half of 2014). In the summer of 2015, the tests are further extended with the latest developments of the systems, including the new version of Point Data Abstraction Library (PDAL) with efficient compression. Web services based on point cloud data are becoming popular and they have requirements that most of the available point cloud data management systems can not fulfil. This means that specific custom-made solutions are constructed. We identify the requirements of these web services and propose a realistic benchmark extension, including multi-user and level-of-detail queries. This helps in defining the future lines of work for more generic point cloud data management systems, supporting such increasingly demanded web services.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Actueel Hoogtebestand Nederland (AHN). (2015). http://www.ahn.nl/.
AHN viewer. (2015). http://ahn.maps.arcgis.com/apps/webappviewer/index.html.
AHN2 3D viewer and download tool. (2015). http://ahn2.pointclouds.nl/.
De Kleijn, M., De Hond, R., Martinez-Rubi, O., & Svetachov, P. (2015). A 3d geographic information system for mapping the via appia. Tech. rep. research memorandum (VU-FEWEB) 2015-1, VU University Amsterdam, Amsterdam, The Netherlands.
Fiore, S., DAnca, A., Palazzo, C., Foster, I., Williams, D., & Aloisio, G. (2013). Ophidia: Toward big data analytics for escience. Procedia Computer Science, 18, 2376 – 2385. http://dx.doi.org/10.1016/j.procs.2013.05.409, http://www.sciencedirect.com/science/article/pii/S1877050913005528. 2013 international conference on computational science.
Group, T.P.G.D. (2014). PostgreSQL 9.3.5 Documentation. Tech. Rep. 9.3.5, The PostgreSQL Global Development Group.
laz-perf. (2015). https://github.com/verma/laz-perf.git.
Mapping the Via Appia in 3D. (2015). http://mappingtheviaappia.nl/4dgis/.
Martinez-Rubi, O., Kersten, M., Goncalves, R., & Ivanova, M. (2014). A column-store meets the point clouds. FOSS4GEurope.
Martinez-Rubi, O., van Oosterom, P., Gonçalves, R., Tijssen, T., Ivanova, M., Kersten, M. L., et al. (2015). Benchmarking and improving point cloud data management in monetdb. SIGSPATIAL Special, 6(2), 11–18. doi:10.1145/2744700.2744702. http://doi.acm.org/10.1145/2744700.2744702.
Oracle Database Online Documentation 12c Release 1 (12.1): Spatial and Graph Developer’s Guide / SDO_PC_PKG Package (Point Clouds). (2014). https://docs.oracle.com/database/121/SPATL/sdo_pc_pkg_ref.htm.
Oracle Exadata Database Machine X4-2. (2015). https://www.oracle.com/engineered-systems/exadata/database-machine-x4-2/index.html.
PDAL. (2015). http://www.pdal.io/.
Python/pandas. (2015). http://pandas.pydata.org/.
Ramsey, P. (2015). A PostgreSQL extension for storing point cloud (LIDAR) data. https://github.com/pramsey/pointcloud.
Rapidlasso GmbH. (2015). http://rapidlasso.com/.
Rapidlasso GmbH LASzip—free and lossless LiDAR compression. (2014). http://www.laszip.org/.
Suijker, P. M., Alkemade, I., Kodde, M. P., & Nonhebel, A. E. (2014). User requirements massive point clouds for eSciences (WP1). Tech. rep., Delft University of Technology. http://repository.tudelft.nl/view/ir/uuid%3A351e0d1e-f473-4651-bf15-8f9b29b7b800/.
van Oosterom, P., Martinez-Rubi, O., Ivanova, M., Horhammer, M., Geringer, D., Ravada, S., Tijssen, T., Kodde, M., & Gonçalves, R. (2015). Massive point cloud data management: Design, implementation and execution of a point cloud benchmark. Computers and Graphics, 49, 92–125. http://dx.doi.org/10.1016/j.cag.2015.01.007, http://www.sciencedirect.com/science/article/pii/S0097849315000084.
van Oosterom, P., & Meijers, M. (2014). Vario-scale data structures supporting smooth zoom and progressive transfer of 2d and 3d data. International Journal of Geographical Information Systems, 28(3), 455–478.
Acknowledgments
We thank all the members of the project Massive Point Clouds for eSciences, which is supported in part by the Netherlands eScience Center under project code 027.012.101. Also special thanks for their assistance to Mike Horhammer, Daniel Geringer, Siva Ravada (all Oracle), Markus Schütz (developer of potree), Martin Isenburg (developer of LAStools), and to Howard Butler, Andrew Bell and the rest of PDAL developers.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
1.1 Appendix A: Executable Benchmark Data Sets and Queries
1.1.1 Data sets
Tables 6 and 7 contains information on the used data sets in the executable benchmark and their usage in the different stages. Figure 7 shows the extent of the used data sets.
Note that for the full AHN2 we include two versions of the data set. The first one, 639478M was used in our previous full-benchmark execution while the second one, 638860Mc is the one used for in the new execution and does not contain erroneous and duplicate points that are found in the first version. Also note the difference in the data sets sizes. This is due to the fact that for the cleaning process that was required to generate the second (cleaned) version of the full AHN2 data set the points in the files needed to be resorted and that affected dramatically the compression performance of LAZ. In the first version the files were separated according to their nature, in object and terrain files, and that improved the compressor performance. However, as part of the cleaning processes, these files were joined and the compression ratio was affected. The compression ratio improves when the data is resorted by LAStools as part of the benchmark execution (see Table 1) but even in that case it is not optimal due to the mixing of point cloud from different nature (total size 1,66 Tb).
1.1.2 Queries
Figures 8 and 9 show the first 20 query geometries that were used in the several benchmark stages. Table 8 describes all of them, their ID, the number of points in the boundary of the query geometry (Pnts) and the test data set name in which the query geometry is located.
1.2 Appendix B: Executable Benchmark Loading Results
Table 9 contains the loading details of the medium-benchmark execution for various PCDMS’s and data sets. The results of LAStools are when using LAS (instead of LAZ). The PCDMS using the blocks model were using the compression available at that time (second half 2014) and with optimal block sizes previously computed. Note that all the Oracle Exadata approaches (oe* on the table) run in a different hardware than the other approaches.
Table 10 contains the loading details of the full-benchmark execution that was done with LAStools and Oracle Exadata PCDMS’s. Note that for this execution the 6394784M data set was used, i.e. the AHN2 version with duplicate and erroneous points. For an in-deep analysis of these results we refer the reader to our previous work (van Oosterom et al. 2015).
1.3 Appendix C: Executable Benchmark Querying Results
Table 11 contains the number of returned points and the response times of the first seven queries for the different PCDMS’s and data sets. Note that each query was executed twice, the numbers in the table are from the second execution, usually called hot query because of the fact that the PCDMS may be able to reuse cached data either by the PCDMS itself or the file system or the operative system (OS).
Table 12 contains the number of returned points and the response times of the execution of the 30 full-benchmark queries for the LAStools and Oracle Exadata PCDMS. Note that for LAStools two columns are given. The first one is when using a DBMS in a pre-filtering step for the queries and the other is without it. For an in-deep analysis of these results we refer the reader to our previous work (van Oosterom et al. 2015).
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this chapter
Cite this chapter
van Oosterom, P., Martinez-Rubi, O., Tijssen, T., Gonçalves, R. (2017). Realistic Benchmarks for Point Cloud Data Management Systems. In: Abdul-Rahman, A. (eds) Advances in 3D Geoinformation. Lecture Notes in Geoinformation and Cartography. Springer, Cham. https://doi.org/10.1007/978-3-319-25691-7_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-25691-7_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25689-4
Online ISBN: 978-3-319-25691-7
eBook Packages: Earth and Environmental ScienceEarth and Environmental Science (R0)