Realistic Benchmarks for Point Cloud Data Management Systems

van Oosterom, Peter; Martinez-Rubi, Oscar; Tijssen, Theo; Gonçalves, Romulo

doi:10.1007/978-3-319-25691-7_1

Realistic Benchmarks for Point Cloud Data Management Systems

Peter van Oosterom⁶,
Oscar Martinez-Rubi⁷,
Theo Tijssen⁶ &
…
Romulo Gonçalves⁷

Chapter
First Online: 18 October 2016

1516 Accesses
3 Citations

Part of the book series: Lecture Notes in Geoinformation and Cartography ((LNGC))

Abstract

Lidar, photogrammetry, and various other survey technologies enable the collection of massive point clouds. Faced with hundreds of billions or trillions of points the traditional solutions for handling point clouds usually under-perform even for classical loading and retrieving operations. To obtain insight in the features affecting performance the authors carried out single-user tests with different storage models on various systems, including Oracle Spatial and Graph, PostgreSQL-PostGIS, MonetDB and LAStools (during the second half of 2014). In the summer of 2015, the tests are further extended with the latest developments of the systems, including the new version of Point Data Abstraction Library (PDAL) with efficient compression. Web services based on point cloud data are becoming popular and they have requirements that most of the available point cloud data management systems can not fulfil. This means that specific custom-made solutions are constructed. We identify the requirements of these web services and propose a realistic benchmark extension, including multi-user and level-of-detail queries. This helps in defining the future lines of work for more generic point cloud data management systems, supporting such increasingly demanded web services.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Actueel Hoogtebestand Nederland (AHN). (2015). http://www.ahn.nl/.
AHN viewer. (2015). http://ahn.maps.arcgis.com/apps/webappviewer/index.html.
AHN2 3D viewer and download tool. (2015). http://ahn2.pointclouds.nl/.
De Kleijn, M., De Hond, R., Martinez-Rubi, O., & Svetachov, P. (2015). A 3d geographic information system for mapping the via appia. Tech. rep. research memorandum (VU-FEWEB) 2015-1, VU University Amsterdam, Amsterdam, The Netherlands.
Google Scholar
Fiore, S., DAnca, A., Palazzo, C., Foster, I., Williams, D., & Aloisio, G. (2013). Ophidia: Toward big data analytics for escience. Procedia Computer Science, 18, 2376 – 2385. http://dx.doi.org/10.1016/j.procs.2013.05.409, http://www.sciencedirect.com/science/article/pii/S1877050913005528. 2013 international conference on computational science.
Group, T.P.G.D. (2014). PostgreSQL 9.3.5 Documentation. Tech. Rep. 9.3.5, The PostgreSQL Global Development Group.
Google Scholar
laz-perf. (2015). https://github.com/verma/laz-perf.git.
Mapping the Via Appia in 3D. (2015). http://mappingtheviaappia.nl/4dgis/.
Martinez-Rubi, O., Kersten, M., Goncalves, R., & Ivanova, M. (2014). A column-store meets the point clouds. FOSS4GEurope.
Google Scholar
Martinez-Rubi, O., van Oosterom, P., Gonçalves, R., Tijssen, T., Ivanova, M., Kersten, M. L., et al. (2015). Benchmarking and improving point cloud data management in monetdb. SIGSPATIAL Special, 6(2), 11–18. doi:10.1145/2744700.2744702. http://doi.acm.org/10.1145/2744700.2744702.
Oracle Database Online Documentation 12c Release 1 (12.1): Spatial and Graph Developer’s Guide / SDO_PC_PKG Package (Point Clouds). (2014). https://docs.oracle.com/database/121/SPATL/sdo_pc_pkg_ref.htm.
Oracle Exadata Database Machine X4-2. (2015). https://www.oracle.com/engineered-systems/exadata/database-machine-x4-2/index.html.
PDAL. (2015). http://www.pdal.io/.
Python/pandas. (2015). http://pandas.pydata.org/.
Ramsey, P. (2015). A PostgreSQL extension for storing point cloud (LIDAR) data. https://github.com/pramsey/pointcloud.
Rapidlasso GmbH. (2015). http://rapidlasso.com/.
Rapidlasso GmbH LASzip—free and lossless LiDAR compression. (2014). http://www.laszip.org/.
Suijker, P. M., Alkemade, I., Kodde, M. P., & Nonhebel, A. E. (2014). User requirements massive point clouds for eSciences (WP1). Tech. rep., Delft University of Technology. http://repository.tudelft.nl/view/ir/uuid%3A351e0d1e-f473-4651-bf15-8f9b29b7b800/.
van Oosterom, P., Martinez-Rubi, O., Ivanova, M., Horhammer, M., Geringer, D., Ravada, S., Tijssen, T., Kodde, M., & Gonçalves, R. (2015). Massive point cloud data management: Design, implementation and execution of a point cloud benchmark. Computers and Graphics, 49, 92–125. http://dx.doi.org/10.1016/j.cag.2015.01.007, http://www.sciencedirect.com/science/article/pii/S0097849315000084.
van Oosterom, P., & Meijers, M. (2014). Vario-scale data structures supporting smooth zoom and progressive transfer of 2d and 3d data. International Journal of Geographical Information Systems, 28(3), 455–478.
Article Google Scholar

Download references

Acknowledgments

We thank all the members of the project Massive Point Clouds for eSciences, which is supported in part by the Netherlands eScience Center under project code 027.012.101. Also special thanks for their assistance to Mike Horhammer, Daniel Geringer, Siva Ravada (all Oracle), Markus Schütz (developer of potree), Martin Isenburg (developer of LAStools), and to Howard Butler, Andrew Bell and the rest of PDAL developers.

Author information

Authors and Affiliations

Faculty of Architecture and the Built Environment, Department OTB, Section GIS Technology, TU Delft, Delft, The Netherlands
Peter van Oosterom & Theo Tijssen
Netherlands eScience Center, Amsterdam, The Netherlands
Oscar Martinez-Rubi & Romulo Gonçalves

Authors

Peter van Oosterom
View author publications
You can also search for this author in PubMed Google Scholar
Oscar Martinez-Rubi
View author publications
You can also search for this author in PubMed Google Scholar
Theo Tijssen
View author publications
You can also search for this author in PubMed Google Scholar
Romulo Gonçalves
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peter van Oosterom .

Editor information

Editors and Affiliations

Department of Geoinformation, Universiti Teknologi Malaysia, Johor Bahru, Malaysia
Alias Abdul-Rahman

Appendix

1.1 Appendix A: Executable Benchmark Data Sets and Queries

Table 6 Data sets name, benchmarks in which they are used, number of files and disk size

Full size table

Table 7 Data sets area and description

Full size table

1.1.1 Data sets

Tables 6 and 7 contains information on the used data sets in the executable benchmark and their usage in the different stages. Figure 7 shows the extent of the used data sets.

Note that for the full AHN2 we include two versions of the data set. The first one, 639478M was used in our previous full-benchmark execution while the second one, 638860Mc is the one used for in the new execution and does not contain erroneous and duplicate points that are found in the first version. Also note the difference in the data sets sizes. This is due to the fact that for the cleaning process that was required to generate the second (cleaned) version of the full AHN2 data set the points in the files needed to be resorted and that affected dramatically the compression performance of LAZ. In the first version the files were separated according to their nature, in object and terrain files, and that improved the compressor performance. However, as part of the cleaning processes, these files were joined and the compression ratio was affected. The compression ratio improves when the data is resorted by LAStools as part of the benchmark execution (see Table 1) but even in that case it is not optimal due to the mixing of point cloud from different nature (total size 1,66 Tb).

1.1.2 Queries

Figures 8 and 9 show the first 20 query geometries that were used in the several benchmark stages. Table 8 describes all of them, their ID, the number of points in the boundary of the query geometry (Pnts) and the test data set name in which the query geometry is located.

1.2 Appendix B: Executable Benchmark Loading Results

Table 9 contains the loading details of the medium-benchmark execution for various PCDMS’s and data sets. The results of LAStools are when using LAS (instead of LAZ). The PCDMS using the blocks model were using the compression available at that time (second half 2014) and with optimal block sizes previously computed. Note that all the Oracle Exadata approaches (oe* on the table) run in a different hardware than the other approaches.

Table 8 Description of the different queries

Full size table

Table 9 Times and sizes of the data loading procedure for the different PCDMSs and datasets. The names of approaches encode the PCDMS name (o for Oracle, p for PostgreSQL, etc.), flat or blocked model (f and b, respectively), and the dataset name. For example ob2201M stands for the dataset 2201M loaded in the Oracle blocks PCDMS

Full size table

Table 10 contains the loading details of the full-benchmark execution that was done with LAStools and Oracle Exadata PCDMS’s. Note that for this execution the 6394784M data set was used, i.e. the AHN2 version with duplicate and erroneous points. For an in-deep analysis of these results we refer the reader to our previous work (van Oosterom et al. 2015).

1.3 Appendix C: Executable Benchmark Querying Results

Table 11 contains the number of returned points and the response times of the first seven queries for the different PCDMS’s and data sets. Note that each query was executed twice, the numbers in the table are from the second execution, usually called hot query because of the fact that the PCDMS may be able to reuse cached data either by the PCDMS itself or the file system or the operative system (OS).

Table 12 contains the number of returned points and the response times of the execution of the 30 full-benchmark queries for the LAStools and Oracle Exadata PCDMS. Note that for LAStools two columns are given. The first one is when using a DBMS in a pre-filtering step for the queries and the other is without it. For an in-deep analysis of these results we refer the reader to our previous work (van Oosterom et al. 2015).

Table 10 Full-benchmark loading results for the LAStools and Oracle Exadata PCDMSs

Full size table

Table 11 Comparison of number of points returned and response times by the hot queries 1 to 7 in the different approaches

Full size table

Table 12 Full benchmark query results of LAStools and Oracle Exadata. Notes (a.) Nearest neighbours queries (#18, #19 and #20) were not executed as functionality was not implemented, and (b.) Oracle Exadata query #25 was also re-run using an MBR instead of a geometry close to an MBR with and improved the time to 353.93 s with 3.6546E+10 selected points

Full size table

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

van Oosterom, P., Martinez-Rubi, O., Tijssen, T., Gonçalves, R. (2017). Realistic Benchmarks for Point Cloud Data Management Systems. In: Abdul-Rahman, A. (eds) Advances in 3D Geoinformation. Lecture Notes in Geoinformation and Cartography. Springer, Cham. https://doi.org/10.1007/978-3-319-25691-7_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-25691-7_1
Published: 18 October 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25689-4
Online ISBN: 978-3-319-25691-7
eBook Packages: Earth and Environmental ScienceEarth and Environmental Science (R0)

Publish with us

Policies and ethics

Abstract

Buying options

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendix

Appendix

1.1 Appendix A: Executable Benchmark Data Sets and Queries

1.1.1 Data sets

1.1.2 Queries

1.2 Appendix B: Executable Benchmark Loading Results

1.3 Appendix C: Executable Benchmark Querying Results

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation