Abstract
Nowadays computers are equipped with multicore processors and large RAM to support high performance processing. In-memory analytical processing and just-in-time data warehousing have become realistic for various enterprises. An analytical query normally requires a small proportion of ‘hot’ data, usually defined by a set of keywords, instead of the entire data set, which involves large volume table scan and costly star joins. Therefore, identifying frequent keywords to retrieve hot data can dramatically reduce the cost of full table scan or star-join. In this paper, we propose a keyword oriented bitmap join index to improve the space efficiency and performance of in-memory data warehouse. Keyword oriented bitmap join index is a global bitmap join index for the entire data warehouse, as opposed to conventional bitmap join indexes which are indicated for specified attributes. With our index, star-join is first converted into keyword search and bitmap combination. The resulting bitmap filters are then employed to filter records. Through the filtering by bitmaps, a star-join is converted into positional scan on the fact table and additional dimension filtering. Both memory bandwidth and analytical performance can then be improved.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
http://www.sap.com/solutions/technology/in-memory-computing-platform/hana/overview/index.epx
http://www.oracle.com/us/products/database/exadata-db-machine-x3-2-1851253.pdf
Boncz, P.A., Mangegold, S., Kersten, M.L.: Database architecture optimized for the new bottleneck: Memory access. In: VLDB, pp. 266–277 (1999)
Funke, F., Kemper, A., Neumann, T.: HyPer-sonic Combined Transaction AND Query Processing. PVLDB 4(12), 1367–1370 (2011)
O’Neil, P., O’Neil, B., Chen, X.: The Star Schema Benchmark (SSB), http://www.cs.umb.edu/~poneil/StarSchemaB.PDF
http://docs.oracle.com/cd/B10500_01/server.920/a96520/indexes.htm
Levandoski, J., Larson, P., Stoica, R.: Identifying Hot and Cold Data in Main-Memory Databases. In: ICDE 2013 (2013)
Park, D., Du, D.H.C.: Hot data identification for flash-based storage systems using multiple bloom filters. In: Proceedings of the 2011 IEEE 27th Symposium on Mass Storage Systems and Technologies, MSST 2011, pp. 1–11 (2011)
Aouiche, K., Darmont, J., Boussaid, O., Bentayeb, F.: Automatic Selection of Bitmap Join Indexes in Data Warehouses. CoRR abs/cs/0703113 (2007)
Zhang, Y., Wang, S., Lu, J.: Improving performance by creating a native join-index for OLAP. Frontiers of Computer Science in China 5(2), 236–249 (2011)
Abadi, D.J., Madden, S., Hachem, N.: Column-stores vs. row-stores: how different are they really? In: SIGMOD Conference 2008, pp. 967–980 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, Y., Su, M., Zhou, X., Wang, S., Wang, X. (2013). Keyword Oriented Bitmap Join Index for In-Memory Analytical Processing. In: Wang, J., Xiong, H., Ishikawa, Y., Xu, J., Zhou, J. (eds) Web-Age Information Management. WAIM 2013. Lecture Notes in Computer Science, vol 7923. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38562-9_41
Download citation
DOI: https://doi.org/10.1007/978-3-642-38562-9_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38561-2
Online ISBN: 978-3-642-38562-9
eBook Packages: Computer ScienceComputer Science (R0)