Skip to main content

Efficient Approximate Top-k Query Algorithm Using Cube Index

  • Conference paper
Web Technologies and Applications (APWeb 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6612))

Included in the following conference series:

  • 1087 Accesses

Abstract

Exact top-k query processing has attracted much attention recently because of its wide use in many research areas. Since missing the truly best answers is inherent and unavoidable due to the user’s subjective judgment, and the cost of processing exact top-k queries is highly expensive for datasets with huge volume, it is intriguing to answer approximate top-k query instead. In this paper, we first define a novel kind of approximate top-k query, called μ - approximate top-k query. Then we introduce an efficient index structure, i.e. cube index, based on which, we propose our novel Cube Index Algorithm (CIA). We analyze the complexity of both constructing cube index and CIA algorithm. Moreover, extensive experiments show that CIA performs much better than the well-known approximate TA θ algorithm [3].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ilyas, I., Beskales, G., Soliman, M.A.: A Survey of Top-k Query Processing Techniques in Relational Database Systems. In: ACM Computing Surveys, New York (2008)

    Google Scholar 

  2. Michel, S., Triantafillou, P., Weikum, G.: KLEE: A frame work for distributed top-k query algorithms. In: VLDB (2005)

    Google Scholar 

  3. Fagin, R., Lotem, A., Naor, M.: Optimal aggregation algorithms for middleware. In: PODS (2001)

    Google Scholar 

  4. Gong, N.Z., Sun, G.Z.: Parallel Algorithms for Top-k Query Processing. ACM SIGMOD (2010)

    Google Scholar 

  5. Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms. MIT Press, Cambridge (2001)

    MATH  Google Scholar 

  6. Theobald, M., Weikum, G., Schenkel, R.: Top-k Query Evaluation with Probabilistic Guarantees. In: VLDB (2004)

    Google Scholar 

  7. Zou, L., Chen, L.: Dominant Graph An Efficient Indexing Structure to Answer Top-K Queries. In: ICDE (2008)

    Google Scholar 

  8. Amato, G., Rabitti, F., Savino, P., Zezula, P.: Region Proximity in Metric Spaces and Its Use For Approximate Similarity Search. ACM Trans. Inform. Syst. (2003)

    Google Scholar 

  9. Xin, D., Han, J., Cheng, H., Li, X.: Answering Top-k Queries with Multi-Dimensional Selections: The Ranking Cube Approach. In: VLDB (2006)

    Google Scholar 

  10. Fagin, R., Kumar, R., Sivakumar, D.: Comparing Top K Lists. ACM-SIAM SODA (2003)

    Google Scholar 

  11. Donjerkovic, D., Ramakrishnan, R.: Probabilistic Optimization of Top N Queries. In: VLDB (1999)

    Google Scholar 

  12. Hellerstein, J., Haas, P., Wang, H.: Online Aggregation. ACM SIGMOD (1997)

    Google Scholar 

  13. Ilyas, I., Aref, W., Elmagarmid, A.: Supporting Top-K Join Queries in Relational Databases. In: VLDB (2004)

    Google Scholar 

  14. Kendall, M., Gibbons, J.D.: Rank Correlation Methods. Oxford University Press, Oxford (1990)

    MATH  Google Scholar 

  15. Re, C., Dalvi, N., Suciu, D.: Efficient Top-K Query Evaluation on Probabilistic Data. In: ICDE (2007)

    Google Scholar 

  16. Balke, W.-T., Nejdl, W., Siberski, W., Thaden, U.: Progressive distributed top-k retrieval in peer-to-peer networks. In: ICDE Conf. (2005)

    Google Scholar 

  17. Kimelfeld, B., Sagiv, Y.: Finding and approximating top-k answers in keyword proximity search. In: PODS Conf. (2006)

    Google Scholar 

  18. Akbarinia, R., Pacitti, E., Valduriez, P.: Reducing network traffic in unstructured P2P systems using Top-k queries. Distributed and Parallel Databases 19(2) (2006)

    Google Scholar 

  19. Akbarinia, R., Pacitti, E., Valduriez, P.: Processing top-k queries in distributed hash tables. In: Kermarrec, A.-M., Bougé, L., Priol, T. (eds.) Euro-Par 2007. LNCS, vol. 4641, pp. 489–502. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  20. Chaudhuri, S., Gravano, L., Marian, A.: Optimizing top-k selection queries over multimedia repositories. IEEE Trans. on Knowledge and Data Engineering 16(8) (2004)

    Google Scholar 

  21. Nepal, S., Ramakrishna, M.V.: Query processing issues in image (multimedia) databases. In: ICDE Conf. (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, D., Sun, GZ., Gong, N.Z. (2011). Efficient Approximate Top-k Query Algorithm Using Cube Index. In: Du, X., Fan, W., Wang, J., Peng, Z., Sharaf, M.A. (eds) Web Technologies and Applications. APWeb 2011. Lecture Notes in Computer Science, vol 6612. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20291-9_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20291-9_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20290-2

  • Online ISBN: 978-3-642-20291-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics