Abstract
XML filtering applications are gaining increasing popularity recently. Automata are generally adopted to construct query indexes for evaluating large numbers of XPath queries over XML streams. Usually only shallow data are observed in existing approaches. How to process deep and recursive XML data with low memory limitation efficiently is still a challenging issue. In this paper, we propose HFilter, a Hybrid Finite Automaton (HFA) based stream filtering approach, to solve this problem. We introduce the basic two-tier HFA (lazy DFA tier and NFA tier) first, which realizes data prefix sharing and memory overflow control to improve the filtering throughput. Then an optimized three-tier HFA with an extra pre-expanded DFA tier is put forward, which significantly reduces the restarting cost of HFA after memory overflow. Experiments show that our approaches work more efficiently than existing ones.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Altinel, M., Franklin, M.: Efficient filtering of XML documents for selective dissemination of information. In: VLDB (2000)
Diao, Y., Fischer, P., Franklin, M., To, R.: YFilter: Efficient and Scalable Filtering of XML Documents. In: ICDE (2002)
Chan, C., Felber, P., Garofalakis, M.N., Rastogi, R.: Efficient filtering of XML documents with XPath expressions. In: ICDE (2002)
Diao, Y., Altinel, M., Franklin, M.J., Zhang, H., Fischer, P.: Path sharing and predicate evaluation for high-performance XML filtering. ACM Trans. on Database Systems (TODS)Â 28(4) (2003)
Green, T., Gupta, A., Miklau, G., Onizuka, M., Suciu, D.: Processing XML streams with deterministic automata and stream index. ACM Trans. on Database Systems (TODS)Â 29(4) (2004)
Gupta, A., Suciu, D.: Stream processing of XPath queries with predicates. In: SIGMOD (2003)
Onizuka, M.: Light-weight XPath processing of XML stream with deterministic automata. In: CIKM 2003 (2003)
Chen, D., Wong, R.: Optimizing The lazy DFA approach for XML stream processing. In: The Fifteenth Australasian Database Conference (ADC) (2004)
Florescu, D., Hillery, C., Kossmann, D., Lucas, P.: The BEA/XQRL streaming XQuery processor. In: VLDB 2003 (2003)
Ives, Z., Halevy, A., Weld, D.: An XML query engine for network-bound data. VLDB Journal 11(4) (2002)
Diaz, A.L., Lovell, D.: XML Generator, http://www.alphaworks.ibm.com/tech/xmlgenerator
He, B., Luo, Q., Choi, B.: Cache-conscious automata for XML filtering. In: ICDE 2005 (2005)
Zhang, X., Yang, L., Lee, M., Hsu, W.: Scaling SDI systems via query clustering and aggregation. In: Lee, Y., Li, J., Whang, K.-Y., Lee, D. (eds.) DASFAA 2004. LNCS, vol. 2973, pp. 208–219. Springer, Heidelberg (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sun, W., Qin, Y., Yu, P., Zhang, Z., He, Z. (2008). HFilter: Hybrid Finite Automaton Based Stream Filtering for Deep and Recursive XML Data. In: Bhowmick, S.S., Küng, J., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2008. Lecture Notes in Computer Science, vol 5181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85654-2_48
Download citation
DOI: https://doi.org/10.1007/978-3-540-85654-2_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85653-5
Online ISBN: 978-3-540-85654-2
eBook Packages: Computer ScienceComputer Science (R0)