A Distributed Rule Engine for Streaming Big Data

Cai, Debo; Hou, Di; Qi, Yong; Yan, Jinpei; Lu, Yu

doi:10.1007/978-3-030-02934-0_12

A Distributed Rule Engine for Streaming Big Data

Debo Cai¹⁸,
Di Hou¹⁸,
Yong Qi¹⁸,
Jinpei Yan¹⁸ &
…
Yu Lu¹⁹

Conference paper
First Online: 20 November 2018

1416 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11242))

Abstract

The rules engine has been widely used in industry and academia, because it can separate the rules from the execution logic and incorporate the features of expert knowledge. With the advent of big data era, the amount of data has grown at an unprecedented rate. However, traditional rule engines based on PCs or servers are hard to handle streaming big data owing to limitation of hardware performance. The structured streaming computing framework can provide new solutions for these challenges. In this paper, we design a distributed rule engine based on Kafka and Structured Streaming (KSSRE), and propose a rule-fact matching strategy using the Spark SQL engine to support a large number of event stream inferences. KSSRE uses DataFrame to store data and inherits the load balancing, scalability and fault-tolerance mechanisms of Spark2.x. In addition, in order to remove the possible repetitive rules and optimize the matching process, we use the ternary grid model [1] for representing rules and design a scheduling model to improve the memory sharing in the matching process. The evaluation shows that KSSRE has a better performance, scalability and fault tolerance based on DBLP data sets.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Erdani, Y.: Developing algorithms of ternary grid technique for optimizing expert system’s knowledge base. In: 2006 Seminar Nasional Aplikasi Teknologi Informasi (2006)
Google Scholar
Apache Kafka. http://kafka.apache.org/. Accessed May 2018
Structured Streaming. http://spark.apache.org. Accessed May 2018
Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51, 107–113 (2008)
Article Google Scholar
Cao, B., Yin, J., Zhang, Q., Ye, Y.: A MapReduce-based architecture for rule matching in production system, pp. 790–795. IEEE (2010)
Google Scholar
Zhou, R., Wang, G., Wang, J., Li, J.: RUNES II: a distributed rule engine based on rete network in cloud computing. Int. J. Grid Distrib. Comput. 7, 91–110 (2014)
Article Google Scholar
Chen, Y., Bordbar, B.: DRESS: a rule engine on spark for event stream processing, pp. 46–51. ACM (2016)
Google Scholar
Zhang, J., Yang, J., Li, J.: When rule engine meets big data: design and implementation of a distributed rule engine using spark, pp. 41–49. IEEE (2017)
Google Scholar
Liang, S., Fodor, P., Wan, H., Kifer, M.: OpenRuleBench: an analysis of the performance of rule engines. In: Proceedings of the 18th International Conference on World Wide Web, pp. 601–610. ACM (2009)
Google Scholar
DBLP: computer science bibliography. http://dblp.uni-trier.de/db/. Accessed May 2018
Forgy, C.L.: Rete: a fast algorithm for the many pattern/many object pattern match problem. Artif. Intell. 19, 17–37 (1982)
Article Google Scholar
Drools. https://www.drools.org/. Accessed May 2018

Download references

Acknowledgment

This work is partially supported by the National Key Research and Development Program of China under Grant No. 2016YFB1000600.

Author information

Authors and Affiliations

Department of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, China
Debo Cai, Di Hou, Yong Qi & Jinpei Yan
Troops 69064 of PLA, Xinjiang, China
Yu Lu

Authors

Debo Cai
View author publications
You can also search for this author in PubMed Google Scholar
Di Hou
View author publications
You can also search for this author in PubMed Google Scholar
Yong Qi
View author publications
You can also search for this author in PubMed Google Scholar
Jinpei Yan
View author publications
You can also search for this author in PubMed Google Scholar
Yu Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Debo Cai .

Editor information

Editors and Affiliations

Renmin University of China, Beijing, China
Xiaofeng Meng
Huazhong University of Science and Technology, Wuhan, China
Ruixuan Li
Renmin University of China, Beijing, China
Kanliang Wang
Taiyuan University of Technology, Yuci, China
Baoning Niu
Tianjin University, Tianjin, China
Xin Wang
South China Normal University, Guangzhou, China
Gansen Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cai, D., Hou, D., Qi, Y., Yan, J., Lu, Y. (2018). A Distributed Rule Engine for Streaming Big Data. In: Meng, X., Li, R., Wang, K., Niu, B., Wang, X., Zhao, G. (eds) Web Information Systems and Applications. WISA 2018. Lecture Notes in Computer Science(), vol 11242. Springer, Cham. https://doi.org/10.1007/978-3-030-02934-0_12

Download citation

DOI: https://doi.org/10.1007/978-3-030-02934-0_12
Published: 20 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02933-3
Online ISBN: 978-3-030-02934-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics