WBPL: An Open-Source Library for Predicting Web Surfing Behaviors

Gueniche, Ted; Fournier-Viger, Philippe; Nkambou, Roger; Tseng, Vincent S.

doi:10.1007/978-3-319-08326-1_55

WBPL: An Open-Source Library for Predicting Web Surfing Behaviors

Ted Gueniche²²,
Philippe Fournier-Viger²²,
Roger Nkambou²³ &
…
Vincent S. Tseng²⁴

Conference paper

1559 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8502))

Abstract

We present WBPL (Web users Behavior Prediction Library), a cross-platform open-source library for predicting the behavior of web users. WBPL allows training prediction models from server logs. The proposed library offers support for three of the most used webservers (Apache, Nginx and Lighttpd). Models can then be used to predict the next resources fetched by users and can be updated with new logs efficiently. WBPL offers multiple state-of-the-art prediction models such as PPM, All-K-Order-Markov and DG and a novel prediction model CPT (Compact Prediction Tree). Experiments on various web click-stream datasets shows that the library can be used to predict web surfing or buying behaviors with a very high overall accuracy (up to 38 %) and is very efficient (up to 1000 predictions /s).

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cleary, J., Witten, I.: Data compression using adaptive coding and partial string matching. IEEE Trans. on Inform. Theory 24(4), 413–421 (1984)
MathSciNet Google Scholar
Deshpande, M., Karypis, G.: Selective Markov models for predicting Web page accesses. ACM Transactions on Internet Technology 4(2), 163–184 (2004)
Article Google Scholar
Google Prediction API, https://developers.google.com/prediction (accessed: February 15, 2014)
Gueniche, T., Fournier-Viger, P., Tseng, V.-S.: Compact Prediction Tree: A Lossless Model for Accurate Sequence Prediction. In: Motoda, H., Wu, Z., Cao, L., Zaiane, O., Yao, M., Wang, W. (eds.) ADMA 2013, Part II. LNCS (LNAI), vol. 8347, pp. 177–188. Springer, Heidelberg (2013)
Chapter Google Scholar
Hassan, M.T., Junejo, K.N., Karim, A.: Learning and Predicting Key Web Navigation Patterns Using Bayesian Models. In: Gervasi, O., Taniar, D., Murgante, B., Laganà, A., Mun, Y., Gavrilova, M.L. (eds.) ICCSA 2009, Part II. LNCS, vol. 5593, pp. 877–887. Springer, Heidelberg (2009)
Chapter Google Scholar
HMMgene (v. 1.1), http://www.cbs.dtu.dk/services/HMMgene (accessed: February 15, 2014)
Padmanabhan, V.N., Mogul, J.C.: Using Prefetching to Improve World Wide Web Latency. Computer Communications 16, 358–368 (1998)
Google Scholar
Domenech, J., de la Ossa, B., Sahuquillo, J., Gil, J.A., Pont, A.: A taxonomy of web prediction algorithms. Expert Systems with Applications (9) (2012)
Google Scholar
Pitkow, J., Pirolli, P.: Mining longest repeating subsequence to predict world wide web surng. In: Proc. 2nd USENIX Symposium on Internet Technologies and Systems, Boulder, CO, pp. 13–25 (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of computer science, University of Moncton, Canada
Ted Gueniche & Philippe Fournier-Viger
Dept. d’informatique, Université du Québec á Montréal, Canada
Roger Nkambou
Dept. of computer science and inf. eng., National Cheng Kung University, Taiwan
Vincent S. Tseng

Authors

Ted Gueniche
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Fournier-Viger
View author publications
You can also search for this author in PubMed Google Scholar
Roger Nkambou
View author publications
You can also search for this author in PubMed Google Scholar
Vincent S. Tseng
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Research Group PLIS: Programming, Logic and Intelligent Systems Dept. of Communication, Business and Information Technologies, Roskilde University, Denmark
Troels Andreasen & Henning Christiansen &
Department of Computer Science and Artificial Intelligence, CITIC, University of Granada, 18071, Granada, Spain
Juan-Carlos Cubero
University of North Carolina, , , 9201 University City Blvd, Charlotte, NC 28223 USA, and Warsaw University of Technology, ul. Nowowiejska 15/19, 00-665 Warsaw, Poland
Zbigniew W. Raś

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gueniche, T., Fournier-Viger, P., Nkambou, R., Tseng, V.S. (2014). WBPL: An Open-Source Library for Predicting Web Surfing Behaviors. In: Andreasen, T., Christiansen, H., Cubero, JC., Raś, Z.W. (eds) Foundations of Intelligent Systems. ISMIS 2014. Lecture Notes in Computer Science(), vol 8502. Springer, Cham. https://doi.org/10.1007/978-3-319-08326-1_55

Download citation

DOI: https://doi.org/10.1007/978-3-319-08326-1_55
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08325-4
Online ISBN: 978-3-319-08326-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics