Abstract
Nowadays Web plays a vital role in everyone’s life. Whether one has to search something, perform any online task or anything on Internet, one has to come across Web pages throughout. For small screen terminals such as small mobile phone screen, palmtop, pagers, tablet computers, and many more, reading a Web page is time-consuming task because there are many unwanted content parts which are to be scrolled or there are advertisements, etc. To overcome such issue, the source code of a Web page is organized in tree format. In this paper, backtracking is applied on the tree as per some mentioned rules (I, II, III) to filter out the Web page headings. These rules are applied until none of the nodes is left unvisited. The filtered data is then mapped to a multidimensional (MDB) data cube. Again MDB is used by online analytical processing (OLAP) which in turn is one of the greatest tools for analysts in today’s entrepreneur’s scenario. This strategy makes Web page access a very simple and interesting task.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
S. Shefali, N. Garg, Hybrid web-page segmentation and block extraction for small screen terminals, in IJCA Proceedings on 4th International IT Summit Confluence 2013—The Next Generation Information Technology Summit Confluence (2013) (2), 12–15 Jan 2014
M. Álvarez, A. Pan, J. Raposo, F. Bellas, F. Cacheda, Extracting lists of data records from semi-structured web pages, Department of Information and Communication Technologies, University of A Coruña, Campus de Elviña s/n, 15071 A Coruña, Spain, 11 Oct 2007
D. Cai, S. Yu, J.-R. Wen, W.-Y. Ma, VIPS: a vision-based page segmentation algorithm, in Microsoft Research Microsoft Corporation One Microsoft Way Redmond, WA 98052, Nov 1 2003
S. Baluja, Browsing on small screens: recasting web-page segmentation into an efficient machine learning framework, in Proceeding of the 15th International Conference on World Wide Web, 2006-05-23
R. Song, H. Liu, J.-R. Wen, W.-Y. Ma, Learning block importance models for web pages, in Proceedings of the 13th International Conference on World Wide Web, 2004-05-17. ISBN:1-58113-844-X
D. Cai, S. Yu, J.-R. Wen, W.-Y. Ma, Block-based web search, in Proceedings of the 27th Annual International ACM SIGIR Conference on Research and development in Information Retrieval, 2004-07-25
K. Vieira, S. Altigran da Silva, N. Pinto, S. Edleno de Moura, J.M.B. Cavalcanti, J. Freire, A fast and robust method for web page template detection and removal, in Proceedings of the 15th ACM International Conference on Information and Knowledge Management, 2006-11-06
Y. Guo, H. Tang; L. Song, Y. Wang, G. Ding, ECON: an approach to extract content from web news page, in 2010 12th International Asia-Pacific Web Conference (APWEB), 6–8 Apr 2010
F. Zhao, The algorithm analyses and design about the subjective test online basing on the DOM tree, in 2008 International Conference on Computer Science and Software Engineering, 12–14 Dec 2008
G. Colliat, OLAP, relational, and multidimensional database systems. ACM SIGMOD Record 25(3) (1996)
P. Vassiliadis, Modeling multidimensional databases, cubes and cube operations, in Proceedings of Tenth International Conference on Scientific and Statistical Database Management, 3 July 1998
C. Stolte, D. Tang, P. Hanrahan, Polaris: a system for query, analysis, and visualization of multidimensional relational databases. IEEE Trans. Vis. Comput. Graph. (Jan/Mar 2002) 8(1) (2002)
P. Vassiliadis, T. Sellis, A survey of logical models for OLAP databases. ACM SIGMOD Record 28(4) (1999)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Singhal, S., Garg, N. (2018). Web Page Representation Using Backtracking with Multidimensional Database for Small Screen Terminals. In: Panda, B., Sharma, S., Batra, U. (eds) Innovations in Computational Intelligence . Studies in Computational Intelligence, vol 713. Springer, Singapore. https://doi.org/10.1007/978-981-10-4555-4_21
Download citation
DOI: https://doi.org/10.1007/978-981-10-4555-4_21
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-4554-7
Online ISBN: 978-981-10-4555-4
eBook Packages: EngineeringEngineering (R0)