Skip to main content

Web Page Representation Using Backtracking with Multidimensional Database for Small Screen Terminals

  • Chapter
  • First Online:
Innovations in Computational Intelligence

Part of the book series: Studies in Computational Intelligence ((SCI,volume 713))

Abstract

Nowadays Web plays a vital role in everyone’s life. Whether one has to search something, perform any online task or anything on Internet, one has to come across Web pages throughout. For small screen terminals such as small mobile phone screen, palmtop, pagers, tablet computers, and many more, reading a Web page is time-consuming task because there are many unwanted content parts which are to be scrolled or there are advertisements, etc. To overcome such issue, the source code of a Web page is organized in tree format. In this paper, backtracking is applied on the tree as per some mentioned rules (I, II, III) to filter out the Web page headings. These rules are applied until none of the nodes is left unvisited. The filtered data is then mapped to a multidimensional (MDB) data cube. Again MDB is used by online analytical processing (OLAP) which in turn is one of the greatest tools for analysts in today’s entrepreneur’s scenario. This strategy makes Web page access a very simple and interesting task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. S. Shefali, N. Garg, Hybrid web-page segmentation and block extraction for small screen terminals, in IJCA Proceedings on 4th International IT Summit Confluence 2013—The Next Generation Information Technology Summit Confluence (2013) (2), 12–15 Jan 2014

    Google Scholar 

  2. M. Álvarez, A. Pan, J. Raposo, F. Bellas, F. Cacheda, Extracting lists of data records from semi-structured web pages, Department of Information and Communication Technologies, University of A Coruña, Campus de Elviña s/n, 15071 A Coruña, Spain, 11 Oct 2007

    Google Scholar 

  3. D. Cai, S. Yu, J.-R. Wen, W.-Y. Ma, VIPS: a vision-based page segmentation algorithm, in Microsoft Research Microsoft Corporation One Microsoft Way Redmond, WA 98052, Nov 1 2003

    Google Scholar 

  4. S. Baluja, Browsing on small screens: recasting web-page segmentation into an efficient machine learning framework, in Proceeding of the 15th International Conference on World Wide Web, 2006-05-23

    Google Scholar 

  5. R. Song, H. Liu, J.-R. Wen, W.-Y. Ma, Learning block importance models for web pages, in Proceedings of the 13th International Conference on World Wide Web, 2004-05-17. ISBN:1-58113-844-X

    Google Scholar 

  6. D. Cai, S. Yu, J.-R. Wen, W.-Y. Ma, Block-based web search, in Proceedings of the 27th Annual International ACM SIGIR Conference on Research and development in Information Retrieval, 2004-07-25

    Google Scholar 

  7. K. Vieira, S. Altigran da Silva, N. Pinto, S. Edleno de Moura, J.M.B. Cavalcanti, J. Freire, A fast and robust method for web page template detection and removal, in Proceedings of the 15th ACM International Conference on Information and Knowledge Management, 2006-11-06

    Google Scholar 

  8. Y. Guo, H. Tang; L. Song, Y. Wang, G. Ding, ECON: an approach to extract content from web news page, in 2010 12th International Asia-Pacific Web Conference (APWEB), 6–8 Apr 2010

    Google Scholar 

  9. F. Zhao, The algorithm analyses and design about the subjective test online basing on the DOM tree, in 2008 International Conference on Computer Science and Software Engineering, 12–14 Dec 2008

    Google Scholar 

  10. G. Colliat, OLAP, relational, and multidimensional database systems. ACM SIGMOD Record 25(3) (1996)

    Google Scholar 

  11. P. Vassiliadis, Modeling multidimensional databases, cubes and cube operations, in Proceedings of Tenth International Conference on Scientific and Statistical Database Management, 3 July 1998

    Google Scholar 

  12. C. Stolte, D. Tang, P. Hanrahan, Polaris: a system for query, analysis, and visualization of multidimensional relational databases. IEEE Trans. Vis. Comput. Graph. (Jan/Mar 2002) 8(1) (2002)

    Google Scholar 

  13. P. Vassiliadis, T. Sellis, A survey of logical models for OLAP databases. ACM SIGMOD Record 28(4) (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shefali Singhal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Singhal, S., Garg, N. (2018). Web Page Representation Using Backtracking with Multidimensional Database for Small Screen Terminals. In: Panda, B., Sharma, S., Batra, U. (eds) Innovations in Computational Intelligence . Studies in Computational Intelligence, vol 713. Springer, Singapore. https://doi.org/10.1007/978-981-10-4555-4_21

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-4555-4_21

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-4554-7

  • Online ISBN: 978-981-10-4555-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics