Skip to main content

WebSuite—A Tool Suite for Harnessing Web Data

  • Conference paper
The World Wide Web and Databases (WebDB 1998)

Abstract

We present a system for searching, collecting, and integrating Web-resident data. The system consists of five tools, where each tool provides a specific functionality aimed at solving one aspect of the complex task of using and managing Web data. Each tool can be used in a stand-alone mode, in combination with the other tools, or even in conjunction with other systems. Together, the tools offer a wide range of capabilities that overcome many of the limitations in existing systems for harnessing Web data. The paper describes each tool, possible ways of combining the tools, and the architecture of the combined system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. W3SQL, http://www.cs.technion.ac.il/~konop/w3qs.html

  2. W3QS, http://www.cs.technion.ac.il/~W3QS

  3. West university, http://www.west.edu/datamining/publications.html

  4. W3TRANS, http://www.math.tau.ac.il/~pinim/w3trans-home.html

  5. VRG, http://www.cs.technion.ac.il/~anda/files/WebSuite.html

  6. Abiteboul, S., Cluet, S., Milo, T.: Querying the file. In: Proc. of Intl. Conf. on Very Large Data Bases, Dublin (1993)

    Google Scholar 

  7. Abiteboul, S., Cluet, S., Milo, T.: A database interface for files update. In: Proc. ACM SIGMOD Int. Conf. on Management of Data (May 1995)

    Google Scholar 

  8. Abiteboul, S., Cluet, S., Milo, T.: Correspondence and Translation for Hetero-geneous Data. In: Proc. Int. Conf. on Database Theory (ICDT), pp. 351–363 (1997)

    Google Scholar 

  9. Atzeni, P., Labonia, S., Masci, A., Mecca, G., Merialdo, P., Tabet, E.: The ARANEUS Project, http://poincare.inf.uniroma3.it:8080/Araneus/araneus.html

  10. Bell, G., Parisi, A., Pesce, M.: The Virtual Reality Modeling Language: Version 1 Specification (May 1995), http://www.virtpark.com/theme/vrml/

  11. Buneman, P., Davidson, S., Hart, K., Overton, C., Wong, L.: A data transformation system for biological data sources. In: Proc. Int. Conf. on Very Large Data Bases (VLDB), Zurich, Switzerland, pp. 158–169 (1995)

    Google Scholar 

  12. Buneman, P., Davidson, S., Suciu, D.: Programming constructs for unstructured data (May 1996)

    Google Scholar 

  13. Carey, M.J., et al.: Towards heterogeneous multimedia information systems: The Garlic approach. Technical Report RJ 9911, IBM Almaden Research Center (1994)

    Google Scholar 

  14. Christophides, V., Abiteboul, S., Cluet, S., Scholl, M.: From structured documents to novel query facilities. In: Proc. ACM Sigmod, Minneapolis (1994)

    Google Scholar 

  15. Chang, T.-P., Hull, R.: Using witness generators to support bi-directional up-date between object-based databases. In: Proc. ACM SIGMOD/SIGACT Conf. on Princ. of Database Syst. (PODS), San Jose, California (May 1995)

    Google Scholar 

  16. Consens, M., Milo, T.: Optimizing Queries on Files. In: ACM SIGMOD Int. Conf. on Management of Data, Minneapolis, Minnesota, May 1994, pp. 301–312 (1994)

    Google Scholar 

  17. Das Neves, F.: The aleph: A tool to spatially represent user knowledge about the www docuverse. In: Proc. ACM Hypertext 1997 (1997)

    Google Scholar 

  18. Doemel, P.: WebMap - a graphical hypertext navigation tool. In: Proceedings of the 2nd Int’l World Wide Web Conference, Chicago (October 1994), http://www.ncsa.uiuc.edu/SDG/IT94/Proceedings/Searching/doemel/-www-fall94.html

  19. Excite Inc. Excite: Main page (1996), http://www.excite.com

  20. Feng, A., Wakayama, T.: Simon: A Grammar Based Transformation System of Structured Documents. In: Proc. Int. Conf. Electronic Publishing (1994)

    Google Scholar 

  21. Franchitti, J.C., King, R.: Amalgame: a tool for creating interoperating persistent, heterogeneous components. In: Advanced Database Systems, pp. 313–336 (1993)

    Google Scholar 

  22. Garcia-Molina, H., Quass, D., Papakonstantinou, Y., Rajaraman, A., Sagiv, Y., Ullman, J.D., Widom, J.: The TSIMMIS Approach to Mediation: Data Models and Languages. A special issue of the International Journal of Intelligent Information Systems (to appear)

    Google Scholar 

  23. Hemmje, M.: Lyberworld - a visulalization user interface with full text retrieval. In: Proceedings of SIGIR 1994 (1994)

    Google Scholar 

  24. Kifer, M., Lausen, G., Wu, J.: Logical Foundations of Object-Oriented and Frame-Based Languages. JACM 42(4), 741–843 (1995)

    Article  MATH  MathSciNet  Google Scholar 

  25. Kirk, T., Levy, A.Y., Sagiv, Y., Srivastava, D.: The Information Manifold. In: AI Spring Symp. (1995)

    Google Scholar 

  26. Konopnicki, D., Shmueli, O.: Information Gathering in the World-Wide Web: The W3QL Query Language and the W3QS system. ACM TODS (to appear)

    Google Scholar 

  27. Konopnicki, D., Shmueli, O.: W3QS: A Query System for the World-Wide Web. In: Proceedings of 1995 VLDB Conference, Zurich, Switzerland (September 1995)

    Google Scholar 

  28. Levy, A., Rajaraman, A., Ordille, J.: The World-Wide Web as a Collection of Views: Query Processing in the Information Manifold. In: Proc. Workshop on Mate- rialized Views: Techniques and Applications, Montreal, Canada, pp. 43–55 (1996)

    Google Scholar 

  29. Levy, A.Y., Mendelzon, A.O., Sagiv, Y., Srivastava, D.: Answering Queries Using Views. In: Proc. 14th PODS (1995)

    Google Scholar 

  30. Mamrak, A., O’Connell, C.: Technical Documentation for the Integrated Chameleon Architecture: ftp.ifi.uio.no /pub/SGML/ICA (1992)

    Google Scholar 

  31. Mendelzon, A., Mihaila, G., Milo, T.: Querying the world wide. In: Proc. of PDIS (1996)

    Google Scholar 

  32. Mogilevski, P.: Integration and Translation of Heterogeneous Data. M.Sc Thesis, Tel-Aviv University (1997)

    Google Scholar 

  33. Mukherjea, S., Foley, J.D.: Visualizing the World Wide Web with the Navigational View Builder. Computer Networks and ISDN Systems 27, 1075–1087 (1995)

    Article  Google Scholar 

  34. Papakonstantinou, Y., Garcia-Molina, H., Ullman, J.: Medmaker: A mediation system based on declarative specifications. Available at db.stanford.edu /pub/papakonstantinou/1995/medmaker.ps

    Google Scholar 

  35. Papakonstantinou, Y., Garcia-Molina, H., Widom, J.: Object exchange across heterogeneous information sources. In: Int’l Conf. on Data Engineering (1995)

    Google Scholar 

  36. Pitkow, J.E., Bharat, K.A.: WebViz: A tools for World Wide Web access log analysis. In: Proc. 1st Int’l World Wide Web Conf., Geneva, Switzerland (May 1994), http://www1.cern.ch/PapersWWW94/pitkow-webvis.ps

  37. Quass, D., Rajaraman, A., Sagiv, Y., Ullman, J.D., Widom, J.: Querying Semi-structured Heterogeneous Information. In: Ling, T.W., Mendelzon, A.O., Vieille, L. (eds.) Proc. 4th Int. Conf., DOOD 1995, December 1995. LNCS, vol. 1013, pp. 319–344. Springer, Heidelberg (1995)

    Google Scholar 

  38. Rajaraman, A., Sagiv, Y., Ullman, J.D.: Answering Queries Using Templates With Binding Patterns. In: Proc. 14th PODS (1995)

    Google Scholar 

  39. Shoens, K., Luniewski, A., Schwartz, P., Stamos, J., Thomas, J.: The rofus system: Information organization for semi-structured data. In: Proc. of the 19th Int. conf. on Very Large Databases, VLDB 1993, pp. 97–107 (1993)

    Google Scholar 

  40. Slonim, N., Tishby, N.: Automatic statistical categorization and segmentation of text, HUJI Technical Report (to appear)

    Google Scholar 

  41. Subrahmanian, V.S., Adali, S., Brink, A., Emery, R., Lu, J., Rajput, A., Rogers, T., Ross, R., Ward, C.: HERMES: A Heterogeneous Reasoning and Mediator System. Tech. Report, U. of Maryland (1995)

    Google Scholar 

  42. Walker, J.: HTML Converters. In (1994), http://www2.ncsu.edu/bae/people/faculty/walker/hotlist/htmlconv.html

  43. Yahoo Inc. Yahoo: Main page (1996), http://www.yahoo.com

  44. W3QS Home Page, http://www.cs.technion.ac.il/~konop/w3qs.html

  45. The W3QS System, http://www.cs.technion.ac.il/~W3QS

  46. PERLCOND Home Page, http://www.cs.technion.ac.il/~konop/perlcond.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Beeri, C. et al. (1999). WebSuite—A Tool Suite for Harnessing Web Data. In: Atzeni, P., Mendelzon, A., Mecca, G. (eds) The World Wide Web and Databases. WebDB 1998. Lecture Notes in Computer Science, vol 1590. Springer, Berlin, Heidelberg. https://doi.org/10.1007/10704656_10

Download citation

  • DOI: https://doi.org/10.1007/10704656_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65890-0

  • Online ISBN: 978-3-540-48909-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics