Skip to main content

A Research of the Internet Based on Web Information Extraction and Data Fusion

  • Conference paper
New Horizons in Web-Based Learning - ICWL 2010 Workshops (ICWL 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6537))

Included in the following conference series:

  • 1299 Accesses

Abstract

This paper proposes a strategy to personalized the Internet searching, which would help to filter, extract and integrate the massive information from the web based on the specific user requirements in the hopes that it can relieve them from the tedious process of manually selecting and retrieving the relevant information as well as the confusion caused by the inconsistencies of the information. The strategy proposed in this paper has been applied to the searching of the laptop product information and the result shows a much less human effort involved and a much more accurate price range.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fard, A.M., Ghaemi, R., Mohammad, R., Akbarzadeh, T., Kavosh, A.H.: An intelligent neuro-fuzzy search engine. Intelligent Systems Design and Application (4), 597–602 (2007)

    Google Scholar 

  2. Hou, J.: Research on Design of an Auotomatic Evaluation System of search engine. Future Computer and Communication, 16–18 (2009)

    Google Scholar 

  3. Wang, X.-Y., Hu, Q.-S., Li, B., Zhuang, Z.-Q.: A system of personalized intelligent information retrieval for Internet. Journal of Computer Research and Development 36(9), 1039–1046 (1999)

    Google Scholar 

  4. Wu, X.-J.: Research of named entity recognition and automatic pattern acquisition in information extraction. Northeastern University, Shenyang (2004)

    Google Scholar 

  5. Soderland, S.: Learning information extraction rules for semi-structured and Free Text. Machine Learning 34(1-3), 233–272 (1999)

    Article  MATH  Google Scholar 

  6. Muslea, I., Minton, S., Knolock, C.: Hierarchical wrapper induction for semi-structured information sources. Autonomous Agents and Multi-Agent System 4(1/2), 93–114 (2001)

    Article  Google Scholar 

  7. Knoblock, C.A., Kristina, L., et al.: Accurately and reliably extraction data from: A machine learning approach. Data Engineering Bulletin 23(4), 33–41 (2000)

    Google Scholar 

  8. Muslea, I., Minton, S., Craig, A., et al.: Active learning for hierarchical wrapper induction. In: Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, Orlando, Florida, USA (1999)

    Google Scholar 

  9. Muslea, I., Minton, S., Craig, A., et al.: A hierarchical approach to wrapper induction. In: Proceedings of the Third International Conference on Autonomous Agents, Washington, USA (1999)

    Google Scholar 

  10. Embley, D., Campbell, D., Jiang, S., et al.: Conceptual-model-based data extraction from multiple record web pages. Data and Knowledge Engineering 31(3), 227–251 (1999)

    Article  MATH  Google Scholar 

  11. Crescenzi, V., Mecca, G.: RoadRunner: towards automatic data extraction from large Web sites. In: Proceedings of the 27th International Conference on Very Large Database, Roma, Italy (2001)

    Google Scholar 

  12. Doorenbos, R., Etzinoni, O., Weld, D.: A scalable comparison-shopping agent for the world-wide web. In: Proceeding of the AAAI 15th National Conference on Artificial Intelligence (1998)

    Google Scholar 

  13. Embley, R., Xu, L.: Locating and reconfiguring records in unstructured multiple-record web documents. In: Suciu, D., Vossen, G. (eds.) WebDB 2000. LNCS, vol. 1997, p. 256. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  14. Lu, R.: The Research of XML-Based Web information extraction. Dalian Maritime University, Dalian

    Google Scholar 

  15. Chunying, K.: DOM-based web page to Detemine the structure of the similatity Algorithm. Intelligent Information Technology Application, 245–248 (2009)

    Google Scholar 

  16. Intelligence Fusion Pushed. Auiation Week and Space Technology, 205-211 (1979)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jiang, Y., Wu, Z., Zhan, Z., Xu, L. (2011). A Research of the Internet Based on Web Information Extraction and Data Fusion. In: Luo, X., Cao, Y., Yang, B., Liu, J., Ye, F. (eds) New Horizons in Web-Based Learning - ICWL 2010 Workshops. ICWL 2010. Lecture Notes in Computer Science, vol 6537. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20539-2_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20539-2_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20538-5

  • Online ISBN: 978-3-642-20539-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics