A Research of the Internet Based on Web Information Extraction and Data Fusion

Jiang, Yajun; Wu, Zaoliang; Zhan, Zengrong; Xu, Lingyu

doi:10.1007/978-3-642-20539-2_22

Yajun Jiang²¹,
Zaoliang Wu²¹,
Zengrong Zhan²¹ &
…
Lingyu Xu²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6537))

Included in the following conference series:

International Conference on Web-Based Learning

1299 Accesses

Abstract

This paper proposes a strategy to personalized the Internet searching, which would help to filter, extract and integrate the massive information from the web based on the specific user requirements in the hopes that it can relieve them from the tedious process of manually selecting and retrieving the relevant information as well as the confusion caused by the inconsistencies of the information. The strategy proposed in this paper has been applied to the searching of the laptop product information and the result shows a much less human effort involved and a much more accurate price range.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fard, A.M., Ghaemi, R., Mohammad, R., Akbarzadeh, T., Kavosh, A.H.: An intelligent neuro-fuzzy search engine. Intelligent Systems Design and Application (4), 597–602 (2007)
Google Scholar
Hou, J.: Research on Design of an Auotomatic Evaluation System of search engine. Future Computer and Communication, 16–18 (2009)
Google Scholar
Wang, X.-Y., Hu, Q.-S., Li, B., Zhuang, Z.-Q.: A system of personalized intelligent information retrieval for Internet. Journal of Computer Research and Development 36(9), 1039–1046 (1999)
Google Scholar
Wu, X.-J.: Research of named entity recognition and automatic pattern acquisition in information extraction. Northeastern University, Shenyang (2004)
Google Scholar
Soderland, S.: Learning information extraction rules for semi-structured and Free Text. Machine Learning 34(1-3), 233–272 (1999)
Article MATH Google Scholar
Muslea, I., Minton, S., Knolock, C.: Hierarchical wrapper induction for semi-structured information sources. Autonomous Agents and Multi-Agent System 4(1/2), 93–114 (2001)
Article Google Scholar
Knoblock, C.A., Kristina, L., et al.: Accurately and reliably extraction data from: A machine learning approach. Data Engineering Bulletin 23(4), 33–41 (2000)
Google Scholar
Muslea, I., Minton, S., Craig, A., et al.: Active learning for hierarchical wrapper induction. In: Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, Orlando, Florida, USA (1999)
Google Scholar
Muslea, I., Minton, S., Craig, A., et al.: A hierarchical approach to wrapper induction. In: Proceedings of the Third International Conference on Autonomous Agents, Washington, USA (1999)
Google Scholar
Embley, D., Campbell, D., Jiang, S., et al.: Conceptual-model-based data extraction from multiple record web pages. Data and Knowledge Engineering 31(3), 227–251 (1999)
Article MATH Google Scholar
Crescenzi, V., Mecca, G.: RoadRunner: towards automatic data extraction from large Web sites. In: Proceedings of the 27th International Conference on Very Large Database, Roma, Italy (2001)
Google Scholar
Doorenbos, R., Etzinoni, O., Weld, D.: A scalable comparison-shopping agent for the world-wide web. In: Proceeding of the AAAI 15th National Conference on Artificial Intelligence (1998)
Google Scholar
Embley, R., Xu, L.: Locating and reconfiguring records in unstructured multiple-record web documents. In: Suciu, D., Vossen, G. (eds.) WebDB 2000. LNCS, vol. 1997, p. 256. Springer, Heidelberg (2001)
Chapter Google Scholar
Lu, R.: The Research of XML-Based Web information extraction. Dalian Maritime University, Dalian
Google Scholar
Chunying, K.: DOM-based web page to Detemine the structure of the similatity Algorithm. Intelligent Information Technology Application, 245–248 (2009)
Google Scholar
Intelligence Fusion Pushed. Auiation Week and Space Technology, 205-211 (1979)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Engineering, Guangzhou Panyu Polytechnic College, No.1342, Shiliang Road, Panyu District, 511483, Guangzhou, P.R. China
Yajun Jiang, Zaoliang Wu, Zengrong Zhan & Lingyu Xu

Authors

Yajun Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Zaoliang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Zengrong Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Lingyu Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Shanghai University, Xingjian Building, No. 149 Yanchang Road, 200072, Shanghai, China
Xiangfeng Luo
Information Systems and Databases, RWTH Aachen University, Ahornstr. 55, 52056, Aachen, Germany
Yiwei Cao
School of Computer Science and Engineering, University of Electronic Science and Techology of China, No. 2006 Xiyuan Avenue, High-Tech Zone (West), 611731, Chengdu, China
Bo Yang
Knowledge Grid Lab, Hunan University of Science and Technology, 411202, Xiangtan, Hunan, China
Jianxun Liu
School of Computer Engineering and Science, Shanghai University, Xingjian Building, No. 149, Yanchang Road, 200072, Shanghai, China
Feiyue Ye

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, Y., Wu, Z., Zhan, Z., Xu, L. (2011). A Research of the Internet Based on Web Information Extraction and Data Fusion. In: Luo, X., Cao, Y., Yang, B., Liu, J., Ye, F. (eds) New Horizons in Web-Based Learning - ICWL 2010 Workshops. ICWL 2010. Lecture Notes in Computer Science, vol 6537. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20539-2_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-20539-2_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20538-5
Online ISBN: 978-3-642-20539-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics