Abstract
This paper proposes a strategy to personalized the Internet searching, which would help to filter, extract and integrate the massive information from the web based on the specific user requirements in the hopes that it can relieve them from the tedious process of manually selecting and retrieving the relevant information as well as the confusion caused by the inconsistencies of the information. The strategy proposed in this paper has been applied to the searching of the laptop product information and the result shows a much less human effort involved and a much more accurate price range.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fard, A.M., Ghaemi, R., Mohammad, R., Akbarzadeh, T., Kavosh, A.H.: An intelligent neuro-fuzzy search engine. Intelligent Systems Design and Application (4), 597–602 (2007)
Hou, J.: Research on Design of an Auotomatic Evaluation System of search engine. Future Computer and Communication, 16–18 (2009)
Wang, X.-Y., Hu, Q.-S., Li, B., Zhuang, Z.-Q.: A system of personalized intelligent information retrieval for Internet. Journal of Computer Research and Development 36(9), 1039–1046 (1999)
Wu, X.-J.: Research of named entity recognition and automatic pattern acquisition in information extraction. Northeastern University, Shenyang (2004)
Soderland, S.: Learning information extraction rules for semi-structured and Free Text. Machine Learning 34(1-3), 233–272 (1999)
Muslea, I., Minton, S., Knolock, C.: Hierarchical wrapper induction for semi-structured information sources. Autonomous Agents and Multi-Agent System 4(1/2), 93–114 (2001)
Knoblock, C.A., Kristina, L., et al.: Accurately and reliably extraction data from: A machine learning approach. Data Engineering Bulletin 23(4), 33–41 (2000)
Muslea, I., Minton, S., Craig, A., et al.: Active learning for hierarchical wrapper induction. In: Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, Orlando, Florida, USA (1999)
Muslea, I., Minton, S., Craig, A., et al.: A hierarchical approach to wrapper induction. In: Proceedings of the Third International Conference on Autonomous Agents, Washington, USA (1999)
Embley, D., Campbell, D., Jiang, S., et al.: Conceptual-model-based data extraction from multiple record web pages. Data and Knowledge Engineering 31(3), 227–251 (1999)
Crescenzi, V., Mecca, G.: RoadRunner: towards automatic data extraction from large Web sites. In: Proceedings of the 27th International Conference on Very Large Database, Roma, Italy (2001)
Doorenbos, R., Etzinoni, O., Weld, D.: A scalable comparison-shopping agent for the world-wide web. In: Proceeding of the AAAI 15th National Conference on Artificial Intelligence (1998)
Embley, R., Xu, L.: Locating and reconfiguring records in unstructured multiple-record web documents. In: Suciu, D., Vossen, G. (eds.) WebDB 2000. LNCS, vol. 1997, p. 256. Springer, Heidelberg (2001)
Lu, R.: The Research of XML-Based Web information extraction. Dalian Maritime University, Dalian
Chunying, K.: DOM-based web page to Detemine the structure of the similatity Algorithm. Intelligent Information Technology Application, 245–248 (2009)
Intelligence Fusion Pushed. Auiation Week and Space Technology, 205-211 (1979)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jiang, Y., Wu, Z., Zhan, Z., Xu, L. (2011). A Research of the Internet Based on Web Information Extraction and Data Fusion. In: Luo, X., Cao, Y., Yang, B., Liu, J., Ye, F. (eds) New Horizons in Web-Based Learning - ICWL 2010 Workshops. ICWL 2010. Lecture Notes in Computer Science, vol 6537. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20539-2_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-20539-2_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20538-5
Online ISBN: 978-3-642-20539-2
eBook Packages: Computer ScienceComputer Science (R0)