Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3334))

Included in the following conference series:

Abstract

This paper presents our implementation techniques for an intelligent Web image search engine. A reference architecture of the system is provided and addressed in this paper. The system includes several components such as a crawler, a preprocessor, a semantic extractor, an indexer, a knowledge learner and a query engine. The crawler traverses web sites in multithread accesses model. And it can dynamically control its access load to a Web server based on the corresponding capacity of the local system. The preprocessor is used to clean and normalize the information resource downloaded from Web sites. In this process, stop-word removing and word stemming are applied to the raw resources. The semantic extractor derives Web image semantics by partitioning combining the associated text. The indexer of the system creates and maintains inverted indices with relational model. Our knowledge learner is designed to automatically acquire knowledge from users’ query activities. Finally, the query engine delivers search results in two phases in order to mine out the users’ feedbacks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chakrabarti, S., Berg, M.V.D., Dom, B.: Focused Crawling: a New Approach to Topic-Specific Web Resource Discovery. Computer Networks 31(11-16), 1623–1640 (1999)

    Article  Google Scholar 

  2. Chakrabarti, S., Berg, M.V.D., Dom, B.: Focused Crawling: a New Approach to Topic-Specific Web Resource Discovery. Computer Networks 31(11-16), 1623–1640 (1999)

    Article  Google Scholar 

  3. Chang, S.-K., Hsu, A.: Image information systems: Where do we go from here? IEEE Trans. on Knowledge and Data Eng. 4(5), 431–442 (1992)

    Article  MathSciNet  Google Scholar 

  4. Chen, Z. et al.: Web Mining for Web Image Retrieval. To appear in the special issue of Journal of the American Society for Information Science on Visual Based Retrieval Systems and Web Mining

    Google Scholar 

  5. DOM: http://www.w3.org/DOM/

  6. Gong, Z., Hou U, L., Cheang, C.W.: Web Image Semantic Extractions from its Associated Texts. In: The 8th IASTED International Conference on Internet & Multimedia Systems & Applications, Kauai, Hawaii, USA, August 16-18 (2004)

    Google Scholar 

  7. Harmandas, V., Sanderson, M., Dunlop, M.D.: Image Retrieval By Hypertext Links. In: Proceedings of SIGIR-1997, 20th ACM International Conference on Research and Development in Information Retrieval (1997)

    Google Scholar 

  8. Li, M., Chen, Z., Zhang, H.: Statistical Correlation Analysis in Image Retrieval. Pattern Recognition 35, 2687–2693 (2002)

    Article  MATH  MathSciNet  Google Scholar 

  9. Lin, H.: Discovering Informative Content Blocks from Web Documents. In: ACM SIGKDD 2002, Edmonton, Alberta, Canada, July 23 - 26 (2002)

    Google Scholar 

  10. Porter, M.F.: An Algorithm For Suffix Stripping. Program 14(3), 130–137 (1980)

    Google Scholar 

  11. Stop Word List, http://www.searchengineworld.com/spy/stopwords.htm

  12. Tamura, H., Yokoya, N.: Image database systems: A survey. Patt. Recog 17(1), 29–43 (1984)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gong, Z., U, L.H., Cheang, C.W. (2004). An Implementation of Web Image Search Engines. In: Chen, Z., Chen, H., Miao, Q., Fu, Y., Fox, E., Lim, Ep. (eds) Digital Libraries: International Collaboration and Cross-Fertilization. ICADL 2004. Lecture Notes in Computer Science, vol 3334. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30544-6_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30544-6_39

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-24030-3

  • Online ISBN: 978-3-540-30544-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics