A Meta-search Engine with Hits Redundancies Filtering

  • Suet Peng Yong
  • Wan Nordiana Wan Abd Kadir
Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 4)

Searching information on the Web can be viewed as extremely easy, or it can be extremely difficult. This is because the WWW is not indexed like many library catalogs or journal-article indexes. When we search on the Web, we are not searching it directly but are actually searching the Web pages collected and indexed by a search tool from computers all over the world that contains the actual Web pages. Examples of search tools are Yahoo! Search, Google, Alta Vista, and so on.

The different types of search tools have their own strengths and weaknesses. Depending on your information needs, one may work better for you than another. Search directories are hierarchical databases with references to websites. The websites that are included are classified according to the rules of that particular search service. Search engines, on the other hand, use software to crawl the Internet in search of what you would like using of terms or keywords. Specialized databases are the hidden parts of the WWW that are normally not found by regular search engines [1]. This paper presents the extension work from Integrated Filtered Web-Search Engine (IFWSE) [2] by looking at the enhancement of the program structure to retrieve the search hits. Further experiments were also carried out to compare its performance with with a well-established, meta-search engine-the MetaCrawler [3] [4].


Search Engine User Query Alta Vista Hierarchical Database Metasearch Engine 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Chris S (2005) Internet Search Strategies: Search Tools. Minnesota West Home, Minnesota West Community and Technical College Available: http://www.mwctc.cc.mn.us/libraries/strategies/tools.htm
  2. 2.
    Yong SP and Wan Nordiana WAK (2007) Search Engines Integration with Hits Redundancy Filtering. In Proceedings of International Multi Conference of Engineers and Computer Scientists, Hong Kong, International Association of Engineers (IAENG)Google Scholar
  3. 3.
    Selberg E and Etzioni O (1997) The MetaCrawler Architecture for Resource Aggregation on the Web. IEEE Expert, 12(1): 8–14CrossRefGoogle Scholar
  4. 4.
    Computer Science and Engineering (2001) MetaCrawler/Husky/Grouper Available: http://www.cs.washington.edu/research/projects/WebWarel/www/metacrawler/
  5. 5.
    Joe B (2005) Meta-Search Engines. University of California. Available: http://www.lib.berkeley.edu/TeachingLib/Guides/Internet/MetaSearch.html
  6. 6.
    Copernic (2005) About us – Technologies, Indexing. Available: http://www.copernic.com/en/company/technologies.html
  7. 7.
    Jensen B, Spink A and Saracevic T (2000) Real life, real users and real needs: a study and analysis of user queries on the Web. Information Processing and Management. 36(2): 207–2267CrossRefGoogle Scholar
  8. 8.
    Richard J (2000) Measuring Search Effectiveness. Available: Creighton University Health Sciences Library and Learning Resources CenterGoogle Scholar
  9. 9.
    Meng W, Yu C and Liu KL (2002) Building Effective and Efficient Metasearch Engine, ACM Computing Surveys, vol 34 No 1, March p 48–89Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2008

Authors and Affiliations

  • Suet Peng Yong
    • 1
  • Wan Nordiana Wan Abd Kadir
    • 1
  1. 1.Universiti Teknologi PETRONASMalaysia

Personalised recommendations