A Localness-Filter for Searched Web Pages

  • Qiang Ma
  • Chiyako Matsumoto
  • Katsumi Tanaka
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2642)


With the spreading of the Internet, information about our daily life and our residential region is becoming to be more and more active on the WWW (World Wide Web). That’s to say, there are a lot of Web pages, whose content is ‘local’ and may only interest residents of a narrow region. The conventional information retrieval systems and search engines, such as Google[1], Yahoo[2], etc., are very useful to help users finding interesting information. However, it’s not yet easy to find or exclude ‘local’ information about our daily life and residential region. In this paper, we propose a localness-filter for searched Web pages, which can discover and exclude information about our daily life and residential region from the searched Web pages. We compute the localness degree of a Web page by 1) estimating its region dependence: the frequency of geographical words and the content coverage of this Web page, and 2) estimating the ubiquitousness of its topic: in other words, we estimate if it is usual information that appears everyday and everywhere in our daily life.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [1]
    Google., 2002.
  2. [2]
    Yahoo! Japan., 2002.
  3. [3]
    Qiang Ma, Kazutoshi Sumiya, and Katsumi Tanaka. Information filtering based on time-series features for data dissemination systems (in Japanese). IPSJ TOD7, 41(SIG6(TOD7)):46–57, 2000.Google Scholar
  4. [4]
    Qiang Ma, Shinya Miyazaki, and Katsumi Tanaka. Webscan: Discovering and notifying important changes of web sites. proc. of DEXA2001, LNCS 2113, pages 587–598, 2001.Google Scholar
  5. [5]
    Shinya Miyazaki, Qiang Ma, and Katsumi Tanaka. Webscan: Content-based change discovery and broadcast-notification of web sites (in Japanese). IPSJ TOD10, 42(SIG8(TOD10)):96–107, 2001.Google Scholar
  6. [6]
    Yahoo!regional., 2002.
  7. [7]
    MACHIgoo., 2002.
  8. [8]
    Chiyako Matsumoto, Ma Qiang, and Katsumi Tanaka. Web information retrieval based on the localness degree. proc. of DEXA 2002, LNCS 2453, pages 172–181, 2002.Google Scholar
  9. [9]
    Antonin Guttman. R-trees: A dynamic index structure for spatial searching. Proc. ACM SIGMOD Conference on Management of Data, 14(2):47–57, 1984.CrossRefGoogle Scholar
  10. [10]
    Carlo Zaniolo, Stefano Ceri, Christos Faloutsos, Richard T. Snodgrass, V. S. Subrahmanian, and Roberto Zicari. Advanced Database Systems. The Morgan Kaufmann, 1997.Google Scholar
  11. [11]
    Nobuyuki Miura, Katsumi Takahashi, Seiji Yokoji, and Kenichi Shima. Location oriented information integration — mobile info search 2 experiment — (in Japanese). The 57th National Convention of IPSJ, 3: 637–638, 1998.Google Scholar
  12. [12]
    KOKONONET., 2002.
  13. [13]
    Orkut Buyukkokten, Junghoo Cho, Hector Garcia-Molina, Luis Gravano, and Narayanan Shivakumar. Exploiting geographical location information of web pages. proc. of WebDB (Informal Proceedings), pages 91–96, 1999.Google Scholar
  14. [14]
    Daniel Egnor. Google programing contest, 2002.Google Scholar
  15. [15]
    T. Takeda. The latitude / longitude position database of all-prefectures cities, towns and villages in japan, 2000.Google Scholar
  16. [16]
    Google Web API., 2002.
  17. [17], 2002.
  18. [18]

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Qiang Ma
    • 1
  • Chiyako Matsumoto
    • 2
  • Katsumi Tanaka
    • 1
  1. 1.Graduate School of InformaticsKyoto UniversityKyotoJapan
  2. 2.Graduate School of Science and TechnologyKobe UniversityKobeJapan

Personalised recommendations