Probabilistic classification techniques to perform geographical labeling of web objects

Article
  • 13 Downloads

Abstract

Web search engines provide relevant documents to answer user query. These result-set documents might also contain redundant information which the user might not require. The user has to invest effort in navigating each document to identify relevant information. To overcome such cumbersome overheads, Web object search engines were proposed. These systems provide powerful vertical search facility, so that, the result set of a query will only contain the relevant Web object information. Many techniques have been proposed to geographically label documents for Web search engines, however, geographical labeling of Web objects has got limited attention. The presence of noise in the Web objects due to inaccurate object extraction process complicates the task of assigning geographical labels. Recently in the literature. Gaussian mixture model oriented classification technique was proposed to achieve geographical labeling of Web objects, even then, there is an ample scope to improve labeling accuracy. In this work, two probabilistic classier namely-Bayesian Classifier and Variational Inference Classifier are utilized to achieve geographical labeling of Web objects. The proposed technique provides at least 30% better labeling accuracy and twice better computational efficiency when compared with the contemporary technique.

Keywords

Cluster classification techniques Web objects Bayesian classification model 

References

  1. 1.
  2. 2.
  3. 3.
    Raper, J.: Geographic relevance. J. Doc. 63(6), 836–852 (2007)CrossRefGoogle Scholar
  4. 4.
    Li, K., Changqing, Z., Shuhui, B., Yun, L., Jian, Z., Minglun, G.: Multi-modal feature fusion for geographic image annotation. Pattern Recognit. 73, 1–14 (2018)CrossRefGoogle Scholar
  5. 5.
    Sathiyamoorthi, V.: Data mining and data warehousing: introduction to data mining and data warehousing. In: Web Data Mining and the Development of Knowledge-Based Decision Support Systems, IGI Global , pp. 312–337 (2017)Google Scholar
  6. 6.
    Tezuka, T., Kondo, H., Tanaka, K.: Estimation of Geographic Relevance for Web objects Using Probabilistic Models. Springer-Verlag, Berlin Heidelberg (2008)CrossRefGoogle Scholar
  7. 7.
    McCurley, K.S.: Geospatial mapping and navigation of the Web. In: Proceedings of the 10th International World Wide Web Conference, Hong kong, China, pp. 221–229 (2001)Google Scholar
  8. 8.
    Gao, W., Lee, H.C., Miao, Y.: Geographically focused collaborative crawling. In: Proceedings of the 15th International World Wide Web Conference, Edinburgh, Scotland, pp. 287–296 (2006)Google Scholar
  9. 9.
    Mei, Q., Liu, C., Su, H., Zhai, C.: A probabilistic approach to spatiotemporal theme pattern mining on weblogs. In: Proceedings of the 15th International World Wide Web Conference, Edinburgh, Scotland, pp. 533–542 (2006)Google Scholar
  10. 10.
    Tezuka, T., Kurashima, T., Tanaka, k.: Toward tighter integration of Web search with a geo graphic information system. In: Proceedings of the 15th World Wide Web Conference, Edin -burgh, Scotland, pp. 277–286 (2006)Google Scholar
  11. 11.
    Gravano, L., Hatzivassiloglou, V., Litchenstein, R.: Categorizing Web queries according to geo- graphical locality. In: Proceedings of the 12th International Conference on Information and Knowledge Management, New Orleans, Lousiana, pp. 325–333(2003)Google Scholar
  12. 12.
    Chen, L., Zhang, L., Jing, F., Deng, k., Ma, W Y.: Ranking Web objects from multiple communities. In: Proceedings of the International Conference on Information and Knowledge Management, Arlington, Virginia, pp. 377–386 (2006)Google Scholar
  13. 13.
    Nie, Z., Ma, Y., Shi, S., Wen, J.R., Ma, W.Y.: Web object retrieval. In: Proceedings of the 16th International World Wide Web Conference, Ban, Canada, pp. 81–90 (2007)Google Scholar
  14. 14.
    Nie, Z., Wen, J R., Ma, W.Y.: Object-level vertical search. In: Proceedings of the 3rd Biennial Conference on Innovative Data Systems Research, Asilomar, California, pp. 235–246 (2007)Google Scholar
  15. 15.
    Buyukkokten, O., Cho, J., Garcia-Molina, H., Gravano, L., shivakumar, N.: Exploiting geographical location information of Web pages. In: Proceedings of the ACM SIGMOD Workshop on the Web and databases, Philadelphia, Pennsylvania (1999)Google Scholar
  16. 16.
    Davis, C.A., Fonseca, F.T.: Assessing the certainty of locations produced by an address geo coding system. Geoinformatica 11(1), 103–129 (2007)CrossRefGoogle Scholar
  17. 17.
    Amitay, E., Har, N., Sivan, R., Soffer, A.: Geotagging web content. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Sheffield, United Kingdom, pp. 273–280Google Scholar
  18. 18.
    Lieberman, M.D., Sperling, J.: STEWARD Architecture of a spatio-textual search engine. In: Proceedings of the 15th Annual ACM International Symposium on Advances in Geo-graphic Information Systems, Seattle, Washington, Article No. 25 (2007)Google Scholar
  19. 19.
    Sengar, V., Joshi, T., Joy, Prakash. S.: Toyama K Robust location search from text queries. In: Proceedings of the 15th Annual ACM International Symposium on Advances in Geographic Information Systems, Seattle, Washington, Article No. 24 (2007)Google Scholar
  20. 20.
    Schneider, M.: Geographic data modeling: Fuzzy topological Predicates, their properties and their integration into query languages. In: Proceedings of the 9th ACM International Symposium on advances in geographic information systems, Atlanta, Georgia, pp. 9–14 (2001)Google Scholar
  21. 21.
    Coffman, J., Weaver, A.C.: A framework for evaluating database keyword search strategies. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management (2010)Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  • K. N. AnjanKumar
    • 1
  • S. Chitra
    • 2
  • T. Satish Kumar
    • 3
  1. 1.Anna University ChennaiChennaiIndia
  2. 2.Er. Perumal Manimekalai College of EngineeringHosurIndia
  3. 3.Department of Computer Science and EngineeringGlobal Academy of TechnologyBangaloreIndia

Personalised recommendations