Towards the Extraction of Intelligence about Competitor from the Web

  • Jie Zhao
  • Peiquan Jin
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5736)


In this paper we present a system framework for the extraction of intelligence about competitor from the Web. With the surprising increasing of the data volume in the Web, how to get useful intelligence about competitor has been an interesting issue. Previous study shows that most people prefer to look up information by competitor. We first analyze the requirements on the extraction of competitor intelligence from the Web and define three types of intelligence for competitor. And then a system framework to extract competitor intelligence from the Web is described. We discuss the three key issues of the system in detail, which are the profile intelligence extraction, the events intelligence extraction, and the relations intelligence extraction. Some new techniques to deal with those issues are introduced in the paper.


competitive intelligence competitor Web intelligence extraction 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    LaMar, J.: Competitive Intelligence Survey Report (2007),
  2. 2.
    Thompson, S., Wing, C.Y.: Assessing the Impact of Using the Internet for Competitive Intelligence. Information & Management 39(1), 67–83 (2001)CrossRefGoogle Scholar
  3. 3.
    Mikroyannidis, A., Theodoulidis, B., Persidis, A.: PARMENIDES: Towards Business Intelligence Discovery from Web Data. In: Proc. of IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006), pp. 1057–1060 (2006)Google Scholar
  4. 4.
    Kahaner, L.: Competitive Intelligence. Simon & Schuster, New York (1996)Google Scholar
  5. 5.
    Hotho, A., Nürnberger, A., Paass, G.: A Brief Survey of Text Mining. LDV Forum (LDVF) 20(1), 19–62 (2005)Google Scholar
  6. 6.
    Khoury, I., El-Mawas, R.M., El-Rawas, O., et al.: An Efficient Web Page Change Detection System Based on an Optimized Hungarian Algorithm. IEEE Transaction on Knowledge Data Engineering (TKDE) 19(5), 599–613 (2007)CrossRefGoogle Scholar
  7. 7.
  8. 8.
    Whitelaw, C., Kehlenbeck, A., Petrovic, N., et al.: Web-scale Named Entity Recognition. In: Proc. of CIKM 2008, pp. 123–132 (2008)Google Scholar
  9. 9.
    Sundheim, M.: Named Entity Task Definition-Version 2.1. In: Proc. of the Sixth Message Understanding Conference, pp. 319–332 (1995)Google Scholar
  10. 10.
    Khalid, M., Jijkoun, V., de Rijke, M.: The impact of named entity normalization on information retrieval for question answering. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 705–710. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  11. 11.
  12. 12.
    Sun, B., Mitra, P., Giles, C.L., et al.: Topic segmentation with shared topic detection and alignment of multiple documents. In: Proc. of SIGIR, pp. 199–206 (2007)Google Scholar
  13. 13.
    Wong, K.F., Xia, Y., Li, W., et al.: An Overview of Temporal Information Extraction. Int. Journal of Computer Processing of Oriental Languages 18(2), 137–152 (2005)CrossRefGoogle Scholar
  14. 14.
  15. 15.
  16. 16.
    ACE (Automatic Content Extraction) English Annotation Guidelines for Relations, Version 6.2, Linguistic Data Consortium (2008),

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Jie Zhao
    • 1
    • 2
  • Peiquan Jin
    • 3
  1. 1.School of Industry and AdministrationAnhui UniversityHefeiChina
  2. 2.School of ManagementUniversity of Science and Technology of ChinaHefeiChina
  3. 3.Department of Computer Science and TechnologyUniversity of Science and Technology of ChinaHefeiChina

Personalised recommendations