Abstract
Information acquisition and validation play an important role in the decision making process during negotiation. In this chapter we briefly present the framework of a smart data mining system for providing contextual information extracted from the Internet to a negotiation agent. We then present one of its components in more details – an effective automated technique for extracting relevant articles from news web sites, so that they can be used further by the mining agents. Most current techniques experience difficulties in coping with changes in web site structure and formats. The proposed extraction process is completely automatic and independent of web site formats. Proposed technique identifies regularities in both format and content of news web sites. The algorithms are applicable to both single- and multi-document web sites. Since invalid URLs can cause errors in data extraction, we also present a method for the negotiation agent to estimate the validity of the extracted data based on the frequency of the relevant words in the news title. Once the news articles are extracted the next task is to construct sets of given articles. This chapter presents a new procedure for constructing news data sets on given topics. The extracted news data set is further utilised by the parties involved in negotiation. The information retrieved from the data set can support both human and automated negotiators.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Simoff, S.J., Debenham, J.K.: Curious negotiator. In: Klusch, M., Ossowski, S., Shehory, O. (eds.) CIA 2002. LNCS (LNAI), vol. 2446, p. 104. Springer, Heidelberg (2002)
Gerding, E.H., van Bragt, D.D.B., et al.: Multi-issue negotiation processes by evolutionary simulation: validation and social extensions. In: Proceedings Workshop on Complex Behavior in Economics. Aix-en-Provence, France (2000)
Gomes, A., Jehiel, P.: Dynamic process of social and economic interactions: On the persistence of inefficiencies. Journal of Political Economy 113(3), 626–667 (2005)
Kraus, S.: Strategic Negotiation in Multiagent Environments. MIT Press, Cambridge (2001)
Ströbel, M.: Design of Roles and Protocols for Electronic Negotiations. Electronic Commerce Research Journal, Special Issue on Market Design (2001)
Milgrom, P., Weber, R.A.: Theory of Auctions with Competitive Bidding. Econometrica 50(5), 1089–1122 (1982)
Watkins, M.: Breakthrough Business Negotiation-A Toolbox for Managers. Jossey-Bass (2002)
Hand, D., Mannila, H., et al.: Principles of Data Mining. MIT Press, Cambridge (2001)
Franz, M., Ittycheriah, A., et al.: First Story Detection: Combining Similarity and Novelty Based Approaches. In: Topic Detection and Tracking Workshop Report (2001)
Simoff, S.J., Debenham, J.K.: Time-constrained support for decision-making in e-market environments. In: Proceedings of the 6th International Conference of The International Society for Decision Support Systems ISDSS 2001, London, UK, pp. 193–206 (2001)
Chidlovskii, B., Ragetli, J., et al.: Automatic wrapper generation for web search engines. In: Lu, H., Zhou, A. (eds.) WAIM 2000. LNCS, vol. 1846, pp. 399–410. Springer, Heidelberg (2000)
Freitag, D., Kushmerick, N.: Boosted wrapper induction. In: Proceedings of the 17th National Conference on Artificial Intelligence, AAAI 2000 (2000)
Kushmerick, N., Grace, B.: The wrapper induction environment. In: Workshop on Software Tools for Developing Agents, AAAI 1998 (1998)
Kushmerick, N.: Wrapper induction: Efficiency and expressiveness. Artificial Intelligence 118(1-2), 15–68 (2000)
Muslea, I., Minton, S., et al.: STALKER: Learning extraction rules for semistructured, Web-based information sources. In: Proceedings of AAAI 1998 Workshop on AI and Information Integration. AAAI Press, Menlo Park (1998)
Gao, X., Sterling, L.: Semi-structured Data Extraction from Heterogeneous Sources. In: Bratjevik, T., Schwartz, D., Divitini, M. (eds.) Internet-based Knowledge Management and Organizational Memories, pp. 83–102. Idea Group Publishing, USA (2000)
McKeown, K.R., Barzilay, R., et al.: Columbia multi-document summarization: Approach and evaluation. In: Proceedings of the Workshop on Text Summarization, ACM SIGIR Conference, DARPA/NIST Document Understanding Conferences, DUC (2001)
Lin, S.H., Ho, J.M.: Discovering informative content blocks from Web documents. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining KDD 2002. ACM Press, New York (2002)
Salton, G.: Automatic Text Processing: The Transformation, Analysis and Retrieval of Information by Computer. Addison-Wesley, Reading (1989)
Hulth, A., Karlgren, J., Jonsson, A., Boström, H., Asker, L.: Automatic Keyword Extraction Using Domain Knowledge. In: Gelbukh, A. (ed.) CICLing 2001. LNCS, vol. 2004, p. 472. Springer, Heidelberg (2001)
Andrade, M., Valencia, A.: Automatic extraction of keywords from scientific text: application to the knowledge domain of protein families. Bioinformatics (14), 600–607 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Zhang, D., Simoff, S.J. (2006). Informing the Curious Negotiator: Automatic News Extraction from the Internet. In: Williams, G.J., Simoff, S.J. (eds) Data Mining. Lecture Notes in Computer Science(), vol 3755. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11677437_14
Download citation
DOI: https://doi.org/10.1007/11677437_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32547-5
Online ISBN: 978-3-540-32548-2
eBook Packages: Computer ScienceComputer Science (R0)