Abstract
We have developed a system for gathering information from the Web, using it to create a personal history, and presenting it as a chronological table. It simplifies the task of sorting out the information for various namesakes and dealing with information in widely scattered sources. The system comprises five components: namesake disambiguation, date expression extraction, date expression normalization and completion, relevant information extraction, and chronological table generation.
Chapter PDF
Similar content being viewed by others
References
Guha, R., Garg, A.: Disambiguating people in search. Stanford University, Stanford (2004)
Al-Kamha, R., Embley, D.W.: Grouping search-engine returned citations for person-name queries. In: Proc. ACM WIDM 2004, pp. 96–103. ACM Press, New York (2004)
Wan, X., Gao, J., Li, M., Ding, B.: Person resolution in person search results: Webhawk. In: Proc. ACM CIKM 2005, pp. 163–170. ACM Press, New York (2005)
IREX Committee. In: Proceedings of the IREX Workshop, IREX Committee (1999)
Watanabe, I., Masui, F., Fukumoto, J.: Improvement of next performance: Elavolating precision and userbility of the named entity extraction tool. In: Proc. NLP 2004, pp. 413–415 (in Japanese) (2004)
Ikehara, S., Miyazaki, M., Shirai, S., Yokoo, A., Nakaiwa, H., Ogura, K., Ooyama, Y., Hayashi, Y.: Nihongo Goi Taikei - A Japanese Lexicon (CD-ROM). Iwanami Syoten (in Japanese) (1999)
Matsumoto, Y., Kitauchi, A., Yamashita, T., Hirano, Y., Matsuda, H., Takaoka, K., Asahara, M.: Japanese Morphological Analysis System ChaSen version 2.2.1 (2000)
Mizobuchi, S., Sumitomo, T., Fuketa, M., Aoe, J.: A method for understanding time expressions. In: Proc. IEEE SMC 1998, pp. 1151–1155. IEEE Computer Society Press, Los Alamitos (1998)
Zhao, Y., Karypis, G.: Evaluation of hierarchical clustering algorithms for document datasets. In: Proc. ACM CIKM 2002, pp. 515–524. ACM Press, New York (2002)
John, G.H., Langley, P.: Estimating continuous distributions in bayesian classifiers. In: Proc. UAI 1995, pp. 338–345 (1995)
Quinlan, R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Vapnik, V.: Statistical Learning Theory. Wiley, Chichester (1998)
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: Smote: Synthetic minority over-sampling technique. JAIR 16, 321–357 (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kimura, R., Oyama, S., Toda, H., Tanaka, K. (2007). Creating Personal Histories from the Web Using Namesake Disambiguation and Event Extraction. In: Baresi, L., Fraternali, P., Houben, GJ. (eds) Web Engineering. ICWE 2007. Lecture Notes in Computer Science, vol 4607. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73597-7_33
Download citation
DOI: https://doi.org/10.1007/978-3-540-73597-7_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73596-0
Online ISBN: 978-3-540-73597-7
eBook Packages: Computer ScienceComputer Science (R0)