Advertisement

Discussion Data Analytics

  • Katashi NagaoEmail author
Chapter

Abstract

Evidence-based research, such as research on big data applications, has been receiving much attention and has led to the proposal of techniques for improving the quality of life by storing and analyzing data on daily activities in large quantities. These types of techniques have been applied in the education sector, but a crucial problem remains to be overcome: it is generally difficult to record intellectual activities and accumulate and analyze such data on a large scale. Since this kind of data is not possible to compress in a manner, such as taking the average, it is necessary to maintain the original data as the instances of cases. Such human intellectual-activity data should be treated as big data in the near future. We have been developing a discussion mining system that records face-to-face meetings in detail, analyzes their content, and conducts knowledge discovery. Looking back on past discussion content by browsing documents, such as minutes, is an effective means for conducting future activities. In meetings at which some research topics are regularly discussed, such as seminars in laboratories, the presenters are required to discuss future issues by checking urgent matters from the discussion records. We call statements including advice or requests proposed at previous meetings “task statements” and propose a method for automatically extracting them. With this method, based on certain semantic attributes and linguistic characteristics of statements, a statistical machine learning model is created using logistic regression analysis. A statement is judged whether it is a task statement according to its probability. We also developed a method that maintains the extraction accuracy by using the discussion mining system and its extension on the basis of task statement extraction over a long period. Specifically, we constructed an initial discriminant model of task statements and then applied active learning to new meeting minutes to improve the extraction accuracy. Active learning also has the advantage of reducing labeling costs in supervised machine learning. We explain the improvement in extraction accuracy and reduction in labeling costs with our method and confirm its effectiveness through simulations we conducted.

Keywords

Discussion mining Discussion structure Summarization Task discovery Statistical machine learning Logistic regression analysis Active learning 

References

  1. R. Barzilay, M. Lapata, Modeling local coherence: an entity-based approach. Comput. Linguist. 34(1), 1–34 (2008)CrossRefGoogle Scholar
  2. K. Georgala, A. Kosmopoulos, and G. Paliouras, Spam Filtering: An Active Learning Approach using Incremental Clustering, in Proceedings of the 4th International Conference on Web Intelligence, Mining and Semantics, No. 23 (2014)Google Scholar
  3. D. D. Lewis, W. A. Gale, A Sequential Algorithm for Training Text Classifiers, in Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3–12 (ACM/Springer, 1994)Google Scholar
  4. A. Liu, L. Reyzin, B. D. Ziebart, Shift-Pessimistic Active Learning using Robust Bias-Aware Prediction, in Proceedings of the AAAI Conference on Artificial Intelligence (2015)Google Scholar
  5. T. Mikolov, I. Sutskever, K. Chen, G. Corrado, J. Dean, Distributed Representations of Words and Phrases and their Compositionality, in Advances in Neural Information Processing Systems, ed by Burges, C.J.C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K.Q., pp. 3111–3119 (2013)Google Scholar
  6. T. Mikolov, K. Chen, G. Corrado, J. Dean, Efficient Estimation of Word Representations in Vector Space. arXiv preprint, arXiv:1301.3781 (2013)
  7. K. Nagao, K. Hasida, Automatic Text Summarization Based on the Global Document Annotation, in Proceedings of the Seventeenth International Conference on Computational Linguistics (COLING-98), pp. 917–921 (1998)Google Scholar
  8. K. Nagao, K. Inoue, N. Morita, S. Matsubara, Automatic Extraction of Task Statements from Structured Meeting Content, in Proceedings of the 7th International Conference on Knowledge Discovery and Information Retrieval (KDIR 2015) (2015)Google Scholar
  9. K. Nagao, K. Kaji, D. Yamamoto, H. Tomobe, Discussion Mining: Annotation-Based Knowledge Discovery from Real World Activities, in Advances in Multimedia Information Processing—PCM 2004, LNCS, Vol. 3331, pp. 522–531 (Springer, 2005)Google Scholar
  10. W. Nakano, T. Kobayashi, Y. Katsuyama, S. Naoi, H. Yokota, Treatment of Laser Pointer and Speech Information in Lecture Scene Retrieval, in Proceedings of the 8th IEEE International Symposium on Multimedia 2006, pp. 927–932 (2006)Google Scholar
  11. T. Nishida, Conversation quantization for conversational knowledge process. Int. J. Comput. Sci. Eng. 3(2), 134–144 (2007)Google Scholar
  12. N. Roy, A. Mccallum, Toward Optimal Active Learning through Monte Carlo Estimation of Error Reduction, in Proceedings of the 18th International Conference on Machine Learning, pp. 441–448 (ICML 2001)Google Scholar
  13. B. Settles, M. Craven, An Analysis of Active Learning Strategies for Sequence Labeling Tasks, in Proceedings of the Conference on Empirical Methods in Natural Language Processing (Association for Computational Linguistics, 2008)Google Scholar
  14. B. Settles, Active Learning Literature Survey, Computer Sciences Technical Report 1648, University of Wisconsin-Madison (2010)Google Scholar
  15. H. Shimodaira, Improving Predictive Inference under Covariate Shift by Weighting the Log-Likelihood Function. J. Stat. Plann. Infer. 90, 227–244 (2000)MathSciNetCrossRefGoogle Scholar
  16. M. Sugiyama, M. Kawanabe, Machine Learning in Non-Stationary Environments: Introduction to Covariate Shift Adaptation (The MIT Press, 2012)Google Scholar
  17. T. Schultz, A. Waibel, M. Bett, F. Metze, Y. Pan, K. Ries, T. Schaaf, H. Soltau, W. Martin, H. Yu, K. Zechner, The ISL Meeting Room System, in Proceedings of the Workshop on Hands-Free Speech Communication (HSC-2001) (2001)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  1. 1.Nagoya UniversityNagoyaJapan

Personalised recommendations