Abstract
In text mining, importance indices of terms such as simple frequency, document frequency including the terms, and tf-idf of the terms, play a key role for finding valuable patterns in documents. As for the documents, they are often published daily, monthly, annually, and irregularly for each purpose. Although the purposes of each set of documents are not changed, roles of terms and the relationship among them in the documents change temporally. In order to detect such temporal changes, we decomposed the process into three sub-processes: automatic term extraction, importance index calculation, and temporal trend detection. On the basis of the consideration, we propose a method for detecting temporal trends of technical terms based on importance indices and clustering methods. By focusing on technical phrases, we carried out an experimentation to detect emergent and subsiding trends in a set of research document. The result shows that our method determined the temporal trends of technical phrases related to finding of patterns for innovations of research topics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lent, B., Agrawal, R., Srikant, R.: Discovering trends in text databases, pp. 227–230. AAAI Press, Menlo Park (1997)
Kontostathis, A., Galitsky, L., Pottenger, W.M., Roy, S., Phelps, D.J.: A survey of emerging trend detection in textual data mining. A Comprehensive Survey of Text Mining (2003)
Anderberg, M.R.: Cluster Analysis for Applications. Monographs and Textbooks on Probability and Mathematical Statistics. Academic Press, Inc., New York (1973)
Nakagawa, H.: Automatic term recognition based on statistics of compound nouns. Terminology 6(2), 195–210 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abe, H., Tsumoto, S. (2009). Detecting Temporal Patterns of Importance Indices about Technical Phrases. In: Velásquez, J.D., RÃos, S.A., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2009. Lecture Notes in Computer Science(), vol 5712. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04592-9_32
Download citation
DOI: https://doi.org/10.1007/978-3-642-04592-9_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04591-2
Online ISBN: 978-3-642-04592-9
eBook Packages: Computer ScienceComputer Science (R0)