Abstract
The Technology Opportunities Analysis System (TOAS), being developed under a Defense Advanced Research Projects Agency (DARPA) project, enables mining of text files using bibliometrics. TOAS, a software system, extracts useful information from literature abstract files, which have identified fields that repeat in each abstract record of specific databases, such as Engineering Index (ENGI), INSPEC, Business Index, U.S. Patents, and the National Technical Information Service (NTIS) Research Reports. The TOAS applies various technologies, which include natural language processing (NLP), computational linguistics (CL), fuzzy analysis, latent semantic indexing, and principle components analysis (PCA). This software system combines simple operations (i.e., listing, counting, list comparisons and sorting of search term retrieved consolidated records' field results) with complex matrix manipulations, statistical inference and artificial intelligence approaches to reveal patterns and provide insights from large amounts of information, primarily related to technology-oriented management issues.
The authors apply the TOAS tool on its own root technologies, NLP and computational linguistics-two apparently synonymous terms. These terms, however, when used in a literature search of the same abstract databases, ENGI and INSPEC, provide distinctly different search results with only 10% to 25% search result abstract records overlap. This paper introduces TOAS, summarizes analyses comparing NLP and CL, and then discusses the underlying development implications.
Chapter PDF
Similar content being viewed by others
Keywords
- Singular Value Decomposition
- Speech Recognition
- Principle Component Analysis
- Natural Language Processing
- Logic Programming
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Cunningham, S. (1996), The Content Evaluation of British Scientific Research. Science Policy Research Unit: Brighton, U.K.
Deerwester, S. and S.T. Dumais, G.W. Furnas, T.K. Landauer, D. Harshman (1990), “Indexing by Latent Semantic Analysis,” Journal of the American Society for Information Science, Vol. 41 No. 6, p. 391–407.
Kostoff, R. N. (1993), “Database Tomography for Technical Intelligence,” Competitive Intelligence Review Vol. 5 No. 1, p. 1–9.
Kostoff, R. N. (1994), “Database Tomography: Origins and Applications,” Competitive Intelligence Review, Vol. 5 No. 1, p. 48–55.
Porter, Alan L. and Detampel, Michael, J. (1995), “Technology Opportunities Analysis,” echnology Forecasting and Social Change, Vol. 49, p. 237–255.
Press, W. H. and B. P. Flannery, S.A. Teukolsky, W. T. Vetterling (1986), Numerical Recipes in C. Cambridge University Press: Cambridge.
Watts, Robert J. and Porter, Alan L. (to appear), “Innovation Forecasting,” Technology Forecasting and Social Change.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Watts, R.J., Porter, A.L., Cunningham, S., Zhu, D. (1997). TOAS intelligence mining; analysis of natural language processing and computational linguistics. In: Komorowski, J., Zytkow, J. (eds) Principles of Data Mining and Knowledge Discovery. PKDD 1997. Lecture Notes in Computer Science, vol 1263. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63223-9_131
Download citation
DOI: https://doi.org/10.1007/3-540-63223-9_131
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63223-8
Online ISBN: 978-3-540-69236-2
eBook Packages: Springer Book Archive