Semi-automatic Content Extraction from Specifications

Thirunarayan, Krishnaprasad; Berkovich, Aaron; Sokol, Dan

doi:10.1007/3-540-36271-1_4

Krishnaprasad Thirunarayan⁵,
Aaron Berkovich⁶ &
Dan Sokol⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2553))

Included in the following conference series:

International Conference on Application of Natural Language to Information Systems

446 Accesses
1 Citations

Abstract

Specifications are critical to companies involved in complex manufacturing. The constant reading, reviewing, and analysis of materials and process specifications is extremely labor-intensive, qualityimpacting, and time-consuming. A conceptual design for a tool that provides computer-assistance in the interpretation of specification requirements has been created and a strategy for semantic-markup, which is the overlaying of abstract syntax (“the essence”) on the text, has been developed. The solution is based on the techniques for Information Extraction and the XML technology, and it captures the specification content within a semantic ontology. The working prototype of the tool being built will serve as the foundation for potential full-scale commercialization.

This work was supported in part by NSF SBIR Phases I and II Grants DMI-0078525 (1999–2002). It does not necessarily reflect the opinions of NSF.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sokol, D.Z.: Concurrent Engineering in the Materials Industry: Case Study in the Application of Information Technology, Fourth Annual Conference on Management of Technology, 1994.
Google Scholar
Sokol, D.Z., Rowe, J.: Integrating STEP and SGML for Concurrent Engineering, CALS 95 International Expo.
Google Scholar
Sokol, D.Z.: Concurrent Engineering Design System for High Technology Material Suppliers, NSF Phase II Final Report, 1997.
Google Scholar
Soderland S. G.: Learning Information Extraction Rules for Semi-structured and Free Text, Machine Learning, Vol. 34, No. 1-3 (1999) 233–272.
Article MATH Google Scholar
Hobbs J., Appelt D., Bear J., Israel D., Kameyama M., Stickel M., and Tyson M.: FASTUS: Extracting Information from Natural-Language Text, 1996. (http://www.ai.sri.com/natural-language/projects/fastus-schabes.html)
Grishman R.: The NYU System for MUC-6 or Where’s the Syntax?, Procs. of MUC-6 (1995).
Google Scholar
Lehnert W.G., Cardie C., Fisher D., McCarthy J., Riloff E., and Soderland S.: Evaluating an Information Extraction System, Journal of Integrated Computer-Aided Engineering, 1(6) (1994) 453–472.
Google Scholar
Riloff, E.: Automatically Constructing a Dictionary for Information Extraction Tasks, Proceedings of the Eleventh Annual Conference on Artificial Intelligence (1994) 811–816.
Google Scholar
Fujii, A., and Ishikawa, T.: Cross-Language Information Retrieval for Technical Documents (1996).
Google Scholar
Grishman R.: Information Extraction: Techniques and Challenges, Information Extraction (International Summer School SCIE-97), ed. Maria Teresa Pazienza, Springer-Verlag, 1997.
Google Scholar
Soderland S.G.: CRYSTAL: Learning Domain-specific Text Analysis Rules, CIIR Technical Report # 43, University of Massachusetts at Amherst.
Google Scholar
Du Charme B.: XSLT Quickly, Manning Publications Co. (2001).
Google Scholar
Tidwell D.: XSLT, O’Reilly (2001).
Google Scholar
Harold E. R.: XML Bible, Hungry Minds Inc. (1999).
Google Scholar
Dietel H. M.: et al, XML: How to Program, Prentice Hall Inc. (2000).
Google Scholar
Porter M. F.: An Algorithm for Suffix Stripping, Program, Vol. 14, No. 3, (1990), 130–137.
Google Scholar
McCarthy J.: A Trainable Approach to Coreference Resolution for Information Extraction, PhD Thesis. Dept. of Computer Science Technical Report # 78, University of Massachusetts, Amherst.
Google Scholar
van Harmelen F. and Fensel D.: Practical Knowledge Representation for the Web. Practical Knowledge Representation for the Web. In Proceedings of the Workshop on Intelligent Information Integration (III99), (1999) IJCAI-99.
Google Scholar
Muslea, I.: Extraction Patterns for Information Extraction Tasks: A Survey, In Proceedings of AAAI-99 Workshop on Machine Learning for Information Extraction, (1999) AAAI-99.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Wright State University, 3640, Col. Glenn Hwy, 45435, Dayton, Ohio, USA
Krishnaprasad Thirunarayan
Cohesia Corporation, First National Plaza, Suite 1414, 45402, Dayton, Ohio, USA
Aaron Berkovich & Dan Sokol

Authors

Krishnaprasad Thirunarayan
View author publications
You can also search for this author in PubMed Google Scholar
Aaron Berkovich
View author publications
You can also search for this author in PubMed Google Scholar
Dan Sokol
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer and Systems Sciences, Royal Institute of Technology, Forum 100, 16440, Kista, Sweden
Birger Andersson , Maria Bergholtz & Paul Johannesson , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Thirunarayan, K., Berkovich, A., Sokol, D. (2002). Semi-automatic Content Extraction from Specifications. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds) Natural Language Processing and Information Systems. NLDB 2002. Lecture Notes in Computer Science, vol 2553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36271-1_4

Download citation

DOI: https://doi.org/10.1007/3-540-36271-1_4
Published: 28 February 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00307-6
Online ISBN: 978-3-540-36271-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics