Skip to main content

Methodology of Mining Massive Data Sets for Improving Manufacturing Quality/Efficiency

  • Chapter
Data Mining for Design and Manufacturing

Part of the book series: Massive Computing ((MACO,volume 3))

Abstract

In this information era, many enterprises have begun exploring ways to utilize information stored in various databases for creating a competitive edge in managing their supply chain and networked manufacturing processes. This practice requires tools to automatically synthesize a large volume of data for getting needed knowledge. Although there are several existing data mining techniques, most of them are not effective in processing large amounts of data with possible nonstationary and dynamically changing trends. Our procedure first reduces the massive data sets into smaller size data by using data splitting and other data reduction techniques. Then, the traditionally used methods in data mining, signal/image processing and statistical analysis can be useful to handle the reduced-size data. Thus, decision rules for identifying and classifying process problems can be constructed based on these reduced-size data to improve manufacturing quality and efficiency. Finally, by using weighted averaging or voting procedures including artificial neural networks, the synthesized results obtained from the split-data can be integrated. Our real-life examples show a great potential of the proposed methods in mining knowledge from massive manufacturing data sets and in making significant impact in many fields including E-business operations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Reference

  • Antoniadis, A., Gijbels, I. and Gregoire, G., “Model Selection Using Wavelet Decomposition and Applications,” Biometrika, 84 (4), 751–763, 1997.

    Article  MathSciNet  MATH  Google Scholar 

  • Bakshi, B., Koulouris, A. and Stephanopoulos, G., “Wave-Nets: Novel Learning Techniques, and the Induction of Physically Interpretable Models,” in Wavelet Applications, 2242, 637–648, 1994.

    Chapter  Google Scholar 

  • Benediktsson, J., Sveinsson, J., Arnason, K., “Classification and Integration of Multitype Data,” in Proceedings of the IEEE 1998 International Geoscience and Remote Sensing Symposium, 177–179, 1998.

    Google Scholar 

  • Castillo, E. D. and Hurwitz, A. M., “Run-to-Run Process Control: Literature Review and Extensions,” Journal of Quality Technology, 29 (2), 184–196, 1997.

    Google Scholar 

  • Chen, V. C. P., Ruppert, D. and Shoemaker, C. A., “Applying Experimental Design and Regression Splines to High-Dimensional Continuous-State Stochastic Dynamic Programming,” Operations Research, 47 (1), 38–53, 1999.

    Article  MathSciNet  MATH  Google Scholar 

  • Cochran, W. G., Sampling Techniques. New York: Academic Press, 1977.

    MATH  Google Scholar 

  • Critchlow T. et al., “DataFoundry: Information Management for Scientific Data,” IEEE Transactions on Information Technology in Biomedicine, 4 (1), 2000.

    Google Scholar 

  • Donoho, D. and Johnstone, I., “Adapting to Unknown Smoothness via Wavelet Shrinkage,” Journal of the American Statistical Society, 90, 1200–1224, 1995.

    Article  MathSciNet  MATH  Google Scholar 

  • Efron, B., The Jackknife, the Bootstrap, and Other Resampling Plans, in CBMS-NSF Regional Conference Series in Applied Mathematics, Number 38. New York: SIAM, 1982.

    Google Scholar 

  • Friedman, J. H., “Multivariate Adaptive Regression Splines (with discussion),” Annals of Statistics, 19, 1–141, 1991.

    Article  MathSciNet  MATH  Google Scholar 

  • Gardner, M. M., Lu, J. C., Gyurcsik, R. S., Wortman, J. J., Horning, B. E., Heinish, H. H., Rying, E. A., Rao, S., Davis, J. C., Mozumder, P. K., “Equipment Fault Detection Using Spatial Signatures,” IEEE Transactions on Components, Hybrids and Manufacturing Technology, Part C: Manufacturing, 20 (4), 295–304, 1997.

    Google Scholar 

  • Jin, J. Shi, J., “Feature-Preserving Data Compression of Stamping Tonnage Information Using Wavelets,” Technometrics, 41 (4), 327–339, 1999.

    Article  Google Scholar 

  • Johnson, R. A. Wichern, D. W., Applied Multivariate Statistical Analysis. New Jersey: Prentice Hall, 1982.

    MATH  Google Scholar 

  • Koh, C. K. H., Shi, J., Williams, W. J., Ni, J., “Multiple Fault Detection and Isolation Using the Haar Transform,” Transactions of the ASME, Part 1: Theory, 290–294; Part 2: Application to the Stamping Process, 295–299, 1999.

    Google Scholar 

  • Kunt, T. A., McAvoy, R. E., Cavicchi, R. E., Semancik, S., “Optimization of Temperature Programmed Sensing for Gas Identification Using Micro-Hotplate Sensors,” Sensors and Actuators, 53, 24–43, 1998.

    Article  Google Scholar 

  • Lada, E, Process Fault Detection with Wavelet-based Data Reduction Methods, unpublished M.S. thesis (under Dr. J.-C. Lu’s supervision), Operations Research Program, North Carolina State University, Raleigh, 2000.

    Google Scholar 

  • Lenth, R. V., “Quick and Easy Analysis of Unreplicated Factorials,” Technometrics, 31 (4), 469–473, 1989.

    Article  MathSciNet  Google Scholar 

  • Mallat, C. G. Hwang, W., “Singularity detection and processing with wavelets,” IEEE Transactions on Information Theory, 38 (2), 617–643, 1992.

    Article  MathSciNet  MATH  Google Scholar 

  • Mallat, C. G., A Wavelet Tour of Signal Processing. Boston: Academic Press, 1998.

    MATH  Google Scholar 

  • May, G. S. Spanos, C. J., “Automated Malfunction Diagnosis of Semiconductor Fabrication Equipment: A Plasma Etch Application, ” IEEE Transactions on Semiconductor Manufacturing, 6 (1), 28–40, 1993.

    Article  Google Scholar 

  • Martell, L, Wavelet Model Selection and Data Reduction, unpublished Ph.D. thesis (under Dr. J.-C. Lu’s supervision), Department of Statistics, North Carolina State University, Raleigh, 2000.

    Google Scholar 

  • Mascoli G. J., “Automated Dynamic Strain Gage Data Reduction Using Fuzzy C-Means Clustering,” in Proceedings of the IEEE International Conference on Fuzzy Systems, 2207–2214, 1995.

    Google Scholar 

  • Mendenhall, W., Wackerly, D. D., Scheaffer, R. L., Mathematical Statistics with Applications ( fourth edition ). Boston: PWS-KENT Publishing Company, 1990.

    MATH  Google Scholar 

  • Moore A. W., “An Introductory Tutorial on Kd-trees,” Computer Laboratory, University of Cambridge, Technical Report No. 209, 1991.

    Google Scholar 

  • Ogden, R. T.. Essential Wavelets for Statistical Applications and Data Analysis. Boston: Birkhauser, 1997.

    Google Scholar 

  • Prothman B., “Meta data: Managing Needles in the Proverbial Haystacks,” IEEE Potentials, 19 (1), 20–23, 2000.

    Article  Google Scholar 

  • Raghavan S., Cromp, R., Srinivasan, S., Poovendran, R., Campbell, W. and Kanal L., “Extracting an Image Similarity Index using Meta Data Content for Image Mining Applications,” in Proceedings of the SPIE, The International Society for Optical Engineering, 2962, 78–91, 1997.

    Google Scholar 

  • Ramsay, J.O. and Silverman, B. W., Functional Data Analysis. New York: Springer-Verlag, 1997

    Book  MATH  Google Scholar 

  • Rosner, G. Vidakovic, B., “Wavelet Functional ANOVA, Bayesian False Discovery Rate, and Longitudinal Measurements of Oxygen Pressure in Rats,” Technical Report, Statistics Group at ISyE, Georgia Institute of Technology, Atlanta, 2000.

    Google Scholar 

  • Rying, E. A., Gyurcsik, R. S., Lu, J. C., Bilbro, G., Parsons, G., and Sorrell, F. Y., “Wavelet Analysis of Mass Spectrometry Signals for Transient Event Detection and Run-To-Run Process Control,” in Proceedings of the Second International Symposium on Process Control, Diagnostics, and Modeling in Semiconductor Manufacturing, editors: Meyyappan, M., Economou D., J., Bulter, S. W., 37–44, 1997.

    Google Scholar 

  • Rying E. A., Hodge, D. W., Oberhofer, A., Young, K. M., Fenner, J. S., Miller, K., Lu, J. C., Maher, D. M., Kuehn, D., “Continuous Quality Improvement in the NC State University Research Environment,” in Proceedings of SRC TECHCON’98, 1–14, 1998.

    Google Scholar 

  • Rying E. A., Bilbro, G., Lu, J. C., “Focused Local Learning with Wavelet Neural Networks,” manuscript submitted for publication, 2001.

    Google Scholar 

  • Rying E. A., Bilbro, G., Ozturk, M. C., Lu, J. C., “In-Situ Fault Detection and Thickness Metrology Using Quadrupole Mass Spectrometry,” manuscript submitted for publication, 2001.

    Google Scholar 

  • Sachs E., Hu, A., Ingolfsson, A., “Run by Run Process Control: Combining SPC and Feedback Control,” IEEE Transactions on Semiconductor Manufacturing, 8 (1), 26–43, 1995.

    Article  Google Scholar 

  • Schroder M., Seidel K., Datcu M., “Bayesian Modeling of Remote Sensing Image Content,” IEEE 1999 International Geoscience and Remote Sensing Symposium, 3, 1810–1812, 1999.

    Google Scholar 

  • Splus WAVELETS, A Statistical Software Module for Wavelets sold by Math-Soft, Inc., Seattle, WA, 1996.

    Google Scholar 

  • Thuraisingham B., Data Mining: Technologies, Techniques, Tools, and Trends. New York: CRC Press, 1998.

    MATH  Google Scholar 

  • Wornell, G. W., Signal Processing with Fractals: A Wavelet Based Approach. Englewood Cliffs, NJ: Prentice Hall, 1996.

    Google Scholar 

  • Wang, X. Z., Chen, B. H., Yang, S. H., McGreavy, C., “Application of Wavelets and Neural Networks to Diagnostic System Development, 2, An Integrated Framework and its Application,” Computers and Chemical Engineering, 23, 945–954, 1999.

    Article  Google Scholar 

  • Zhang, Q., Benveniste, A., “Wavelet Networks,” IEEE Transactions on Neural Networks, 3 (6), 889–898, 1992.

    Article  Google Scholar 

  • Zhang, Q., “Using Wavelet Network in Nonparametric Estimation,” in IEEE Transactions on Neural Networks, 8(2), 227–236, 1997.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Lu, JC. (2001). Methodology of Mining Massive Data Sets for Improving Manufacturing Quality/Efficiency. In: Braha, D. (eds) Data Mining for Design and Manufacturing. Massive Computing, vol 3. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-4911-3_11

Download citation

  • DOI: https://doi.org/10.1007/978-1-4757-4911-3_11

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4419-5205-9

  • Online ISBN: 978-1-4757-4911-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics