Pattern-Based Energy Consumption Analysis by Chaining Principle Component Analysis and Logistic Regression

  • Seog-Chan OhEmail author
  • Alfred J. Hildreth
Part of the Springer Series in Advanced Manufacturing book series (SSAM)


It is often required to carry out sensor-based condition monitoring for machines or operations (e.g., machining centre, foundry) during production to ensure the effectiveness. Due to the requirements of a non-invasive installation or no interruption during production, however, it may be difficult to fully instrument the machine or production equipment with monitoring sensors. As an alternative to the direct monitoring, it is possible to use energy power or temperature data, and other easy-to-install sensors measured with relatively high time resolution (~2 s) to provide enough information to effectively infer events and other properties. From this reason, the ability of inferring becomes important. To introduce how the inferencing technology can be used in the energy management, this chapter presents a pattern-based energy consumption analysis by chaining Principle Component Analysis (PCA) and logistic regression. The PCA provides an unsupervised dimension reduction to mitigate the issue of multicollinearity (high dependence) among the explanatory variables, while the logistic regression does the prediction based on the reduced dataset expressed in orthogonal axes that are uncorrelated principle components represented by Eigenvectors found in the PCA. By chaining the PCA and logistic regression, it is possible to train manually time-logged energy data and to infer the events associated with the manufacturing operations. It is expected that the proposed analysis method will enable manufacturing companies to correlate energy and operations and further use the power data to predict when operation events of interest (e.g. start up, idle, peak operation, etc.) occur, resulting in determining how current energy usage levels in manufacturing operations compares to the optimal usage patterns. This chapter also provides a short instruction to Python and IPython Notebook. It illustrates a supervised learning process by using Python to carry out pipelining PCA and logistic regression and applying a grid search to training and inference energy consumption patterns.


Logistic Regression Principle Component Analysis Multinomial Logistic Regression Multinomial Logistic Regression Model Manufacturing Operation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. Aguilera AM, Escabias M, Valderrama MJ (2006) Using principal components for estimating logistic regression with high-dimensional multicollinear data. Comput Stat Data Anal 50:1905–1924MathSciNetCrossRefzbMATHGoogle Scholar
  2. Cangelosi R, Goriely A (2007) Component retention in principal component analysis with application to cDNA microarray data. Available online: Accessed 18 August 2015
  3. Camminatiello I, Lucadamo A (2010) Estimating multinomial logit model with multicollinear data. Asian J Math Stat 3:93–101MathSciNetCrossRefGoogle Scholar
  4. Dahmus JB, Gutowski TC (2004) An environmental analysis of machining. In: ASME international mechanical engineering congress and RD&D ExpoGoogle Scholar
  5. Fang K, Uhan N, Zhao F, Sutherland WJ (2011) A new approach to scheduling in manufacturing for power consumption and carbon footprint reduction. J Manuf Syst 30:234–240CrossRefGoogle Scholar
  6. Greene WH (2012) Econometric analysis, 7th edn. Prentice Hall, New JerseyGoogle Scholar
  7. Gutowski TC, Murphy C, Allen D, Bauer D, Bras B, Piwonka T, Sheng P, Sutherland J, Thurston D, Wolff E (2005) Environmentally benign manufacturing: observations from Japan, Europe and the United States. J Cleaner Prod 13:1–17CrossRefGoogle Scholar
  8. Oh S-C, Hidreth AJ (2013) Decisions on energy demand response option contracts in smart grids based on activity-based costing and stochastic programming. Energies 6:425–443CrossRefGoogle Scholar
  9. Oh S-C, Hidreth AJ (2014) Estimating the technical improvement of energy efficiency in the automotive industry—stochastic and deterministic frontier benchmarking approaches. Energies 9:6198–6222Google Scholar
  10. Oh SC, D’Arcy JB, Arinez JF, Biller SR, Hidreth AJ (2011) Assessment of energy demand response options in smart grid utilizing the stochastic programming approach. In: Proceedings of IEEE power and energy society general meeting, Detroit, MI, USA, 24–28 JulyGoogle Scholar
  11. Raschka S (2014) Implementing a Principal Component Analysis (PCA) in Python step by step. Available online: Accessed 12 August 2015
  12. Scikit-Learn Machine Learning in Python. Available online: Accessed 6 August 2015
  13. Smith LI (2002) A tutorial on Principal Components Analysis. Available online: Accessed 11 August 2015
  14. Train KE (2003) Discrete choice methods with simulation. Cambridge University Press, Cambridge, USACrossRefzbMATHGoogle Scholar
  15. Turk M, Pentland A (1991) Eigenfaces for recognition. J Cogn Neurosci 3(1):71–86CrossRefGoogle Scholar
  16. Weisstein E (2014) K-Means clustering algorithm. Available online: Accessed 18 August 2015

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  1. 1.TroyUSA
  2. 2.RochesterUSA

Personalised recommendations