Abstract
Large sets of data (numerical, textural and image) have been accumulating in all aspects of our lives for a long time. Advances in sensor technology, the Internet, social networks, wireless communication, and inexpensive memory have all contributed to an explosion of “Big Data,” as part of thrust in robust intelligence. Big data is created in many ways in today’s highly inter-connected world. Social networks, system of systems (SoS: complex interoperable) systems and wireless systems are only some of the platforms creating big data. Recent efforts have developed a promising approach, called “Data Analytics”, which uses statistical and computational intelligence (CI) tools such as principal component analysis (PCA), clustering, fuzzy logic, neuro-computing, evolutionary computation, Regression analysis, Bayesian networks, etc. to reduce the size of “Big Data” to a manageable size and apply these tools to (a) extract information, (b) build a knowledge base using the derived data, and (c) eventually develop a non-parametric model for the “Big Data”. This chapter attempts to construct a bridge between SoS and Data Analytics to develop reliable models for such systems. One of the recent most promising data analytic too is “Deep Learning”. Deep learning is the broad term for the recent development and extensions of neural networks in the machine learning community, which has allowed for state of the art results in speech, image, and natural language processing tasks. Hierarchical learning is an area of research which focuses on learning high order representations from low level data. Learning to recognize objects from images, recognizing words or syllables from audio, or recognizing poses and movement from video are all good examples of modern hierarchical learning research, which is a central focus of the “deep learning” movement in the machine learning and computational statistics community. This chapter will give a rather comprehensive look at all the BIG data analytic tools—old and new. Data bases of photovoltaic and wind energy (from US National Renewable Energy Laboratory) as well as transportation data will all be used here.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Beale M, Hagan M, Demuth H (2014) MATLAB neural network toolbox user’s guide. http://www.mathworks.com/help/pdf_doc/nnet/nnet_ug.pdf
Bengio Y (2009) Learning deep architectures for AI. Found trends Mach Learn 2(1):1–127
Bengio Y, LeCun Y, Yann S, Chopra R, Hadsell M, Ranzato HF (2006) A tutorial on energy-based learning. In: Bakir G, Hofman T, Schölkopf B, Smola A, Taskar B (eds) Predicting structured data. MIT Press, Cambridge, pp 1–59
Hinton G (2010) A practical guide to training restricted Boltzmann machines. Momentum 9(1):926
Hinton G, Osindero S, The Y-W (2006) A fast learning algorithm for deep belief nets. Neural Comput 18(7):1527–1554
IEM—Iowa Environmental Mesonet (2014) ASOS/AWOS data download. http://mesonet.agron.iastate.edu/request/download.phtml
Jamshidi M (ed) (2008) Systems of systems engineering—principles and applications. CRC—Taylor & Francis Publishers, Oxford
Jamshidi M (ed) (2009) System of systems engineering—innovations for the 21st century. John Wiley & Sons, New York
Moussavi-Khalkhali A, Jamshidi M (2014) Leveraging machine learning algorithms to perform online and offline highway traffic flow predictions. In: Machine Learning and Applications (ICMLA), 2014 13th International Conference on, IEEE, Dec 2014, pp 419–423
Murphy KP (2012) Machine learning: a probabilistic perspective. MIT Press, Cambridge
NREL—National Renewable Energy Laboratory (2014) MIDC SOLPOS Calculator. http://www.nrel.gov/midc/solpos/solpos.html
Ng A (2011) Sparse autoencoder. CS294A Lecture notes 72
Shlens J (2009) A tutorial on principal component analysis. http://www.snl.salk.edu/~shlens/pca.pdf Accessed 22 April 2009
Smith LI (2002) A tutorial on principal components analysis. http://www.cs.otago.ac.nz/cosc453/student_tutorials/principal_components.pdf. Accessed 26 Feb 2002
Tannahill B (2014) Big data analytic techniques: Predicting renewable energy. MS Thesis, ACE Laboratory, University of Texas, San Antonio, April, 2014
Tannahill B, Jamshidi M (2014) System of systems and big data analytics—bridging the gap. Int J Comput Electr Eng 40:2–15, Elsevier
Acknowledgements
This work has been supported, in part, by University of Texas System’s Lutcher Brown Endowment, University of Texas at San Antonio. Original version of this chapter, is in parts, based on a keynote by first author at AAAI Workshop on Big Data, Stanford, CA, April 24, 2014.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media (outside the USA)
About this chapter
Cite this chapter
Jamshidi, M., Tannahill, B., Moussavi, A. (2016). Big Data Analytic Paradigms: From Principle Component Analysis to Deep Learning. In: Mittu, R., Sofge, D., Wagner, A., Lawless, W. (eds) Robust Intelligence and Trust in Autonomous Systems. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7668-0_5
Download citation
DOI: https://doi.org/10.1007/978-1-4899-7668-0_5
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-7666-6
Online ISBN: 978-1-4899-7668-0
eBook Packages: Computer ScienceComputer Science (R0)