Abstract
Data mining has been successfully applied in many businesses, thus aiding managers to make informed decisions that are based on facts, rather than having to rely on guesswork and incorrect extrapolations. Data mining algorithms equip institutions to predict the movements of financial indicators, enable companies to move towards more energy-efficient buildings, as well as allow businesses to conduct targeted marketing campaigns and forecast sales. Specific data mining success stories include customer loyalty prediction, economic forecasting, and fraud detection. The strength of data mining lies in the fact that it allows for not only predicting trends and behaviors, but also for the discovery of previously unknown patterns. However, a number of challenges remain, especially in this era of big data. These challenges are brought forward due to the sheer Volume of today’s databases, as well as the Velocity (in terms of speed of arrival) and the Variety, in terms of the various types of data collected. This chapter focuses on techniques that address these issues. Specifically, we turn our attention to the financial sector, which has become paramount to business. Our discussion centers on issues such as considering data distributions with high fluctuations, incorporating late arriving data, and handling the unknown. We review the current state-of-the-art, mainly focusing on model-based approaches. We conclude the chapter by providing our perspective as to what the future holds, in terms of building accurate models against today’s business, and specifically financial, data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Baaquie, B.E.: Interest Rates and Coupon Bonds in Quantum Finance. Cambridge University Press, Cambridge (2010)
NYSE Market Data, Data Products, Product Summaries. http://www.nyxdata.com/Data-Products/Product-Summaries. Accessed 14 Nov 2014
Han, J., Kamber, M.: Data Mining Concepts and Techniques, 2nd edn. Morgan Kauffman (2008)
Shasha, D., Zhu, Y.: High Performance Discovery In Time Series: Techniques and Case Studies. Springer (2004)
Yi, B.K., Sidiropoulos, N., Johnson, T., Jagadish, H.V., Faloutsos, C., Biliris, A.: Online data mining for co-evolving time sequences. In: Proceedings of 2000 International Conference on Data Engineering (ICDE 2000), pp. 13–22. San Diego, CA (2000)
Shumway, R.H., Stoffer, D.S.: Time Series Analysis and Its Applications. Springer (2005)
Oshaug, C.A.J.: Lvy Processes and Path Integral Methods with Applications in the Energy Markets. Norwegian University of Science and Technology, Thesis (2011)
Paquet, E., Viktor, H.L., Guo, H.: Learning in the presence of large fluctuations: a study of aggregation and correlation, new frontiers in mining complex patterns. LNCS 7765, 49–63 (2013)
Rachev, S.T., et al.: Bayesian Methods in Finance. Wiley, Hoboken (2008)
McNeil., A.J., Wendin, J.P.: Bayesian inference for generalized linear mixed models of portfolio credit risk. J. Empirical Finan. 14(2), 131–147 (2007)
Garlappi, L., Uppal, R., Wang, T.: Portfolio selection with parameter and model uncertainty: a multi-prior approach. Rev. Finan. Stud. Oxford J. 20(1), 41–81 (2007)
Jacquier, E., Polson, N.G., Rossi, P.E.: Bayesian analysis of stochastic volatility models. J. Busin. Econ. Statis. 20(1), 69–87 (2002)
Masujima, M.: Path Integral Quantization and Stochastic Quantization. Springer, Berlin (2009)
Campbell, J.Y., Low, A.S., Mackinlay, A.C.: The Econometric of Financial Markets. Princeton University Press, Princeton (1997)
Lloyds fined 218m over Libor rate rigging scandal. http://www.bbc.com/news/business-28528349?print=true. Accessed 21 Dec 2014
Merton, R.C.: Continuous Time Finance. Blackwell Publishing, Oxford (1990)
Lemmens, D., Wouters, M., Tempere, J.: Path integral approach to closed-form option pricing formulas with applications to stochastic volatility and interest rate models. Phys. Rev. E 78, 016101-1–016101-8 (2008)
Baaquie, B.E.: Quantum Finance: Path Integrals and Hamiltonians for Options and Interest Rates. Cambridge University Press (2004)
Dalakouras, G.V., Kwon, R.H., Pardalos, P.M.: Semidefinite programming approaches for bounding Asian option prices. In: Computational Methods in Financial, Engineering, pp. 103–116 (2008)
Devreese, J.P.A., Lemmens, M., Tempere, J.: Path integral approach to asian options in the BlackâĂŞcholes model. Physica A: Statis. Mech. Appl. 384, 780–788 (2010)
Binder, K., Heermann, D.W.: Monte Carlo Simulation in Statistical Physics: An Introduction. Springer, Berlin (2010)
Heath, D., Jarrow, R., Morton, A.: Bond pricing and the term structure of interest rates: a new methodology for contingent claim valuation. Econometrica 60, 77–105 (1992)
Baaquie, B.E.: Financial modeling and quantum mathematics. Comput. Math. Appl. 65, 1665–1673 (2013)
Krempl, G., \(\tilde{Z}\)liobaite, I., Brzezinski, D., Hullermeier, E., Last, M., Lemaire, V., Stefanowski, J.: Open challenges for data stream mining research. ACM SIGKDD Explor. Newslett. 16(1), 1–10 (2014)
Bollen, J., Mao, H.: Twitter mood as a stock market predictor. Computer 44(10), 91–94 (2011)
Zhang, X., Fuehres, H., Gloor, P.: Predicting stock market indicators through twitter ("I hope it is not as bad as I fear"). Procedia: Social Behav. Sci. 26, 55–62 (2011)
Ferrer, J.N., Olivero, S., Medarova-Bergstorm, K., Rizos, V.: Financing models for smart cities. November 2013. http://eu-smartcities.eu/sites/all/files/GuidelineFinancingModelsforsmartcities-january.pdf. Accessed 15 Dec 2014
NationalGridUS, Managing Energy Costs in Office Buildings. https://www.nationalgridus.com/non_html/shared_energyeff_office.pdf. Accessed 15 Dec 2014
European PPP Expertise Centre (EPEC): Guidance on energy efficiency in public buildings. http://www.eib.org/epec/resources/epec_guidance_ee_public_buildings_en.pdf. Accessed 15 Dec 2014
Guo, H.: Modeling short-term energy load with continuous conditional random fields. In: ECML/PKDD 2013 machine learning and knowledge discovery in databases. In: Lecture Notes in Computer Science, vol. 8188, pp. 433–448 (2013)
Bosnić, Z., Demsar, J., Ke\(\tilde{\rm {s}}\)pret, G., Rodrigues, P.P., Gama, J., Kononenko, I: Enhancing data stream predictions with reliability estimators and explanation. Eng. Appl. Artif. Intell. 34, 174-188 (2014)
Kanagal, B., Deshpande, A.: Online filtering, smoothing and probabilistic modeling of streaming data. In: IEEE 24th International Conference on Data Engineering, 2008, ICDE 2008, pp. 1160–1169 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Paquet, E., Viktor, H., Guo, H. (2016). Data Mining in Finance: Current Advances and Future Challenges. In: Japkowicz, N., Stefanowski, J. (eds) Big Data Analysis: New Algorithms for a New Society. Studies in Big Data, vol 16. Springer, Cham. https://doi.org/10.1007/978-3-319-26989-4_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-26989-4_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26987-0
Online ISBN: 978-3-319-26989-4
eBook Packages: EngineeringEngineering (R0)