Classification and Preprocessing in the Stock Data

  • Przemysław JuszczukEmail author
  • Jan Kozak
Conference paper
Part of the Lecture Notes in Business Information Processing book series (LNBIP, volume 303)


In this paper we deal with the problem of assigning classes to the given market situation. We consider approach in which every market situation can be connected with one of the following decision classes: BUY, SELL or WAIT. Each of two classes: BUY and SELL can be assigned only on the basis of significant rises or drops of the given instrument. In all remaining cases WAIT class is assigned. Such approach allows to be independent of indicator values which nowadays are considered to have the significant prediction power. To achieve the goal we selected various stock instruments and with the use of the preprocessing and data discretization we generated decision tables for every considered datasets.

Furthermore, decision trees is built on the basis of generated decision tables. Decision trees are used in the process of classification of newly generated stock data. Presented approach is tested with the use of two independent sets: training set – used to built classifiers – decision classes, and test set – used to estimate accuracy of the generated decision trees. Finally we refer results to other approach in which forex data were used.


Stock data Machine learning Decision trees Data classification 


  1. 1.
    Atsalakis, G.S., Valavanis, K.P.: Forecasting stock market short-term trends using a neuro-fuzzy based methodology. Expert Syst. Appl. 36(7), 10696–10707 (2009)CrossRefGoogle Scholar
  2. 2.
    Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. CRC Press, Boca Raton (1984)Google Scholar
  3. 3.
    Chena, S.H., Yeh, C.H.: Evolving traders and the business school with genetic programming: a new architecture of the agent-based artificial stock market. J. Econ. Dyn. Control 25(3–4), 363–393 (2001)CrossRefGoogle Scholar
  4. 4.
    Gomber, P., Arndt, B., Lutat, M., Uhle, T.: High-Frequency Trading (2011).
  5. 5.
    Huang, W., Nakamoria, Y., Wang, S.Y.: Forecasting stock market movement direction with support vector machine. Comput. Oper. Res. 32(10), 2513–2522 (2005)CrossRefGoogle Scholar
  6. 6.
    Przemyslaw, J., Jan, K., Katarzyna, T.: Decision trees on the foreign exchange market. In: Czarnowski, I., Caballero, A.M., Howlett, R.J., Jain, L.C. (eds.) Intelligent Decision Technologies 2016. SIST, vol. 57, pp. 127–138. Springer, Cham (2016). doi: 10.1007/978-3-319-39627-9_12 CrossRefGoogle Scholar
  7. 7.
    Korczak, J., Hernes, M., Bac, M.: Risk avoiding strategy in multi-agent trading system. In: Proceedings of Federated Conference Computer Science and Information Systems (FedCSIS), pp. 1119–1126 (2013)Google Scholar
  8. 8.
    Kuo, R.J., Chen, C.H., Hwang, Y.C.: An intelligent stock trading decision support system through integration of genetic algorithm based fuzzy neural network and artificial neural network. Fuzzy Sets Syst. 118(1), 21–45 (2001)CrossRefGoogle Scholar
  9. 9.
    Lai, K.K., Yu, L., Wang, S.: A neural network and web-based decision support system for forex forecasting and trading. Data Min. Knowl. Manag. 3327, 243–253 (2005)CrossRefGoogle Scholar
  10. 10.
    Menkveld, A.J.: High frequency trading and the new market makers. J. Financ. Markets 16(4), 712–740 (2013)CrossRefGoogle Scholar
  11. 11.
    Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)Google Scholar
  12. 12.
    Safavin, S., Landgrebe, R.D.: A survey of decision tree classiffier methodology. IEEE Trans. Syst. 21(3), 660–674 (1991)Google Scholar
  13. 13.
    Samaras, G.D., Matsatsinis, N.F., Zopounidis, C.: A multicriteria DSS for stock evaluation using fundamental analysis. Eur. J. Oper. Res. 187(3), 1380–1401 (2008)CrossRefGoogle Scholar
  14. 14.
    Woodside-Oriakhi, M., Lucas, C., Beasley, J.E.: Heuristic algorithms for the cardinality constrained efficient frontier. Eur. J. Oper. Res. 213(3), 538–550 (2011)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  1. 1.Department of Knowledge Engineering, Faculty of Informatics and CommunicationUniversity of EconomicsKatowicePoland

Personalised recommendations