Skip to main content

Computational Semantics for Asset Correlations

  • Chapter
  • First Online:
Intelligent Asset Management

Part of the book series: Socio-Affective Computing ((SAC,volume 9))

  • 505 Accesses

Abstract

This chapter explores the possibility to leverage semantic knowledge for robust estimation of correlations among financial assets. A graphical model for high-dimensional stochastic dependence termed a “vine” structure, which is derived from copula theory, is introduced here. To model the prior semantic knowledge, we use a neural network-based language model to generate distributed semantic representations for financial documents. The semantic representations are used for computing similarities between the assets they respectively refer. The constructed dependence structure is experimented with real-world data. Results suggest that our semantic vine construction-based method is superior to the state-of-the-art covariance matrix estimation method, which is based on an arbitrary vine that at least guarantees robustness of the estimated covariance matrix. The effectiveness of using semantic vines for robust correlation estimation for Markowitz’s asset allocation model on a large scale of assets (up to 50 stocks) is also showed and discussed.

We use a machine, or the drawing of a machine, to symbolize a particular action of the machine. — Ludwig Wittgenstein

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Shallow refers to the neural networks that only have one hidden layer of neurons.

  2. 2.

    The correlation between a 1 and a 3 conditions on a pivot asset a 2. Therefore, we use a different type of dashed link to denote this conditional correlation. The dashed link is abbreviated as 1,  3|2.

  3. 3.

    Proof of this theorem uses trigonometric substitution. For details, see Lemma 12 in Bedford and Cooke [11].

  4. 4.

    This is defined as the optimal truncation of vines as a minimum number of edges would have large absolute partial correlations and rest of the edges are assumed insignificant (independent).

  5. 5.

    Information on the stock list is elaborated in Appendix A.

  6. 6.

    http://quandl.com/tools/api

  7. 7.

    http://daviddlewis.com/resources/testcollections/reuters21578

  8. 8.

    Retrieved from the Internet on 2017-10-09.

  9. 9.

    See Sect. 4.2.3 for the definition of the optimal vine truncation.

  10. 10.

    This time span is roughly chosen because it is reasonable to assume the asset correlations keep the same. If simulation is carried out for a longer period, we have to access historical corpus of the Reuters Company Business Descriptions and Wikipedia pages, which is out of scope for our discussion.

References

  1. K. Aas, D. Berg, Models for construction of multivariate dependence – a comparison study. Eur. J. Financ. 15, 639–659 (2009)

    Article  Google Scholar 

  2. H. Bai, F.Z. Xing, E. Cambria, W.-B. Huang, Business taxonomy construction using concept-level hierarchical clustering, in The First Workshop on Financial Technology and Natural Language Processing (FinNLP-IJCAI), 2019, pp. 1–7

    Google Scholar 

  3. T. Bedford, R.M. Cooke, Probability density decomposition for conditionally dependent random variables modeled by vines. Ann. Math. Artif. Intell. 32, 245–268 (2001)

    Article  Google Scholar 

  4. T. Bedford, R.M. Cooke, Vines: a new graphical model for dependent random variables. Ann. Stat. 30(4), 1031–1068 (2002)

    Article  Google Scholar 

  5. Y. Bengio, R. Ducharme, P. Vincent, A neural probabilistic language model. J. Mach. Learn. Res. 3, 1137–1155 (2003)

    Google Scholar 

  6. L.K.C. Chan, J. Lakonishok, B. Swaminathan, Industry classification and return comovement. Financ. Anal. J. 63(6), 56–70 (2007)

    Article  Google Scholar 

  7. I. Chaturvedi, Y.-S. Ong, I. Tsang, R.E. Welsch, E. Cambria, Learning word dependencies in text by means of a deep recurrent belief network. Knowl. Based Syst. 108, 144–154 (2016)

    Article  Google Scholar 

  8. R.M. Cooke, D. Kurowicka, K. Wilson, Sampling, conditionalizing, counting, merging, searching regular vines. J. Multivar. Anal. 138, 4–18 (2015)

    Article  Google Scholar 

  9. W. Croft, D.A. Cruse, Cognitive Linguistics (Cambridge University Press, New York, 2004)

    Book  Google Scholar 

  10. A.B. Davidow, J.D. Peterson, A modern approach to asset allocation and portfolio construction. Technical Report MKT81752HL-02, Schwab Center for Financial Research, 2014

    Google Scholar 

  11. F. Durante, C. Sempi, Principles of Copula Theory (CRC Press, Boca Raton, 2016)

    Google Scholar 

  12. G. Elidan, Copulas in machine learning, in Copulae in Mathematical and Quantitative Finance, vol. 213 (Springer, Berlin/Heidelberg, 2013), pp. 39–60

    Book  Google Scholar 

  13. E.F. Fama, K.R. French, Luck versus skill in the cross-section of mutual fund returns. J. Financ. 65(5), 1915–1947 (2010)

    Article  Google Scholar 

  14. K.K. Hung, C.C. Cheung, L. Xu, New Sharpe-ratio-related methods for portfolio selection, in Proceedings of the Conference on Computational Intelligence for Financial Engineering (CIFEr), 2000, pp. 34–37

    Google Scholar 

  15. D.P. Kingma, J. Ba, Adam: a method for stochastic optimization, in Proceedings of International Conference on Learning Representations, 2015

    Google Scholar 

  16. D. Kurowicka, H. Joe (eds.), Dependence Modeling: Vine Copula Handbook (World Scientific, London, 2011)

    Google Scholar 

  17. Q. Le, T. Mikolov, Distributed representations of sentences and documents, in Proceedings of the 31st International Conference on Machine Learning (ICML), 2014, pp. 1188–1196

    Google Scholar 

  18. G. Leech, Semantics: The Study of Meaning, 2 edn. (Harmondsworth, Penguin, 1981)

    Google Scholar 

  19. X. Li, H. Xie, Y. Song, S. Zhu, Q. Li, F.L. Wang, Does summarization help stock prediction? A news impact analysis. IEEE Intell. Syst. 30(3), 26–34 (2015)

    CAS  Google Scholar 

  20. L. Luo, Y. Xiong, Y. Liu, X. Sun, Adaptive gradient methods with dynamic bound of learning rate, in Proceedings of International Conference on Learning Representations, 2019

    Google Scholar 

  21. R.C. Merton, On estimating the expected return on the market: an exploratory investigation. J. Financ. Econ. 8(4), 323–361 (1980)

    Article  Google Scholar 

  22. D. Metzler, W.B. Croft, A Markov random field model for term dependencies, in Proceedings of the 28th Annual International Conference on Research and Development in Information Retrieval (SIGIR), 2005, pp. 472–479

    Google Scholar 

  23. T. Mikolov, I. Sutskever, K. Chen, G. Corrado, J. Dean, Distributed representations of words and phrases and their compositionality, in Proceedings of the 26th International Conference on Neural Information Processing Systems (NIPS), 2013, pp. 3111–3119

    Google Scholar 

  24. N.M. Neykov, P. Filzmoser, P.N. Neytchev, Robust joint modeling of mean and dispersion through trimming. Comput. Stat. Data Anal. 56(1), 34–48 (2012)

    Article  Google Scholar 

  25. A. Panagiotelis, C. Czado, H. Joe, J. Stoeber, Model selection for discrete regular vine copulas. Comput. Stat. Data Anal. 106, 138–152 (2017)

    Article  Google Scholar 

  26. H. Qiu, F. Han, H. Liu, B. Caffo, Robust portfolio optimization, in Neural Information Processing Systems (NIPS), 2015, pp. 46–54

    Google Scholar 

  27. S.T. Rachev, S.V. Stoyanov, A. Biglova, F.J. Fabozzi, An empirical examination of daily stock return distributions for U.S. Stocks, in Data Analysis and Decision Support (Springer, Berlin/Heidelberg, 2005), pp. 269–281

    Google Scholar 

  28. D. Tran, D.M. Blei, E.M. Airoldi, Copula variational inference, in Advances in Neural Information Processing Systems (NIPS) (Springer, Cham, 2015), pp. 3564–3572

    Google Scholar 

  29. R.R. Trippi, J.K. Lee, Artificial Intelligence in Finance & Investing (Irwin Professional Publishing, Chicago, 1996)

    Google Scholar 

  30. R.E. Welsch, X. Zhou, Application of robust statistics to asset allocation models. Revstat Stat. J. 5(1), 97–114 (2007)

    Google Scholar 

  31. F.Z. Xing, E. Cambria, R.E. Welsch, Growing semantic vines for robust asset allocation. Knowl. Based Syst. 165, 297–305 (2019)

    Article  Google Scholar 

  32. L. Zhang, C. Aggarwal, G.-J. Qi, Stock price prediction via discovering multi-frequency trading patterns, in The 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2017, pp. 2141–2149

    Google Scholar 

  33. Z. Zhu, R.E. Welsch, Robust dependence modeling for high-dimensional covariance matrices with financial applications. Ann. Appl. Stat. 12(2), 1228–1249 (2018)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Xing, F., Cambria, E., Welsch, R. (2019). Computational Semantics for Asset Correlations. In: Intelligent Asset Management. Socio-Affective Computing, vol 9. Springer, Cham. https://doi.org/10.1007/978-3-030-30263-4_4

Download citation

Publish with us

Policies and ethics