Factor Integration Based on Neural Networks for Factor Investing

Lu, Zhichen; Long, Wen; Zhang, Jiashuai; Tian, Yingjie

doi:10.1007/978-3-030-22744-9_22

Factor Integration Based on Neural Networks for Factor Investing

Zhichen Lu^16,17,
Wen Long^16,17,
Jiashuai Zhang^17,18 &
…
Yingjie Tian^16,17,18

Conference paper
First Online: 08 June 2019

1971 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11538))

Abstract

Factor investing is one kind of quantitative investing methodologies for portfolio construction based on factors. Factors with different style are extracted from multiple sources such as market data, fundamental information from financial statements, sentimental information from the Internet, etc. Numerous style factors are defined by Barra model proposed by Morgan Stanley Capital International(MSCI) to explain the return of a portfolio. Multiple factors are usually integrated linearly when being put to use, which ensures the stability of the process of integration and enhances the effectiveness of integrated factors. In this work, we integrate factors by machine learning and deep learning methodologies to explore deeper information among multiple style factors defined by MSCI Barra model. Multi-factors indexes are compiled using Smart Beta Index methodology proposed by MSCI. The results show non-linear integration by deep neural network can enhance the profitability and stability of the index compiled according to the integrated factor.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

The definition of factors of factor investing originates from “Arbitrage pricing theory” proposed by Ross [10], which holds that the expected return of a financial asset can be modeled as a function of various macroeconomic factors or theoretical market indexes. And then researchers have tried to use specific factors to model the return of stocks. Three-factors model [4] was the primary one which modeled excess return of stock by book value, earning. Further researches verified a series of factors can be used to explain the return of investing in stocks, factors can be summarised into three main categories: macroeconomic, statistical, and fundamental. In risk model developed by Barra team from MSCI company, factor returns are estimated through cross-sectional regression [8]. Factor portfolios were built according to target factors to construct factor returns in Fama-French approach [1, 4]. Similarly, Smart Beta Index from MSCI company [2, 3] is compiled according to target factors to reflect the style and performance of specific factors under the different market situation. When being put to use, multiple factors usually need to be integrated, a common way to integrate factors is a linearly weighted sum, and weights of each factor are calculated by solving an optimization with subjectively defined target [3]. In recent years, non-linear methods such Support Vector Machine, Logistic Regression, Random Forest, Neural Networks and deep learning methodologies are well used in financial time series modeling, yet most existing works focus on stock price prediction. They learn parameters of models by fitting training samples and presume that the distribution of the training set and test set in the feature space are identical [9, 13,14,15]. In the aspect of cross-section modeling and feature integration, only several works exist [5, 6].

In our works, we introduce neural networks into the task of cross-section factor integration, and we extract factors according to the definition from Barra [8]. We use Smart Beta Index methodology to compile factor indexes to reflect performance and style of them on the Chinese market. Experimental results show the index that compiled based on factors integrated by neural networks results in better profitability and stability.

2 Factors and Factor Indexes

The changes of the stock price are not just a result of historical market behavior, but also affected by information from multiple sources such as macroeconomy and financial situation of the corresponding listed company. Indicators can be selected and defined to capture this information for usage on investment practice, and they are called factors. Factors are extracted from three main sources: technical indicators from market samples, fundamental indicators from financial statements and macroeconomic indicators.

When used in market practices, stocks are ranked and selected according to scores calculated by one or multiple factors. Factors that proven to be robust through a long time period are summarized by Barra risk model. Table 1 present the definition of factors. Original indicators are extracted from market data of stocks and financial statement of their corresponding listed companies. Factors are usually sampled in monthly frequency when being used.

Table 1. BARRA style factors

Full size table

To reflect performances of factors on market practices, factor indexes are compiled according to methodologies proposed by MSCI company. At beginning of each season component stocks of benchmark CSI 800 are sorted by factor score, and top 100 are selected as component of factor index and weighted according to their market value. For single factor indexes, component stocks are sorted by single target factor, for multi-factors indexes, weights of component stocks are calculated by solving optimization whose objective are maximizing multiple target factors:

$$\begin{aligned}&\max \quad \sum \limits _{k=1}^K{\sum \limits _{i=1}^n \omega _i X_{ik}^{target}}\\&\begin{array}{r@{\quad }r@{}l@{\quad }l} s.t. &{}\sum \limits _{i=1}^n \omega _i X_{ik}^{non-target} \ge \sum \limits _{i=1}^n \omega _i^{benchmark} X_{ik}^{non-target}-0.25*std(X_{k}^{non-target}), \\ {} &{}k=1,2,3\ldots ,\tilde{K}\\ &{}\sum \limits _{i=1}^n \omega _i X_{ik}^{non-target} \le \sum \limits _{i=1}^n \omega _i^{benchmark} X_{ik}^{non-target}+0.25*std(X_{k}^{non-target}), \\ {} &{}k=1,2,3\ldots ,\tilde{K}\\ &{}max(0,\omega _i^{benchmark}-2\%)\le \omega _i \le \max (10\omega _i^{benchmark},\omega _i^{benchmark}+2\%), \\ {} &{}i=1,2,3\ldots ,n \\ \end{array} . \end{aligned}$$

According to this methodology we compile single factor indexes and multi-factor indexes with target on Momentum, Size, Value, Dividend, which follows document from MSCI. Figure 1 is back-test results of factor indexes during 2010 to 2017. Factors present different style among different market situation. Profitability and risk of each factors are evaluated by indicators listed in Table 2, from which we can see that factor indexes reach higher returns and Sharpe ratio than benchmark, which verified the effectiveness of these factors on Chinese market. Moreover, subjectively setting the objective of optimization for factor integration may lead to unsatisfied result on profitability and risk, since factors show different performance in different market.

Table 2. Smart Beta Index simulation results based on CSI 800

Full size table

3 Neural Networks for Factor Integration

Deep learning methodology is explored on stock price prediction [7, 11, 12], and deep neural networks are designed to extract features from time series samples for prediction. Portfolio construction is another kind of market practice which provides cross-section level samples. In this work, we introduce Multi-layer Perceptron (MLP) to deal with cross-section factors. Traditional machine learning and linear regression are also applied in the experiment for comparison.

We use factors of each component stock of CSI 800 index from 2008 to 2017 for the experiment. Models are trained at the start of every year using monthly samples $\{\chi _{t}^i,y_t^i\}$ from previous 3 years, where $\chi _t^i$ denotes factors listed in Table 1 of stock i, and $y_t^i$ denotes return of from t to $t+1$. At the start of each month, factors of each stock are integrated by models trained at the start of that year, and stocks are sorted according to integrated factors, and top 100 stocks are used for index compilation and weighted according to their market size.

Results of indexes compiled based on integrated factors are performed in Fig. 2, from which we can see that the net value of most models based integrated factor indexes outperform benchmarks during most part of the back-test period. We further evaluate each index by the same performance indicators listed in Table 3. From the results of performance indicators, we can conclude that: (1) Factors integrated by neural networks and linear regression show better performance on profitability and stability than the multi-factors index. It implies that the model based integration can potentially mine the relationship between factors of stocks and their future performances. On the one hand, neural networks and linear regression based indexes show higher return than multi-factor indexes, on the other hand, volatility of multi-factor index is higher which means higher risk. Moreover, the higher Sharpe ratio still implies higher stability. (2) Neural networks show better performance than linear regression, which means the non-linear relationship between factors can be used to enhance the performance of integrated factors.

Table 3. Integrated factor indexes simulation results based on CSI 800

Full size table

4 Conclusion

Factor indexes reflect performances of factors for factor investing so that robust factors can be filtered. Filtered factors need to be further integrated, our work introduces deep neural networks and other supervised models to integrate factors supervised by future return. And indexes are compiled according to integrated factors to evaluate their performance. Experimental results show that supervised integration by the model can enhance the effectiveness of integrated factors compared to integration by optimization with a subjectively defined objective. And Neural network is verified to be more effective since it is able to mine deep non-linear relationship between factors and future performance of stock price.

References

Ang, A.: A five-factor asset pricing model. Fama-Miller Working Paper (2014)
Google Scholar
Bender, J., Briand, R., Melas, D., Subramanian, R.: Foundations of factor investing (2013)
Google Scholar
Bender, J., Briand, R., Melas, D., Subramanian, R.A., Subramanian, M.: Deploying multi-factor index allocations in institutional portfolios. In: Risk-Based and Factor Investing, pp. 339–363. Elsevier (2015)
Google Scholar
Fama, E.F., French, K.R.: The cross-section of expected stock returns. J. Finance 47(2), 427–465 (1992)
Article Google Scholar
Gu, S., Kelly, B.T., Xiu, D.: Empirical asset pricing via machine learning. SSRN (2018). https://doi.org/10.2139/ssrn.3159577
Krauss, C., Do, X.A., Huck, N.: Deep neural networks, gradient-boosted trees, random forests: statistical arbitrage on the S&P 500. Eur. J. Oper. Res. 259(2), 689–702 (2017)
Article Google Scholar
Long, W., Lu, Z., Cui, L.: Deep learning-based feature engineering for stock price movement prediction. Knowl.-Based Syst. 164, 163–173 (2019). http://www.sciencedirect.com/science/article/pii/S0950705118305264
Menchero, J., Orr, D., Wang, J.: The Barra US equity model (USE4) methodology notes. MSCI Model Insight (2011)
Google Scholar
Rivest, R.L.: Learning decision lists. Mach. Learn. 2(3), 229–246 (1987)
Google Scholar
Ross, S.A.: The arbitrage theory of capital asset pricing. In: Handbook of the Fundamentals of Financial Decision Making: Part I, pp. 11–30. World Scientific (2013)
Google Scholar
Shen, F., Chao, J., Zhao, J.: Forecasting exchange rate using deep belief networks and conjugate gradient method. Neurocomputing 167, 243–253 (2015)
Article Google Scholar
Singh, R., Srivastava, S.: Stock prediction using deep learning. Multimed. Tools Appl. 76(18), 18569–18584 (2017)
Article Google Scholar
Valiant, L.G.: A theory of the learnable. Commun. ACM 27(11), 1134–1142 (1984)
Article Google Scholar
Xiong, T., Li, C., Bao, Y., Hu, Z., Zhang, L.: A combination method for interval forecasting of agricultural commodity futures prices. Knowl.-Based Syst. 77(C), 92–102 (2015)
Article Google Scholar
Zhou, T., Gao, S., Wang, J., Chu, C., Todo, Y., Tang, Z.: Financial time series prediction using a dendritic neuron model. Knowl.-Based Syst. 105(C), 214–224 (2016)
Article Google Scholar

Download references

Acknowledgement

This research was partly supported by the grants from National Natural Science Foundation of China (No. 71771204, 71331005, 91546201).

Author information

Authors and Affiliations

School of Economics and Management, University of Chinese Academy of Sciences, Beijing, 100190, People’s Republic of China
Zhichen Lu, Wen Long & Yingjie Tian
Research Center on Fictitious Economy & Data Science, Chinese Academy of Sciences, Beijing, 100190, People’s Republic of China
Zhichen Lu, Wen Long, Jiashuai Zhang & Yingjie Tian
School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, 100190, People’s Republic of China
Jiashuai Zhang & Yingjie Tian

Authors

Zhichen Lu
View author publications
You can also search for this author in PubMed Google Scholar
Wen Long
View author publications
You can also search for this author in PubMed Google Scholar
Jiashuai Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yingjie Tian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wen Long .

Editor information

Editors and Affiliations

University of Algarve, Faro, Portugal
João M. F. Rodrigues
University of Algarve, Faro, Portugal
Pedro J. S. Cardoso
University of Algarve, Faro, Portugal
Jânio Monteiro
University of Algarve, Faro, Portugal
Roberto Lam
University of Amsterdam, Amsterdam, The Netherlands
Valeria V. Krzhizhanovskaya
University of Amsterdam, Amsterdam, The Netherlands
Michael H. Lees
University of Tennessee at Knoxville, Knoxville, TN, USA
Jack J. Dongarra
University of Amsterdam, Amsterdam, The Netherlands
Peter M.A. Sloot

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lu, Z., Long, W., Zhang, J., Tian, Y. (2019). Factor Integration Based on Neural Networks for Factor Investing. In: Rodrigues, J.M.F., et al. Computational Science – ICCS 2019. ICCS 2019. Lecture Notes in Computer Science(), vol 11538. Springer, Cham. https://doi.org/10.1007/978-3-030-22744-9_22

Download citation

DOI: https://doi.org/10.1007/978-3-030-22744-9_22
Published: 08 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-22743-2
Online ISBN: 978-3-030-22744-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Abstract

1 Introduction

2 Factors and Factor Indexes

3 Neural Networks for Factor Integration

4 Conclusion

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation