Skip to main content

Improving GMM Efficiency in Dynamic Models for Panel Data with Mean Stationarity

  • Chapter
  • First Online:
Applications in Statistical Computing

Abstract

Estimation of dynamic panel data models largely relies on the generalized method of moments (GMM), and adopted sets of moment conditions exploit information up to the second moment of the variables. However, in many microeconomic applications, the variables of interest are skewed (typical examples are individual wages, size of the firms, number of employees, etc.); therefore, third moments might provide useful information for the estimation process. In this paper, we propose a moment condition, to be added to the set of conditions customarily exploited in GMM estimation of dynamic panel data models, that exploits third moments. The moment condition we propose is based on the data generating process that, under mean stationarity, characterizes the initial observation \(y_{i0}\) and the long-run mean of the dependent variable. In the literature on dynamic panel data models and in the way how Monte Carlo simulations are implemented therein for mean stationary processes, this condition is always fulfilled, but never explicitly exploited for estimation. Monte Carlo experiments show remarkable efficiency improvements when the distribution of individual effects, and thus of \(y_{i0}\), is indeed skewed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The zero mean of the individual effect restriction is imposed without loss of generality. In case \(E(\alpha _i) \ne 0\), a model with an intercept can be considered: \(y_{it} = \beta _1 + \beta _2 y_{it-1} + \alpha _i + e_{it} \) with \(E(\alpha _i)=0\) (see e.g. Baltagi 2013).

  2. 2.

    Other approaches are also available for estimation of linear dynamic panel data models. Hsiao et al. (2002) developed a transformed likelihood approach for the estimation of linear dynamic model within a fixed effect framework. Kiviet (1995) proposed a method to correct the small sample bias of the least squares dummy variable estimator. The performance of simulation methods (indirect inference) has also been analysed (Gouriéroux et al. 2010). This paper focuses on GMM.

  3. 3.

    Ahn and Schmidt (1995) show that the difference GMM estimator is not fully exploiting all the information in the data: under the standard assumptions, they propose additional moment conditions that allow improving efficiency (see also Schmidt et al. 1992).

  4. 4.

    Of course, longer lags could be considered, but these conditions are redundant when the moment conditions (13.2) are included for estimation (Arellano and Bover 1995; Blundell and Bond 1998).

  5. 5.

    In addition, moment conditions (13.4) and (13.2) are shown to encompass the moment conditions proposed by Ahn and Schmidt (1995), Blundell and Bond (1998).

  6. 6.

    However, the data generating process for \(y_{i0}\) in (13.5) is more restrictive than condition (13.3). As a result, the condition we propose is more restrictive than assumptions needed for consistency of the system GMM estimator. For instance, condition (13.3) would hold even if \(y_{i0} = \gamma _i \alpha _i + e_{i0}\) with a random \(\gamma _i\) independent of \(\alpha _i\) and \(E(\gamma _i) = 1/(1-\beta )\). Instead, the data generating process in (13.5) also assumes \(V(\gamma _i) =0\) (\(\gamma _i = 1/(1-\beta )\) for each i). Furthermore, \(\alpha _i\) and \(e_{i0}\) are generated as independent random variables.

  7. 7.

    We have also explored moment conditions that involve the fourth-order moment of \(\alpha _i\), but without obtaining any significant benefit.

  8. 8.

    In the case of log-normal and \(\chi ^2\) distributions, which are characterized by a non-zero mean, we first simulated the value of \(\alpha _i\) and then considered its demeaned transformation in the data generating process. We also multiplied it by an appropriate constant to get the chosen value for the unconditional variance.

  9. 9.

    When the ratio of unconditional variances equals 1, the two error components give equally weighted contribution to the unconditional variance of \(y_{it}\). A ratio equal to four mimics the condition of a variance of \(\alpha _i\) much larger than the variance of \(e_{it}\), identified by Bun and Windmeijer (2010) as responsible for large bias in small samples.

  10. 10.

    No closed-form solution is available for the two-step version of the GMM estimator that exploits the additional moment condition, and for the continuously updated version of GMM (also for system GMM estimation). Algorithms for the minimization of the GMM function are therefore required. Computation efficiency might therefore be an issue, and ‘efficient computation algorithm could be implemented by mixing the use of the approximations to the Hessian and of the exact Hessian’ (Calzolari et al. 1987, p. 299).

  11. 11.

    The CUE is known to have a problem of possible multiple minima. In order to control for this issue, we considered two different initial values (i.e. OLS estimator and the two-step system GMM estimator) and disregarded those simulation runs in which we found multiple minima. In most experiments, less than 1% of estimated values is disregarded with the exception of the simulation runs with \(T+1=4\) and \(\rho =0.9\) in which we disregarded about 2.5% of cases. The issue of multiple minima in panel data model estimation also arises within a static framework, when an autoregressive process characterizes the idiosyncratic component (Calzolari and Magazzini 2012). The model is closely related to the dynamic specification considered in this paper.

  12. 12.

    Asymptotically, any consistent estimator of the weighting matrix should provide the same final (second step) result. However, experimentally, on small- or medium-sized samples, this was the best choice.

  13. 13.

    Better performance of the estimator we propose also arises with normally distributed \(y_{i0}\) and \(\beta = 0.9\). However, in unreported Monte Carlo experiments, this result vanishes with a larger N, and therefore we do not place too much emphasis on this result.

  14. 14.

    The full set of results, comprising also normally and chi-squared distributed error terms, is reported in Appendix A.

  15. 15.

    The hypothesis of common factor restriction is never rejected at the 5% level of significance. See Blundell and Bond (2000) for details.

  16. 16.

    The equality can be obtained by relying on the fact that \(\varLambda _1^{-1} = \varLambda _1^{*} / \det (\varLambda _1)\) where \(\varLambda _1^{*}\) is adjugate of \(\varLambda _1\). Furthermore, for the way in which they are computed, both \(\varLambda \) and \(\varLambda _1\) are positive definite matrices, so their determinant will be positive.

References

  • Ahn, S. C., & Schmidt, P. (1995). Efficient Estimation of models for dynamic panel data. Journal of Econometrics, 68(1), 5–27.

    Article  MathSciNet  MATH  Google Scholar 

  • Anderson, T., & Hsiao, C. (1981). Estimation of dynamic models with error components. Journal of the American Statistical Association, 76, 598–606.

    Article  MathSciNet  MATH  Google Scholar 

  • Anderson, T., & Hsiao, C. (1982). Formulation and estimation of dynamic models using panel data. Journal of Econometrics, 18, 47–82.

    Article  MathSciNet  MATH  Google Scholar 

  • Arellano, M., & Bond, S. (1991). Some tests of specification for panel data: Monte Carlo evidence and an application to employment equations. The Review of Economic Studies, 58(2), 277–297.

    Article  MATH  Google Scholar 

  • Arellano, M., & Bover, O. (1995). Another look at the instrumental variables estimation of error components models. Journal of Econometrics, 68, 29–51.

    Article  MATH  Google Scholar 

  • Baltagi, B. H. (2013). Econometric analysis of panel data (5th ed.). New York: Wiley.

    MATH  Google Scholar 

  • Blundell, R., & Bond, S. (1998). Initial conditions and moment restrictions in dynamic panel data models. Journal of Econometrics, 87(1), 115–143.

    Article  MATH  Google Scholar 

  • Blundell, R., & Bond, S. (2000). GMM estimation with persistent panel data: An application to production functions. Econometric Reviews, 19(3), 321–340.

    Article  MATH  Google Scholar 

  • Blundell, R., Bond, S., & Windmeijer, F. (2000). Estimation in dynamic panel data models: Improving on the performance of the standard GMM estimator. In B. H. Baltagi, T. B Fomby, & C. Hill (Eds.), Nonstationary panels, panel cointegration, and dynamic panels. Advances in econometrics, 15.

    Google Scholar 

  • Bun, M. J. C., & Windmeijer, F. (2010). The weak instrument problem of the system GMM estimator in dynamic panel data models. Econometrics Journal, 13(1), 95–126.

    Article  MathSciNet  MATH  Google Scholar 

  • Calzolari, G., & Magazzini, L. (2012). Autocorrelation and masked heterogeneity in panel data models estimated by maximum likelihood. Empirical Economics, 43(1), 145–152.

    Article  Google Scholar 

  • Calzolari, G., Panattoni, L., & Weihs, C. (1987). Computational efficiency of FIML estimation. Journal of Econometrics, 36(3), 299–310.

    Article  MATH  Google Scholar 

  • Gouriéroux, C., Phillips, P. C. B., & Yu, J. (2010). Indirect inference for dynamic panel models. Journal of Econometrics, 157(1), 68–77.

    Article  MathSciNet  MATH  Google Scholar 

  • Hahn, J. (1999). How informative is the initial condition in the dynamic panel model with fixed effects? Journal of Econometrics, 93, 309–326.

    Article  MathSciNet  MATH  Google Scholar 

  • Hansen, L. P. (1982). Large sample properties of generalized method of moments estimators. Econometrica, 50, 1029–1054.

    Article  MathSciNet  MATH  Google Scholar 

  • Hansen, L. P., Heaton, J., & Yaron, A. (1996). Finite-sample properties of some alternative GMM estimators. Journal of Business and Economic Statistics, 14(3), 262–280.

    Google Scholar 

  • Hayakawa, K. (2007). Small sample bias properties of the system GMM estimator in dynamic panel data models. Economics Letters, 95, 32–38.

    Article  MathSciNet  MATH  Google Scholar 

  • Hsiao, C., Pesaran, M. H., & Tahmiscioglu, A. K. (2002). Maximum likelihood estimation of fixed effects dynamic panel data models covering short time periods. Journal of Econometrics, 109(1), 107–150.

    Article  MathSciNet  MATH  Google Scholar 

  • Kiviet, J. F. (1995). On bias, inconsistency, and efficiency of various estimators in dynamic panel data models. Journal of Econometrics, 68(1), 53–78.

    Article  MathSciNet  MATH  Google Scholar 

  • Roodman, D. (2009). A note on the theme of too many instruments. Oxford Bulletin of Economics and Statistics, 71(1), 135–158.

    Article  Google Scholar 

  • Schmidt, P., Ahn, S. C., & Wyhowski, D. (1992). Comment. Journal of Business and Economic Statistics, 10, 10–14.

    Google Scholar 

  • Windmeijer, F. (2005). A finite sample correction for the variance of linear efficient two-step GMM estimators. Journal of Econometrics, 126(1), 25–51.

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

We gratefully acknowledge comments and suggestions from two anonymous Reviewers, Francesca Mantese, and conference participants at the Fifth and Sixth Italian Congress of Econometrics and Empirical Economics.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Laura Magazzini .

Editor information

Editors and Affiliations

Appendices

Appendix A: Detailed Results of Monte Carlo Experiments

In this Appendix, detailed tables reporting the results of the Monte Carlo experiments are presented. Table 13.4 reports the full set of results for the Monte Carlo experiments related to the pure dynamic model (Sect. 13.3.1; Table 13.1), whereas Table 13.5 reports results for the experiments including a simultaneously determined regressor (Sect. 13.3.2; Table 13.2). The tables show the mean, variance, median and interquartile range (IQR) of Monte Carlo estimates.

Table 13.4 Detailed results of Monte Carlo experiments within a pure dynamic framework (Sect. 13.3.1; Table 13.1)
Table 13.5 Detailed results of Monte Carlo experiments with an endogenous regressor (Sect. 13.3.2; Table 13.2)

Appendix B: Variance Comparison

In this Appendix, we derive the variance of the estimator we propose, based on moment conditions (13.2), (13.4) and (13.6), and we compare it with the variance of the traditional system GMM estimator.

We denote the set of R moment conditions in the population as \(E(\mathbf {f}(\beta ))=0\). In order to simplify computations, we take into account the case of model (13.1), so that \(\beta \) (and the variance of its estimator) is a scalar.

From general GMM theory (Hansen 1982), the asymptotic variance of a GMM estimator is given by

$$ V(\hat{\beta }) = (B'\varLambda ^{-1} B)^{-1} $$

with \(B= \partial \mathbf {f}(\beta ) / \partial \beta \) and \(\varLambda = E(\mathbf {f}(\beta ) \mathbf {f}(\beta )')\) the efficient weighting matrix of the GMM procedure.

In this case, we can partition B into two components (and \(\varLambda \) accordingly): (i) the elements corresponding to the moment conditions that characterize the traditional system GMM estimator, and (ii) the one additional moment condition we propose in this paper. Assuming that the moment condition we propose is the last moment condition in the full set composed of (13.2), (13.4) and (13.6), we can write

$$ B = \left( \begin{array}{c} B_1 \\ B_2 \end{array} \right) \; \; \text { and } \; \; \varLambda = \left( \begin{array}{cc} \varLambda _1 &{} \varLambda _2 \\ \varLambda _3 &{} \varLambda _4 \end{array} \right) $$

with \(B_1\) related to the moment conditions in (13.2) and (13.4), and \(B_2\) related to the moment condition in (13.6). \(\varLambda \) is partitioned accordingly (with \(\varLambda _3=\varLambda _2'\)). Therefore, we have

$$\begin{aligned} V(\hat{\beta }) = (B'\varLambda ^{-1} B)^{-1} = \left\{ \left( \begin{array}{cc} B_1'&B_2' \end{array} \right) \left( \begin{array}{cc} \varLambda _1 &{} \varLambda _2 \\ \varLambda _3 &{} \varLambda _4 \end{array} \right) ^{-1} \left( \begin{array}{c} B_1 \\ B_2 \end{array} \right) \right\} ^{-1} \end{aligned}$$
(13.14)

By using the formula of the inverse of a partitioned matrix, we get

$$\begin{aligned} V(\hat{\beta }) = (B_1' \varLambda _1^{-1} B_1 + (\varLambda _3 \varLambda _1^{-1} B_1 - B_2)' A (\varLambda _3 \varLambda _1^{-1} B_1 - B_2))^{-1} \end{aligned}$$
(13.15)

with \(A=(\varLambda _4 - \varLambda _3 \varLambda _1^{-1} \varLambda _2)^{-1} = (\varLambda _4 - \varLambda _2' \varLambda _1^{-1} \varLambda _2)^{-1}\) (a scalar). Note that the first element in (13.15) corresponds to the inverse of the variance of the system GMM estimator of \(\beta \). Therefore, the variance of the GMM estimator based on (13.2), (13.4), and the additional moment condition (13.6) will be smaller than the variance of the system GMM estimator if

$$ (\varLambda _3 \varLambda _1^{-1} B_1 - B_2)' A (\varLambda _3 \varLambda _1^{-1} B_1 - B_2) >0 $$

As the scalar \(A = (\det (\varLambda ) / \det (\varLambda _1))^{-1} > 0\),Footnote 16 this condition will be always satisfied unless

$$\begin{aligned} \varLambda _3 \varLambda _1^{-1} B_1 - B_2 = 0 \end{aligned}$$
(13.16)

No efficiency gain will be obtained adding the moment condition we propose when condition (13.16) is satisfied. On the contrary, if condition (13.16) is not satisfied (\(\varLambda _3 \varLambda _1^{-1} B_1 - B_2 \ne 0\)), efficiency gains are obtained by adding the moment condition we propose to the moment conditions that characterize the system GMM estimator.

Fig. 13.1
figure 1

Ratio of the asymptotic variance of the estimator we propose with respect to the asymptotic variance of the system GMM estimator, for different (i) levels of asymmetry in the distribution of \(\alpha _i\), \(e_{i0}\); (ii) time periods; (iii) ratios of unconditional variance of \(\alpha \) with respect to e; (iv) values of \(\beta \)

By relying on further simulations (not reported in the main text of the paper) with \(T+1=3\), it is possible to show that \(\varLambda _3 \varLambda _1^{-1} B_1 - B_2 \ne 0\) if \(\alpha _i\) or \(e_{i0}\) have an asymmetric distribution (the third moment is not zero). Coherently, the Monte Carlo experiments reported in Sect. 13.3 show gains in performance in the cases in which the error components are assumed to have an asymmetric (log-normal or chi-squared) distribution.

Figure 13.1 shows the ratio of the asymptotic variance of the estimator we propose with respect to the asymptotic variance of the system GMM estimator. The baseline experiment consider \(\beta =0.5\), \(T+1=3\), a \(\chi ^2_3\) distribution for \(e_{i0}\) and \(\alpha _i\), and equal unconditional variances of \(\alpha _i\) and \(e_{it}\) (both equal to 1). From the baseline experiment, we changed: (i) the level of asymmetry in the distribution of \(\alpha _i\), \(e_{i0}\), accomplished by changing the number of degrees of freedom of the \(\chi ^2\) distribution; (ii) the number of time periods \(T+1\) from 3 to 10; (iii) the ratio of unconditional variance of \(\alpha \) with respect to e (from 1 to 9; we changed the value of the unconditional variance of \(\alpha _i\), leaving the unconditional variance of e fixed at 1); (iv) the values of \(\beta \) from 0.1 to 0.9. Coherently with the results in the Monte Carlo experiments in Sect. 13.3, the gains in performance led by the additional moment condition we propose are stronger for larger asymmetry in the distribution of \(\alpha _i\) and \(e_{i0}\) (thus, of \(y_{i0}\)), shorter observation in time, and values of \(\beta \) that approaches 1. Also, the gains in performance decrease when the unconditional variance of \(\alpha _i\) increases with respect to the unconditional variance of \(e_{it}\).

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Calzolari, G., Magazzini, L. (2019). Improving GMM Efficiency in Dynamic Models for Panel Data with Mean Stationarity. In: Bauer, N., Ickstadt, K., Lübke, K., Szepannek, G., Trautmann, H., Vichi, M. (eds) Applications in Statistical Computing. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-030-25147-5_13

Download citation

Publish with us

Policies and ethics