Model averaging estimators for the stochastic frontier model

Parmeter, Christopher F.; Wan, Alan T. K.; Zhang, Xinyu

doi:10.1007/s11123-019-00547-8

Model averaging estimators for the stochastic frontier model

Published: 04 April 2019

Volume 51, pages 91–103, (2019)
Cite this article

Journal of Productivity Analysis Aims and scope Submit manuscript

632 Accesses
12 Citations
Explore all metrics

Abstract

Model uncertainty is a prominent feature in many applied settings. This is certainty true in the efficiency analysis realm where concerns over the proper distributional specification of the error components of a stochastic frontier model is, generally, still open along with which variables influence inefficiency. Given the concern over the impact that model uncertainty is likely to have on the stochastic frontier model in practice, the present research proposes two distinct model averaging estimators, one which averages over nested classes of inefficiency distributions and another that has the ability to average over distinct distributions of inefficiency. Both of these estimators are shown to produce optimal weights when the aim is to uncover conditional inefficiency at the firm level. We study the finite-sample performance of the model average estimator via Monte Carlo experiments and compare with traditional model averaging estimators based on weights constructed from model selection criteria and present a short empirical application.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Model uncertainty and efficiency measurement in stochastic frontier analysis with generalized errors

Article Open access 19 May 2022

The conditional mode in parametric frontier models

Article 18 September 2023

The wrong skewness problem in stochastic frontier analysis: a review

Article 17 October 2023

Notes

For example, Lai and Huang (2010) include years of education of the primary decision maker in the household in their study of Indian farming. It is not theoretically clear if, and how, this variable should enter the production structure.
Note that the model averaging estimator of Hansen and Racine’s (2012) can accommodate nonlinearity of the unknown conditional mean through a sequence of bases such as orthogonal polynomials of varying order, splines of varying order and so forth, but their construction of weights is designed around a quadratic objective function with parameters which enter the model linearly. Here our focus is on the construction of weights when we have parameters which enter the model in a nonlinear fashion and/or the objective function is not quadratic.
The use of J ≫ 1 is to reduce the number of leave-one-out samples that need to be constructed to average over making the estimation more streamlined.
A similar strategy, in the context of productivity measurement across countries, appears in Sickles et al. (2015).
See also Shang (2015).
See Parmeter and Kumbhakar (2014) for a detailed account of this model.
Given that the error term $\varepsilon _i^ \ast$ is heteroskedastic, $Var(\varepsilon ^ \ast |{\boldsymbol{x}}_i,{\boldsymbol{z}}_{u,i}) = \sigma _v^2 + \sigma _u^{2 \ast }e^{2{\boldsymbol{z}}_{u,i}^\prime {\boldsymbol{\delta }}^u}$, where $\sigma _v^2 = Var(v_i)$ and $\sigma _u^{2 \ast } = Var(u^ \ast )$, a generalized nonlinear least squares algorithm (though this requires distributional assumptions to disentangle $\sigma _v^2$ and $\sigma _u^{2 \ast }$) or heteroscedasticity robust standard errors would be required to conduct valid inference.
See Coelli et al. (2005, Appendix 2) for a more detailed description of the data.

References

Aigner DJ, Lovell CAK, Schmidt P (1977) Formulation and estimation of stochastic frontier production functions. J Econom 6(1):21–37
Article Google Scholar
Alvarez A, Amsler C, Orea L, Schmidt P (2006) Interpreting and testing the scaling property in models where inefficiency depends on firm characteristics. J Prod Anal 25(2):201–212
Article Google Scholar
Battese GE, Coelli TJ (1988) Prediction of firm-level technical efficiencies with a generalized frontier production function and panel data. J Econom 38:387–399
Article Google Scholar
Buckland ST, Burnham KP, Augustin NH (1997) Model selection: an integral part of inference. Biometrics 53(4):603–618
Article Google Scholar
Coelli TJ, Rao DP, O’Donnell CJ, Battese GE (2005) An Introduction to Efficiency and Productivity Analysis. Springer, New York
Google Scholar
Hansen BE (2007) Least squares model averaging. Econometrica 75(4):1175–1189
Article Google Scholar
Hansen BE, Racine JS (2012) Jackknife model averaging. J Econom 167(1):38–46
Article Google Scholar
Huang CJ, Lai H-P (2012) Estimation of stochastic frontier models based on multimodel inference. J Prod Anal 38:273–284
Article Google Scholar
Jondrow J, Lovell CAK, Materov IS, Schmidt P (1982) On the estimation of technical efficiency in the stochastic frontier production function model. J Econom 19(2/3):233–238
Article Google Scholar
Kneip A, Simar L, Van Keilegom I (2015) Frontier estimation in the presence of measurement error with unknown variance. J Econom 184:379–393
Article Google Scholar
Kumbhakar SC, Parmeter CF, Tsionas E (2013) A zero inefficiency stochastic frontier estimator. J Econom 172(1):66–76
Article Google Scholar
Lai H-P, Huang CJ (2010) Likelihood ratio tests for model selection of stochastic frontier models. J Prod Anal 34(1):3–13
Article Google Scholar
Mallows CL (1973) Some comments on cp. Tehcnometrics 15:661–675
Google Scholar
Meeusen W, van den Broeck J (1977) Efficiency estimation from Cobb-Douglas production functions with composed error. Int Econ Rev 18(2):435–444
Article Google Scholar
Olesen OB, Ruggiero J (2018) An improved Afriat-Diewert-Parkan nonparametric production function estimator. Eur J Operat Res 264:1172–1188
Article Google Scholar
Parmeter CF, Kumbhakar SC (2014) Efficiency analysis: a primer on recent advances. Found Trends Econom 7(3-4):191–385
Article Google Scholar
Parmeter CF, Wang H-J, Kumbhakar SC (2017) Nonparametric estimation of the determinants of inefficiency. J Prod Anal 47(3):205–221
Article Google Scholar
Rho S, Schmidt P (2015) Are all firms inefficient? J Prod Anal 43(3):327–349
Article Google Scholar
Shang C (2015) Essays on the use of duality, robust empirical methods, panel treatments, and model averaging with applications to housing price index construction and world productivity growth, PhD thesis, Rice University
Sickles RC (2005) Panel estimators and the identification of firm-specific efficiency levels in parametric, semiparametric and nonparametric settings. J Econom 126(2):305–334
Article Google Scholar
Sickles RC, Hao J, Shang C (2014) Panel data and productivity measurement: an analysis of Asian productivity trends. J Chin Econ Bus Stud 12(3):211–231
Article Google Scholar
Sickles RC, Hao J, Shang C (2015) Panel data and productivity measurement. In: Baltagi B (ed) Ch 17, Oxford Handbook fo Panel Data. Oxford University Press, New York, pp 517–547
Simar L, Lovell CAK, van den Eeckaut P (1994) Stochastic frontiers incorporating exogenous inuences on efficiency. Discussion Papers No. 9403, Institut de Statistique, Universite de Louvain
Stone M (2002) How not to measure the efficiency of public services (and how one might). J R Stat Soc Ser A 165:405–434
Google Scholar
Tsionas EG (2017) “When, where and how” of efficiency estimation: Improved procedures for stochastic frontier modeling. J Am Stat Assoc 112:948–965
Article Google Scholar
Wan ATK, Zhang X, Zou G (2010) Least squares model averaging by Mallows criterion. J Econom 156(4):277–283
Article Google Scholar
Wang H-J, Schmidt P (2002) One-step and two-step estimation of the effects of exogenous variables on technical efficiency levels. J Prod Anal 18:129–144
Article Google Scholar
White H (1982) Maximum likelihood estimation of misspecified models. Econometrica 50(1):1–25
Article Google Scholar

Download references

Acknowledgements

We thank participants at the New York Camp Econometrics X, the 14th European Workshop on Efficiency and Productivity Analysis, LECCEWEPA 2015, the CEPA Workshop on Economic Measurement and the 2016 North American Productivity Workshop for valuable insight. Xinyu Zhang acknowledges the support from National Natural Science Foundation of China (Grant numbers 71522004, 11471324 and 71631008). The usual disclaimer applies.

Author contributions

All three authors contributed equally to this work and the order of authorship has nothing other than alphabetical significance.

Author information

Authors and Affiliations

Department of Economics, University of Miami, Miami, FL, USA
Christopher F. Parmeter
Department of Management Sciences, City University of Hong Kong, Kowloon, Hong Kong
Alan T. K. Wan
Academy of Mathematics and Systems Science, Chinese Academy of Sciences, 100190, Beijing, China
Xinyu Zhang

Authors

Christopher F. Parmeter
View author publications
You can also search for this author in PubMed Google Scholar
Alan T. K. Wan
View author publications
You can also search for this author in PubMed Google Scholar
Xinyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christopher F. Parmeter.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

A.1. Proof of Theorem 1. We first decompose C(w) as follows:

$$\begin{array}{c}C({\boldsymbol{w}}) = \left\Vert {\widehat {\boldsymbol{\rho }}({\boldsymbol{w}}) - \widehat {\boldsymbol{\rho }}_{{\mathrm{full}}}} \right\Vert^2\, + \,n^{1/2}\log (n){\boldsymbol{k}}\prime {\boldsymbol{w}}\\ = \left\Vert {\widehat {\boldsymbol{\rho }}({\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\Vert^2\, + \,\left\Vert {\widehat {\boldsymbol{\rho }}_{{\mathrm{full}}} - {\boldsymbol{\rho }}} \right\Vert^2 - 2\left\{ {\widehat {\boldsymbol{\rho }}({\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\}\prime \left\{ {\widehat {\boldsymbol{\rho }}_{{\mathrm{full}}} - {\boldsymbol{\rho }}} \right\} \\+\,n^{1/2}\log (n){\boldsymbol{k}}\prime {\boldsymbol{w}}.\end{array}$$

(A.1)

From the $\sqrt n$-consistency property of MLE and Assumption C.1, we have, for any $s^ \ast \in \{ {\cal O} \cup \{ t\} \}$,

$$\left\Vert {\widehat {\boldsymbol{\rho }}_{s^ \ast } - {\boldsymbol{\rho }}} \right\Vert^2 = \mathop {\sum}\limits_{i = 1}^n {\left\Vert {\frac{{\partial \widehat {\boldsymbol{\rho }}_{s^ \ast ,i}}}{{\partial \widehat {\boldsymbol{\theta }}_{s^ \ast }^\prime }}\mid _{\widehat {\boldsymbol{\theta }}_{s^ \ast } = \widetilde {\boldsymbol{\theta }}_{s^ \ast ,i}}(\widehat {\boldsymbol{\theta }}_{s^ \ast } - {\boldsymbol{\theta }}_{s^ \ast })} \right\Vert^2} = O_p(1),$$

(A.2)

where $\widetilde {\boldsymbol{\theta }}_{s^ \ast ,i}$ lies between $\widehat {\boldsymbol{\theta }}_{s^ \ast }$ and ${\boldsymbol{\theta }}_{s^ \ast }$. By the definition of $\widehat {\boldsymbol{w}}$ in Eq. (14), we have

$$\begin{array}{c}C(\widehat {\boldsymbol{w}}) \le \left\Vert {\widehat {\boldsymbol{\rho }}_{s^ \ast } - \widehat {\boldsymbol{\rho }}_{{\mathrm{full}}}} \right\Vert^2\, + \,n^{1/2}\log(n)k_{s^ \ast }\\ \le 2\left\Vert {\widehat {\boldsymbol{\rho }}_{s^ \ast } - {\boldsymbol{\rho }}} \right\Vert^2\, + \,2\left\Vert {\widehat {\boldsymbol{\rho }}_{{\mathrm{full}}} - {\boldsymbol{\rho }}} \right\Vert^2\, + \,n^{1/2}\log(n)k_{s^ \ast },\end{array}$$

(A.3)

which, along with (A.1), implies that

$$\begin{array}{lcc}\left\Vert {\widehat {\boldsymbol{\rho }}(\widehat {\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\Vert^2 - 2\left\{ {\widehat {\boldsymbol{\rho }}(\widehat {\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\}^{\prime} \left\{ {\widehat {\boldsymbol{\rho }}_{{\mathrm{full}}} - {\boldsymbol{\rho }}} \right\} \le 2\left\Vert {\widehat {\boldsymbol{\rho }}_{s^ \ast } - {\boldsymbol{\rho }}} \right\Vert^2\\ +\, \left\Vert {\widehat {\boldsymbol{\rho }}_{{\mathrm{full}}} - {\boldsymbol{\rho }}} \right\Vert^2\, + \,O(n^{1/2}\log(n))\end{array},$$

and thus

$$\begin{array}{lcc}\left\Vert {\widehat {\boldsymbol{\rho }}(\widehat {\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\Vert^2\, \le 2\left\Vert {\widehat {\boldsymbol{\rho }}_{s^ \ast } - {\boldsymbol{\rho }}} \right\Vert^2\, + \,\left\Vert {\widehat {\boldsymbol{\rho }}_{{\mathrm{full}}} - {\boldsymbol{\rho }}} \right\Vert^2 \\\,+ \,O(n^{1/2}\log(n)) + 2\left\Vert {\widehat {\boldsymbol{\rho }}(\widehat {\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\Vert\left\Vert {\widehat {\boldsymbol{\rho }}_{{\mathrm{full}}} - {\boldsymbol{\rho }}} \right\Vert\end{array}.$$

Therefore,

$$\begin{array}{lcc}\left\{ {\left\Vert {\widehat {\boldsymbol{\rho }}(\widehat {\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\Vert - \left\Vert {\widehat {\boldsymbol{\rho }}_{{\mathrm{full}}} - {\boldsymbol{\rho }}} \right\Vert} \right\}^2\, \le 2\left\Vert {\widehat {\boldsymbol{\rho }}_{s^ \ast } - {\boldsymbol{\rho }}} \right\Vert^2 \\+ \,O(n^{1/2}\log(n)) + 2\left\Vert {\widehat {\boldsymbol{\rho }}_{{\mathrm{full}}} - {\boldsymbol{\rho }}} \right\Vert^2\end{array}.$$

(A.4)

The full model belongs to ${\cal O}_1$ provided that any one candidate model belongs to ${\cal O}_1$. Hence we can obtain Eq. (15) from (A.2) and (A.4).

We next prove Eq. (16). Let

$$a_{s,m} = \left( {\widehat {\boldsymbol{\rho }}_s - \widehat {\boldsymbol{\rho }}_{{\mathrm{full}}}} \right)\prime \left( {\widehat {\boldsymbol{\rho }}_m - \widehat {\boldsymbol{\rho }}_{{\mathrm{full}}}} \right)$$

(A.5)

and Φ be an S × S matrix with its smth element given by

$$\Phi _{s,m} = a_{s,m} + n^{1/2}\log (n)(k_s + k_m)/2.$$

(A.6)

It can be easily shown that for any ${\boldsymbol{w}} \in {\cal W}$, C(w) = w′Φw. Now, define

$$\widetilde{\boldsymbol{w}} = \left({\widehat{w}}_1, \ldots, {\widehat{w}}_{s_o - 1},{\widehat{w}}_{s_o} + {\widehat{w}}_{m^ \ast }, {\widehat{w}}_{s_o + 1}, \ldots ,{\widehat{w}}_{m^ \ast - 1},0,{\widehat{w}}_{m^ \ast + 1}, \ldots ,{\widehat{w}}_S\right){\prime}.$$

Then we have

$$\begin{array}{c}0 \le C(\widetilde {\boldsymbol{w}}) - C(\widehat {\boldsymbol{w}})\\ = \widetilde {\boldsymbol{w}}\prime \Phi \widetilde {\boldsymbol{w}} - \widehat {\boldsymbol{w}}\prime {\bf{\Phi }}\widehat {\boldsymbol{w}}\\ = \left( {\widetilde {\boldsymbol{w}} + \widehat {\boldsymbol{w}}} \right)\prime {\bf{\Phi }}(\widetilde {\boldsymbol{w}} - \widehat {\boldsymbol{w}})\\ = \{ 2\widehat {\boldsymbol{w}}\prime - (0, \ldots ,0,\widehat w_{m^ \ast },0, \ldots ,0, - \widehat w_{m^ \ast },0, \ldots ,0)\} {\bf{\Phi }}\left( {0, \ldots ,0,\widehat w_{m^ \ast },0, \ldots ,0, - \widehat w_{m^ \ast },0, \ldots ,0} \right)\prime \\ = \widehat w_{m^ \ast }^2(2\Phi _{s_o,m^ \ast } - \Phi _{s_o,s_o} - \Phi _{m^ \ast ,m^ \ast }) + 2\widehat {\boldsymbol{w}}\prime {\bf{\Phi }}\left( {0, \ldots ,0,\widehat w_{m^ \ast },0, \ldots ,0, - \widehat w_{m^ \ast },0, \ldots ,0} \right)\prime \\ = \widehat w_{m^ \ast }^2(2\Phi _{s_o,m^ \ast } - \Phi _{s_o,s_o} - \Phi _{m^ \ast ,m^ \ast }) + 2\widehat w_{m^ \ast }\widehat {\boldsymbol{w}}\prime \left( {\Phi _{1,s_o} - \Phi _{1,m^ \ast }, \ldots ,\Phi _{S,s_o} - \Phi _{S,m^ \ast }} \right)\prime \\ = \widehat w_{m^ \ast }^2(2\Phi _{s_o,m^ \ast } - \Phi _{s_o,s_o} - \Phi _{m^ \ast ,m^ \ast }) + 2\widehat w_{m^ \ast }\mathop {\sum}\limits_{s = 1}^s {\widehat w_s} (\Phi _{s,s_o} - \Phi _{s,m^ \ast })\\ = \widehat w_{m^ \ast }^2O_p(1) + 2\widehat w_{m^ \ast }\mathop {\sum}\limits_{s = 1}^S {\widehat w_s} \left\{ {O_p(n^{1/2}) + n^{1/2}\log(n)(k_{s_o} - k_{m^ \ast })/2} \right\}\\ = \widehat w_{m^ \ast }^2O_p(1) + 2\widehat w_{m^ \ast }O_p(n^{1/2}) + 2\widehat w_{m^ \ast }n^{1/2}\log(n)(k_{s_o} - k_{m^ \ast })/2,\end{array}$$

where the seventh equality expression is obtained using (A.2), (A.5) and (A.6). This yields

$$\widehat {w}_{m^ {\ast} }n^{1/2}\log(n)(k_{m^ {\ast} } - k_{s_o})/2 \le \widehat {w}_{m^ \ast }^2O_p(1) + \widehat {w}_{m^ {\ast} }O_p(n^{1/2})$$

and hence $\widehat w_{m^ \ast } = O_p(\log ^{ - 1}(n))$, which is Eq. (16).

A.2. Proof of Theorem 2. Write

$$\begin{array}{c}\left\Vert {\widehat {\boldsymbol{\rho }}({\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\Vert^2\, =\left\Vert {\widehat {\boldsymbol{\rho }}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\Vert^2\, + \,\left\Vert {\widehat {\boldsymbol{\rho }}({\boldsymbol{w}}) - \widehat {\boldsymbol{\rho }}^ \ast ({\boldsymbol{w}})} \right\Vert^2\\ \qquad\qquad\qquad+ \;2\left\{ {\widehat {\boldsymbol{\rho }}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\}\prime \left\{ {\widehat {\boldsymbol{\rho }}({\mathbf{w}}) - \widehat {\boldsymbol{\rho }}^ \ast ({\mathbf{w}})} \right\}.\end{array}$$

(A.7)

From (A.1), (A.7), Assumption C.2, and the proof of Theorem 1′ in Wan et al. (2010), Theorem 2 holds provided that the following conditions hold:

$$\mathop {{\sup }}\limits_{{\boldsymbol{w}} \in {\cal W}} \frac{{\left\Vert {\widehat {\boldsymbol{\rho }}({\boldsymbol{w}}) - \widehat {\boldsymbol{\rho }}^ \ast ({\boldsymbol{w}})} \right\Vert^2}}{{\left\Vert {\widehat {\boldsymbol{\rho }}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\Vert^2}}\, = o_p(1),$$

(A.8)

$$\mathop {{\sup }}\limits_{{\boldsymbol{w}} \in {\cal W}} \frac{{\left| {\left\{ {\widehat {\boldsymbol{\rho }}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\}^\prime \left\{ {\widehat {\boldsymbol{\rho }}({\boldsymbol{w}}) - \widehat {\boldsymbol{\rho }}^ \ast ({\boldsymbol{w}})} \right\}} \right|}}{{\left\Vert {\widehat {\boldsymbol{\rho }}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\Vert^2}}\, = o_p(1),$$

(A.9)

and

$$\mathop {{\sup }}\limits_{{\boldsymbol{w}} \in {\cal W}} \frac{{\left| {\left\{ {\widehat {\boldsymbol{\rho }}({\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\}^\prime \left\{ {\widehat {\boldsymbol{\rho }}_{{\mathrm{full}}} - {\boldsymbol{\rho }}} \right\}} \right|}}{{\left\Vert {\widehat {\boldsymbol{\rho }}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{\rho }}} \right\Vert^2}} = o_p(1).$$

(A.10)

From Eq. (17) and Assumption C.1, we have

$$\begin{array}{l}\mathop {\sup}\limits_{{\boldsymbol{w}} \in {\cal W}} \frac{{\left\Vert {\widehat {\boldsymbol{\rho }}({\boldsymbol{w}})\, - \,\widehat {\boldsymbol{\rho }}^ {\ast} ({\boldsymbol{w}})} \right\Vert^{2}}}{{\left\Vert {\widehat {\boldsymbol{\rho }}^ \ast ({\boldsymbol{w}}) - \,{\boldsymbol{\rho }}} \right\Vert^{2}}}\\{\le} \xi _n^{ - 1}\mathop {\sum}\limits_{i = 1}^{n} {\mathop {\sup}\limits_{{\boldsymbol{w}} \in {\cal W}} } \left\{ {\widehat \rho _i({\boldsymbol{w}}) - \widehat \rho _i^ {\ast} ({\boldsymbol{w}})} \right\}^{2}\\ = \xi _n^{ - 1}\mathop {\sum}\limits_{i = 1}^{n} {\mathop {\sup}\limits_{{\boldsymbol{w}} \in {\cal W}} } \left\{ {\mathop {\sum}\limits_{s = 1}^{S} {w_s} (\widehat \rho _{s,i} - \widehat \rho _{s,i}^{\ast} )} \right\}^{2}\\ \le \xi _n^{ - 1}\mathop {\sum}\limits_{i = 1}^{n} {\mathop {sup}\limits_{1 \le s \le S} } (\widehat \rho _{s,i} - \widehat \rho _{s,i}^{\ast})^{2}\\ = \xi _{n}^{ - 1}\mathop {\sum}\limits_{i = 1}^{n} {\mathop {\sup}\limits_{1 \le s \le S} } \left\{ {\frac{{\partial \widehat \rho _{s,i}}}{{\partial \widehat {\boldsymbol{\theta }}_s{\prime} }}|_{\widehat {\boldsymbol{\theta }}_s = \widetilde {\boldsymbol{\theta }}_{s,i}}(\widehat {\boldsymbol{\theta }}_s - {\boldsymbol{\theta }}_s^{\ast} )} \right\}^{2}\\ = O_p(\xi _n^{ - 1}).\end{array}$$

(A.11)

It follows from (A.11) and Assumption C.2 that (A.8) holds. In a similar way, we can prove that (A.9) and (A.10) hold. This proves Theorem 2.

A.3. Proof of Theorem 3. It can be seen that

$$\begin{array}{c}CV_J({\boldsymbol{w}}) = \left\Vert {\widetilde {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{y}}} \right\Vert^2\\ = \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}} + \widetilde {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - (\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})) + {\boldsymbol{b}} - {\boldsymbol{y}}} \right\Vert^2\\ \le \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert^2 + \left\Vert {\widetilde {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert^2 + \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert^2 + \left\Vert {{\boldsymbol{b}} - {\boldsymbol{y}}} \right\Vert^2\\ + \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert\left\Vert {\widetilde {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert + \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert\left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert + \left| {(\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}){\prime}({\boldsymbol{b}} - {\boldsymbol{y}})} \right|\\ + \left\Vert {\widetilde {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert\left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert + \left\Vert {\widetilde {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert\left\Vert {{\boldsymbol{b}} - {\boldsymbol{y}}} \right\Vert + \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert\left\Vert {{\boldsymbol{b}} - {\boldsymbol{y}}} \right\Vert\\ \le \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert^2 + \left\Vert {\widetilde {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert^2 + \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert^2\\ + \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert\left\Vert {\widetilde {\boldsymbol{b}}({\mathbf{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert + \left\Vert {{\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert\left\Vert {\widetilde {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert\\ + \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert\left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert + \left\Vert {{\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert\left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert\\ + \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert\left\Vert {{\boldsymbol{b}} - {\boldsymbol{y}}} \right\Vert + \left| {({\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}){\prime}({\boldsymbol{b}} - {\boldsymbol{y}})} \right| + \left\Vert {\widetilde {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert\left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert\\ + \left\Vert {\widetilde {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert\left\Vert {{\boldsymbol{b}} - {\boldsymbol{y}}} \right\Vert + \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert\left\Vert {{\boldsymbol{b}} - {\boldsymbol{y}}} \right\Vert + \left\Vert {{\boldsymbol{b}} - {\boldsymbol{y}}} \right\Vert^2\\ \equiv \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert^2 + {\Pi}_n({\boldsymbol{w}}) + \left\Vert {{\boldsymbol{b}} - {\boldsymbol{y}}} \right\Vert^2\end{array}$$

and

$$\begin{array}{c}\left\Vert \,{\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert^2 = \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) + {\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert^2\\ = \left\Vert {{\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert^2 + \left\Vert \,{\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert^2 + \,2\left( {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right)\prime ({\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}})\\ = \left\Vert {{\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert^2 + \,\Xi _n({\boldsymbol{w}}).\end{array}$$

Hence to prove Theorem 3, it suffices to show that

$$\mathop {{\sup }}\limits_{{\boldsymbol{w}} \in {\cal W}} \frac{{\Xi _n({\boldsymbol{w}})}}{{\left\Vert {{\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert^2}} = o_p(1)$$

(A.12)

and

$$\mathop {{\sup }}\limits_{{\boldsymbol{w}} \in {\cal W}} \frac{{{\Pi}_n({\boldsymbol{w}})}}{{\left\Vert {{\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert^2}}\, = o_p(1).$$

(A.13)

Similar to the proof of (A.2), by Eq. (17) and Assumption C.4, we have

$$\begin{array}{l}\mathop {{\sup }}\limits_{{\boldsymbol{w}} \in {\cal W}} \left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert^2\, = O_p(1)\quad \\{\mathrm{and}}\\ \mathop {{\sup }}\limits_{{\boldsymbol{w}} \in {\cal W}} \left\Vert {\widetilde {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert^2\, = O_p(1)\end{array}.$$

(A.14)

It is readily seen that

$$\left\Vert {{\boldsymbol{b}} - {\boldsymbol{y}}} \right\Vert^2\, = O_p(n)$$

(A.15)

and

$$\mathop {{\sup }}\limits_{{\boldsymbol{w}} \in {\cal W}} \frac{{\left\Vert {{\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert\left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert}}{{\left\Vert {{\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert^2}}, = \mathop {{\sup }}\limits_{{\boldsymbol{w}} \in {\cal W}} \frac{{\left\Vert {\widehat {\boldsymbol{b}}({\boldsymbol{w}}) - {\boldsymbol{b}}^ \ast ({\boldsymbol{w}})} \right\Vert}}{{\left\Vert {{\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert}}.$$

(A.16)

For any δ > 0,

$$\begin{array}{l}Pr\{ \mathop {{\sup }}\limits_{{\boldsymbol{w}} \in {\cal W}} \zeta _n^{ - 1}|({\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}){\prime}({\boldsymbol{b}} - {\boldsymbol{y}})| \,{>}\, \delta \} \\ \le Pr\{ \mathop {{\sup }}\limits_{{\boldsymbol{w}} \in {\cal W}} \zeta _n^{ - 1}\mathop {\sum}\limits_{s = 1}^S {w_s} |({\boldsymbol{b}}_m^ \ast - {\boldsymbol{b}}){\prime}({\boldsymbol{b}} - {\boldsymbol{y}})| \,{>}\, \delta \} \\ = Pr\{ \mathop {{\max }}\limits_s |({\boldsymbol{b}}_s^ \ast - {\boldsymbol{b}}){\prime}({\boldsymbol{b}} - {\boldsymbol{y}})| \,{>}\, \zeta _n\delta \} \\ \le \mathop {\sum}\limits_{s = 1}^S {Pr} \{ |({\boldsymbol{b}}_s^ \ast - {\boldsymbol{b}}){\prime}({\boldsymbol{b}} - {\boldsymbol{y}})| \,{>}\, \zeta _n\delta \} \\ \le \mathop {\sum}\limits_{s = 1}^S E \{ \zeta _n^{ - 2}\delta ^{ - 2}({\boldsymbol{b}}_s^ \ast - {\boldsymbol{b}}){\prime}({\boldsymbol{b}} - {\boldsymbol{y}})\} ^2\\ = \mathop {\sum}\limits_{s = 1}^S E \left[ {E\{ \zeta _n^{ - 2}\delta ^{ - 2}({\boldsymbol{b}}_s^ \ast - {\boldsymbol{b}}){\prime}({\boldsymbol{b}} - {\boldsymbol{y}})|{\boldsymbol{x}},{\boldsymbol{z}}\} ^2} \right]\\ = \mathop {\sum}\limits_{s = 1}^S E \left[ {\zeta _n^{ - 2}\delta ^{ - 2}({\boldsymbol{b}}_s^ \ast - {\boldsymbol{b}}){\prime}{\mathrm{var}}({\boldsymbol{b}} - {\boldsymbol{y}}|{\boldsymbol{x}},{\boldsymbol{z}})({\boldsymbol{b}}_s^ \ast - {\boldsymbol{b}})} \right]\\ \le \mathop {\sum}\limits_{s = 1}^S E \left[ {\zeta _n^{ - 2}\delta ^{ - 2}\sigma ^2\left\Vert {{\boldsymbol{b}}_s^ \ast - {\boldsymbol{b}}} \right\Vert^2} \right],\end{array}$$

where x = (x₁, …, x_n)′ and z = (z₁, …, z_n)′, and σ² is defined in the line above Assumption C.3. Together with Assumption C.3, this implies

$$\mathop {{\sup }}\limits_{{\boldsymbol{w}} \in {\cal W}} \frac{{|({\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}){\prime}({\boldsymbol{b}} - {\boldsymbol{y}})|}}{{\left\Vert {{\boldsymbol{b}}^ \ast ({\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert^2}} = o_p(1).$$

(A.17)

By combining (A.14)–(A.17) and Assumption C.3, we can obtain (A.12) and (A.13) and hence Eq. (21). This completes the proof of Theorem 3.

A.4. Simulation Results with Smaller Hold Out Sample. Similar to Tables 2 and 4 presents the mean, median and standard deviation of $\left\Vert {{\boldsymbol{b}}(\widehat {\boldsymbol{w}}) - {\boldsymbol{b}}} \right\Vert^2/n_{}^{}$, where b = E[y|x, z], based on AIC and BIC model selection, as well as s-AIC and s-BIC model averaging, the full (correctly specified model) and two variants of JCVMA across 1000 replications. JCVMA1 uses ${\boldsymbol{b}}(\widehat {\boldsymbol{w}}) = \widehat {\boldsymbol{b}}(\widehat {\boldsymbol{w}})$, as defined in Eq. (11), the fitted values from S candidate models not using hold-out samples, while JCVMA2 uses ${\boldsymbol{b}}(\widehat {\boldsymbol{w}}) = \widetilde {\boldsymbol{b}}(\widehat {\boldsymbol{w}})$, as defined in Eq. (12), the leave-J-observations-out fitted values from the estimated models. Note that both JCVMA1 and JCVMA2 use the same weights, $\widehat {\boldsymbol{w}}$, obtained from Eq. (20), they just conduct the averaging over different sets of fitted values. For all the simulations we leave out 2.5% of the sample size for our hold out prediction (i.e., for n = 200, we hold out 5 observations at a time, for n = 400 we hold out 10 observations at a time, etc.).

Table 4 Simulation results for the JCVMA estimator—1000 simulations. Holdout sample is 2.5% of n

Full size table

Several insights are immediate from Table 4 relative to the results from Table 2. JCVMA2 still outperforms JCVMA1. JCVMA2 always outperforms the other methods in terms of mean risk, and also has equal standard deviation of risk. Comparing mean and median risk, it does not appear that the size of the hold out sample has much effect on the performance of either of the JCVMA estimators.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Parmeter, C.F., Wan, A.T.K. & Zhang, X. Model averaging estimators for the stochastic frontier model. J Prod Anal 51, 91–103 (2019). https://doi.org/10.1007/s11123-019-00547-8

Download citation

Published: 04 April 2019
Issue Date: 01 June 2019
DOI: https://doi.org/10.1007/s11123-019-00547-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Model averaging estimators for the stochastic frontier model

Abstract

Access this article

Similar content being viewed by others

Model uncertainty and efficiency measurement in stochastic frontier analysis with generalized errors

The conditional mode in parametric frontier models

The wrong skewness problem in stochastic frontier analysis: a review

Notes

References

Acknowledgements

Author contributions

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Appendices

Appendices

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Model averaging estimators for the stochastic frontier model

Abstract

Access this article

Similar content being viewed by others

Model uncertainty and efficiency measurement in stochastic frontier analysis with generalized errors

The conditional mode in parametric frontier models

The wrong skewness problem in stochastic frontier analysis: a review

Notes

References

Acknowledgements

Author contributions

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Appendices

Appendices

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation