Background

The vast majority of poor households living in the developing areas rely on agriculture for their food, income, and livelihood (Minten and Barrett 2008; Dethier and Effenberger 2012; Larsen and Lilleør 2014). Hence, the growth and development of the agriculture sector is considered to be the main pathway out of poverty and food insecurity and to have a pro-poor economic development in those areas (Diao et al. 2010; Kassie et al. 2013; Collier and Dercon 2014; Dawson et al. 2016).

The agriculture sector is the mainstay of the Ethiopian economy as it accounts for nearly 45% of GDP and employs about 85% of labor forces (Dercon et al. 2012). Despite its strategic role for the country’s economy, the sector is dominated by subsistence and semi-subsistence farming system (Alemu et al. 2006; Anley et al. 2007; Francesconi and Heerink 2011; Teshome et al. 2016). Smallholders own on average less than 1 ha land per holder account for about 95% of land covered by crops on which they produce about 90% of agricultural outputs in the country (CSA, 2014). Moreover, the ever-increasing population of the country is reducing the farm sizes rapidly (Bezu and Holden 2014; Headey et al. 2014). This population growth coupled with a legal restriction on agricultural land market hindered farm expansion that made farms smaller and hiring labor superfluous, which created a significant level of unemployment in the rural part of the country (Bezu and Holden 2014). Hence, there has to be a mechanism to support the livelihood of the agricultural communities and absorb the excess labor in the rural economy. As emphasized by Lanjouw and Lanjouw (2001), the main options available for the unemployed rural community in this scenario are either migration to urban areas or engagement in off-farm activities in rural areas. Actually, smallholder farmers in developing countries rarely rely on a full-time agricultural work; rather, they often maintain a portfolio of activities in which off-farm activities are an important contributions to their well-being (Barrett et al. 2001; Foster and Rosenzweig 2004; Smith et al. 2005; Wouterse and Taylor 2008; Lanjouw and Murgai 2009; Zezza et al. 2009; Davis et al. 2010; Haggblade et al. 2010).

Empirical findings consistently show that incomes generated from off-farm activities ease the burden on agriculture as it enables households to have better incomes. Hence, they enhance food security as they manage food consumption fluctuations better than a household without such an activity (de Janvry and Sadoulet 2001; Ruben 2001; Barrett et al. 2001; Babatunde and Qaim 2010; Haggblade et al. 2010; Owusu et al. 2011; Bezu et al. 2012; Hoang et al. 2014; Mishra et al. 2015). Especially when the agricultural production fails due to climate change, pest, deceases, or other shocks, off-farm employment provides a risk management tool to reduce the income variability and will fill the gap that will be created between farm income and household consumption El-Osta et al (2008). By engaging in off-farm employment, farmers also become self-insured (Alasia et al. 2009) and they may invest in a risky but high-returning agricultural business.

Off-farm income can also enhance agricultural production by relaxing liquidity and credit constraints to purchase productivity enhancing agricultural technologies such as improved seed, fertilizer, machineries, and hiring labor (Ruben 2001; Lamb 2003; Matshe and Young 2004; Kilic et al. 2009; Oseni and Winter 2009; Anriquez and Daidone 2010). This is particularly true in developing countries where farmers are facing credit constraints (Stampini and Davis 2009).

Though there are ample empirical evidences on the impact of off-farm income on adoption of productivity-enhancing agricultural inputs, its impact on farm efficiency is mixed. For example, Mochebelele and Winter-Nelson (2000), Tijani (2006), Haji (2007), Pfeiffer et al. (2009), Bojnec and Ferto (2013), and Babatunde (2013) found a positive significant impact of off-farm income on farm efficiency. There are also cases (such as Goodwin and Mishra (2004), Chang and Wen (2011), and Kilic et al. (2009)) in which participation in off-farm activities has an adverse effect on the agriculture. They argued that if the income from the off-farm activities is more attractive than the agriculture, farmers might give less attention for the agriculture and they might devote more family labor and time for off-farm activities. There are also studies (such as Chavas et al. 2005, Bozoğlu and Ceyhan 2007; Lien et al. 2010, Feng et al. 2010; Chang and Wen 2011) that found no significant association between the two variables. Table 1 summarizes some of the empirical works that examined the role of off-farm activities on technical efficiency.

Table 1 Summary of the studies which examined the relationship between off-farm income and technical efficiency

As it can be observed from Table 1, the findings are not consistent. This disagreement between empirical results could be due to the fact that the role of off-farm activities on farm productivity is different from context to context. The other important thing learnt from the above empirical works is that selectivity bias is not addressed in most of the cases; therefore, the reported impact and associated technical efficiency (TE) scores are likely to be biased (Bravo-Ureta et al. 2012; González-Flores et al. 2014). Other studies like Abebe (2014) relied on a single frontier equation for both those who participated in off-farm activities and non-participants without giving due attention for potential differences between the two groups.

Hence, unlike previous works, this paper aims to estimate the impact of participation in off-farm activities on TE of smallholder maize producers of eastern Ethiopia by combining propensity score matching with the Greene (2010) model to control for both observed and unobserved heterogeneities.

Therefore, the results of this study adds to the existing literature (e.g., Abate et al. 2016; Barrett et al. 2001; Beyene 2008; Feng et al. 2010) through identifying the factors affecting the participation in off-farm activities. As the study evaluates the impact of participation in off-farm activities on the TE of maize producers, the study indicates the relationship between off-farm and on-farm activities and indicates whether there exists trade-off or complementarity between the two income-generating activities.

Methods

Econometric framework and estimation strategies

Estimation of technical efficiency

Technical efficiency (TE) refers to the ability of a decision-making unit (DMU) to produce the maximum feasible output from a given bundle of inputs (Farrell 1957; Xiaogang et al. 2005). Any deviation from this maximal output is considered as technical inefficiency (Coelli et al. 2005). TE can be measured by using either parametric or non-parametric approaches (Simar and Wilson 2015). The main difference between the two approaches is that the non-parametric approach assumes that the DMU has full control on the production process and all deviations from the frontier are associated with inefficiency. The parametric approach on the other hand distinguishes inefficiency from deviations that are caused by factors beyond the control of the DMU. Given the intrinsic variability of agricultural production due to factors like climatic change, plant pathology, and insect and pests, the assumption that all deviations from the frontier are associated with inefficiency, as assumed in the non-parametric approaches, is difficult to accept (Bauman et al. 2016). Hence, this study has adopted a parametric approach, specifically stochastic frontier analysis (SFA). Studies that utilized SFA to measure TE include Binam et al. 2004; Balcombe et al. 2007; Bozoğlu and Ceyhan 2007; Gedara et al. 2012; and Xu et al. 2015. The stochastic production frontier (SPF) function was independently proposed by Aigner et al. 1977 and Meeusen and Von den Broeck 1977). This model can be expressed in the following form.

$$ {Y}_i=F\left({X}_i;\beta \right)\exp \left({V}_i-{U}_i\right)\kern1.25em I=1,2,3\dots n $$
(1)

where Y i is the observed maize production of the ith farmer, X i is a vector of inputs used by the ith farmer, β is a vector of unknown parameters, V i is the stochastic effect beyond the farmer’s control, measurement errors, and other statistical noises which are assumed to be \( N\left(0,{\sigma}_v^2\right) \)and independent of the U i which is a non-negative random variable assumed to account for technical inefficiency in production. As SFA requires prior specification of functional form of the production function, a log-likelihood ratio (LR) test was conducted to choose between the Cobb-Douglas and translog functional forms using the following formula given by (Coelli 1998).

$$ LR=-2\times \left(\ln {L}_{TL}-\left(\ln {L}_{CB}\right)\right) $$
(2)

where lnL TL and lnL CB represent the log-likelihood function values obtained from the translog and the Cobb-Douglas production function, respectively. The LRFootnote 1 test result indicated that Cobb-Douglas production function is a more appropriate functional form for this study than the alternative translog production function. Several studies have utilized Cobb-Douglas production including Bozoğlu and Ceyhan 2007; Pfeiffer et al. 2009, and González-Flores et al. 2014.

Impact evaluation and efficiency estimation

Following the works of González-Flores et al. (2014) and Bravo-Ureta et al. (2012), we measured the impact of participating in off-farm activities on TE by combining propensity score matching (PSM) technique with the Greene (2010) model to correct biases from observed characteristics and for selectivity bias arising from unobserved variables, respectively. The first step in PSM is to predict the propensity score which is equal to the probability of receiving treatment, considering both treated and non-treated groups based on a given set of predetermined covariates, using a binary choice model (Cameron and Trivedi 2005). This step is followed by imposing the common support region, which is the area within the minimum and maximum propensity scores of treated and comparison groups, respectively (Caliendo and Kopeinig 2008). Several recent studies have applied PSM within the impact evaluation literature (e.g., Becerril and Abdulai 2010; Mishra et al. 2016; Abate et al. 2016; Chagwiza et al. 2016).

Nevertheless, PSM works only if selection is solely based on observable characteristics and potential outcomes are independent of treatment assignment. If unobservable characteristics affect the outcome variable, PSM is not the appropriate technique (Takahashi and Barrett 2013; Khonje et al. 2015). Hence, to handle biases from the unobserved characteristics, we used the model introduced by Greene (2010). According to this model, the sample selection and SPF models, along with their error structures, can be expressed as follows:

$$ {\displaystyle \begin{array}{l}\mathrm{Sample}\ \mathrm{selection}:{F}_i=1\left[{\alpha}^{\prime }{\mathbf{z}}_i+{\omega}_i>0\right],{\omega}_i\sim N\ \left[0,1\right]\\ {}\kern1.75em \mathrm{SPF}:\kern3.5em {y}_i={\beta}^{\prime }{\mathbf{x}}_i+{\varepsilon}_i,{\varepsilon}_i\sim N\ \left[0,{\sigma}_{\varepsilon}^2\right]\ \\ {}\kern7em \left({y}_i,{\mathbf{x}}_i\right)\ \mathrm{observed}\ \mathrm{only}\ \mathrm{when}\ {F}_{\mathrm{i}}=1\kern1em \end{array}} $$
(4)
$$ {\displaystyle \begin{array}{l}\mathrm{Error}\ \mathrm{structure}:{\varepsilon}_i={v}_i-{u}_i\\ {}\kern6.5em {u}_i=\kern0.5em \left|{\sigma}_u{U}_i\right|={\sigma}_u\left|{U}_i\right|,\kern0.5em \mathrm{where}\kern0.5em {U}_i\sim N\left[0,1\right]\\ {}\kern6.75em {v}_i={\sigma}_v{V}_i,\kern0.5em \mathrm{where}\kern0.5em {V}_i\sim N\left[0,1\right]\\ {}\kern6.25em \left({\omega}_i,{v}_i\right)\sim {N}_2\ \left[0,1\right),\kern0.5em \left(1,{\rho \sigma}_v,{\sigma}_v^2\right)\Big]\end{array}} $$

where F is a binary variable equal to one for participants on off-farm activities and zero for control, y is amount of maize produced, z is a vector of covariates included in the sample selection equation, and x is a vector of inputs in the production frontier; α and β are parameters to be estimated while ε i denotes the characters in the error structure corresponding to the typical characterization of a stochastic frontier model and ρ captures the presence or absence of selectivity bias.

Study areas and sampling technique

This study is undertaken in the eastern part of Ethiopia specifically in the East Hararghe Zone of Oromia National Regional State. The Zone is classified into three major climatic categories namely, temperate tropical highlands, semi-temperate, and semi-arid. This wide range of agroecology allowed the area to produce different types of products including cereals, pulses, oilseed, vegetables, fruits, and cash crops such as coffee and chat. Among the cereal crops, maize is the dominant crop as both the size of land allocated to it and the number of households producing it was the highest compared to the other cereal crops cultivated in the zone (CSA 2014). From the selected zone, two districts namely Haramaya and Girawa were selected for this study based on their extent of maize production. Next, four rural kebelesFootnote 2 were randomly picked from both districts. Finally, 355 households (76 (which is equivalent to 21.41%) participants and 279 non-participants) were selected, proportional to the size of maize-producing farmers using simple random sampling technique with replacement.Footnote 3 Then, the primary data were collected using structured questionnaires administered by trained enumerators from February to March 2016.

Results and discussions

Descriptive statistics

Before embarking to the econometrics results, it is important to give brief information regarding the sample respondents and variables used in the econometrics model. Accordingly, Table 2 presents descriptive statistics of the variablesFootnote 4 used for this study. As it is indicated in the table, 21% of the respondents have been participating in off-farm income-generating activities, which include selling firewood, renting assets, trading, and remittance. The mean age of the sample respondents is about 38 years. Nearly 90% of the sample households are headed by male. Concerning their educational status, 63.1% of the respondents and 24.2% of the spouses are literate. The mean family size of the respondents measured by adult equivalent (AE)Footnote 5 is 5.36. As far as the asset ownership is concerned, on average, they have 2.79 quxiFootnote 6 of land and 3.01 units of livestock measured by tropical livestock units (TLUs). The sample respondents, on average, travel about 35 min to reach the nearest market, and about 22% of respondents are members of agricultural cooperatives. The t test indicated that there is a statistical mean difference between off-farm participant and non-participant households in terms of educational status of the household head and the spouse, sex of the household head, and agricultural cooperative membership. Table 2 also indicated the input and output variables used to estimate the stochastic frontier production functions.

Table 2 Descriptive statistics

Econometrics results

Determinants of participation in off-farm activities

This sub-section presents the result of the probit regression model, which was used to estimate the propensity score for matching the off-farm participants with non-participants. The model sufficiently fitted the data at one significant level (LR χ2 (12) = 42.34; Prob > χ2 = 0.00). The result, presented in Table 3, reveals that sex of the household head, literacy of the spouse, agricultural cooperative membership, family size, and access to market information have significant effect on farmers’ participation in off-farm activities. Specifically, the result implies that being a female-headed household increases the probability of participation in off-farm activities.

Table 3 Result of the Probit model of factors determining participation in off-farm activities

Female-headed households are more likely to participate in off-farm activities than the male-headed counterparts because female-headed families engage in off-farm activities to offset their relative lower farm income compared to male-headed ones. However, this is against the result found by Beyene (2008).

Literacy of the spouse influences the participation in off-farm activities positively. Educated people have lower incentive to obtain income from own farming because educated people often have access to higher paying off-farm jobs (Lanjouw 2001; Satriawan and Swinton 2007) compared to uneducated ones. Therefore, the educated spouse would engage in higher earning off-farm activities so as to improve the livelihood of their family.

Agricultural cooperative membership has a positive influence on farmers’ participation in off-farm activities. When a farmer becomes a cooperative member, he/she can get relatively more information from his/her fellows about the available off-farm works compared to non-members. Hence, his/her participation in off-farm activities would be more likely than the non-members.

Family size measured in terms of adult equivalent is an indicator of labor availability, and it has a positive influence to participate in off-farm activities. This is consistent with the results by Tassew et al. (2000) in such a way that large family size increases the households’ participation in off-farm works since a larger family size requires relatively higher marginal income. This means that participation in off-farm activities can maintain the burden of large family size. Thus, households with large family size would have abundant labor and send some of the family members to off-farm activities.

Access to market information has a positive influence on farmers’ participation in off-farm activities. Access to information about the availability of high-earning off-farm activities would give an opportunity to participate in those activities.

After estimating the probit model, we predicted the propensity scores and the common support region. The values of propensity scores of both treatment and comparison groups are found between 0.04806759 and 0.71167307. This leads to the total matched sample size of 344 respondents of whom 76 are participants and 268 are non-participants. Figure 1 shows the density estimates of the distribution of propensity scores for each group, along with the areas with and without common support.

Fig. 1
figure 1

The distribution of propensity scores for treated and untreated groups

Estimation of the SPF and measuring technical efficiency

Once the matched samples are constructed, the next step is to determine if the conventional SPF should be run for the whole sample or if separate frontiers are necessary for participant and non-participant farmers. To determine this, we conducted a LR test based on following specification:

$$ LR=-2\times \left(\ln {L}_P-\left(\ln {L}_d+\ln {L}_C\right)\right) $$
(5)

where lnL P , lnL d , and lnL C represent the log-likelihood function values obtained from the pooled model, the participant, and the non-participant subsamples, respectively. Hence, we first estimated a SPF with pooled data by including a binary variable, Off_farm, as a regressor, which indicates whether the household participates or not in off-farm activities, and two separate SPF models, one for households participating in off-farm activities and the other for non- participants. The pooled SPF models indicated that there is no significant difference between the two groups of farmers in their TE score as the variable Off_farm are insignificant under both matched and unmatched SPF. However, this result is rejected by the LR test, which favors separate frontiers indicating the two groups have different production function. Then, to correct for the possible bias from observable heterogeneities, we re-estimated the above three frontiers (pooled, participants, and non-participants) using the matched data set.Footnote 7 However, results of LR test again supported separate frontiers for each groups. Finally, to correct for the possible bias from unobservables, two separate SPF models are re-estimated using Greene’s (2010) selection correction framework. The results of the SPF models are presented in Table 4 for the unmatched samples and in Table 5 for the matched samples.

Table 4 Conventional and selection SPF based on unmatched observation
Table 5 Conventional and selection SPF based on matched observations

The estimated value of sigma are significant at less than 1% probability level for all frontier functions indicating the conventional average production function is not an adequate representation of the data. The coefficients of rho, which indicate the presence of selection bias, are also significant in all selection frontiers except for the non-participants in the case of the unmatched data. This result suggests the presence of selection bias, thus lending support to the use of a sample selection framework to estimate separate SPFs for the beneficiaries and control groups.

As it is presented in the tables, the signs of all significant variables are positive as we expected. However, different input-output responses are found between these two groups of farmers. The result indicates that expenditure on inorganic fertilizer and seed are significant in all frontiers, while labor is found to be insignificant for non-participant respondents, which indicate that the amount of labor has no impact in determining the production level of maize in this subsample. Almost in all frontiers, the average production elasticity of seed and land allocated for maize are the highest.

Impact of participation in off-farm activities on technical efficiency

After estimating the SPF corrected for both observed and unobserved heterogeneities, we predicted the TE score of each sample respondent. Figure 2 presents the distribution of TE score for both participant and non-participant farmers. The TE scores range between 39.27 and 99.56 for the entire population. It indicates that the TE scores of 37.21% percent of the maize producers considered for this study were below the mean efficiency level. Considering only the participants, the TE score is ranged between 59.39 and 99.56 and the corresponding figure for the nonparticipants is found between 39.27 and 99.06.

Fig. 2
figure 2

The distribution of TE score for participant and non-participant farmers

Table 6 summarizes the TE mean difference between the two groups after correcting for bias from both observable and non-observable heterogeneities. The results indicate a significant mean difference between the two groups. Specifically, the result reveals farmers who are participating in off-farm income-generating activities have 6.23% of technical efficiency gain compared with their non-participant counterparts. The result is consistent with Pfeiffer et al. (2009).

Table 6 TE scores after bias correction

As indicated in Table 5, the mean TE for pooled sample respondents was found to be 81.44% and the corresponding figure for households participating in off-farm activities and non-participants are 86.29 and 80.06%, respectively.

Finally, we examined which of the two groups (participants vs. nonparticipants) has higher output after controlling for biases from observed and unobserved variables. For this purpose, we compared the predicted frontier output of the two groups at three different input levels: (1) at the average for the smallest matched pair of farms, (2) at the average for the entire sample, and (3) at the average for the largest matched pair. The result is indicated along with a test of the mean difference in Table 7. As it is indicated in the table, households who are participating in off-farm activities are significantly more productive than the non-participant farmers. Thus, the analysis suggests that participants do not only exhibit higher TE but also higher total output.

Table 7 Predicted frontier output after bias correction

Conclusion

This study analyzed the determinants of participation in off-farm activities and examined the impact of participation in off-farm activities on technical efficiency of maize production in eastern Ethiopia using data collected from 355 households. The probit model results indicate that literacy of the spouse, agricultural cooperative membership, family size, and access to market information had positively and significantly affected the participation in off-farm activities whereas male household head had negative and significant effect on participation in off-farm activities. The stochastic frontier production function was used to estimate technical efficiency scores, and propensity score matching was combined with Green (2010) to control sample selection bias. The result indicated that participation in off-farm activities have a positive influence on both production and productivity of maize. The result suggests that participants in off-farm activities are more technically efficient in maize production than the non-participants. The finding of this study indicated the existence of complementarity between on-farm and off-farm activities. This result challenges the perception that participation in off-farm activities may lead to a reduction in on-farm production due to competition between farm and off-farm works for family labor. The result indicates that livelihood diversification improves the agriculture sector in addition to creation of more employment opportunities as it provides a medium for family labor reallocation. This confirms the fact that off-farm income enhances agricultural production and productivity by providing cash to purchase agricultural technologies and efficiently utilize those resources.

Hence, given the complementarities between off-farm and farm activities, providing services like education and market information to the farmers and strengthening agricultural cooperatives should be facilitated to increase the participation of farmers in off-farm activities and in turn to improve their livelihoods.