Quality of Life Research

, Volume 25, Issue 5, pp 1137–1149 | Cite as

Does reporting behaviour bias the measurement of social inequalities in self-rated health in Indonesia? An anchoring vignette analysis

Open Access
Article

Abstract

Purpose

Studies on self-rated health outcomes are fraught with problems when individuals’ reporting behaviour is systematically biased by demographic, socio-economic, or cultural factors. Analysing the data drawn from the Indonesia Family Life Survey 2007, this paper aims to investigate the extent of differential health reporting behaviour by demographic and socio-economic status among Indonesians aged 40 and older (\(N = 3735\)).

Methods

Interpersonal heterogeneity in reporting style is identified by asking respondents to rate a number of vignettes that describe varying levels of health status in targeted health domains (mobility, pain, cognition, sleep, depression, and breathing) using the same ordinal response scale that is applied to the self-report health question. A compound hierarchical ordered probit model is fitted to obtain health differences by demographic and socio-economic status. The obtained regression coefficients are then compared to the standard ordered probit model.

Results

We find that Indonesians with more education tend to rate a given health status in each domain more negatively than their less-educated counterparts. Allowing for such differential reporting behaviour results in relatively stronger positive education effects.

Conclusion

There is a need to correct for differential reporting behaviour using vignettes when analysing self-rated health measures in older adults in Indonesia. Unless such an adjustment is made, the salutary effect of education will be underestimated.

Keywords

Self-rated health Socio-economic status Reporting heterogeneity Anchoring vignette Indonesia 

Introduction

Both resource constraints and the multidimensionality of health concepts being studied often necessitate the collection of self-rated health (SRH) data. SRH measures, which ask individuals to report their health status either in general or on a specific health domain using an ordinal response scale, require no specialist intervention during data collection, are relatively cheap and quick to obtain, and are feasible to implement in large-scale surveys. In addition to the belief that SRH can capture aspects of health that cannot be tapped by objective measure [35], research has shown that SRH is highly correlated with assessments provided by health professionals [9] and that is also a strong predictor of mortality [15] as well as health care utilisation [30].

Notwithstanding these benefits, the use of SRH in the study of socio-economic inequalities in health becomes fraught with serious problems when individuals have different expectations, knowledge, or standards of what constitutes a good health. For example, when experiencing an identically severe health problem, poor individuals may paradoxically report better health than their richer counterparts (Fig. 1) simply because the poor have a much higher tolerance to health problems than the rich [28]. This is known in the literature as ‘reporting heterogeneity’ [29], ‘differential item functioning’ [19], ‘response category cut-point shift’ [22], ‘scale of reference bias’ [11], or simply ‘differential reporting’ [20].
Fig. 1

Problem of response-scale heterogeneity

To address this problem, the anchoring vignette method has been proposed [18, 19, 32, 36]. By means of this method, researchers can identify the individual-specific reporting style by asking respondents to rate a number of vignettes (hypothetical scenarios) that describe varying levels of health status in a health domain using the same ordinal response scale that is applied to the self-report health. Then, if one is willing to assume that, apart from random error, each vignette is perceived in the same way by all respondents (vignette equivalence assumption) and that they apply exactly the same standard to judge both their own health status and those of the vignettes (response consistency assumption), one can fit a compound hierarchical ordered probit (CHOPIT) model [19] to identify health inequalities that are free from bias due to heterogeneous reporting style.

Using anchoring vignette, it has been shown that among older individuals in eight European countries, there is strong evidence for the existence of differential health reporting by education level. Bago d’Uva et al. [3] found that highly educated older Europeans tend to have higher expectation of health than their less-educated peers and suggested that accounting for differences in the reporting of health is important because ‘measured health inequalities by education are often underestimated, and even go undetected, if no account is taken of these reporting differences’ [3, p. 1375]. However, when the authors analysed data from three most populous developing countries (China, India, and Indonesia), they found that in Indonesia and India, ‘there are either no differences in reporting by education or the better educated are more likely to report very good health’ [2, p. 362]. This finding defies conventional expectation; the authors then speculated that perhaps the Chinese sample, which has a higher level of education than the Indonesian and Indian, were more able to comprehend the vignette exercise.

Motivated by these mixed findings, this paper aims to investigate whether there is evidence for differential reporting behaviour by demographic and socio-economic status (SES) among Indonesians. We analyse the data from the fourth wave of the Indonesia Family Life Survey (IFLS 2007), which is among the very few population studies conducted in developing countries that employed a vignette rating module. The present study departs from the existing application of anchoring vignette method in Indonesia [2] in its use of a newer data set and of fewer and simpler vignettes, as well as in its analysis of a more homogeneous age group.

Methods

Study population

The data are drawn from the IFLS 2007, which is a multi-purpose household longitudinal study that collects information from more than 30,000 individuals from 12,000 households living in 260 districts in Indonesia and is representative of about 83 % of the entire population [25]. The IFLS 2007 is the only IFLS wave that has vignette module. Because the module was administered to only a fraction of study participants, the sample of this study is, by design, limited to 3735 adults aged 40 and older. These individuals were asked to report their self-assessment of health, but only one-third of them (1245 individuals) were subjected to the vignette rating questionnaire. Further details regarding sampling and ethical procedure are available in the IFLS’s documentation [25].

Measures

Survey respondents were asked to evaluate their own health in six health domains (mobility, pain, cognition, sleep, depression, and breathing) using the question ‘Overall in the last 30 days, how much of a problem did you have with \(\ldots\)?’. Responses were recorded using a five-category ordinal scale: (1) none, (2) mild, (3) moderate, (4) severe, and (5) extreme. In addition to this self-assessment, randomly selected respondents were also asked to evaluate the health status of hypothetical persons described in the vignettes. For each domain, three vignettes of varying severity were presented; respondents were then asked to think about these persons’ experiences as if they were their own and to rate the health status of the persons portrayed in the hypothetical scenarios in the same way they had rated their own health earlier. Vignettes were presented in the order of mild–moderate–severe health problem, and responses were recorded using the same response scale applied to the SRH. For ease of understanding, we reverse-coded the response scale so that a score of 5 represents very good health and a score of 1 represents very poor health.

The SES variables are education (entered as a dummy variable representing those who completed the 9-year compulsory education) and the logarithm of per capita household asset value. We opted to use these SES indicators rather than the usual indicators of income or expenditure because many respondents were already at the retirement age (56 or older). In this case, education is particularly relevant because it is probably the best measure of SES for older adults [12]. In later life, education serves as a good proxy for permanent income and is less endogenous than income as it is usually fixed early in life [12]. Per capita household asset value was measured from the total value of land, property, vehicles, poultry, livestock, fish ponds, hard stem plants, household appliances, household furniture and utensils, savings, deposit, stocks, receivables, and jewellery owned by the household members. Like education, assets are also considered as less endogenous than income due to their accumulative nature [21].

We also include respondents’ age groups (40–49, 50–59, 60–69, 70+), gender, marital status (married and unmarried), family size (dummy variable for those living with more than four household members), and urban or rural residential location.

Data analysis

For each health domain, we first fit an ordered probit (OPROBIT) model [10] to estimate the effect of demographic and SES variables on health. Then, we refit the same specification with a CHOPIT model [19] that generalises the OPROBIT by allowing cut points or thresholds to be different across individuals.

The CHOPIT model is comprised of two components: the self-assessment and the vignette rating component. In the self-assessment equation, we write the unobserved perceived level of health as:
$$\begin{aligned} y_{i}^{*}\sim N(\mu _{i},1) \end{aligned}$$
(1)
$$\begin{aligned} \mu _{i}= X_{i}\beta \end{aligned}$$
(2)
with subscript i denotes individuals responding to SRH questionnaire. Individuals’ actual health level \(\mu _{i}\) varies as a linear function of observed covariates \(X_{i}\) with parameter vector \(\beta\). Respondents then turn their perceived level of health \(y_{i}^{*}\) into reported ordinal category \(y_{i}\) via the following observation mechanism:
$$y_{i}=k\quad {\mathrm {if}}\; \tau _{i}^{k-1}<y_{i}^{*}<\tau _{i}^{k},\quad k=1,\ldots ,K$$
(3)
where
$$\begin{aligned} -\infty =\tau _{i}^{0}<\tau _{i}^{1}<\tau _{i}^{2}<\cdots <\tau _{i}^{K}=\infty \end{aligned}$$
(4)
To allow for individual-specific response category cut-point shift, thresholds \(\tau _{i}\) are modelled as a linear function of observed covariates \(X_{i}\) with parameter vector \(\gamma\) and are identified in the model using information obtained from the vignette rating exercise.
$$\tau _{i}^{1}= X_{i}\gamma ^{1}$$
(5)
$$\tau _{i}^{k}= \tau _{i}^{k-1}+X_{i}\gamma ^{k},\quad {\mathrm {for}}\quad k=2,\ldots ,K$$
(6)
In the vignette rating equation, we write the perceived level of health of the person described in vignette j evaluated by survey respondent i as:
$$z_{ij}^{*}\sim N(\theta _{j},\sigma ^{2})$$
(7)
The actual health level of the person described in the vignette (\(\theta _{j}\)) is assumed to be identical for every respondent, hence formalising the ‘vignette equivalence’ assumption. As in the self-assessment part of the model, respondents then turn the perceived level of health \(z_{ij}^{*}\) into the same K ordinal category via similar mechanism:
$$\begin{aligned} z_{ij}= & {} k\quad {\mathrm {if}}\;\tau _{ij}^{k-1}<z_{ij}^{*}<\tau _{ij}^{k},\quad k=1,\ldots ,K \end{aligned}$$
(8)
Thresholds in the vignette rating equation are determined by the same \(\gamma\) parameter as in the self-assessment part, but note that the sample used in each model component need not be identical. The appearance of the same \(\gamma\) parameter vector in both self-assessment and vignette rating components thus formalises the ‘response consistency’ assumption.

For identification and model comparability purposes, the standard ordered probit normalisation restriction (intercept is fixed at zero; variance is set to one) [37] is imposed upon both OPROBIT and CHOPIT models. Then, formal tests of reporting homogeneity (\(H_{0}{:}\;{\mathrm {all}}\;\gamma =0\)) and parallel cut-point shift (\(H_{0}{:}\;\gamma ^{1}=\gamma ^{2}=\cdots =\gamma ^{K-1}\)) [16] are performed after acquiring the estimate of the CHOPIT model, accompanied by graphical illustrations when necessary. To facilitate interpretation, we also compute the partial effect of relevant variables on the probability of reporting very good health [16].

Only complete observations are used in the modelling exercise, yielding a sample size of 3069 individuals in the SRH equations (82 % of the original sample) and 939–1130 individuals in the vignette rating equations (75–90 % of the original sample).

Results

We begin with a description of the sample. The mean age is 53.95 (SD = 10.81, median = 52, IQR = 16); half of the sample (52.8 %) are female and 20 % are unmarried. The majority of the sample (77.4 %) live with at least five household members; about half (49.18 %) live in urban area and only one-third (37.92 %) completed the 9-year compulsory education. Per capita household asset value is log-normally distributed with a mean equal to USD 1660 (SD = 3800, median = 721, IQR = 1368). The well-behaved histograms in Fig. 2 show that respondents seem to understand the vignette rating exercise very well: the ratings of moderate health problems are symmetrically distributed, while those of mild and severe health problems are left- and right-skewed, respectively. Overall, there is no marked difference between the characteristics of the SRH sample and those of the vignette sample.
Fig. 2

Distribution of vignette ratings (1 extreme, 2 severe, 3 moderate, 4 mild, 5 none)

The regression coefficients obtained from the OPROBIT model are represented by hollow circles plotted in the left panel of both Figs. 3 and 4. Assuming that respondents apply identical thresholds, the results suggest a general trend that (1) health deteriorates with age in a possibly nonlinear fashion (except in the depression domain), (2) women report worse health than men (except in the breathing domain), and (3) the better educated are healthier than those with minimal education attainment (except in the depression domain). Being unmarried is associated with lower health status in the sleep and depression domains, but there is no evidence for such association in other domains. The models show that there seems to be no statistically discernible effect of family size and urban–rural residential location on health in all six domains. Wealth, however, seems to have a positive impact on health in the mobility, cognition, sleep, and depression domains if only to a very small degree. This can be understood as monetary welfare is no longer a good indicator of SES in later life.
Fig. 3

Estimation results for mobility, pain, and cognition domains [main coefficients (\(\beta\)) in left panel, threshold coefficients (\(\gamma\)) in right panel, intercepts in threshold equation not shown]

Fig. 4

Estimation results for sleep, depression, and breathing domains [main coefficients (\(\beta\)) in left panel, threshold coefficients (\(\gamma\)) in right panel, intercepts in threshold equation not shown]

Fig. 5

Estimated location of vignette rating \((\theta _{j})\)

What happen when we relax the reporting homogeneity assumption by fitting a CHOPIT model? Regression coefficients predicting the latent health index in each domain (\(\beta\)) are shown using solid circles in the left panels of Figs. 3 and 4, while those predicting the individual-specific thresholds (\(\gamma\)) are shown using numbers in the right panels of the figures. An omnibus test of reporting homogeneity in each domain (Table 1) rejects the joint null hypothesis that all coefficients in the threshold equation are equal to zero at conventional significance levels, indicating that respondents do not necessarily apply identical cut points when transferring their latent health indices onto the ordinal categories. In other words, there seems to be disagreement as to what constitutes good health among the respondents; some may have higher or lower standards than others. The statistically significant results of a global test of parallel cut-point shift in each domain (except in mobility and cognition; see Table 1) further indicate that respondents’ reporting behaviour depends on the covariates in a complex way. The relationship between the thresholds and the covariates is not necessarily characterised by a simple linear function. Respondents, however, seem to agree on the levels of health described in the vignettes. As shown in Fig. 5, the estimated vignette locations in the latent health space are in concordance with the intended ordering. This confirms the earlier exploratory analysis presented in Fig. 2.
Table 1

Test of reporting homogeneity and parallel cut-point shift

Test

Mobility

Pain

Cognition

Sleep

Depression

Breathing

Reporting homogeneity

50.70*

93.86

82.28

99.03

105.46

98.81

Parallel cut-point shift

32.16

66.99

33.40

53.06

67.98

46.12

Reported are \(\chi ^2\) statistic with 36 degrees of freedom (reporting homogeneity) and 27 degrees of freedom (parallel cut-point shift)

\(p<0.10; {}^{\dagger}\,p<0.05; {}^{\ddagger}\,p<0.01\)

Allowing for interpersonal differences in reporting style does alter the point estimate of each \(\beta\) coefficient (Figs. 3, 4), but with the exception of that of education, the correction is practically negligible. In fact, when we test for reporting homogeneity by each covariate, only education variable is consistently statistically significant in all six health domains (Table 2). After adjusting for reporting heterogeneity, the 95 % confidence intervals of age, gender, family size, wealth, and urban/rural residential location still overlap largely with those of the OPROBIT model, and their interpretation remains. For marital status, the adjustment brings significant change in the sleep and depression domains where the health-protective effect of being married diminishes after correcting for the lower expectation of health among married individuals.
Table 2

Test of reporting homogeneity by each covariate

Variable

Mobility

Pain

Cognition

Sleep

Depression

Breathing

Age 50–59

      

Age 60–69

\({\circledcirc }\)

\({\circ }\)

 

\({\circledcirc }\)

\({\circledcirc }\)

 

Age 70+

 

\({\circledcirc }\)

 

\({\circledcirc }\)

  

Female

 

\({\circ }\)

 

\({\circledcirc }\)

  

Unmarried

\({\circ }\)

 

\({\circ }\)

\({\circledcirc }\)

\(\triangle\)

\({\circ }\)

Big family

   

\(\triangle\)

  

Educated

\({\circ }\)

\(\triangle\)

\(\triangle\)

\({\circ }\)

\(\triangle\)

\(\triangle\)

Log(asset)

  

\(\triangle\)

   

Urban

 

\({\circ }\)

 

\(\triangle\)

\(\triangle\)

\(\triangle\)

\(^\circ \,p<0.10;\; ^\circledcirc \,p<0.05; \;^\triangle \,p<0.01\)

Table 3

Partial effects of education on the probability of reporting very good health

Domain

OPROBIT

CHOPIT

Mobility

0.03 ± 0.01

0.04 ± 0.02

Pain

0.06 ± 0.02

0.08 ± 0.02

Cognition

0.03 ± 0.02

0.09 ± 0.02

Sleep

0.04 ± 0.02

0.06 ± 0.02

Depression

−0.00 ± 0.02

0.03 ± 0.02*

Breathing

0.03 ± 0.01

0.06 ± 0.01

\(p<0.10;\,{}^{\dagger}\,p<0.05;\,{}^{\ddagger}\,p<0.01\)

Fig. 6

Effect of education on vignette ratings’ cut points

A significant correction is observed with regard to education. The positive education effect in some threshold equations across health domains (shown in the right panels of Figs. 3, 4) suggests that Indonesians with high levels of educational attainment tend to rate a given health status more negatively than their less-educated counterparts. This is consistent with the educated being better informed; they have higher health standards. Thus, adjusting for this difference magnifies the positive effect of education on health status in all domains (Table 3). Most notably, the adjustment raises the estimated difference in the probability of reporting very good health between the well- and less-educated Indonesians in cognition and breathing domains by two- to threefold. The CHOPIT coefficients also tend to be more precisely estimated. Figure 6 shows how education level alters respondents’ thresholds, which are used to transfer the latent health index onto the ordinal categories. The plots suggest that reporting behaviour depends on education in a rather complex way, reiterating the results of the test of parallel cut-point shift (Tables 1, 4). Finally, following the method of Voňková and Hullegie [34], we test whether or not the adjustment to reporting heterogeneity is sensitive to the choice of vignettes used in the model by refitting the CHOPIT model with a single vignette at a time, predicting the latent health index and then calculating the Pearson's correlation coefficient between pairs of predicted values in each domain. As shown in Fig. 7, the strong correlations suggest that the adjustment is insensitive to the choice of hypothetical scenarios.

Discussion

Applying anchoring vignette methodology to a sample of older Indonesians, this study investigates the extent of differential reporting behaviour by demographic and socio-economic status in six health domains. We find that allowing for interpersonal heterogeneity in response style consistently magnifies the positive effect of education on health in all domains. One plausible interpretation of this finding is that educated Indonesians, who are likely to be well informed and aware of their well-being, have higher standards or expectations with regard to health than their less-educated counterparts. This indicates that health disparity by education might actually be wider than it is usually reported. Unless an adjustment is made for this systematic differential, the salutary effect of education will be underestimated. This finding is in line with an earlier observation in Europe [3], but it contradicts a previous study showing the overestimation of education effect among the general population in Indonesia [2]. Such a divergence might result from our (1) use of fewer and simpler vignettes, (2) analysis of a more homogeneous age group, and/or (3) use of a newer data set. We also find significant modification in the effect of marital status in the sleep and depression domains. The detrimental effect in these domains of being unmarried diminishes after correcting for the higher expectations of health prevalent among unmarried individuals. Otherwise, we find little difference when calibrating the effects of other demographic variables. Overall, these findings suggest that policy-maker cannot only rely on people’s perception of health when attempting to measure the reality. Studies on self-reported health outcomes particularly in developing countries should consider administering vignettes and using them to arrive at unbiased report on health inequality.
Fig. 7

Correlations among pairs of predicted health index in each domain

The generalisability of this study is limited by the restricted age group being analysed as well as by the small sample size. Future studies may collect more extensive vignette data so that statistical inferences can be extended to general population and so that stratified analysis by age, gender, or urban/rural residential location can be performed. We also note that the validity of the anchoring vignette method hinges critically on the maintenance of both vignette equivalence and response consistency assumptions. A number of studies have investigated the plausibility of these assumptions; some have found positive supports [19, 26, 31], while others report possible violations [5, 6, 8, 13]. In this study, there is always the possibility that these assumptions are violated. Vignette equivalence, for example, might not hold if some individuals perceive one of the vignettes more as being in a serious health condition because he or she has experienced or taken care of a family member who went through similar illness. Also, unmeasured respondents’ past experience of adverse events, surgery, or major illness, could have an effect on their perception of the vignettes as well as on their response to SRH questionnaire. While we have not provided a direct test for these assumptions, we are at least reassured that our analysis is insensitive to the choice of vignettes used in the model. Furthermore, by asking survey respondents to rate the vignettes as if they assess their own health condition, the IFLS study has at least tried to reinforce the response consistency assumption during data collection stage.

Anchoring vignette is a promising method that offers a direct way of handling interpersonal incomparability in self-report measure. Although methodologists have extended the original anchoring vignette method [19] to accommodate more complex situations [4, 6, 17, 23, 24, 31, 36], adequate attention should also be given to the fundamental matters of question wording [1, 13] and ordering [7, 14]. We believe that, given its cost-effectiveness and feasibility in large-scale surveys, SRH and anchoring vignette have the potential to play a greater role in public health research in now-decentralised Indonesia, where more than 500 local administrations must struggle with a scarcity of competent health workers [27, 33] as well as with the high cost of collecting objective health measures.

Notes

Acknowledgments

Wulung Anggara Hanandita’s research is supported by the Economic and Social Research Council, UK [ES/J500094/1] (PhD project: ‘Essays on Poverty and Health in Indonesia’).

Author contributions

W.H. conceived the study, analysed and interpreted the data, and wrote and edited the manuscript. G.T. contributed to the conceptualisation of the study, interpretation of the results, and editing of the manuscript. The authors had no conflicts of interest to declare.

Compliance with ethical standards

The authors declare that ethics approval is not required for this study since it uses secondary data sources. A reference regarding ethical and sampling procedures is provided in the methodology section of the paper.

Conflict of interest

The authors have no conflicts of interest to declare.

References

  1. 1.
    Au, N., & Lorgelly, P. K. (2014). Anchoring vignettes for health comparisons: An analysis of response consistency. Quality of Life Research, 23(6), 1721–1731.CrossRefPubMedGoogle Scholar
  2. 2.
    Bago d’Uva, T., van Doorslaer, E., Lindeboom, M., & O’Donnell, O. (2008a). Does reporting heterogeneity bias the measurement of health disparities? Health Economics, 17(3), 351–375.CrossRefPubMedGoogle Scholar
  3. 3.
    Bago d’Uva, T., O’Donnell, O., & van Doorslaer, E. (2008b). Differential health reporting by education level and its impact on the measurement of health inequalities among older Europeans. International Journal of Epidemiology, 37(6), 1375–1383.CrossRefPubMedCentralGoogle Scholar
  4. 4.
    Bago d’Uva, T., Lindeboom, M., O’Donnell, O., & van Doorslaer, E. (2011a). Education-related inequity in healthcare with heterogeneous reporting of health. Journal of the Royal Statistical Society: Series A (Statistics in Society), 174(3), 639–664.CrossRefGoogle Scholar
  5. 5.
    Bago d’Uva, T., Lindeboom, M., O’Donnell, O., & van Doorslaer, E. (2011b). Slipping anchor? Testing the vignettes approach to identification and correction of reporting heterogeneity. Journal of Human Resources, 46(4), 875–906.CrossRefGoogle Scholar
  6. 6.
    Bolt, D. M., Lu, Y., & Kim, J. S. (2014). Measurement and control of response styles using anchoring vignettes: A model-based approach. Psychological Methods, 19(4), 528–541.CrossRefPubMedGoogle Scholar
  7. 7.
    Buckley, J. (2008). Survey context effects in anchoring vignettes. Research note. http://polmeth.wustl.edu/media/Paper/surveyartifacts.pdf.
  8. 8.
    Datta Gupta, N., Kristensen, N., & Pozzoli, D. (2010). External validation of the use of vignettes in cross-country health studies. Economic Modelling, 27(4), 854–865.CrossRefGoogle Scholar
  9. 9.
    Ferraro, K. F. (1980). Self-ratings of health among the old and the old–old. Journal of Health and Social Behavior, 21(4), 377–383.CrossRefPubMedGoogle Scholar
  10. 10.
    Greene, W. A., & Hensher, D. A. (2010). Modeling ordered choices: A primer. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
  11. 11.
    Groot, W. (2000). Adaptation and scale of reference bias in self-assessments of quality of life. Journal of Health Economics, 19(3), 403–420.CrossRefPubMedGoogle Scholar
  12. 12.
    Grundy, E., & Holt, G. (2001). The socioeconomic status of older adults: How should we measure it in studies of health inequalities? Journal of Epidemiology and Community Health, 55(12), 895–904.CrossRefPubMedPubMedCentralGoogle Scholar
  13. 13.
    Hirve, S., Gómez-Olivé, X., Oti, S., Debpuur, C., Juvekar, S., Tollman, S., et al. (2013). Use of anchoring vignettes to evaluate health reporting behavior amongst adults aged 50 years and above in Africa and Asia—Testing assumptions. Global Health Action, 6(21), 064.Google Scholar
  14. 14.
    Hopkins, D. J., & King, G. (2010). Improving anchoring vignettes designing surveys to correct interpersonal incomparability. Public Opinion Quarterly, 74(2), 201–222.CrossRefGoogle Scholar
  15. 15.
    Idler, E. L., & Benyamini, Y. (1997). Self-rated health and mortality: A review of twenty-seven community studies. Journal of Health and Social Behavior, 38(1), 21–37.CrossRefPubMedGoogle Scholar
  16. 16.
    Jones, A. M., Rice, N., Bago d’Uva, T., & Balia, S. (2013). Applied health economics (2nd ed.). Oxon: Routledge.Google Scholar
  17. 17.
    Kapteyn, A., Smith, J. P., & van Soest, A. (2007). Vignettes and self-reports of work disability in the United States and the Netherlands. American Economic Review, 97(1), 461–473.CrossRefGoogle Scholar
  18. 18.
    King, G., & Wand, J. (2007). Comparing incomparable survey responses: Evaluating and selecting anchoring vignettes. Political Analysis, 15(1), 46–66.CrossRefGoogle Scholar
  19. 19.
    King, G., Murray, C. J. L., Solomon, J. A., & Tandon, A. (2004). Enhancing the validity and cross-cultural comparability of measurement in survey research. American Political Science Review, 98(1), 191–207.CrossRefGoogle Scholar
  20. 20.
    Lindeboom, M., & van Doorslaer, E. (2004). Cut-point shift and index shift in self-reported health. Journal of Health Economics, 23(6), 1083–1099.CrossRefPubMedGoogle Scholar
  21. 21.
    Mu, R. (2014). Regional disparities in self-reported health: Evidence from Chinese older adults. Health Economics, 23(5), 529–549.CrossRefPubMedGoogle Scholar
  22. 22.
    Murray, C. J. L., Tandon, A., Solomon, J. A., Mathers, C. D., & Sadana, R. (2002). Cross-population comparability of evidence for health policy. Global Programme on Evidence for Health Policy Discussion Paper No. 46, World Health Organization.Google Scholar
  23. 23.
    Paccagnella, O. (2011). Anchoring vignettes with sample selection due to non-response. Journal of the Royal Statistical Society: Series A (Statistics in Society), 174(3), 665–687.CrossRefGoogle Scholar
  24. 24.
    Peracchi, F., & Rosetti, C. (2012). Heterogeneity in health responses and anchoring vignettes. Empirical Economics, 42(2), 513–538.CrossRefGoogle Scholar
  25. 25.
    RAND. (2007). IFLS-4 survey description. http://www.rand.org/labor/FLS/IFLS/ifls4.html. Accessed May 1, 2014.
  26. 26.
    Rice, N., Robone, S., & Smith, P. (2011). Analysis of the validity of the vignette approach to correct for heterogeneity in reporting health system responsiveness. European Journal of Health Economics, 12(2), 141–162.CrossRefPubMedGoogle Scholar
  27. 27.
    Rokx, C., Giles, J., Satriawan, E., Marzoeki, P., Harimurti, P., & Yavuz, E. (2010). New insights into the provision of health services in Indonesia: A health workforce study. Washington, DC: World Bank.CrossRefGoogle Scholar
  28. 28.
    Sen, A. (2002). Health: Perception versus observation. BMJ, 324(7342), 860–861.CrossRefPubMedPubMedCentralGoogle Scholar
  29. 29.
    Shmueli, A. (2002). Reporting heterogeneity in the measurement of health and health-related quality of life. PharmacoEconomics, 20(6), 405–412.CrossRefPubMedGoogle Scholar
  30. 30.
    van Doorslaer, E., Koolman, X., & Jones, A. M. (2004). Explaining income-related inequalities in doctor utilisation in Europe. Health Economics, 13(7), 629–647.CrossRefPubMedGoogle Scholar
  31. 31.
    van Soest, A., Delaney, L., Harmon, C., Kapteyn, A., & Smith, J. P. (2011). Validating the use of anchoring vignettes for the correction of response scale differences in subjective questions. Journal of the Royal Statistical Society: Series A (Statistics in Society), 174(3), 575–595.CrossRefGoogle Scholar
  32. 32.
    Tandon, A., Murray, C. J. L., Solomon, J. A., & King, G. (2002). Statistical models for enhancing cross-population comparability. Global Programme on Evidence for Health Policy Discussion Paper No. 42, World Health Organization.Google Scholar
  33. 33.
    Thabrany, H. (2006). Human resources in decentralized health systems in Indonesia: Challenges for equity. Regional Health Forum, 10(1), 75–88.Google Scholar
  34. 34.
    Voňková, H., & Hullegie, P. (2011). Is the anchoring vignette method sensitive to the domain and choice of the vignette? Journal of the Royal Statistical Society: Series A (Statistics in Society), 174(3), 597–620.CrossRefGoogle Scholar
  35. 35.
    Wallace, R. B., & Herzog, A. R. (1995). Overview of the health measures in the health and retirement study. Journal of Human Resources, 30, 84–107.CrossRefGoogle Scholar
  36. 36.
    Wand, J. (2013). Credible comparisons using interpersonally incomparable data: Nonparametric scales with anchoring vignettes. American Journal of Political Science, 57(1), 249–262.CrossRefGoogle Scholar
  37. 37.
    Wand, J., King, G., & Lau, O. (2011). Anchors: Software for anchoring vignette data. Journal of Statistical Software, 42(3), 1–25.Google Scholar

Copyright information

© The Author(s) 2015

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Authors and Affiliations

  1. 1.Cathie Marsh Institute for Social ResearchUniversity of ManchesterManchesterUK

Personalised recommendations