Optimizing variance-bias trade-off in the TWANG package for estimation of propensity scores

Parast, Layla; McCaffrey, Daniel F.; Burgette, Lane F.; de la Guardia, Fernando Hoces; Golinelli, Daniela; Miles, Jeremy N. V.; Griffin, Beth Ann

doi:10.1007/s10742-016-0168-2

Optimizing variance-bias trade-off in the TWANG package for estimation of propensity scores

Published: 26 December 2016

Volume 17, pages 175–197, (2017)
Cite this article

Health Services and Outcomes Research Methodology Aims and scope Submit manuscript

Layla Parast¹,
Daniel F. McCaffrey³,
Lane F. Burgette²,
Fernando Hoces de la Guardia¹,
Daniela Golinelli⁴,
Jeremy N. V. Miles¹ &
…
Beth Ann Griffin²

817 Accesses
13 Citations
Explore all metrics

Abstract

While propensity score weighting has been shown to reduce bias in treatment effect estimation when selection bias is present, it has also been shown that such weighting can perform poorly if the estimated propensity score weights are highly variable. Various approaches have been proposed which can reduce the variability of the weights and the risk of poor performance, particularly those based on machine learning methods. In this study, we closely examine approaches to fine-tune one machine learning technique [generalized boosted models (GBM)] to select propensity scores that seek to optimize the variance-bias trade-off that is inherent in most propensity score analyses. Specifically, we propose and evaluate three approaches for selecting the optimal number of trees for the GBM in the twang package in R. Normally, the twang package in R iteratively selects the optimal number of trees as that which maximizes balance between the treatment groups being considered. Because the selected number of trees may lead to highly variable propensity score weights, we examine alternative ways to tune the number of trees used in the estimation of propensity score weights such that we sacrifice some balance on the pre-treatment covariates in exchange for less variable weights. We use simulation studies to illustrate these methods and to describe the potential advantages and disadvantages of each method. We apply these methods to two case studies: one examining the effect of dog ownership on the owner’s general health using data from a large, population-based survey in California, and a second investigating the relationship between abstinence and a long-term economic outcome among a sample of high-risk youth.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Power comparison for propensity score methods

Article 15 November 2018

Overview of Propensity Score Methods

On Propensity Score Methodology

References

Austin, P.C.: The performance of different propensity score methods for estimating marginal odds ratios. Stat. Med. 26(16), 3078–3094 (2007)
Article PubMed Google Scholar
Austin, P.C.: Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples. Stat. Med. 28, 3083–3107 (2009)
Article PubMed PubMed Central Google Scholar
Austin, P.C., Stuart, E.A.: Moving towards best practice when using inverse probability of treatment weighting (IPTW) using the propensity score to estimate causal treatment effects in observational studies. Stat. Med. 34(28), 3661–3679 (2015)
Article PubMed PubMed Central Google Scholar
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Article Google Scholar
Breiman, L., Friedman, J., Stone, C.J., Olshen, R.A.: Classification and regression trees. CRC Press, New York (1984)
Google Scholar
Brookhart, M.A., Schneeweiss, S., Rothman, K.J., Glynn, R.J., Avorn, J., Stürmer, T.: Variable selection for propensity score models. Am. J. Epidemiol. 163(12), 1149–1156 (2006)
Article PubMed PubMed Central Google Scholar
Burgette, L., McCaffrey, D.F., Griffin, B.A.: Propensity score estimation with boosted regression. In: Pan, W. (ed.) Propensity Score Analysis: Fundamentals and Developments. Guilford Publications, New York (2015)
Google Scholar
California Health Interview Survey (CHIS): CHIS 2003 Methodology Report Series. UCLA Center for Health Policy Research, Los Angeles, CA (2005)
Dennis, M.L.: Overview of the Global Appraisal of Individual Needs (Gain): Summary. Chestnut Health Systems, Bloomington, IL (1999)
Google Scholar
Golinelli, D., Ridgeway, G., Rhoades, H., Tucker, J., Wenzel, S.: Bias and variance trade-offs when combining propensity score weighting and regression: with an application to hiv status and homeless men. Health Serv. Outcomes Res. Method. 12(2–3), 104–118 (2012)
Article Google Scholar
Griffin, B.A., Ramchand, R., Edelen, M.O., McCaffrey, D.F., Morral, A.R.: Associations between abstinence in adolescence and economic and educational outcomes seven years later among high-risk youth. Drug Alcohol Depend. 113(2), 118–124 (2011)
Article PubMed Google Scholar
Griffin, B.A., Eibner, C., Bird, C.E., Jewell, A., Margolis, K., Shih, R., Slaughter, M.E., Whitsel, E.A., Allison, M., Escarce, J.J.: The relationship between urban sprawl and coronary heart disease in women. Health Place 20, 51–61 (2013)
Article PubMed Google Scholar
Hankey, B.F., Myers, M.H.: Evaluating differences in survival between two groups of patients. J. Chronic Dis. 24(9), 523–531 (1971)
Article CAS PubMed Google Scholar
Hansen, B.B.: The prognostic analogue of the propensity score. Biometrika 95(2), 481–488 (2008)
Article Google Scholar
Harder, V.S., Stuart, E.A., Anthony, J.C.: Propensity score techniques and the assessment of measured covariate balance to test causal associations in psychological research. Psychol. Methods 15(3), 234 (2010)
Article PubMed PubMed Central Google Scholar
Hernán, M.Á., Brumback, B., Robins, J.M.: Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men. Epidemiology 11(5), 561–570 (2000)
Article PubMed Google Scholar
Higashi, T., Shekelle, P.G., Adams, J.L., Kamberg, C.J., Roth, C.P., Solomon, D.H., Reuben, D.B., Chiang, L., MacLean, C.H., Chang, J.T., et al.: Quality of care is associated with survival in vulnerable older patients. Ann. Intern. Med. 143(4), 274–281 (2005)
Article PubMed Google Scholar
Hill, J.L.: Bayesian nonparametric modeling for causal inference. J. Comput. Graph. Stat. 20(1), 217–240 (2011)
Article Google Scholar
Imai, K., Ratkovic, M.: Covariate balancing propensity score. J. R. Stat. Soc. Ser. B (Stat. Method.) 76(1), 243–263 (2014)
Article Google Scholar
Imbens, G.W.: The role of the propensity score in estimating dose-response functions. Biometrika 87(3), 706–710 (2000)
Article Google Scholar
Imbens, G.W., Rubin, D.B.: Causal Inference in Statistics, Social, and Biomedical Sciences. Cambridge University Press, Cambridge (2015)
Book Google Scholar
Kaestner, R.: The effect of illicit drug use on the wages of young adults. Tech. rep., National Bureau of Economic Research (1990)
Kaestner, R.: New estimates of the effect of marijuana and cocaine use on wages. Ind. Labor Relat. Rev. 47(3), 454–470 (1994)
Article Google Scholar
Kang, J.D., Schafer, J.L.: Demystifying double robustness: a comparison of alternative strategies for estimating a population mean from incomplete data. Statistical Science, pp. 523–539 (2007)
Lee, B.K., Lessler, J., Stuart, E.A.: Improving propensity score weighting using machine learning. Stat. Med. 29(3), 337–346 (2010)
PubMed PubMed Central Google Scholar
Lee, B.K., Lessler, J., Stuart, E.A.: Weight trimming and propensity score weighting. PloS One 6(3), e18,174 (2011)
Article CAS Google Scholar
Lee, S., Brown, E.R., Grant, D., Belin, T.R., Brick, J.M.: Exploring nonresponse bias in a health survey using neighborhood characteristics. Am. J. Public Health 99(10), 1811 (2009)
Article PubMed PubMed Central Google Scholar
Liaw, A., Wiener, M.: Classification and regression by randomforest. R News 2(3), 18–22 (2002)
Google Scholar
McCaffrey, D.F., Ridgeway, G., Morral, A.R.: Propensity score estimation with boosted regression for evaluating causal effects in observational studies. Psychol. Methods 9(4), 403 (2004)
Article PubMed Google Scholar
McConnell, A.R., Brown, C.M., Shoda, T.M., Stayton, L.E., Martin, C.E.: Friends with benefits: on the positive consequences of pet ownership. J. Personal. Soc. Psychol. 101(6), 1239 (2011)
Article Google Scholar
Morral, A.R., McCaffrey, D.F., Ridgeway, G.: Effectiveness of community-based treatment for substance-abusing adolescents: 12-month outcomes of youths entering phoenix academy or alternative probation dispositions. Psychol. Addict. Behav. 18(3), 257 (2004)
Article PubMed Google Scholar
Normand, S.L.T., Landrum, M.B., Guadagnoli, E., Ayanian, J.Z., Ryan, T.J., Cleary, P.D., McNeil, B.J.: Validating recommendations for coronary angiography following acute myocardial infarction in the elderly: a matched analysis using propensity scores. J. Clin. Epidemiol. 54(4), 387–398 (2001)
Article CAS PubMed Google Scholar
Pirracchio, R., Petersen, M.L., van der Laan, M.: Improving propensity score estimators’ robustness to model misspecification using super learner. Am. J. Epidemiol. 181(2), 108–119 (2015)
Article PubMed Google Scholar
Ponce, N.A., Lavarreda, S.A., Yen, W., Brown, E.R., DiSogra, C., Satter, D.E.: The california health interview survey 2001: translation of a major survey for california’s multiethnic population. Public Health Rep. 119(4), 388 (2004)
Article PubMed PubMed Central Google Scholar
Register, C.A., Williams, D.R.: Labor market effects of marijuana and cocaine use among young men. Ind. Labor Relat. Rev. 45(3), 435–448 (1992)
Article Google Scholar
Ridgeway, G.: gbm: Generalized Boosted Regression Models. R package version 2.1.1. Retrieved from cran.r-project.org (2015)
Ridgeway, G., McCaffrey, D., Morral, A., Griffin, B.A., Burgette, L.: Twang: Toolkit for Weighting and Analysis of Nonequivalent Groups. R package version 9.5. Retrieved from cran.r-project.org (2016)
Ringel, J.S., Collins, R.L., Ellickson, P.L.: Time trends and demographic differences in youth exposure to alcohol advertising on television. J. Adolesc. Health 39(4), 473–480 (2006)
Article PubMed Google Scholar
Ringel, J.S., Ellickson, P.L., Collins, R.L.: High school drug use predicts job-related outcomes at age 29. Addict. Behav. 32(3), 576–589 (2007)
Article PubMed Google Scholar
Robins, J.M., Hernán, M.Á., Brumback, B.: Marginal structural models and causal inference in epidemiology. Epidemiology 11(5), 550–560 (2000)
Article CAS PubMed Google Scholar
Rosenbaum, P.R.: Various practical issues in matching. In: Design of Observational Studies, pp. 187–195. Springer, New York (2010)
Rosenbaum, P.R., Rubin, D.B.: Assessing sensitivity to an unobserved binary covariate in an observational study with binary outcome. J. R. Stat. Soc. Ser. B (Methodol.) 45(2), 212–218 (1983a)
Google Scholar
Rosenbaum, P.R., Rubin, D.B.: The central role of the propensity score in observational studies for causal effects. Biometrika 70(1), 41–55 (1983b)
Article Google Scholar
Rosenbaum, P.R., Rubin, D.B.: Reducing bias in observational studies using subclassification on the propensity score. J. Am. Stat. Assoc. 79(387), 516–524 (1984)
Article Google Scholar
Rubin, D.B.: On principles for modeling propensity scores in medical research. Pharmacoepidemiol. Drug Saf. 13(12), 855–857 (2004)
Article PubMed Google Scholar
Stuart, E.A., Lee, B.K., Leacy, F.P.: Prognostic score-based balance measures can be a useful diagnostic for propensity score methods in comparative effectiveness research. J. Clin. Epidemiol. 66(8), S84–S90 (2013)
Article PubMed PubMed Central Google Scholar
Survey, C.H.I.: Technical Paper No. 1: The chis 2001 Sample: Response Rate and Representativeness. Ucla Center for Health Policy Research, Los Angeles, CA (2003)
Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 58(1), 267–288 (1996)
Google Scholar
van der Laan, M.J.: Targeted estimation of nuisance parameters to obtain valid statistical inference. Int. J. Biostat. 10(1), 29–57 (2014)
PubMed Google Scholar
van der Laan, M.J., Polley, E.C., Hubbard, A.E.: Super learner. Stat. Appl. Genet. Mol. Biol. (2007). doi:10.2202/1544-6115.1309
Wells, D.L.: Associations between pet ownership and self-reported health status in people suffering from chronic fatigue syndrome. J. Altern. Complement. Med. 15(4), 407–413 (2009a)
Article PubMed Google Scholar
Wells, D.L.: The effects of animals on human health and well-being. J. Soc. Issues 65(3), 523–543 (2009b)
Article Google Scholar
Westreich, D., Cole, S.R., Funk, M.J., Brookhart, M.A., Stürmer, T.: The role of the c-statistic in variable selection for propensity score models. Pharmacoepidemiol. Drug Saf. 20(3), 317–320 (2011)
Article PubMed Google Scholar

Download references

Funding

This study was funded by National Institutes of Health grant 1R01DA034065-01A1 and National Institute of Child Health and Human Development grant R01HD066591.

Author information

Authors and Affiliations

RAND Corporation, 1776 Main Street, Santa Monica, CA, 90403, USA
Layla Parast, Fernando Hoces de la Guardia & Jeremy N. V. Miles
RAND Corporation, 1200 South Hayes Street, Arlington, VA, 22202, USA
Lane F. Burgette & Beth Ann Griffin
Educational Testing Service, 660 Rosedale Road, Princeton, NJ, 08541, USA
Daniel F. McCaffrey
Mathematica Policy Research, 1100 1st Street, NE, Washington, DC, 20002, USA
Daniela Golinelli

Authors

Layla Parast
View author publications
You can also search for this author in PubMed Google Scholar
Daniel F. McCaffrey
View author publications
You can also search for this author in PubMed Google Scholar
Lane F. Burgette
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Hoces de la Guardia
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Golinelli
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy N. V. Miles
View author publications
You can also search for this author in PubMed Google Scholar
Beth Ann Griffin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Layla Parast.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval and informed consent

This study used only secondary de-identified datasets.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Parast, L., McCaffrey, D.F., Burgette, L.F. et al. Optimizing variance-bias trade-off in the TWANG package for estimation of propensity scores. Health Serv Outcomes Res Method 17, 175–197 (2017). https://doi.org/10.1007/s10742-016-0168-2

Download citation

Received: 04 April 2016
Revised: 17 November 2016
Accepted: 09 December 2016
Published: 26 December 2016
Issue Date: December 2017
DOI: https://doi.org/10.1007/s10742-016-0168-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Optimizing variance-bias trade-off in the TWANG package for estimation of propensity scores

Abstract

Access this article

Similar content being viewed by others

Power comparison for propensity score methods

Overview of Propensity Score Methods

On Propensity Score Methodology

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval and informed consent

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Optimizing variance-bias trade-off in the TWANG package for estimation of propensity scores

Abstract

Access this article

Similar content being viewed by others

Power comparison for propensity score methods

Overview of Propensity Score Methods

On Propensity Score Methodology

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval and informed consent

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation