Abstract
This is the first of four chapters emphasizing the application of descriptive analytics to characterize public and occupational health risks. Much of risk analysis addresses basic descriptive information: how big is a risk now, how is it changing over time or with age, how does it differ for people or situations with different characteristics, on what factors does it depend, with what other risks or characteristics does it cluster? Such questions arise not only for public and occupational health and safety risks, but also for risks of failures or degraded performance in engineering infrastructure or technological systems, financial systems, political systems, or other “systems of systems” (Guo and Haimes 2016). Simply knowing how large a risk is now and whether it is increasing, staying steady, or decreasing may be enough to decide whether a proposed costly intervention to reduce it is worth considering further. This chapter shows how to use basic tools of descriptive analytics, especially interaction plots (showing the conditional expected value of one variable at different levels of one or more other variables), together with more advanced methods from Chap. 2, such as regression trees, partial dependence plots, Bayesian networks (BNs), to describe risks and how they vary with other factors. A brief discussion and motivation of these methods is given for readers who have skipped Chap. 2. Chapter 4 introduces additional descriptive techniques, including plots that use non-parametric regression to pass smooth curves or surfaces through data clouds. It shows how they can be used, together with simple mathematical analysis, to resolve a puzzle that has occasioned some debate among toxicologists: that some studies have concluded that workers form disproportionately high levels of benzene metabolites at very low occupational exposure concentrations compared to higher concentrations, while other studies conclude that metabolism of benzene at low concentrations is approximately linear, and proportional to concentrations in inhaled air. Chapter 5 emphasizes the value of descriptive plots, upper-bounding analyses, and qualitative assumptions, as well as more quantitative risk assessment modeling, in bounding the size of human health risks from use of antibiotics in food animals. Chapter 6 calculates plausible bounds on the sizes of the quantitative risks to human health of infection with a drug-resistant “super-bug” from swine farming operations. Together, these chapters illustrate how descriptive analytics can be used to obtain and present useful quantitative characterizations of human health risks despite realistic scientific uncertainties about the details of relevant causal processes.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Aliferis CE, Statnikov A, Tsamardinos I, Mani S, Koutsoukos XS (2010) Local causal and Markov blanket induction for causal discovery and feature selection for classification part I: algorithms and empirical evaluation. J Mach Learn Res 11:171–234
Burte E, Nadif R, Jacquemin B (2016) Susceptibility factors relevant for the association between long-term air pollution exposure and incident asthma. Curr Environ Health Rep 3(1):23–39
Carlsson AC, Li X, Holzmann MJ, Wändell P, Gasevic D, Sundquist J, Sundquist K (2016) Neighbourhood socioeconomic status and coronary heart disease in individuals between 40 and 50 years. Heart 102(10):775–782
Chen Y, Wong GW, Li J (2016) Environmental exposure and genetic predisposition as risk factors for asthma in China. Allergy Asthma Immunol Res 8(2):92–100. https://doi.org/10.4168/aair.2016.8.2.92
Coogan PF, Castro-Webb N, Yu J, O’Connor GT, Palmer JR, Rosenberg L (2016) Neighborhood and individual socioeconomic status and asthma incidence in African American women. Ethn Dis 26(1):113–122
Dupre ME, George LK, Liu G, Peterson ED (2015) Association between divorce and risks for acute myocardial infarction. Circ Cardiovasc Qual Outcomes 8(3):244–251
Frey L, Fisher D, Tsamardinos I, Aliferis CF, Statnikov A (2003) Identifying Markov blankets with decision tree induction. In: Proceedings of the third IEEE international conference on data mining, Melbourne, 19–22 Nov 2003, pp 59–66
Furqan MS, Siyal MY (2016) Random forest Granger causality for detection of effective brain connectivity using high-dimensional data. J Integr Neurosci 15(1):55–66
Guo Z, Haimes YY (2016) Risk assessment of infrastructure system of systems with precursor analysis. Risk Anal 36(8):1630–1643. https://doi.org/10.1111/risa.12559
Halliday DM, Senik MH, Stevenson CW, Mason R (2016) Non-parametric directionality analysis—extension for removal of a single common predictor and application to time series. J Neurosci Methods 268:87–97
Heinzerling A, Hsu J, Yip F (2016) Respiratory health effects of ultrafine particles in children: a literature review. Water Air Soil Pollut 227:32
Hill J (2016) Atlantic causal inference conference competition: is your SATT where it’s at? http://jenniferhill7.wixsite.com/acic-2016/competition
Janzing D, Balduzzi D, Grosse-Wentrup M, Scholkopf B (2013) Quantifying causal influences. Ann Stat 41(5):2324–2358
Krstić G (2011) Asthma prevalence associated with geographical latitude and regional insolation in the United States of America and Australia. PLoS One 6(4):e18492. https://doi.org/10.1371/journal.pone.0018492
NIPS (Neural Information Processing Society) 2013 workshop on causality. http://clopinet.com/isabelle/Projects/NIPS2013/
Oliveira A, Barros H, Maciel MJ, Lopes C (2007) Tobacco smoking and acute myocardial infarction in young adults: a population-based case-control study. Prev Med 44(4):311–316. Epub 2007 Jan 17
Pearl J (2010) An introduction to causal inference. Int J Biostat 6(2):Article 7
Shah RS, Cole JW (2010) Smoking and stroke: the more you smoke the more you stroke. Expert Rev Cardiovasc Ther 8(7):917–932
Strickland MJ, Hao H, Hu X, Chang HH, Darrow LA, Liu Y (2016) Pediatric emergency visits and short-term changes in PM2.5 concentrations in the U.S. State of Georgia. Environ Health Perspect 124(5):690–696
To T, Zhu J, Villeneuve PJ, Simatovic J, Feldman L, Gao C, Williams D, Chen H, Weichenthal S, Wall C, Miller AB (2015) Chronic disease prevalence in women and air pollution—a 30-year longitudinal cohort study. Environ Int 80:26–32
Zahran HS, Bailey C (2013) Factors associated with asthma prevalence among racial and ethnic groups—United States, 2009-2010 behavioral risk factor surveillance system. J Asthma 50(6):583–589
Zou C, Feng J (2009) Granger causality vs. dynamic Bayesian network inference: a comparative study. BMC Bioinformatics 10:122. https://doi.org/10.1186/1471-2105-10-122
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Cox Jr., L.A., Popken, D.A., Sun, R.X. (2018). Descriptive Analytics for Public Health: Socioeconomic and Air Pollution Correlates of Adult Asthma, Heart Attack, and Stroke Risks. In: Causal Analytics for Applied Risk Analysis. International Series in Operations Research & Management Science, vol 270. Springer, Cham. https://doi.org/10.1007/978-3-319-78242-3_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-78242-3_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-78240-9
Online ISBN: 978-3-319-78242-3
eBook Packages: Business and ManagementBusiness and Management (R0)