Abstract
In the last years, there is movement towards simulation-based inference (e.g. bootstrapping and randomization tests) in order to improve students’ understanding of statistical reasoning as well as a call to introduce statistical computing and reproducible analysis within the curriculum. With the help of R mosaic and the concept of minimal R, we were able to include all this in an introductory statistics course for people studying while working a business-related major. Moreover, this also paves the road towards methods and concepts like data wrangling or algorithmic modelling, more related to data science than to classical statistics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
This questionnaire was used in our first surveys 2015 and 2016 already.
- 2.
Additionally, to the CAOS items, we used a multi-item scale based on the Unified Theory of Acceptance and Use of Technology (UTAUT2 Venkatesh et al. 2012) which is not part of the investigation referred to in this paper.
References
Allaire, J., Xie, Y., McPherson, J., Luraschi, J., Ushey, K., Atkins, A., et al. (2018). rmarkdown: Dynamic documents for R. https://CRAN.R-project.org/package=rmarkdown; R package version 1.10.
Angrist, J. D., & Pischke, J. S. (2017). Undergraduate econometrics instruction: Through our classes, darkly. Journal of Economic Perspectives, 31(2), 125–44. https://doi.org/10.1257/jep.31.2.125..
Baumer, B., Cetinkaya-Rundel, M., Bray, A., Loi, L., & Horton, N. J. (2014). R markdown: Integrating a reproducible analysis tool into introductory statistics. Technological Innovations in Statistics Education, 8(1).
Breiman, L. (2001). Statistical modeling: The two cultures (with comments and a rejoinder by the author). Statistical Science, 16(3), 199–231.
Chance, B., Wong, J., & Tintle, N. (2016). Student performance in curricula centered on simulation-based inference: A preliminary report. Journal of Statistics Education, 24(3), 114–126. https://doi.org/10.1080/10691898.2016.1223529..
Chang, W., Cheng, J., Allaire, J., Xie, Y., & McPherson, J. (2018). Shiny: Web application framework for R. https://CRAN.R-project.org/package=shiny; R package version 1.1.0.
Cobb, G. (2015). Mere renovation is too little too late: We need to rethink our undergraduate curriculum from the ground up. The American Statistician, 69(4), 266–282. https://doi.org/10.1080/00031305.2015.1093029..
Cobb, G. W. (2007). The introductory statistics course: A ptolemaic curriculum? Technology Innovations in Statistics Education, 1(1).
Dabbish, L., Stuart, C., Tsay, J., & Herbsleb, J. (2012). Social coding in github: transparency and collaboration in an open software repository. In Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work (pp. 1277–1286). ACM
De Veaux, R. D., & Velleman, P. F. (2008). Math is music; statistics is literature (or, why are there no six-year-old novelists?). Amstat News, 375, 54–58.
Delmas, G., Joan, G., Ooms, A., & Chance, B. (2007). Assessing students’ conceptual understanding after a first course in statistics. Statistics Education Research Journal, 6(2).
Doi, J., Potter, G., Wong, J., Alcaraz, I., & Chi, P. (2016). Web application teaching tools for statistics using R and shiny. Technology Innovations in Statistics Education, 9(1).
Donoho, D. (2017). 50 years of data science. Journal of Computational and Graphical Statistics, 26(4), 745–766. https://doi.org/10.1080/10618600.2017.1384734..
Ernst, M. D. (2004). Permutation methods: A basis for exact inference. Statistical Science, 19(4), 676–685.
Fox, J. (2017). Using the R commander: A point-and-click interface for R. Boca Raton FL: Chapman and Hall/CRC Press. http://socserv.mcmaster.ca/jfox/Books/RCommander/
Hardin, J., Hoerl, R., Horton, N. J., Nolan, D., Baumer, B., Hall-Holt, O., et al. (2015). Data science in statistics curricula: Preparing students to “think with data”. The American Statistician, 69(4), 343–353. https://doi.org/10.1080/00031305.2015.1077729..
Hesterberg, T. C. (2015). What teachers should know about the bootstrap: Resampling in the undergraduate statistics curriculum. The American Statistician, 69(4), 371–386.
Hildreth, L. A., Robison-Cox, J., & Schmidt, J. (2018). Comparing student success and understanding in introductory statistics under consensus and simulation-based curricula. Statistics Education Research Journal, 17(1).
Horton, N. J., Baumer, B. S., Wickham, H. (2014). Teaching precursors to data science in introductory and second courses in statistics. arXiv:1401.3269
Kaplan, D.: Statistical modeling: A fresh approach (2 edn). Project MOSAIC Books.
Kaplan, D. (2018). Teaching stats for data science. The American Statistician, 72(1), 89–96. https://doi.org/10.1080/00031305.2017.1398107..
Kaplan, D., & Pruim, R. (2018). ggformula: Formula Interface to the Grammar of Graphics. https://CRAN.R-project.org/package=ggformula; R package version 0.9.0.
Kim, A. Y., Escobedo-Land, A. (2015). Okcupid data for introductory statistics and data science courses. Journal of Statistics Education, 23(2), null (2015). https://doi.org/10.1080/10691898.2015.11889737.
Lesser, L. M., Pearl, D. K., III, J. J. W. (2016). Assessing fun items’ effectiveness in increasing learning of college introductory statistics students: Results of a randomized experiment. Journal of Statistics Education, 24(2), 54–62 (2016). https://doi.org/10.1080/10691898.2016.1190190, https://doi.org/10.1080/10691898.2016.1190190.
McGowan, H. M., & Gunderson, B. K. (2010). A randomized experiment exploring how certain features of clicker use effect undergraduate students’ engagement and learning in statistics. Technology Innovations in Statistics Education, 4(1).
Neumann, D. L., Hood, M., & Neumann, M. M. (2013). Using real-life data when teaching statistics: student perceptions of this strategy in an introductory statistics course. Statistics Education Research Journal, 12(2).
Nolan, D., & Lang, D. T. (2010). Computing in the statistics curricula. The American Statistician, 64(2), 97–107. https://doi.org/10.1198/tast.2010.09132..
Pearl, J. (2009). Causal inference in statistics: An overview. Statistics surveys, 3, 96–146.
Pruim, R., Kaplan, D.T., & Horton, N.J. (2017). The mosaic package: Helping students to ‘think with data’ using R. The R Journal, 9(1), 77–102 (2017). https://journal.r-project.org/archive/2017/RJ-2017-024/index.html.
R Core Team. (2018). R: A language and environment for statistical computing. R foundation for statistical computing, Vienna, Austria. https://www.R-project.org/
Ridgway, J. (2016). Implications of the data revolution for statistics education. International Statistical Review, 84(3), 528–549. https://doi.org/10.1111/insr.12110..
Ridsdale, C., Rothwell, J., Smit, M., Ali-Hassan, H., Bliemel, M., Irvine, D., et al. (2015). Strategies and best practices for data literacy education: Knowledge synthesis report (2015). https://doi.org/10.13140/RG.2.1.1922.5044
Schau, C.: Survey of attitudes toward statistiks (2017). https://www.evaluationandstatistics.com/
Schloerke, B., Allaire, J., & Borges, B. (2018). learnr: Interactive Tutorials for R. https://CRAN.R-project.org/package=learnr; R package version 0.9.2.1.
Schulte, F. P. (2015). Die Bedeutung und Erfassung des Erwerbs von Theorie-Praxis-/Praxis-Theorie-Transferkompetenz im Rahmen eines dualen Studiums. http://stifterverband.de/pdf/hds-essen-transferkompetenz.pdf
Sole, M. A., & Weinberg, S. L. (2017). What’s brewing? A statistics education discovery project. Journal of Statistics Education, 25(3), 137–144. https://doi.org/10.1080/10691898.2017.1395302..
Stigler, J. W., Son, J. Y. (2018). Modeling first: A modeling approach to teaching introductory statistics. In M. A. Sorto, A. White, L. Guyot (Eds.), Looking back, looking forward. Proceedings of the Tenth International Conference on Teaching Statistics. https://iase-web.org/icots/10/proceedings/pdfs/ICOTS10_9C3.pdf.
Tintle, N., Chance, B. L., Cobb, G. W., Rossman, A.J., Roy, S., Swanson, T., et al. (2016). Introduction to statistical investigations. Wiley Online Library.
Tintle, N., Clark, J., Fischer, K., Chance, B., Cobb, G., Roy, S., et al. (2018). Assessing the association between precourse metrics of student preparation and student performance in introductory statistics: Results from early data on simulation-based inference vs. nonsimulation-based inference. Journal of Statistics Education, 26(2), 103–109 (2018). https://doi.org/10.1080/10691898.2018.1473061, https://doi.org/10.1080/10691898.2018.1473061.
Tukey, J. W. (1977). Exploratory data analysis. Mass: Reading.
Venkatesh, V., Morris, M. G., Davis, G. B., & Davis, F. D. (2003). User acceptance of information technology: Toward a unified view. MIS Quarterly, 27(3), 425–478.
Venkatesh, V., Thong, J. Y., & Xu, X. (2012). Consumer acceptance and use of information technology: extending the unified theory of acceptance and use of technology. MIS Quarterly (pp. 157–178).
Wang, X., Rush, C., & Horton, N. J. (2017). Data visualization on day one: Bringing big ideas into intro stats early and often. Technology Innovations in Statistics Education, 10(1).
Weihs, C., & Ickstadt, K. (2018). Data science: The impact of statistics. International Journal of Data Science and Analytics,. https://doi.org/10.1007/s41060-018-0102-5..
Wickham, H. (2016). ggplot2: Elegant Graphics for data analysis. New York: Springer. http://ggplot2.org
Wickham, H., François, R., Henry, L., Müller, K. (2018). dplyr: A grammar of data manipulation. https://CRAN.R-project.org/package=dplyr; R package version 0.7.6.
Wild, C. J., & Pfannkuch, M. (1999). Statistical thinking in empirical enquiry. International Statistical Review, 67(3), 223–248. https://doi.org/10.1111/j.1751-5823.1999.tb00442.x..
Wild, C. J., Pfannkuch, M., Regan, M., & Parsonage, R. (2017). Accessible conceptions of statistical inference: Pulling ourselves up by the bootstraps. International Statistical Review, 85(1), 84–107. https://doi.org/10.1111/insr.12117..
Wild, C. J., Utts, J. M., Horton, N. J. (2018). What Is statistics? (pp. 5–36). Cham: Springer International Publishing. https://doi.org/10.1007/978-3-319-66195-7_1, https://doi.org/10.1007/978-3-319-66195-7_1
Wood, B. L., Mocko, M., Everson, M., Horton, N. J., & Velleman, P. (2018). Updated guidelines, updated curriculum: The gaise college report and introductory statistics for the modern student. CHANCE, 31(2), 53–59. https://doi.org/10.1080/09332480.2018.1467642..
Xie, Y. (2015) Dynamic Documents with R and knitr (2nd Edn). Chapman and Hall/CRC, Boca Raton, Florida. https://yihui.name/knitr/. ISBN 978-1498716963
Acknowledgements
We thank Oliver Gansser, Bianca Krol, Sebastian Sauer, and numerous other colleagues for their contribution in the proposed change of the curriculum and for helpful comments in order to improve the teaching materials. Also, we thank Nathan Tintle for his support with the CAOS inventory. The remarks of Nicholas Horton, Randall Pruim, and two reviewers helped to improve this paper a lot. We gratefully acknowledge that our work was supported by an internal teaching innovation grant by our institution.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Lübke, K., Gehrke, M., Markgraf, N. (2019). Statistical Computing and Data Science in Introductory Statistics. In: Bauer, N., Ickstadt, K., Lübke, K., Szepannek, G., Trautmann, H., Vichi, M. (eds) Applications in Statistical Computing. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-030-25147-5_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-25147-5_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-25146-8
Online ISBN: 978-3-030-25147-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)