Advertisement

Stochastic Process Models for Describing Computer Simulator Output

  • Thomas J. Santner
  • Brian J. Williams
  • William I. Notz
Chapter
Part of the Springer Series in Statistics book series (SSS)

Abstract

Recall from Chap.  1 that \(\boldsymbol{x}\) denotes a generic input to our computer simulator and \(y(\boldsymbol{x})\) denotes the associated output. This chapter will introduce several classes of random function models for \(y(\boldsymbol{x})\) that will serve as the core building blocks for the interpolators, experimental designs, calibration, and tuning methodologies that will be introduced in later chapters. The reason that the random function approach is so useful is that accurate prediction based on black box computer simulator output requires a rich class of \(y(\boldsymbol{x})\) options when only a minimal amount might be known about the output function. Indeed, regression mean modeling of simulator output is usually based on a rather arbitrarily selected parametric form.

References

  1. Abrahamsen P (1997) A review of Gaussian random fields and correlation functions. Technical report 917, Norwegian Computing Center, Box 114, Blindern, N0314 OsloGoogle Scholar
  2. Adler RJ (1981) The geometry of random fields. Wiley, New York, NYzbMATHGoogle Scholar
  3. Adler RJ (1990) An introduction to continuity, extrema, and related topics for general Gaussian processes. Institute of Mathematical Statistics, Hayward, CAGoogle Scholar
  4. Banerjee S, Gelfand AE, Finley AO, Sang H (2008) Gaussian predictive process models for large spatial data sets. J R Stat Soc Ser B 70(4):825–848MathSciNetCrossRefGoogle Scholar
  5. Bayarri MJ, Berger JO, Paulo R, Sacks J, Cafeo JA, Cavendish J, Lin CH, Tu J (2007) A framework for validation of computer models. Technometrics 49(2):138–154MathSciNetCrossRefGoogle Scholar
  6. Berger JO (1985) Statistical decision theory and Bayesian analysis. Springer, New York, NYCrossRefGoogle Scholar
  7. Berger JO, De Oliveira V, Sansó B (2001) Objective Bayesian analysis of spatially correlated data. J Am Stat Assoc 96:1361–1374MathSciNetCrossRefGoogle Scholar
  8. Bliznyuk N, Ruppert D, Shoemaker C, Regis R, Wild S, Mugunthan P (2008) Bayesian calibration and uncertainty analysis for computationally expensive models using optimization and radial basis function approximation. J Comput Graph Stat 17:270–294MathSciNetCrossRefGoogle Scholar
  9. Bochner S (1955) Harmonic analysis and the theory of probability. University of California Press, Berkeley, CAzbMATHGoogle Scholar
  10. Bornn L, Shaddick G, Zidek JV (2012) Modeling nonstationary processes through dimension expansion. J Am Stat Assoc 107:281–289MathSciNetCrossRefGoogle Scholar
  11. Chakraborty A, Mallick BK, McClarren RG, Kuranz CC, Bingham D, Grosskopf MJ, Rutter EM, Stripline HF, Drake RP (2013) Spline-based emulators for radiative shock experiments with measurement error. J Am Stat Assoc 108:411–428MathSciNetCrossRefGoogle Scholar
  12. Chen PH, Dean A, Santner T (2014a) Multivariate Gaussian process interpolators with varying-parameter covariance—with an application to Pareto front estimation. Poster Presentation, 2014 meeting of the American Statistical AssociationGoogle Scholar
  13. Chipman H, George E, McCulloch R (1998) Bayesian CART model search (with discussion). J Am Stat Assoc 93:935–960CrossRefGoogle Scholar
  14. Chipman H, George E, McCulloch R (2002) Bayesian treed models. Mach Learn 48:303–324CrossRefGoogle Scholar
  15. Christiansen CL, Morris CN (1997) Hierarchical Poisson regression modeling. J Am Stat Assoc 92:618–632MathSciNetCrossRefGoogle Scholar
  16. Conti S, O’Hagan A (2010) Bayesian emulation of complex multi-output and dynamic computer models. J Stat Plann Inf 140(3):640–651MathSciNetCrossRefGoogle Scholar
  17. Cramér H, Leadbetter MR (1967) Stationary and related stochastic processes. Wiley, New York, NYzbMATHGoogle Scholar
  18. Cressie N, Wikle CK (2011) Statistics for spatio-temporal data. Wiley, New York, NYzbMATHGoogle Scholar
  19. Cressie NA (1993) Statistics for spatial data. Wiley, New York, NYzbMATHGoogle Scholar
  20. Currin C, Mitchell TJ, Morris MD, Ylvisaker D (1991) Bayesian prediction of deterministic functions, with applications to the design and analysis of computer experiments. J Am Stat Assoc 86:953–963MathSciNetCrossRefGoogle Scholar
  21. Doob JL (1953) Stochastic processes. Wiley, New York, NYzbMATHGoogle Scholar
  22. Fricker TE, Oakley JE, Urban NM (2013) Multivariate Gaussian process emulators with nonseparable covariance structures. Technometrics 55:47–56MathSciNetCrossRefGoogle Scholar
  23. Gelfand AE, Schmidt AM, Banerjee S, Sirmans CF (2004) Nonstationary multivariate process modeling through spatially varying coregionalization (with discussion). Test 13:263–312MathSciNetCrossRefGoogle Scholar
  24. Gneiting T (2002) Compactly supported correlation functions. J Multivar Anal 83(2):493–508MathSciNetCrossRefGoogle Scholar
  25. Gneiting T, Kleiber W, Schlather M (2010) Matérn cross-covariance functions for multivariate random fields. J Am Stat Assoc 105:1167–1177CrossRefGoogle Scholar
  26. Gramacy RB, Lee HKH (2008) Bayesian treed Gaussian process models with an application to computer modeling. J Am Stat Assoc 103:1119–1130MathSciNetCrossRefGoogle Scholar
  27. Guttorp P, Sampson PD (1994) Methods for estimating heterogeneous spatial covariance functions with environmental applications. In: Patil GP, Rao CR (eds) Handbook of statistics, vol 12. Elsevier Science B.V., Amsterdam, pp 661–689Google Scholar
  28. Guttorp P, Meiring W, Sampson PD (1994) A space-time analysis of ground-level ozone data. Environmetrics 5:241–254CrossRefGoogle Scholar
  29. Haas TC (1995) Local prediction of a spatio-temporal process with an application to wet sulfate deposition. J Am Stat Assoc 90:1189–1199CrossRefGoogle Scholar
  30. Han G, Santner TJ, Notz WI, Bartel DL (2009a) Prediction for computer experiments having quantitative and qualitative input variables. Technometrics 51(2):278–288MathSciNetCrossRefGoogle Scholar
  31. Handcock MS, Stein ML (1993) A Bayesian analysis of kriging. Technometrics 35:403–410CrossRefGoogle Scholar
  32. Handcock MS, Wallis JR (1994) An approach to statistical spatial-temporal modeling of meterological fields. J Am Stat Assoc 89:368–390CrossRefGoogle Scholar
  33. Hastie T, Tibshirani R, Friedman J (2001) The elements of statistical learning: data mining, inference, and prediction. Springer, New York, NYCrossRefGoogle Scholar
  34. Higdon D, Gattiker J, Williams B, Rightley M (2008) Computer model calibration using high dimensional output. J Am Stat Assoc 103:570–583MathSciNetCrossRefGoogle Scholar
  35. Higdon DM (1998) A process-convolution approach to modelling temperatures in the North Atlantic Ocean (with discussion). Environ Ecol Stat 5:173–192CrossRefGoogle Scholar
  36. Higdon DM, Swall J, Kern JC (1999) Non-stationary spatial modeling. In: Bernardo JM, Berger JO, Dawid AP, Smith AFM (eds) Bayesian statistics, vol 6. Oxford University Press, Oxford, pp 761–768Google Scholar
  37. Journel AG, Huijbregts CJ (1978) Mining geostatistics. Academic, San Diego, CAGoogle Scholar
  38. Kaufman C, Bingham D, Habib S, Heitmann K, Frieman J (2011) Efficient emulators of computer experiments using compactly supported correlation functions, with an application to cosmology. Ann Appl Stat 5:2470–2492MathSciNetCrossRefGoogle Scholar
  39. Kennedy MC, O’Hagan A (2000) Predicting the output from a complex computer code when fast approximations are available. Biometrika 87:1–13MathSciNetCrossRefGoogle Scholar
  40. Kreyszig E (1999) Advanced engineering mathematics. Wiley, New York, NYzbMATHGoogle Scholar
  41. Loeppky JL, Sacks J, Welch WJ (2009) Choosing the sample size of a computer experiment: a practical guide. Technometrics 51(4):366–376MathSciNetCrossRefGoogle Scholar
  42. MacDonald B, Ranjan P, Chipman H (2015) GPfit: an R package for Gaussian process model fitting using a new optimization algorithm. J Stat Softw 64(12):1–23CrossRefGoogle Scholar
  43. Matérn B (1960) Spatial variation. PhD thesis, Meddelanden fran Statens Skogsforskningsinstitut, vol 49, Num 5Google Scholar
  44. Matérn B (1986) Spatial variation, 2nd edn. Springer, New York, NYCrossRefGoogle Scholar
  45. Matheron G (1963) Principles of geostatistics. Econ Geol 58:1246–1266CrossRefGoogle Scholar
  46. McMillan NJ, Sacks J, Welch WJ, Gao F (1999) Analysis of protein activity data by Gaussian stochastic process models. J Biopharm Stat 9:145–160CrossRefGoogle Scholar
  47. Mitchell TJ, Morris MD, Ylvisaker D (1990) Existence of smoothed stationary processes on an interval. Stoch Process Appl 35:109–119MathSciNetCrossRefGoogle Scholar
  48. Mitchell TJ, Morris MD, Ylvisaker D (1994) Asymptotically optimum experimental designs for prediction of deterministic functions given derivative information. J Stat Plann Inf 41:377–389MathSciNetCrossRefGoogle Scholar
  49. Mockus J, Eddy W, Mockus A, Mockus L, Reklaitis G (1997) Bayesian heuristic approach to discrete and global optimization: algorithms, visualization, software, and applications. Kluwer Academic, New York, NYCrossRefGoogle Scholar
  50. Morris MD (2012) Gaussian surrogates for computer models with time-varying inputs and outputs. Technometrics 54(1):42–50MathSciNetCrossRefGoogle Scholar
  51. Morris MD (2014) Maximin distance optimal designs for computer experiments with time-varying inputs and outputs. J Stat Plann Inf 144:63–68MathSciNetCrossRefGoogle Scholar
  52. Morris MD, Mitchell TJ, Ylvisaker D (1993) Bayesian design and analysis of computer experiments: use of derivatives in surface prediction. Technometrics 35:243–255MathSciNetCrossRefGoogle Scholar
  53. Oakley JE (2002) Eliciting Gaussian process priors for complex computer codes. Statistician 51:81–97MathSciNetGoogle Scholar
  54. O’Hagan A (1994) Kendall’s advanced theory of statistics. Volume 2B: Bayesian inference. Cambridge University Press, CambridgeGoogle Scholar
  55. Ong K, Santner T, Bartel D (2008) Robust design for acetabular cup stability accounting for patient and surgical variability. J Biomech Eng 130(031001):1–11Google Scholar
  56. Parzen E (1962) Stochastic processes. Holden-Day, San Francisco, CAzbMATHGoogle Scholar
  57. Qian PZG, Wu H, Wu CFJ (2008) Gaussian Process models for computer experiments with qualitative and quantitative factors. Technometrics 50(3):383–396MathSciNetCrossRefGoogle Scholar
  58. Reese CS, Wilson AG, Hamada M, Martz HF, Ryan KJ (2004) Integrated analysis of computer and physical experiments. Technometrics 46(2):153–164MathSciNetCrossRefGoogle Scholar
  59. Ripley BD (1981) Spatial statistics. Wiley, New York, NYCrossRefGoogle Scholar
  60. Rodríguez-Iturbe I, Mejía JM (1974) The design of rainfall networks in time and space. Water Resour Res 10:713–728CrossRefGoogle Scholar
  61. Sacks J, Welch WJ, Mitchell TJ, Wynn HP (1989b) Design and analysis of computer experiments. Stat Sci 4:409–423MathSciNetCrossRefGoogle Scholar
  62. Sampson PD, Guttorp P (1992) Nonparametric estimation of nonstationary covariance structure. J Am Stat Assoc 87:108–119CrossRefGoogle Scholar
  63. Stein ML (1999) Interpolation of spatial data: some theory for kriging. Springer, New York, NYCrossRefGoogle Scholar
  64. Stein ML (2008) A modeling approach for large spatial datasets. J Korean Stat Soc 37:3–10MathSciNetCrossRefGoogle Scholar
  65. Svenson J, Santner T (2016) Multiobjective optimization of expensive-to-evaluate deterministic computer simulator models. Comput Stat Data Anal 94:250–264MathSciNetCrossRefGoogle Scholar
  66. Vecchia AV (1988) Estimation and identification for continuous spatial processes. J R Stat Soc Ser B 50:297–312MathSciNetGoogle Scholar
  67. Ver Hoef JM, Barry RP (1998) Constructing and fitting models for cokriging and multivariable spatial prediction. J Stat Plann Inf 69:275–294MathSciNetCrossRefGoogle Scholar
  68. Wallstrom TC (2007) Estimating replicate variation. Technical report LA-UR-07-4412, Los Alamos National Laboratory, Los Alamos, NMGoogle Scholar
  69. Yaglom AM (1986) Correlation theory of stationary and related random functions (Volume II). Springer, New York, NYGoogle Scholar
  70. Zhang Y (2014) Computer experiments with both quantitative and qualitative inputs. PhD thesis, Department of Statistics, The Ohio State University, Columbus, OHGoogle Scholar
  71. Zhou Q, Qian PZG, Wu H, Zhou S (2011) A simple approach to emulation for computer models with qualitative and quantitative factors. Technometrics 53:266–273MathSciNetCrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  • Thomas J. Santner
    • 1
  • Brian J. Williams
    • 2
  • William I. Notz
    • 1
  1. 1.Department of StatisticsThe Ohio State UniversityColumbusUSA
  2. 2.Statistical Sciences GroupLos Alamos National LaboratoryLos AlamosUSA

Personalised recommendations