Bayesian Subset Selection Methods for Finding Engineering Design Values: an Application to Lumber Strength

Kondo, Yumi; Zidek, James V; Taylor, Carolyn G; van Eeden, Constance

doi:10.1007/s13171-018-00157-w

Bayesian Subset Selection Methods for Finding Engineering Design Values: an Application to Lumber Strength

Published: 03 December 2018

Volume 80, pages 146–172, (2018)
Cite this article

Sankhya A Aims and scope Submit manuscript

Yumi Kondo¹,
James V Zidek ORCID: orcid.org/0000-0002-0584-9068²,
Carolyn G Taylor² &
…
Constance van Eeden²

42 Accesses
Explore all metrics

Abstract

The paper concerns a random property T of a manufactured product that must with high probability e.g. P^* = 95% exceed a specified quantity η_a called the characteristic value (CV). However the product comes from any one of K different subpopulations that may represent such things as manufacturers, regions or countries; the distribution of T will generally differ from one subpopulation to another and so will the associated CV η_ka, = 1,…,K. Moreover in applications such as the one we focus on in this paper where the subpopulations are species, the subpopulation of origin will, for both strategic or practical reasons, not be known. The problem confronted in this paper is the creation of a single CV for the population consisting of the union of all the subpopulations. A solution proposed long ago in the application concerning manufactured lumber that is addressed in this paper, selects a subset of the subpopulations using random samples of the T s, called the subset of controlling species CS, that includes the smallest of the {η_ka} with high probability. The estimated CV for the entire population is then found by combining and treating as one, the samples for the subpopulations in CS. That method has been published in an ASTM standards document for the lumber industry to ensure the structural engineering strength of manufactured lumber. However this published method has been shown to have some unexpected and undesirable properties, leading to the search for an alternative and this paper. The paper presents and compares three subset selection methods. The simplest of the three methods is an extension of a classical nonparametric method for subset selection. The remaining two, which are more complex, are variations of nonparametric Bayesian methods. Each of the three is seen as a possible candidate for consideration by ASTM committees as a possible replacement for the ASTM method for lumber species depending on what criterion is ultimately used for its selection. But they may well apply in other contexts as well.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On Splitting Training and Validation Set: A Comparative Study of Cross-Validation, Bootstrap and Systematic Sampling for Estimating the Generalization Performance of Supervised Learning

Article Open access 01 July 2018

A Guide for Sparse PCA: Model Comparison and Applications

Article Open access 29 June 2021

Recent advances and applications of surrogate models for finite element method computations: a review

Article 17 July 2022

References

ASTM Standard D1990 (2007). Standard practice for establishing allowable properties for visually-graded dimension lumber from in-grade tests of full-size specimens. Technical Report DOI: 10.1520/D1990-07, ASTM International. https://doi.org/10.1520/D1990-07.
Berger, J.O. and Deely, J. (1988). A Bayesian approach to ranking and selection of related means with alternatives to analysis-of-variance methodology. J. Am. Stat. Assoc. 83, 364–373.
Article MathSciNet MATH Google Scholar
Berger, R.L. et al. (1979). Minimax subset selection for loss measured by subset size. Ann. Stat. 7, 6, 1333–1338.
Article MathSciNet MATH Google Scholar
Blackwell, D. and MacQueen, J.B. (1973). Ferguson distributions via polya urn schemes. Ann. Stat. 1, 353–355.
Article MATH Google Scholar
Caflisch, R.E. (1998). Monte carlo and quasi-monte carlo methods. Acta Numerica 7, 1–49.
Article MathSciNet MATH Google Scholar
Chakraborty, D. (2008). Statistical decision theory. estimation, testing and selection. Investigación Operacional 29, 2, 184–185.
Google Scholar
Evans, J., Kretschmann, D., Herian, V.L. and Green, D. (2001). Procedures for developing allowable properties for a single species under ASTM D1990 and computer programs useful for the calculations. General technical report FPL, 126. Madison, WI : U.S. Dept of Agriculture, Forest Service, Forest Products Laboratory.
Ferguson, T.S. (1973). A Bayesian analysis of some nonparametric problem. Ann. Statist. 1, 209–230.
Article MathSciNet MATH Google Scholar
Ferguson, T.S. (1983). Bayesian density estimation by mixtures of normal distributions. In: Recent advances in statistics, pp. 287–302. Elsevier.
Fong, K.H. and Berger, J. (1993). Ranking, estimation and hypothesis testing in unbalanced models – a Bayesian approach. Statist. Decisions 11, 1–24.
MathSciNet MATH Google Scholar
Ishwaran, H. and James, L.F. (2001). Gibbs sampling methods for stick-breaking priors. J. Am. Stat. Assoc. 96, 161–173.
Article MathSciNet MATH Google Scholar
Johnson, R.A., Evans, J.W. and Green, D.W. (1999). Nonparametric Bayesian predictive distributions for future order statistics. Statist. Probab. Lett. 41, 247–254.
Article MathSciNet MATH Google Scholar
Johnson, R.A. and Lu, W. (2007). Proof load designs for estimation of dependence in a bivariate weibull model. Statist. Probab. Lett. 77, 1061–1069.
Article MathSciNet MATH Google Scholar
Jones, E. (1988). In-grade testing of structural lumber. In: Proceedings of the Workshop on the In-Grade Testing of Structural Lumber, pp. 11–14. Forest Products Research Society.
Kondo, Y. and Zidek, J.V. (2013). Bayesian nonparametric subset selection procedures with Weibull components. Technical Report 273, University of British Columbia.
Kottas, A. (2006). Nonparametric Bayesian Survival Analysis using Mixtures of Weibull Distribution. Journal of Statistical Planning and Inference 136, 3, 578–596.
Article MathSciNet MATH Google Scholar
Liu, Y., Salibiàn-Barrera, M., Zamar, R. and Zidek, J.V. (2019). Using artificial censoring to improve extreme tail quantile estimates. Applied Statistics. To appear.
McDonald, G.C. (2016). Applications of subset selection procedures and bayesian ranking methods in analysis of traffic fatality data. Wiley Interdiscip. Rev. Comput. Stat. 8, 6, 222–237.
Article MathSciNet Google Scholar
Rizvi, M. and Sobel, M. (1967). Nonparametric procedures for selecting a subset containing the population with the largest a-quantile. Ann. Math. Stat. 38, 6, 1788–1803.
Article MathSciNet MATH Google Scholar
Sethuraman, J. (1994). A constructive definition of Dirichlet priors. Stat. Sin. 4, 639–650.
MathSciNet MATH Google Scholar
van Eeden, C. and Zidek, J.V. (2012). Subset selection – extended Rizvi–Sobel for unequal sample sizes and its implementation. Journal of Nonparametric Statistics 24, 299–315. https://doi.org/10.1080/10485252.2012.660482.
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

We are indebted to Conroy Lum from FPInnovations for introducing the second author to the topic addressed in this report and for many helpful discussions during the course of the work. Thanks also to Kyle Hambrook and John Petkau for helpful discussions during the course of the work.

Author information

Authors and Affiliations

Data Mining Services and Solutions, Robert Bosch LLC, Stuttgart, Germany
Yumi Kondo
Department of Statistics, University of British Columbia, 2329 West Mall, Vancouver, BC V6T 1Z4, Canada
James V Zidek, Carolyn G Taylor & Constance van Eeden

Authors

Yumi Kondo
View author publications
You can also search for this author in PubMed Google Scholar
James V Zidek
View author publications
You can also search for this author in PubMed Google Scholar
Carolyn G Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Constance van Eeden
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to James V Zidek.

Additional information

Supported by a Collaborative Research and Development grant from the Natural Sciences and Engineering Research Council of Canada.

Electronic supplementary material

Below is the link to the electronic supplementary material.

(TEX 17.2 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kondo, Y., Zidek, J.V., Taylor, C.G. et al. Bayesian Subset Selection Methods for Finding Engineering Design Values: an Application to Lumber Strength. Sankhya A 80 (Suppl 1), 146–172 (2018). https://doi.org/10.1007/s13171-018-00157-w

Download citation

Received: 10 February 2018
Published: 03 December 2018
Issue Date: 30 December 2018
DOI: https://doi.org/10.1007/s13171-018-00157-w

Keywords and phrases

AMS (2000) subject classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Bayesian Subset Selection Methods for Finding Engineering Design Values: an Application to Lumber Strength

Abstract

Access this article

Similar content being viewed by others

On Splitting Training and Validation Set: A Comparative Study of Cross-Validation, Bootstrap and Systematic Sampling for Estimating the Generalization Performance of Supervised Learning

A Guide for Sparse PCA: Model Comparison and Applications

Recent advances and applications of surrogate models for finite element method computations: a review

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

(TEX 17.2 KB)

Rights and permissions

About this article

Cite this article

Keywords and phrases

AMS (2000) subject classification

Navigation

Bayesian Subset Selection Methods for Finding Engineering Design Values: an Application to Lumber Strength

Abstract

Access this article

Similar content being viewed by others

On Splitting Training and Validation Set: A Comparative Study of Cross-Validation, Bootstrap and Systematic Sampling for Estimating the Generalization Performance of Supervised Learning

A Guide for Sparse PCA: Model Comparison and Applications

Recent advances and applications of surrogate models for finite element method computations: a review

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

(TEX 17.2 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords and phrases

AMS (2000) subject classification

Search

Navigation