Dynamic sample budget allocation in model-based optimization

Hu, Jiaqiao; Chang, Hyeong Soo; Fu, Michael C.; Marcus, Steven I.

doi:10.1007/s10898-009-9490-3

Dynamic sample budget allocation in model-based optimization

Published: 20 November 2009

Volume 50, pages 575–596, (2011)
Cite this article

Journal of Global Optimization Aims and scope Submit manuscript

Jiaqiao Hu¹,
Hyeong Soo Chang²,
Michael C. Fu³ &
…
Steven I. Marcus⁴

301 Accesses
6 Citations
Explore all metrics

Abstract

Model-based search methods are a class of optimization techniques that search the solution space by sampling from an underlying probability distribution “model,” which is updated iteratively after evaluating the performance of the samples at each iteration. This paper aims to improve the sampling efficiency of model-based methods by considering a generalization where a population of distribution models is maintained and subsequently propagated from generation to generation. A key issue in the proposed approach is how to efficiently allocate the sampling budget among the population of models to maximize the algorithm performance. We formulate this problem as a generalized max k-armed bandit problem, and derive an efficient dynamic sample allocation scheme based on Markov decision theory to adaptively allocate computational resources. The proposed allocation scheme is then further used to update the current population to produce an improving population of models. Our preliminary numerical results indicate that the proposed procedure may considerably reduce the number of function evaluations needed to obtain high quality solutions, and thus further enhance the value of model-based methods for optimization problems that require expensive function evaluations for performance evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bertsekas D.P.: Dynamic Programming and Optimal Control, vol 1 and 2. Athena Scientific, Belmont (1995)
Google Scholar
Chepuri K., Homem De Mello T.: Solving the vehicle routing problem with stochastic demands using the cross-entropy method. Ann. Oper. Res. 134, 153–181 (2005)
Article Google Scholar
Cicirello, V., Smith, S.F.: The max k-armed bandit: a new model for exploration applied to search heuristic selection. In: Proceedings of the 20th National Conference on Artificial Intelligence (AAAI-05) (2005)
Dorigo M., Gambardella L.M.: Ant colony system: a cooperative learning approach to the traveling salesman problem. IEEE Trans. Evol. Comput. 1, 53–66 (1997)
Article Google Scholar
Fu, M.C., Hu, J., Marcus, S. I.: Model-based randomized methods for global optimization. In: Proceedings of the 17th international symposium on mathematical theory of networks and systems. Kyoto, Japan (2006)
Glover F.: Tabu search: a tutorial. Interfaces 20, 74–94 (1990)
Article Google Scholar
Hu J., Fu M.C., Marcus S.I.: A model reference adaptive search method for global optimization. Oper. Res. 55, 549–568 (2007)
Article Google Scholar
Kirkpatrick S., Gelatt C.D., Vecchi M.P.: Optimization by simulated annealing. Science 220, 671–680 (1983)
Article Google Scholar
Larrañaga P., Lozano J.A.: Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation. Kluwer, Boston (2002)
Google Scholar
Mühlenbein H., Paaß G.: From recombination of genes to the estimation of distributions: I. Binary parameters. In: Voigt, H.-M., Ebeling, W., Rechenberg, I., Schwefel, H.-P. (eds) Parallel Problem Solving from Nature—PPSN IV, pp. 178–187. Springer, Berlin (1996)
Chapter Google Scholar
Pintér J.D.: Global Optimization in Action. Kluwer, The Netherlands (1996)
Google Scholar
Ross S.: Stochastic Processes 2nd edn. John Wiley, New York (1995)
Google Scholar
Royden H.L.: Real Analysis 3rd edn. Prentice-Hall, Englewood Cliffs (1988)
Google Scholar
Rubinstein R.Y.: Optimization of computer simulation models with rare events. Eur. J. Oper. Res. 99, 89–112 (1997)
Article Google Scholar
Rubinstein R.Y., Kroese D.P.: The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation, and Machine Learning. Springer, New York (2004)
Google Scholar
Shi L., Ólafsson S.: Nested partitions method for global optimization. Oper. Res. 48, 390–407 (2000)
Article Google Scholar
Srinivas M., Patnaik L.M.: Genetic algorithms: a survey. IEEE Comp. 27, 17–26 (1994)
Google Scholar
Streeter, M., Smith, S.F.: A simple distribution-free approach to the max k-armed bandit problem. In: Proceedings of the 12th international conference on principles and practice of constraint programming. Lecture Notes in Computer Science 4204, pp. 560–574. Springer, Berlin (2006)
Tang Z.: Adaptive partitioned random search to global optimization. IEEE Trans. Autom. Control 39, 2235–2244 (1994)
Article Google Scholar
Zlochin M., Birattari M., Meuleau N., Dorigo M.: Model-based search for combinatorial optimization: a critical survey. Ann. Oper. Res. 131, 373–395 (2004)
Article Google Scholar
Zhang, H., Fu, M.C.: Applying model reference adaptive search to American-style option pricing. In: Proceedings of the 28th winter simulation conference, pp. 711–718 (2006)

Download references

Author information

Authors and Affiliations

Department of Applied Mathematics and Statistics, State University of New York, Stony Brook, NY, 11794, USA
Jiaqiao Hu
Department of Computer Science and Engineering (200811037), Sogang University, Seoul, Korea
Hyeong Soo Chang
Robert H. Smith School of Business and The Institute for Systems Research, University of Maryland, College Park, MD, USA
Michael C. Fu
Department of Electrical and Computer Engineering and The Institute for Systems Research, University of Maryland, College Park, MD, USA
Steven I. Marcus

Authors

Jiaqiao Hu
View author publications
You can also search for this author in PubMed Google Scholar
Hyeong Soo Chang
View author publications
You can also search for this author in PubMed Google Scholar
Michael C. Fu
View author publications
You can also search for this author in PubMed Google Scholar
Steven I. Marcus
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michael C. Fu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hu, J., Chang, H.S., Fu, M.C. et al. Dynamic sample budget allocation in model-based optimization. J Glob Optim 50, 575–596 (2011). https://doi.org/10.1007/s10898-009-9490-3

Download citation

Received: 13 January 2009
Accepted: 03 November 2009
Published: 20 November 2009
Issue Date: August 2011
DOI: https://doi.org/10.1007/s10898-009-9490-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dynamic sample budget allocation in model-based optimization

Abstract

Access this article

Similar content being viewed by others

Population model-based optimization

Level-Based Analysis of the Population-Based Incremental Learning Algorithm

A PAC algorithm in relative precision for bandit problem with costly sampling

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Dynamic sample budget allocation in model-based optimization

Abstract

Access this article

Similar content being viewed by others

Population model-based optimization

Level-Based Analysis of the Population-Based Incremental Learning Algorithm

A PAC algorithm in relative precision for bandit problem with costly sampling

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation