Skip to main content

Learning and Optimization with Perturbation Analysis

  • Chapter
Stochastic Learning and Optimization
  • 1917 Accesses

Abstract

As shown in Chapter 2, performance derivatives for Markov systems depend heavily on performance potentials. In this chapter, we first discuss the numerical methods and sample-path-based algorithms for estimating performance potentials, and we then derive the sample-path-based algorithms for estimating performance derivatives. In performance optimization, the process of estimating the potentials and performance derivatives from a sample path is called learning.

You can observe a lot just by watching.

Berra’s Law - Yogi Berra, US baseball player, coach, and manager (1925 – )

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. P. Bratley, B. L. Fox, and L. E. Schrage, A Guide to Simulation, Second Edition, Springer-Verlag, New York, 1987.

    Google Scholar 

  2. P. Bremaud, “Maximal Coupling and Rare Perturbation Sensitivity Analysis,” Queueing Systems: Theory and Applications, Vol. 11, 307-333, 1992.

    Article  MATH  MathSciNet  Google Scholar 

  3. L. Dai, “Rate of Convergence for Derivative Estimation of Discrete-Time Markov Chains Via Finite-Difference Approximations with Common Random Numbers,” SIAM Journal on Applied Mathematics, Vol. 57, 731-751, 1997.

    Article  MATH  MathSciNet  Google Scholar 

  4. L. Dai, “Perturbation Analysis via Coupling,” IEEE Transactions on Automatic Control, Vol. 45, 614-628, 2000.

    Article  MATH  Google Scholar 

  5. P. W. Glynn, “Regenerative Structure of Markov Chains Simulated Via Common Random Numbers,” Operations Research Letters, Vol. 4, 49-53, 1985.

    Article  MATH  MathSciNet  Google Scholar 

  6. P. Heidelberger and D. L. Iglehart, “Comparing Stochastic Systems Using Regenerative Simulation with Common Random Numbers,” Advances in Applied Probability, Vol. 11, 804-819, 1979.

    Article  MathSciNet  Google Scholar 

  7. P. L’Ecuyer, “Convergence Rate for Steady-State Derivative Estimators,” Annals of Operations Research, Vol. 39, 121-136, 1992.

    Article  MATH  MathSciNet  Google Scholar 

  8. P. L’Ecuyer and G. Perron, “On the Convergence Rates of IPA and FDC Derivative Estimators,” Operations Research, Vol. 42, 643-656, 1994.

    Article  MATH  MathSciNet  Google Scholar 

  9. G. Pflug, Optimization of Stochastic Models: The Interface between Simulation and Optimization, Kluwer Academic Publishers, Boston, Massachusetts, 1996.

    MATH  Google Scholar 

  10. G. Pflug and X. R. Cao, unpublished manuscript.

    Google Scholar 

  11. P. Marbach and T. N. Tsitsiklis, “Simulation-Based Optimization of Markov Reward Processes,” IEEE Transactions on Automatic Control, Vol. 46, 191-209, 2001.

    Article  MATH  MathSciNet  Google Scholar 

  12. J. Baxter and P. L. Bartlett, “Infinite-Horizon Policy-Gradient Estimation,” Journal of Artificial Intelligence Research, Vol. 15, 319-350, 2001.

    Article  MATH  MathSciNet  Google Scholar 

  13. J. Baxter, P. L. Bartlett, and L. Weaver, “Experiments with Infinite-Horizon, Policy-Gradient Estimation,” Journal of Artificial Intelligence Research, Vol. 15, 351-381, 2001.

    MATH  MathSciNet  Google Scholar 

  14. D. P. Bertsekas, Nonlinear Programming, Athena Scientific, Belmont, Massachusetts, 1995.

    Google Scholar 

  15. M. Baglietto, F. Davoli, M. Marchese, and M. Mongelli, “Neural Approximation of Open-Loop Feedback Rate Control in Satellite Networks,” IEEE Transactions on Neural Networks, Vol. 16, 1195-1211, 2005.

    Article  Google Scholar 

  16. P. Bremaud, R. P. Malhame, and L. Massoulie, “A Manufacturing System with General Stationary Failure Process: Stability and IPA of Hedging Control Policies,” IEEE Transactions on Automatic Control, Vol. 42, 155-170, 1997.

    Article  MATH  MathSciNet  Google Scholar 

  17. C. A. Brooks and P. Varaiya, “Using Augmented Infinitesimal Perturbation Analysis for Capacity Planning in Intree ATM Networks,” Discrete Event Dynamic Systems: Theory and Applications, Vol. 7, 377-390, 1997.

    Article  MATH  Google Scholar 

  18. C. G. Cassandras, G. Sun, C. G. Panayiotou, and Y. Wardi, “Perturbation Analysis and Control of Two-Class Stochastic Fluid Models for Communication Networks,” IEEE Transactions on Automatic Control, Vol. 48, 770-782, 2003.

    Article  MathSciNet  Google Scholar 

  19. F. Davoli, M. Marchese, and M. Mongelli, “Resource Allocation in Satellite Networks: Certainty Equivalent Approaches Versus Sensitivity Estimation Algorithms,” International Journal of Communication Systems, Vol. 18, 3-36, 2005.

    Article  Google Scholar 

  20. Y. C. Ho, M. A. Eyler, and T. T. Chien, “A Gradient Technique for General Buffer Storage Design in A Production Line,” International Journal of Production Research, Vol. 17, 557-580, 1979.

    Article  Google Scholar 

  21. Y. C. Ho, M. A. Eyler, and T. T. Chien, “A New Approach to Determine Parameter Sensitivities of Transfer Lines,” Management Science, Vol. 29, 700-714, 1983.

    Article  Google Scholar 

  22. J. Q. Hu, S. Nananukul, and W. B. Gong, “A New Approach to (s, S) Inventory Systems,” Journal of Applied Probability, Vol. 30, 898-912, 1993.

    Article  MATH  MathSciNet  Google Scholar 

  23. R. Kapuscinski, and S. Tayur, “A Capacitated Production-inventory Model with Periodic Demand,” Operations Research, Vol. 46, 899-911, 1998.

    Article  MATH  MathSciNet  Google Scholar 

  24. D. C. Lee, “Applying Perturbation Analysis to Traffic Shaping,” Computer Communications, Vol. 24, 798-810, 2001.

    Article  Google Scholar 

  25. G. Liberopoulos and M. Caramanis, “Infinitesimal Perturbation Analysis for Second Derivative Estimation and Design of Manufacturing Flow Controllers,” Journal of Optimization Theory and Applications, Vol. 81, 297-327, 1994.

    Article  MATH  MathSciNet  Google Scholar 

  26. G. Liberopoulos and M. Caramanis, “Dynamics and Design of A Class of Parameterized Manufacturing Flow Controllers,” IEEE Transactions on Automatic Control, Vol. 40, 1018-1028, 1995.

    Article  MATH  MathSciNet  Google Scholar 

  27. N. B. Mandayam and B. Aazhang, “Gradient Estimation for Sensitivity Analysis and Adaptive Multiuser Interference Rejection in Code-Division Multiple-Access Systems,” IEEE Transactions on Communications, Vol. 45, 848-858, 1997.

    Article  MATH  Google Scholar 

  28. M. Marchese, A. Garibbo, F. Davoli, and M. Mongelli, “Equivalent Bandwidth Control for the Mapping of Quality of Service in Heterogeneous Networks,” IEEE International Conference on Communications, Vol. 4, 1948-1952, 2004.

    Google Scholar 

  29. M. Marchese and M. Mongelli, “On-Line Bandwidth Control for Quality of Service Mapping over Satellite Independent Service Access Points,” Computer Networks, Vol. 50, 2088-2111, 2006.

    Article  MATH  Google Scholar 

  30. N. Miyoshi, “Application of IPA to the Sensitivity Analysis of the Leaky-Bucket Filter with Stationary Gradual Input,” Probability in the Engineering and Informational Sciences, Vol. 14, 219-241, 2000.

    Article  MATH  MathSciNet  Google Scholar 

  31. C. Panayiotou, C. G. Cassandras, G. Sun, and Y. Wardi, “Control of Communication Networks Using Infinitesimal Perturbation Analysis of Stochastic Fluid Models,” Advances in Communication Control Networks, Lecture Notes in Control and Information Sciences, Vol. 308, 1-26, 2004.

    MathSciNet  Google Scholar 

  32. C. Panayiotou and C. G. Cassandras, “Infinitesimal Perturbation Analysis and Optimization for Make-to-Stock Manufacturing Systems Based on Stochastic Fluid Models,” Discrete Event Dynamic Systems: Theory and Applications, Vol. 16, 109-142, 2006.

    Article  MATH  MathSciNet  Google Scholar 

  33. J. M. Proth, N. Sauer, Y. Wardi, and X. L. Xie, “Marking Optimization of Stochastic Timed Event Graphs Using IPA,” Discrete Event Dynamic Systems: Theory and Applications, Vol. 6, 221-239, 1996.

    Article  MATH  Google Scholar 

  34. H. Salehfar and S. Trihadi, “Application of Perturbation Analysis to Sensitivity Computations of Generating Units and System Reliability,” IEEE Transactions on Power Systems, Vol. 13, 152-158, 1998.

    Article  Google Scholar 

  35. U. Savagaonkar, E. K. P. Chong, and R. L. Givan, “Online Pricing for Bandwidth Provisioning in Multi-class Networks,” Computer Networks, Vol. 44, 835-853, 2004.

    Article  Google Scholar 

  36. G. A. Rummery and M. Niranjan, “On-Line Q-Learning Using Connectionist Systems,” Technical Report CUED/F-INFENG/TR 166, Engineering Department, Cambridge University, 1994.

    Google Scholar 

  37. Q. Y. Tang and E. K. Boukas, “Adaptive Control for Manufacturing Systems Using Infinitesimal Perturbation Analysis,” IEEE Transactions on Automatic Control, Vol. 44, 1719-1725, 1999.

    Article  MATH  MathSciNet  Google Scholar 

  38. A. C. Williams and R. A. Bhandiwad, “A Generating Function Approach to Queueing Network Analysis of Multiprogrammed Computers,” Networks, Vol. 6, 1-22, 1976.

    Article  MATH  MathSciNet  Google Scholar 

  39. H. Yu, and C. G. Cassandras, “Perturbation Analysis for Production Control and Optimization of Manufacturing Systems,” Automatica, Vol. 40, 945-956, 2004.

    Article  MATH  MathSciNet  Google Scholar 

  40. H. Yu and C. G. Cassandras, “Perturbation Analysis of Communication Networks with Feedback Control Using Stochastic Hybrid Models,” Nonlinear Analysis - Theory Methods and Applications, Vol. 65, 1251-1280, 2006.

    Article  MATH  MathSciNet  Google Scholar 

  41. J. G. Kemeny and J. L. Snell, “Potentials for Denumerable Markov Chains,” Journal of Mathematical Analysis and Applications, Vol. 3, 196-260, 1960.

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xi-Ren Cao PhD .

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Cao, XR. (2007). Learning and Optimization with Perturbation Analysis. In: Stochastic Learning and Optimization. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-69082-7_3

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-69082-7_3

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-36787-3

  • Online ISBN: 978-0-387-69082-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics