A Gray-Box Approach for Curriculum Learning

Foglino, Francesco; Leonetti, Matteo; Sagratella, Simone; Seccia, Ruggiero

doi:10.1007/978-3-030-21803-4_72

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 991))

Included in the following conference series:

World Congress on Global Optimization

1754 Accesses
1 Citations
1 Altmetric

Abstract

Curriculum learning is often employed in deep reinforcement learning to let the agent progress more quickly towards better behaviors. Numerical methods for curriculum learning in the literature provides only initial heuristic solutions, with little to no guarantee on their quality. We define a new gray-box function that, including a suitable scheduling problem, can be effectively used to reformulate the curriculum learning problem. We propose different efficient numerical methods to address this gray-box reformulation. Preliminary numerical results on a benchmark task in the curriculum learning literature show the viability of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Gpyopt: a bayesian optimization framework in python. http://github.com/SheffieldML/GPyOpt (2016)
Belotti, P., Kirches, C., Leyffer, S., Linderoth, J., Luedtke, J., Mahajan, A.: Mixed-integer nonlinear optimization. Acta Numer. 22, 1–131 (2013)
Google Scholar
Bergstra, J.: Hyperopt: distributed asynchronous hyperparameter optimization in python (2013)
Google Scholar
Bergstra, J., Yamins, D., Cox, D.D.: Making a science of model search: hyperparameter optimization in hundreds of dimensions for vision architectures (2013)
Google Scholar
Bergstra, J.S., Bardenet, R., Bengio, Y., Kégl, B.: Algorithms for hyper-parameter optimization. In: Advances in Neural Information Processing Systems, pp. 2546–2554 (2011)
Google Scholar
Custódio, A.L., Scheinberg, K., Nunes Vicente, L.: Methodologies and software for derivative-free optimization. In: Advances and Trends in Optimization with Engineering Applications, pp. 495–506 (2017)
Google Scholar
Di Pillo, G., Liuzzi, G., Lucidi, S., Piccialli, V., Rinaldi, F.: A DIRECT-type approach for derivative-free constrained global optimization. Comput. Optim. Appl. 65(2), 361–397 (2016)
Google Scholar
Foglino, F., Leonetti, M.: An optimization framework for task sequencing in curriculum learning (2019). arXiv preprint arXiv:1901.11478
Frazier, P.I.: A tutorial on bayesian optimization (2018). arXiv preprint arXiv:1807.02811
Leonetti, M., Kormushev, P., Sagratella, S.: Combining local and global direct derivative-free optimization for reinforcement learning. Cybern. Inf. Technol. 12(3), 53–65 (2012)
Google Scholar
Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529 (2015)
Google Scholar
Rasmussen, C.E.: Gaussian processes in machine learning. In: Advanced Lectures on Machine Learning, pp. 63–71. Springer (2004)
Google Scholar
Shahriari, B., Swersky, K., Wang, Z., Adams, R.P., De Freitas, N.: Taking the human out of the loop: a review of bayesian optimization. Proc. IEEE 104(1), 148–175 (2016)
Google Scholar
Snoek, J., Larochelle, H., Adams, R.P.: Practical bayesian optimization of machine learning algorithms. In: Advances in Neural Information Processing Systems, pp. 2951–2959 (2012)
Google Scholar
Svetlik, M., Leonetti, M., Sinapov, J., Shah, R., Walker, N., Stone, P.: Automatic curriculum graph generation for reinforcement learning agents. In: AAAI, pp. 2590–2596 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing, University of Leeds, Leeds, UK
Francesco Foglino & Matteo Leonetti
Department of Computer, Control and Management Engineering Antonio Ruberti, Sapienza, University of Rome, Via Ariosto 25, 00185, Roma, Italy
Simone Sagratella & Ruggiero Seccia

Authors

Francesco Foglino
View author publications
You can also search for this author in PubMed Google Scholar
Matteo Leonetti
View author publications
You can also search for this author in PubMed Google Scholar
Simone Sagratella
View author publications
You can also search for this author in PubMed Google Scholar
Ruggiero Seccia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Simone Sagratella .

Editor information

Editors and Affiliations

Computer science and Applications Department, LGIPM, University of Lorraine, Metz Cedex 03, France
Hoai An Le Thi
Computer Science and Applications Department, LGIPM, University of Lorraine, Metz Cedex 03, France
Hoai Minh Le
Laboratory of Mathematics, National Institute for Applied Sciences (INSA)-Rouen Normadie, Saint-Étienne-du-Rouvray Cedex, France
Tao Pham Dinh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Foglino, F., Leonetti, M., Sagratella, S., Seccia, R. (2020). A Gray-Box Approach for Curriculum Learning. In: Le Thi, H., Le, H., Pham Dinh, T. (eds) Optimization of Complex Systems: Theory, Models, Algorithms and Applications. WCGO 2019. Advances in Intelligent Systems and Computing, vol 991. Springer, Cham. https://doi.org/10.1007/978-3-030-21803-4_72

Download citation

DOI: https://doi.org/10.1007/978-3-030-21803-4_72
Published: 15 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-21802-7
Online ISBN: 978-3-030-21803-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics