Faster Teaching by POMDP Planning

Rafferty, Anna N.; Brunskill, Emma; Griffiths, Thomas L.; Shafto, Patrick

doi:10.1007/978-3-642-21869-9_37

Anna N. Rafferty²³,
Emma Brunskill²³,
Thomas L. Griffiths²³ &
…
Patrick Shafto²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6738))

Included in the following conference series:

International Conference on Artificial Intelligence in Education

4360 Accesses
28 Citations
1 Altmetric

Abstract

Both human and automated tutors must infer what a student knows and plan future actions to maximize learning. Though substantial research has been done on tracking and modeling student learning, there has been significantly less attention on planning teaching actions and how the assumed student model impacts the resulting plans. We frame the problem of optimally selecting teaching actions using a decision-theoretic approach and show how to formulate teaching as a partially-observable Markov decision process (POMDP) planning problem. We consider three models of student learning and present approximate methods for finding optimal teaching actions given the large state and action spaces that arise in teaching. An experimental evaluation of the resulting policies on a simple concept-learning task shows that framing teacher action planning as a POMDP can accelerate learning relative to baseline performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Resolving Efficiency Bottleneck of the Bellman Equation in Adaptive Teaching

Empirically Evaluating the Effectiveness of POMDP vs. MDP Towards the Pedagogical Strategies Induction

Learning Teaching in Teaching: Online Reinforcement Learning for Intelligent Tutoring

References

Barnes, T., Stamper, J.: Toward automatic hint generation for logic proof tutoring using historical student data. In: Woolf, B.P., Aïmeur, E., Nkambou, R., Lajoie, S. (eds.) ITS 2008. LNCS, vol. 5091, pp. 373–382. Springer, Heidelberg (2008)
Chapter Google Scholar
Brunskill, E., Garg, S., Tseng, C., Pal, J., Findlater, L.: Evaluating an adaptive multi-user educational tool for low-resource regions. In: Proceedings of the International Conference on Information and Communication Technologies and Development (2010)
Google Scholar
Brunskill, E., Russell, S.: RAPID: A reachable anytime planner for imprecisely-sensed domains. In: Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence (2010)
Google Scholar
Chang, K.-m., Beck, J.E., Mostow, J., Corbett, A.T.: A bayes net toolkit for student modeling in intelligent tutoring systems. In: Ikeda, M., Ashley, K.D., Chan, T.-W. (eds.) ITS 2006. LNCS, vol. 4053, pp. 104–113. Springer, Heidelberg (2006)
Chapter Google Scholar
Chi, M., Jordan, P., VanLehn, K., Hall, M.: Reinforcement learning-based feature selection for developing pedagogically effective tutorial dialogue tactics. In: Proceedings of the 1st International Conference on Educational Data Mining (2008)
Google Scholar
Conati, C., Muldner, K.: Evaluating a decision-theoretic approach to tailored example selection. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence (2007)
Google Scholar
Corbett, A., Anderson, J.: Knowledge tracing: Modeling the acquisition of procedural knowledge. User Modeling and User-Adapted Interaction 4(4), 253–278 (1995)
Article Google Scholar
Doucet, A., de Freitas, N., Gordon, N.: Sequential Monte Carlo Methods in Practice. Springer, New York (2001)
Book MATH Google Scholar
Folsom-Kovarik, J., Sukthankar, G., Schatz, S., Nicholson, D.: Scalable POMDPs for diagnosis and planning in intelligent tutoring systems. In: AAAI Fall Symposium on Proactive Assistant Agents (2010)
Google Scholar
Murray, R., Vanlehn, K., Mostow, J.: Looking ahead to select tutorial actions: A decision-theoretic approach. International Journal of Artificial Intelligence in Education 14(3), 235–278 (2004)
Google Scholar
Restle, F.: The selection of strategies in cue learning. Psychological Review 69(4), 329–343 (1962)
Article Google Scholar
Ross, S., Chaib-draa, S., Pineau, J.: Bayesian reinforcement learning in continuous POMDPs with application to robot navigation. In: Proceedings of the International Conference on Robotics and Automation (2008)
Google Scholar
Ross, S., Pineau, J., Paquet, S., Chaib-draa, B.: Online planning algorithms for POMDPs. Journal of Artificial Intelligence Research 32(1), 663–704 (2008)
MathSciNet MATH Google Scholar
Sondik, E.J.: The Optimal Control of Partially Observable Markov Processes. Ph.D. thesis. Stanford University (1971)
Google Scholar
Tenenbaum, J.: Rules and similarity in concept learning. Advances in Neural Information Processing Systems 12 (2000)
Google Scholar
Theocharous, G., Beckwith, R., Butko, N., Philipose, M.: Tractable POMDP planning algorithms for optimal teaching in “SPAIS”. In: IJCAI PAIR Workshop (2009)
Google Scholar
Villano, M.: Probabilistic student models: Bayesian belief networks and knowledge space theory. In: Proceedings of the Second International Conference on Intelligent Tutoring Systems (1992)
Google Scholar

Download references

Author information

Authors and Affiliations

University of California, Berkeley, CA, 94720, USA
Anna N. Rafferty, Emma Brunskill & Thomas L. Griffiths
University of Louisville, KY, 40292, USA
Patrick Shafto

Authors

Anna N. Rafferty
View author publications
You can also search for this author in PubMed Google Scholar
Emma Brunskill
View author publications
You can also search for this author in PubMed Google Scholar
Thomas L. Griffiths
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Shafto
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

EECS Department/ISIS, Vanderbilt University, TN 37235, Nashville, USA
Gautam Biswas
Electronic, Electrical and Computer Engineering, University of Birmingham, U.K.
Susan Bull
School of Information Technologies, University of Sydney, 1 Cleveland Street, 2006, Sydney, Australia
Judy Kay
College of Engineering, Department of Computer Science and Software Engineering, University of Canterbury, Private Bag 4800, 8140, Christchurch, New Zealand
Antonija Mitrovic

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rafferty, A.N., Brunskill, E., Griffiths, T.L., Shafto, P. (2011). Faster Teaching by POMDP Planning. In: Biswas, G., Bull, S., Kay, J., Mitrovic, A. (eds) Artificial Intelligence in Education. AIED 2011. Lecture Notes in Computer Science(), vol 6738. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21869-9_37

Download citation

DOI: https://doi.org/10.1007/978-3-642-21869-9_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21868-2
Online ISBN: 978-3-642-21869-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Faster Teaching by POMDP Planning

Abstract

Access this chapter

Preview

Similar content being viewed by others

Resolving Efficiency Bottleneck of the Bellman Equation in Adaptive Teaching

Empirically Evaluating the Effectiveness of POMDP vs. MDP Towards the Pedagogical Strategies Induction

Learning Teaching in Teaching: Online Reinforcement Learning for Intelligent Tutoring

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Faster Teaching by POMDP Planning

Abstract

Access this chapter

Preview

Similar content being viewed by others

Resolving Efficiency Bottleneck of the Bellman Equation in Adaptive Teaching

Empirically Evaluating the Effectiveness of POMDP vs. MDP Towards the Pedagogical Strategies Induction

Learning Teaching in Teaching: Online Reinforcement Learning for Intelligent Tutoring

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation