Abstract
We explore the subject of uniting the control-theoretic with the factorization-based approach to recommendation, arguing that tensor factorization may be employed to vanquish combinatorial complexity impediments related to more sophisticated MDP models that take a history of previous states rather than one single state into account. Specifically, we introduce a tensor representation of transition probabilities of Markov-k-processes and devise a Tucker-based approximation architecture that relies crucially on the notion of an aggregation basis described in Chap. 6. As our method requires a partitioning of the set of state transition histories, we are left with the challenge of how to determine a suitable partitioning, for which we propose a genetic algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-Dynamic Programming. Athena Scientific, Belmont (1996)
Holland, J.H.: Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence. The MIT Press, Cambridge (1992)
Paprotny, A.: Multilevel Methods for Dynamic Programming: Deterministic and Stochastic Iterative Methods with Application to Recommendation Engines. AVM – Akademische Verlagsgemeinschaft, München (2011)
Tsitsiklis, J.N., Roy, B.V.: An analysis of temporal-difference learning with function approximation. IEE Trans. Autom. Control 42(5), 674–690 (1997)
Zimmermann K.-H.: Diskrete Mathematik (in German). Books on Demand, Norderstedt (2006)
Ziv, O.: Algebraic multigrid for reinforcement learning. Master’s Thesis, Technion (2004)
Ziv, O., Shimkin, N.: Multigrid methods for policy evaluation and reinforcement learning. In: 2005 International Symposium on Intelligent Control (2005)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Paprotny, A., Thess, M. (2013). The Big Picture: Toward a Synthesis of RL and Adaptive Tensor Factorization. In: Realtime Data Mining. Applied and Numerical Harmonic Analysis. Birkhäuser, Cham. https://doi.org/10.1007/978-3-319-01321-3_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-01321-3_10
Published:
Publisher Name: Birkhäuser, Cham
Print ISBN: 978-3-319-01320-6
Online ISBN: 978-3-319-01321-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)