Agreement in Teams and the Dynamic Programming Approach Under Information Constraints

Yüksel, Serdar; Başar, Tamer

doi:10.1007/978-1-4614-7085-4_12

Serdar Yüksel³ &
Tamer Başar⁴

Part of the book series: Systems & Control: Foundations & Applications ((SCFA))

2342 Accesses
3 Citations

Abstract

This chapter presents the notions of agreement and common knowledge, and addresses the question of how to achieve common knowledge. It presents a general framework for obtaining solutions to dynamic team problems under decentralized information structures based on dynamic programming and an evolving common knowledge, and applies this primarily in the context of the belief sharing information pattern. Information rates required for tractability of optimal solutions are also presented. Finally, the chapter introduces a team cost-rate function, which provides the minimum cost subject to a rate constraint on the information exchange among members of a team.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Acemoglu, D., Ozdaglar, A.: Opinion dynamics and learning in social networks. Dyn. Games Appl. 1, 3–49 (2010)
Article MathSciNet Google Scholar
Aicardi, M., Davoli, F., Minciardi, R.: Decentralized optimal control of Markov chains with a common past information set. IEEE Trans. Autom. Control 32, 1028–1031 (1987)
Article MathSciNet MATH Google Scholar
Athans, M.: Survey of decentralized control methods. In: 3rd NBER/FRB Workshop on Stochastic Control, Washington D.C, 1974
Google Scholar
Aumann, R.J.: Agreeing to disagree. Ann. Stat. 4, 1236–1239 (1976)
Article MathSciNet MATH Google Scholar
Başar, T.: An equilibrium theory for multi-person decision making with multiple probabilistic models. IEEE Trans. Autom. Control 30, 118–132 (1985)
Article MATH Google Scholar
Bamieh, B., Voulgaris, P.: A convex characterization of distributed control problems in spatially invariant systems with communication constraints. Syst. Control Lett. 54, 575–583 (2005)
Article MathSciNet MATH Google Scholar
Barron, A.R.: Information theory and martingales. In: IEEE International Symposium on Information Theory, Recent Results Session, Budapest, Hungary, 1991
Google Scholar
Barta, S.M.: On linear control of decentralized stochastic systems. Ph.D. Thesis, Department of EECS, MIT, Cambridge (1978)
Google Scholar
Blackwell, D., Dubins, L.: Merging of opinions with increasing information. Ann. Math. Stat. 33, 882–887 (1962)
Article MathSciNet MATH Google Scholar
Borkar, V.S.: Probability Theory: An Advanced Course. Springer, New York (1995)
MATH Google Scholar
Borkar, V.S., Varaiya, P.: Asymptotic agreement in distributed estimation. IEEE Trans. Autom. Control 27, 650–655 (1982)
Article MathSciNet MATH Google Scholar
Boyd, S., Vandenberghe, L.: Convex Optimization. Cambridge University Press, Cambridge (2004)
MATH Google Scholar
Brandenburger, A., Dekel, E.: Common knowledge with probability 1. J. Math. Econ. 16, 237–245 (1987)
Article MathSciNet MATH Google Scholar
Burnashev, M.V.: Data transmission over a discrete channel with feedback. Random transmission time. Prob. Inform. Transm. 12, 10–30 (1976)
MATH Google Scholar
Castanon, D.A., Teneketzis, D.: Distributed estimation algorithms for nonlinear systems. IEEE Trans. Autom. Control 30, 418–425 (1985)
Article MathSciNet MATH Google Scholar
Castanon, D.A., Teneketzis, D.: Further results on the asymptotic agreement problem. IEEE Trans. Autom. Control 33, 515–523 (1988)
Article MathSciNet MATH Google Scholar
Chong, C.Y., Athans, M.: On the periodic coordination of linear stochastic systems. Automatica 12, 321–335 (1976)
Article MathSciNet MATH Google Scholar
Como, G., Fagnani, F.: Scaling limits for continuous opinion dynamics systems. Ann. Appl. Probab. 21, 1537–1567 (2011)
Article MathSciNet MATH Google Scholar
Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley, New York (1991)
Book MATH Google Scholar
Csiszar, I., Korner, J.: Information Theory: Coding Theorems for Discrete Memoryless Channels. Akademiai Kiado, Budapest (1981)
MATH Google Scholar
Diaconis, P., Freedman, D.: On the consistency of bayes estimates. Ann. Stat. 14, 1–26 (1986)
Article MathSciNet MATH Google Scholar
Fardad, M., Lin, F., Jovanovic, M.R.: Optimal control of vehicular formations with nearest neighbor interactions. IEEE Trans. Autom. Control 57, 2203–2218 (2012)
Article MathSciNet Google Scholar
Foster, D.P., Young, H.P.: On the impossibility of predicting the behavior of rational agents. Proc. Nat.Acad. Sci. 22, 12848–12853 (2001)
Article MathSciNet Google Scholar
Frey, B.J.: Graphical Models for Machine Learning and Digital Communication. MIT Press, Cambridge (1998)
Google Scholar
El Gamal, A., Kim, Y.H.: Network Information Theory. Cambridge University Press, Cambridge (2012)
Google Scholar
Geanakoplos, J.: Common knowledge. In: Aumann, R.J., Hart, S. (eds.) Handbook of Game Theory, Chapter 40, pp. 1437–1496. Elsevier, Amsterdam (1994)
Google Scholar
Geanakoplos, J., Polemarchakis, H.M.: We can’t disagree forever. J. Econ. Theor. 28, 192–200 (1982)
Article MATH Google Scholar
Gharesifard, B., Cortés, J.: Distributed continuous-time convex optimization on weight-balanced digraphs, IEEE Transaction on Automatic Control, to appear (2013)
Google Scholar
Giridhar, A., Kumar, P.R.: Toward a theory of in-network computation in wireless sensor networks. IEEE Commun. Mag. 44, 98–107 (2006)
Article Google Scholar
Goldberg, A., Yüksel, S.: On the value of information in decentralized control. Queen’s University Technical Report, August 2012
Google Scholar
György, A., Linder, T.: Optimal entropy-constrained scalar quantization of a uniform source. IEEE Trans. Inf. Theor. 46, 2704–2711 (2000)
Article MATH Google Scholar
Huang, M., Caines, P.E., Malhamé, R.P.: Large-population cost-coupled LQG problems with nonuniform agents: individual-mass behavior and decentralized ε-Nash equilibria. IEEE Trans. Autom. Control 52, 1560–1571 (2007)
Article Google Scholar
Huang, M., Malhamé, R.P., Caines, P.E.: Large population stochastic dynamic games: closed-loop McLean-Vlasov systems and the Nash certainty equivalence principle (Special issue in honour of the 65th birthday of Tyrone Duncan). Commun. Inf. Syst. 6, 221–252 (2006)
MathSciNet MATH Google Scholar
Jadbabaie, A., Lin, J., Morse, A.S.: Coordination of groups of mobile autonomous agents using nearest neighbor rules. IEEE Trans. Autom. Control 48, 988–1001 (2003)
Article MathSciNet Google Scholar
Jovanovic, M.R.: On the optimality of localized distributed controllers. Int. J. Syst. Control Commun. 2, 82–99 (2010)
Article MathSciNet Google Scholar
Kalai, E., Lehrer, E.: Weak and strong merging of opinions. J. Math. Econ. 23, 73–100 (1994)
Article MathSciNet MATH Google Scholar
Lasry, J.M., Lions, P.L.: Mean field games. Jpn. J. Math. 2, 229–260 (2007)
Article MathSciNet MATH Google Scholar
Lessard, L., Lall, S.: A state-space solution to the two-player decentralized optimal control problem. In: Annual Allerton Conference on Communication, Control and Computing, Monticello, IL, 2011
Google Scholar
Li, S., Başar, T.: Asymptotic agreement and convergence of asynchronous stochastic algorithms. IEEE Trans. Autom. Control 32, 612–618 (1987)
Article MATH Google Scholar
Ma, N., Ishwar, P.: Some results on distributed source coding for interactive function computation. IEEE Trans. Inf. Theor. 57, 6180–6195 (2011)
Article MathSciNet Google Scholar
Mahajan, A.: Sequential decomposition of sequential teams: applications to real-time communication and networked control systems. Ph.D. Dissertation, University of Michigan, Ann Arbor (2008)
Google Scholar
Mahajan, A., Martins, N.C., Rotkowitz, M., Yüksel, S.: Information structures in optimal decentralized control. In: IEEE Conference on Decision and Control, Hawaii, USA, 2012
Google Scholar
Mahajan, A., Yüksel, S.: Measure and cost dependent properties of information structures. In: Proceedings of the American Control Conference, Baltimore, MD, USA, 2010
Google Scholar
Mahmoud, M.S.: Decentralized Systems with Design Constraints. Springer, London (2011)
Book MATH Google Scholar
Mooij, M., Kappen, H.J.: Sufficient conditions for convergence of the sum-product algorithm. IEEE Trans. Inf. Theor. 53, 4422–4437 (2007)
Article MathSciNet Google Scholar
Nachbar, J.H.: Prediction, optimization, and learning in games. Econometrica 65, 275–309 (1997)
Article MathSciNet MATH Google Scholar
Nair, G.N.: A nonstochastic information theory for communication and state estimation. IEEE Trans. Autom. Control (2013, to appear)
Google Scholar
Nayyar, A., Mahajan, A., Teneketzis, D.: Decentralized stochastic control with partial sharing information structures: a common information approach. IEEE Transactions on Automatic Control, to appear (2013)
Google Scholar
Nayyar, A., Mahajan, A., Teneketzis, D.: Optimal control strategies in delayed sharing information structures. IEEE Trans. Autom. Control 56, 1606–1620 (2011)
Article MathSciNet Google Scholar
Nayyar, A., Mahajan, A., Teneketzis, D.: The common-information approach to decentralized stochastic control. In: Como, G., Bernhardsson, B., Rantzer, A. (eds.) Information and Control in Networks. Springer, New York (2013)
Google Scholar
Nedić, A., Özdaglar, A., Parrilo, P.A.: Constrained consensus and optimization in multi-agent networks. IEEE Trans. Autom. Control 55, 922–938 (2010)
Article Google Scholar
Nemirovsky, A.S., Yudin, D.B.: Problem Complexity and Method Efficiency in Optimization. Wiley-Interscience, New York (1983)
MATH Google Scholar
Nielsen, L.: Common knowledge, communication and convergence of beliefs. Math. Soc. Sci. 8, 1–14 (1984)
Article MATH Google Scholar
Olshevsky, A.: Efficient information aggregation methods in distributed control and signal processing. Ph.D. Thesis, Department of EECS, MIT, Cambridge, MA (2010)
Google Scholar
Ooi, J., Verbout, S., Ludwig, J., Wornell, G.: A separation theorem for periodic sharing information patterns in decentralized control. IEEE Trans. Autom. Control 42, 1546–1550 (1997)
Article MathSciNet MATH Google Scholar
Orlitsky, A., Roche, J.R.: Coding for computing. IEEE Trans. Inf. Theor. 47, 903–917 (2001)
Article MathSciNet MATH Google Scholar
Raginsky, M., Rakhlin, A.: Information-based complexity, feedback and dynamics in convex programming. IEEE Trans. Inf. Theor. 57, 7036– 7056 (2011)
Article MathSciNet Google Scholar
Rotkowitz, M., Lall, S.: A characterization of convex problems in decentralized control. IEEE Trans. Autom. Control 51, 274–286 (2006)
Article MathSciNet Google Scholar
Sabau, S., Martins, N.C.: Stabilizability and norm-optimal control design subject to sparsity constraints. IEEE Trans. Autom. Control, under review (2013)
Google Scholar
Singh, M.G.: Decentralised Control. North Holland, Amsterdam (1981)
MATH Google Scholar
Srikant, R.: The Mathematics of Internet Congestion Control. Birkhäuser-Springer, USA (2004)
Book MATH Google Scholar
Sundaram, S., Hadjicostis, C.N.: Distributed function calculation and consensus using linear iterative strategies. IEEE J. Sel. Areas Commun.: Issue Control Commun. 26, 650–660 (2008)
Article Google Scholar
Tatikonda, S., Mitter, S.: Control under communication constraints. IEEE Trans. Autom. Control 49(7), 1056–1068 (2004)
Article MathSciNet Google Scholar
Teneketzis, D.: Communication in decentralized control. Ph.D. Dissertation, M.I.T. (1979)
Google Scholar
Teneketzis, D., Varaiya, P.: Consensus in distributed estimation. In: Poor, H.V. (ed.) Advances in Statistical Signal Processing, pp. 361–386. JAI Press, Greenwich (1988)
Google Scholar
Touri, B., Nedić, A.: On ergodicity, infinite flow and consensus in random models. IEEE Trans. Autom. Control 56, 1593–1605 (2011)
Article Google Scholar
Tsitsiklis, J.N.: Problems in decentralized decision making and computation. Ph.D. Thesis, Department of EECS, MIT, Cambridge, MA (1984)
Google Scholar
Tsitsiklis, J.N.: Extremal properties of likelihood-ratio quantizers. IEEE Trans. Commun. 41, 550–558 (1993)
Article MathSciNet MATH Google Scholar
Tsitsiklis, J.N., Athans, M.: Convergence and asymptotic agreement in distributed decision problems. IEEE Trans. Autom. Control 29, 42–50 (1984)
Article MathSciNet MATH Google Scholar
Tsitsiklis, J.N., Athans, M.: On the complexity of decentralized decision making and detection problems. IEEE Trans. Autom. Control 30, 440–446 (1985)
Article MathSciNet MATH Google Scholar
Tsitsiklis, J.N., Luo, Z.-Q.: Communication complexity of convex optimization. J. Complex. 3, 231–243 (1987)
Article MathSciNet MATH Google Scholar
van Zandt, T.: Decentralized Information Processing in the Theory of Organizations, in Contemporary Economic Development Reviewed, vol. 4: The Enterprise and its Environment. Murat Sertel, ed. London: MacMillan. (1997)
Google Scholar
Witsenhausen, H.S.: The zero-error side information problem and chromatic numbers. IEEE Trans. Inf. Theor. 22, 592–593 (1976)
Article MathSciNet MATH Google Scholar
Wong, W.S.: Control communication complexity of distributed control systems. SIAM J. Control Optim. 48, 1722–1742 (2009)
Article MathSciNet MATH Google Scholar
Wong, W.S., Baillieul, J.: Control communication complexity of distributed actions. IEEE Trans. Autom. Control. 57, 2731–2345 (2012)
Article MathSciNet Google Scholar
Yao, A.C.C.: Some complexity questions related to distributive computing. In: Proceedings of the of the 11th Annual ACM Symposium on Theory of Computing, 1979
Google Scholar
Yoshikawa, T.: Dynamic programming approach to decentralized control problems. IEEE Trans. Autom. Control 20, 796–797 (1975)
Article MathSciNet MATH Google Scholar
Yüksel, S.: Stochastic nestedness and the belief sharing information pattern. IEEE Trans. Autom. Control 54, 2773–2786 (2009)
Article Google Scholar
Yüksel, S., Başar, T.: Minimum rate coding for LTI systems over noiseless channels. IEEE Trans. Autom. Control 51(12), 1878–1887 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Statistics, Queen’s University, Kingston, Ontario, Canada
Serdar Yüksel
Department of Electrical and Computer Engineering, Coordinated Science Laboratory, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
Tamer Başar

Authors

Serdar Yüksel
View author publications
You can also search for this author in PubMed Google Scholar
Tamer Başar
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Yüksel, S., Başar, T. (2013). Agreement in Teams and the Dynamic Programming Approach Under Information Constraints. In: Stochastic Networked Control Systems. Systems & Control: Foundations & Applications. Birkhäuser, New York, NY. https://doi.org/10.1007/978-1-4614-7085-4_12

Download citation

DOI: https://doi.org/10.1007/978-1-4614-7085-4_12
Published: 09 April 2013
Publisher Name: Birkhäuser, New York, NY
Print ISBN: 978-1-4614-7084-7
Online ISBN: 978-1-4614-7085-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics