Abstract
The concept of “information structures” in decentralized control is a formalization of the notion of “who knows what and when do they know it.” Even seemingly simple problems with simply stated information structures can be extremely hard to solve. Perhaps the simplest of such unsolved problem is the celebrated Witsenhausen counterexample, formulated by Hans Witsenhausen in 1968. This entry discusses how the information structure of the Witsenhausen counterexample makes it hard and how an information-theoretic approach, which explores the knowledge gradient due to a nonclassical information pattern, has helped obtain insights into the problem.
Access provided by Autonomous University of Puebla. Download reference work entry PDF
Similar content being viewed by others
Keywords
Introduction
Modern control systems often comprise of multiple decentralized control agents that interact over communication channels (Fig. 1). What characteristic distinguishes a centralized control problem from a decentralized one? One fundamental difference is a “knowledge gradient”: agents in a decentralized team often observe, and hence know, different things. This observation leads to the idea of information patterns (Witsenhausen 1971), a formalization of the notion of “who knows what and when do they know it” (Ho et al. 1978; Mitter and Sahai 1999)
.
The information pattern is said to be classical if all agents in the team receive the same information and have perfect recall (so they do not forget it). What is so special about classical information patterns? For these patterns, the presence of external communication links has no effect on the optimal costs! After all, what could the agents use the communication links for, when there is no knowledge gradient? More interesting, therefore, are the problems for which the information pattern is nonclassical. These problems sit at the intersection of communication and control: communication between agents can help reduce the knowledge differential that exists between them, helping them perform the control task. Intellectually and practically, the concept of nonclassical information patterns motivates a lot of formulations at the control-communication intersection. Many of these formulations – including some discussed in this Encyclopedia (e.g., Data Rate of Nonlinear Control Systems and Feedback Entropy; Information and Communication Complexity of Networked Control Systems Quantized Control and Data Rate Constraints; Networked Control Systems: Architecture and Stability Issues; and Networked Control Systems: Estimation and Control Over Lossy Networks) – intellectually ask the question: for a realistic channel that is constrained by noise, bandwidth, and speed, what is the optimal communication and control strategy?
One could ask the question of optimal control strategy even for decentralized control problems where no external channel is available to bridge this knowledge gradient. Why could these problems be of interest? First, these problems are limiting cases of control with communication constraints. Second, and perhaps more importantly, they bring out an interesting possibility that can allow the agents to “communicate,” i.e., exchange information, even when the external channel is absent. It is possible to use control actions to communicate through changing the system state! We now introduce this form of communication using a simple toy example.
Communicating Using Actions: An Example
To gain intuition into when communication using actions could be useful, consider the inverted pendulum example shown in Fig. 2. The goal of the two agents is to bring the pendulum as close to the origin as possible. Both controllers have their strengths and weaknesses. The “weak” controller C w has little energy, but has perfect state observations. On the other hand, the “blurry” controller C b has infinite energy, but noisy observations. They act one after the other, and their goal is to move the pendulum close to the center from any initial state. The information structure of the problem is nonclassical: the C w , but not C b , knows the initial state of the pendulum, and C w does not know the precise (noisy) observation of C b using which C b takes actions.
A possible strategy: A little thought reveals an interesting strategy – the weak controller, having perfect observations, can move the state to the closest of some predecided points in space, effectively quantizing the state. If these quantization points are sufficiently far from each other, they can be estimated accurately (through the noise) by the blurry controller, which can then use its energy to push the pendulum all the way to zero. In this way, the weak controller expends little energy, but is able to “communicate” the state through the noise to the blurry controller, by making it take values on a finite set. Once the blurry controller has received the state through the noise, it can use its infinite energy to push the state to zero.
The Witsenhausen Counterexample
The above two-controller inverted-pendulum example is, in fact, motivated by what is now known as “the Witsenhausen counterexample,” formulated by Witsenhausen in 1968 (see Fig. 3). In the counterexample, two controllers (denoted here by C w for “weak” and C b for “blurry”) act one after the other in two time-steps to minimize a quadratic cost function. The system state is denoted by x t , where t is the time index. u w and u b denote the inputs generated by the two controllers. The cost function is \(k^{2}\mathbb{E}\left [u_{w}^{2}\right ] + \mathbb{E}\left [x_{2}^{2}\right ]\) for some constant k. The initial state x0 and the noise z at the input of the blurry controller are assumed to be Gaussian distributed and independent, with variances σ02 and 1 respectively. The problem is a “linear-quadratic-Gaussian” (LQG) problem, i.e., the state evolution is linear, the costs are quadratic, and the primitive random variables are Gaussian.
Why is the problem called a “counterexample”? The traditional “certainty-equivalence” principle (Bertsekas 1995) shows that for all centralized LQG problems, linear control laws are optimal. Witsenhausen (1968) provided a nonlinear strategy for the Witsenhausen problem which outperforms all linear strategies. Thus, the counterexample showed that the certainty-equivalence doctrine does not extend to decentralized control.
What is this strategy of Witsenhausen that outperforms all linear strategies? It is, in fact, a quantization-based strategy, as suggested in our inverted-pendulum story above. Further, it was shown by Mitter and Sahai (1999) that multipoint quantization strategies can outperform linear strategies by an arbitrarily large factor! This observation, combined with the simplicity of the counterexample, makes the problem very important in decentralized control. This simple two-time-step two-controller LQG problem needs to be understood to have any hope of understanding larger and more complex problems.
While the optimal costs for the problem are still unknown (even though it is known that an optimal strategy exists (Witsenhausen 1968; Wu and Verdú 2011)), there exists a wealth of understanding of the counterexample that has helped address more complicated problems. A body of work, starting with that of Baglietto et al. (1997), numerically obtained solutions that could be close to optimal (although there is no mathematical proof thereof). All these solutions have a consistent form (illustrated in Fig. 4), with slight improvements in the optimal cost. Because the discrete version of the problem, appropriately relaxed, is known to be NP-complete (Papadimitriou and Tsitsiklis 1986), this approach cannot be used to understand the entire parameter space and hence has focused on one point: \(k^{2} = 0.5,\sigma _{0}^{2} = 5\). Nevertheless, the approach has proven to be insightful: a recent information-theoretic body of work shows that the strategies of Fig. 4 can be thought of as information-theoretic strategies of “dirty-paper coding” Costa (1983) that is related to the idea of embedding information in the state. The question here is: how do we embed the information about the state in the state itself ?
An information-theoretic view of the counterexample: This information-theoretic approach that culminated in Grover et al. (2013) also obtained the first approximately optimal solutions to the Witsenhausen counterexample as well as its vector extensions. The result is established by analyzing information flows in the counterexample that work toward minimizing the knowledge gradient, effectively an information pattern in which C w can predict the observation of C b more precisely. The analysis provides an information-theoretic lower bound on cost that holds irrespective of what strategy is used. For the original problem, this characterizes the optimal costs (with associated strategies) within a factor of 8 for all problem parameters (i.e., k and σ02). For any finite-length extension, uniform finite-ratio approximations also exist (Grover et al. 2013). The asymptotically infinite-length extension has been solved exactly (Choudhuri and Mitra 2012).
The problem has also driven delineation of decentralized LQG control problems with optimal linear solutions and those with nonlinear optimal solutions. This led to the development and understanding of many variations of the counterexample (Bansal and Başar 1987; Başar 2008; Ho et al. 1978; Rotkowitz 2006) and understanding that can extend to larger decentralized control problems. More recent work shows that the promise of the Witsenhausen counterexample was not a misplaced one: the information-theoretic approach that provides approximately optimal solutions to the counterexample (Grover et al. 2013) yields solutions to other more complex (e.g., multi-controller, more time-steps) problems as well (Grover 2010; Park and Sahai 2012).
Summary and Future Directions
Even simple problems with nonclassical information structures can be hard to solve using classical techniques, as is demonstrated by the Witsenhausen counterexample. However, nonclassical information pattern for some simple problems – starting with the counterexample – has recently been explored via an information-theoretic lens, yielding the first optimal or approximately optimal solutions to these problems. This approach is promising for larger decentralized control problems as well. It is now important to explore what is the simplest decentralized control problem that cannot be solved (exactly or approximately) using ideas developed for the counterexample. In this manner, the Witsenhausen counterexample can provide a platform to unify the more modern (i.e., external-channel centric approaches, see Quantized Control and Data Rate Constraints; Data Rate of Nonlinear Control Systems and Feedback Entropy; Networked Control Systems: Architecture and Stability Issues; Networked Control Systems: Estimation and Control Over Lossy Networks; Information and Communication Complexity of Networked Control Systems; in the encyclopedia) with the more classical decentralized LQG problems, leading to enriching and useful formulations.
Bibliography
Baglietto M, Parisini T, Zoppoli R (1997) Nonlinear approximations for the solution of team optimal control problems. In: Proceedings of the IEEE conference on decision and control (CDC), San Diego, pp 4592–4594
Bansal R, Başar T (1987) Stochastic teams with nonclassical information revisited: when is an affine control optimal? IEEE Trans Autom Control 32: 554–559
Başar T (2008) Variations on the theme of the Witsenhausen counterexample. In: Proceedings of the 47th IEEE conference on decision and control (CDC), Cancun, pp 1614–1619
Bertsekas D (1995) Dynamic programming. Athena Scientific, Belmont
Costa M (1983) Writing on dirty paper. IEEE Trans Inf Theory 29(3):439–441
Choudhuri C, Mitra U (2012) On Witsenhausen’s counterexample: the asymptotic vector case. In: IEEE information theory workshop (ITW), Lausanne, pp 162–166
Grover P (2010) Actions can speak more clearly than words. Ph.D. dissertation, UC Berkeley, Berkeley
Grover P, Sahai A (2010) Vector Witsenhausen counterexample as assisted interference suppression. Int J Syst Control Commun 2:197–237. Special issue on Information Processing and Decision Making in Distributed Control Systems
Grover P, Park S-Y, Sahai A (2013) Approximately-optimal solutions to the finite-dimensional Witsenhausen counterexample. IEEE Trans Autom Control 58(9):2189
Ho YC, Kastner MP, Wong E (1978) Teams, signaling, and information theory. IEEE Trans Autom Control 23(2):305–312
Mitter SK, Sahai A (1999) Information and control: Witsenhausen revisited. In: Yamamoto Y, Hara S (eds) Learning, control and hybrid systems. Lecture notes in control and information sciences, vol 241. New York, NY: Springer, pp 281–293
Papadimitriou CH, Tsitsiklis JN (1986) Intractable problems in control theory. SIAM J Control Optim 24(4):639–654
Park SY, Sahai A (2012) It may be easier to approximate decentralized infinite-horizon LQG problems. In: 51st IEEE Conference on Decision and Control (CDC), Maui, pp 2250–2255
Rotkowitz M (2006) Linear controllers are uniformly optimal for the Witsenhausen counterexample. In: Proceedings of the 45th IEEE conference on decision and control (CDC), San Diego, pp 553–558
Witsenhausen HS (1968) A counterexample in stochastic optimum control. SIAM J Control 6(1): 131–147
Witsenhausen HS (1971) Separation of estimation and control for discrete time systems. Proc IEEE 59(11):1557–1566
Wu Y, Verdú S (2011) Witsenhausen’s counterexample: a view from optimal transport theory. In: IEEE conference on decision and control and European control conference (CDC-ECC), Orlando, pp 5732–5737
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag London
About this entry
Cite this entry
Grover, P. (2015). Information Structures, the Witsenhausen Counterexample, and Communicating Using Actions. In: Baillieul, J., Samad, T. (eds) Encyclopedia of Systems and Control. Springer, London. https://doi.org/10.1007/978-1-4471-5058-9_148
Download citation
DOI: https://doi.org/10.1007/978-1-4471-5058-9_148
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-4471-5057-2
Online ISBN: 978-1-4471-5058-9
eBook Packages: EngineeringReference Module Computer Science and Engineering