Keywords

1 Introduction

Information theory has been adapted to control problems in various aspects including optimal metric, bound constraint and system explanation. It is well known that information theory and cybernetics share in a common theoretical and engineering background, both of which are concerned with signals and dynamic systems. Most traditional control system analysis assumes that the observed signal is available to controller in its entire decision period [1]. In this paper, the information theory modeling of FIR actuator is a commonly used convolutional structure.

There exists numerous work using information theory to explain and resolve problems in optimal control [2, 3]. Touchette and Lloyd [4] proved that the one-step reduction in entropy of the final state is upper bounded by MI between control variables and current state of the system. Minimum of the one-step reduction in entropy can be found by dynamic programming, but the computational complexity of Dynamic Programming (DP) grows exponentially with the number of time steps and control variables. In fact, a quantization problem arises when information is exchanged between the controller and the controlled dynamic object [7,8,9]. An actuator channel can only transmit a finite number of states at any given time [10]. There is a trade-off between the fineness of the control process and the time delay required to send the data over the channel. J.Baillieul demonstrated in [12] that adequate data quantization necessitates a bigger channel capacity or more data transmission time. In [13], directed information quantity is adopted as a measure of information transmission in feedback control system. However, the specific expressions for distortion measures other than quadratic distortion is rarely studied and researchers paid little attention to signal distribution in non-Gaussian PDF cases. Recently, authors in [14] have conducted some fundamental limit research in control field using information theory. Information theory is adopted as the main mathematical tool to obtain generic bounds on the variance of estimation errors in time series analysis in [15, 16].

In this paper, we will demonstrate how to derive the entropy-rate function about a given actuator with noise, which is based on traditional rate-distortion theory. Then we define the conditional entropy as a distortion metric in an actuator system. Finally, we derive and simulate the MI between system state and output.

This paper is organized as follows. Two preliminaries of our analysis process, entropy-rate function and Sampling A Posterior (SAP) method are introduced in Sect. 2. The modeling of FIR actuator and derivation of the posterior estimation are given in Sect. 3. We derive and calculate the MI between system state and output in Sect. 4. Simulation of a non causal track system is given in Sect. 5.

2 Preliminaries

Throughout the paper, we consider real-valued continuous random variables and random vectors, as well as discrete time stochastic processes. All random variables, random vectors, and stochastic processes are zero mean. Given a stochastic process \(\{x_k\}\), we assume the excitation of the control system be a set of finite-length K-dimensional independent and identically distributed Gaussian vectors, the system is linear and has memory. The output signal is an N-dimensional Gaussian vector Y.

While classical linear system theory uses frequency response theory to characterize noise and system transfer functions, modern control theory uses linear algebra and state space to establish dynamic equations. Specifically, we consider the basic system with state-space model given by

$$\begin{aligned} \left\{ \begin{aligned} \mathbf {\hat{x}}_{k}&= \textbf{x}_{k}+\textbf{w}_{k} \\ \mathbf {\hat{y}}_{k}&= {\textbf {A}} \mathbf {\hat{x}}_{k} \end{aligned} \right. \end{aligned}$$
(1)

where A is the FIR actuator matrix, \(\hat{y}_{k}\) is real output at step k, \(x_{k}\) is system state at step k. The impluse response sequence h(d) has the length of d. The sequence is then filled with zero to the length of \(N_{S}=N_{cp}+d\), where \(N_{cp}\) is the length of circulant prefix. The finite implues response \(\textbf{A}\) during the \(i^{th}\) symbol, which is N \(\times \) N toeplitz [17],

$$\begin{aligned} \begin{aligned}&\textbf{A}=\textbf{R}_{cp}\textbf{H}\textbf{T}_{cp}\\&=\left[ \begin{array}{ccccccc} h(0) &{} 0 &{} \ldots &{} 0 &{} h(d-1) &{} \ldots &{} h(1) \\ h(1) &{} h(0) &{} 0 &{} \ldots &{} \ldots &{} \ldots &{} \ldots \\ \ldots &{} \ldots &{} \ddots &{} \ldots &{} \ldots &{} \ldots &{} h(d-1) \\ h(d-1) &{} \ldots &{} \ldots &{} \ddots &{} \ldots &{} \ldots &{} 0 \\ 0 &{} \ldots &{} \ldots &{} \ldots &{} \ddots &{} \ldots &{} \ldots \\ \ldots &{} \ldots &{} \ldots &{} \ldots &{} \ldots &{} \ddots &{} 0 \\ 0 &{} \ldots &{} 0 &{} h(d-1) &{} \ldots &{} \ldots &{} h(0) \end{array}\right] , \end{aligned} \end{aligned}$$
(2)

where \(\textbf{R}_{c p}=\left[ \begin{array}{ll} 0_{N \times \left( N_{S}-N\right) }&I_{N} \end{array}\right] \) is \(N \times N_{S}\), \(\textbf{T}_{c p}=\left[ \begin{array}{cc} 0_{N-N c p} &{} I_{N c p} \\ &{} I_{N} \end{array}\right] \) is \( N_{S} \times N.\) \(\mathbf {{y}}_{k}\) is defined as a sequence of designed inputs over some number of timesteps k, \(\mathbf {\hat{y}}_{k}\) is the real actuator output. It is reasonable to assume that direct control over designed input \(\mathbf {{y}}_{k}\) exists, but not over \(\mathbf {\hat{y}}_{k}\). This is a common noise assumption for many robotic systems in which command input must pass through a lower-level controller. A prototypical example is the steering and throttle inputs for a car which are then used as set-point targets for low level servomotor controllers.

The MI calculation method we used in this paper is called SAP method. It is based on a random sampling of the posterior probability generated by each received signal. The statistical distribution of this estimate will naturally approximate the theoretical posterior probability distribution of the parameter to be estimated. The main constraint is that the PDFs must be regenerative, eg. Gaussian, Cauchy, Gamma, Poisson, Exponentional distribution. The sampling posterior probability estimation method is described in Algorithm 1. Firstly, generate the received signal vector, then match conditional PDF of received signal vector. The key step is generating an estimate by random sampling of the posterior PDF. Through this way the empirical entropy and the average estimation performance will be calculated.

figure a

3 FIR Actuator Model and Posteriori Estimation

Assuming that the channel is constant (slow fading) during one symbol period, its finite impulse response sequence in the \(i_{th}\) symbol period sequence is

$$\begin{aligned} \begin{aligned} y(n)&= \hat{x}(n)*h(n)=\sum _{l}^{n} \hat{x}(l) h(n-l) \\&=\hat{x}(n) h(0)+\sum _{l}^{n-1}\hat{x}(l)h(n-l), \end{aligned} \end{aligned}$$
(3)

where \(l=n-d+1\).

The FIR system state Eq. (1) can be extended as

$$\begin{aligned} \left\{ \begin{array}{l} \textbf{x}_{k}=A \textbf{x}_{k-1}+B \textbf{e}_{k-1}+\textbf{v}_{k} \\ \textbf{y}_{k}=C \textbf{x}_{k}+D \textbf{e}_{k}+\textbf{w}_{k} \end{array}\right. , \end{aligned}$$
(4)

where Gaussian noise \(v_{k}\), \(w_{k}\) are independent. Combine (4) with (1)(2), we give the whole system model, the notation \(e_{k}\) denotes the input of FIR system, \(y_{k}\) denotes the referred signal, \(\hat{y}_{k}\) denotes the output signal. Based on (3), we write the state equation as

Fig. 1.
figure 1

System Input Design with SAP method

$$\begin{aligned} \left\{ \begin{array}{l} p\left( \textbf{y}_{k} \mid \textbf{e}_{k}, \textbf{x}_{k}\right) =f_{w}\left( \textbf{y}_{k}-C \textbf{x}_{k}-D \textbf{e}_{k}\right) \\ p\left( \textbf{x}_{k} \mid \textbf{x}_{k-1}, \textbf{e}_{k-1}\right) =f_{v}\left( \textbf{x}_{k}-A \textbf{x}_{k-1}-B \textbf{e}_{k-1}\right) \end{array}\right. , \end{aligned}$$
(5)
$$\begin{aligned} \begin{aligned}&p\left( e_{k} \mid y_{k}, y^{k-1}, e^{k-1}, x^{k-1}, x_{k}\right) \\&=\frac{p\left( e_{k}, y_{k}, y^{k-1}, e^{k-1}, x^{k-1}, x_{k}\right) }{p\left( y_{k}, y^{k-1}, e^{k-1}, x^{k-1}, x_{k}\right) }. \end{aligned}\end{aligned}$$
(6)

Based on Bayes’ Rule, we continue to derive (6) and get the step estimation equation

$$\begin{aligned} \begin{aligned} p\left( e_{k} \mid \right.&\left. y_{k}, y^{k-1}, e^{k-1}, x^{k-1}, x_{k}\right) \\ =&\frac{f_{w} f_{v} p\left( y^{k-1}, e^{k-1}, x^{k-1}\right) p\left( e_{k}\right) }{E_{p\left( e_{k}\right) }\left[ f_{w} f_{v} p\left( y^{k-1}, e^{k-1}, x^{k-1}\right) \right] }. \end{aligned} \end{aligned}$$
(7)

The next stages are derived in the same way. Other PDF with property of regenerativity can also do this SAP algorithm.

4 Mutual Information of FIR Actuator

This section will examine the MI-distortion relation of FIR Actuator. The main problem is the processing of MI in this model. SAP algorithm based on sampling method is given to quantitatively figure out the MI between the system output and state.

Based on the actuator model shown in (4), it is reasonable to assume that the stochastic process in both sides of (4) share the same statistical characteristic, thus the auto correlation of a stochastic process \(\{\hat{x}_{k}\}\) is

$$\begin{aligned} \begin{aligned} R(d)&=E\left[ \hat{\textbf{x}} h(0)[\hat{\textbf{x}} h(0)]^{T}\right] \\&=\left[ \begin{array}{cccc} R_{x x}(0) &{} R_{x x}(-1) &{} \cdots &{} R_{x x}(-d+1) \\ R_{x x}(1) &{} R_{x x}(0) &{} \cdots &{} R_{x x}(-d+2) \\ \vdots &{} \vdots &{} \ddots &{} \vdots \\ R_{x x}(d-1) &{} R_{x x}(d-2) &{} \cdots &{} R_{x x}(0) \end{array}\right] , \\ \end{aligned} \end{aligned}$$
(8)

designed output \(y\sim N(0,\varSigma _{Y})\), distortion \((y-\hat{y})\sim N(0,\textbf{D})\). It is known that circulation matrix \(\textbf{A}\) in (2) can be diagonalized by Fourier matrices. Hence we have \(\hat{\textbf{Y}}=\varSigma \hat{\textbf{X}}\). To the same effect, the conditional probability distribution representing the relationship between finite precision output and ideal output is

$$\begin{aligned} \begin{aligned} p(\hat{\boldsymbol{y}} \mid \boldsymbol{y})=\frac{1}{\sqrt{(2 \pi )^{N}\left| R_{\delta }\right| }} \exp \left\{ -\frac{1}{2}(\hat{y}-\boldsymbol{y}) R_{\delta }^{-1}(\hat{\boldsymbol{y}}-\boldsymbol{y})^{T}\right\} , \end{aligned} \end{aligned}$$
(9)

based on the rate-distortion, we write the conditional entropy as the distortion metric constraint for minimization

$$\begin{aligned} \begin{aligned} \frac{1}{2} \log (2 \pi e)^{N} D=H(\textbf{Y} \mid \hat{\textbf{Y}}), \end{aligned}\end{aligned}$$
(10)

we have mutual information which indicates the information capacity of the FIR actuator,

$$\begin{aligned} \begin{aligned}&I(\textbf{y} ; \textbf{x})=\iint p(\textbf{x}, \textbf{y}) \ln \frac{p(\textbf{y} \mid \textbf{x})}{p(\textbf{y})} d \textbf{x} d \textbf{y}=h(\textbf{y})-h(\textbf{y} \mid \textbf{x}) \\&=\frac{1}{2} \ln (2 \pi )^{N}\left| \boldsymbol{\varSigma }_{\textbf{Y}}\right| +\frac{\sqrt{(2 \pi )^{N}\left( \left| \boldsymbol{\varSigma }_{\textbf{Y}}\right| +|\textbf{D}|\right) }}{|\textbf{D}|}\left[ \ln \frac{\sqrt{(2 \pi )^{N}\left( \left| \boldsymbol{\varSigma }_{\textbf{Y}}\right| +|\textbf{D}|\right) }}{(2 \pi )^{N}\left| \boldsymbol{\varSigma }_{\textbf{Y}}\right| |\textbf{D}|}-1\right] . \end{aligned} \end{aligned}$$
(11)

By using SAP algorithm described in Sect. 3 to calculate MI, the results are shown in Fig. 3. Other PDFs with regenerativity such as Cauchy, Gamma, Poisson, Exponentional distribution also apply to this deduction.

5 Simulation of a Non Causal Track System

We carry out the system simulation based on (5). The FIR system has 1 Hz bandwidth and it functions as a track system with reference input y.

Fig. 2.
figure 2

Simulation of the track system

Fig. 3.
figure 3

Mutual Information \(I(y;\hat{y})\) with different frequenciesies from 0.7 Hz 2 Hz

As can be shown in Fig. 2, the left column is the FFT of time series and the right column is the time domain series. The first row is the designed output y, and the second row is the actual output \(\hat{y}\). The third row is designed input signal e based on estimation Eq. (6) and the forth row is the noise w in system equations. This track system is a lowpass filter 1 Hz bandwidth. The MI between the first row y and the second row \(\hat{y}\) is shown in Fig. 3. It can be observed in Fig. 3 that as the tracked frequency y getting closer to the system’s cutoff 1 Hz, tracking process performs worse. The worse performance is reflected by the decrease of MI \(I(y;\hat{y})\) from 0.3 to 0.2.

MI result in Fig. 3 reflects the capacity of this track system. \(I(y;\hat{y})\) tells us the information from y to \(\hat{y}\) in Fig. 1. As the input frequency exceeds the allowed entropy-rate relation, the signal tracking process will not keep up. When the tracked frequency gradually 1 Hz, the MI decreases at the same time. The same phenomenon also exists in Mean Square Error (MSE), as input frequency getting closer to cutoff frequency, MSE between y and \(\hat{y}\) also decreases.

6 Conclusion

In this paper, we proposed an entropy-rate framework for the analysis of a FIR actuator model and gave the input estimation equation and posterior PDF recursive deduction. The simulation result about the MI of system is based on SAP method, and the calculation complexity of SAP method is reduced The MI metric can reflect the capacity of FIR actuator based on simulation results. Our entropy-rate relation can be a feasible framework to explain control system’s capacity. Other PDFs with regenerativity such as Cauchy, Gamma, Poisson, Exponentional distribution also apply to this analysis framework.