Structure theory for ensemble controllability, observability, and duality

Chen, Xudong

doi:10.1007/s00498-019-0237-5

Structure theory for ensemble controllability, observability, and duality

Original Article
Published: 18 June 2019

Volume 31, pages 1–40, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Mathematics of Control, Signals, and Systems Aims and scope Submit manuscript

Structure theory for ensemble controllability, observability, and duality

Download PDF

Xudong Chen ORCID: orcid.org/0000-0002-0135-0606¹

759 Accesses
19 Citations
Explore all metrics

Abstract

Ensemble control deals with the problem of using a finite number of control inputs to simultaneously steer a large population (in the limit, a continuum) of control systems. Dual to the ensemble control problem, ensemble estimation deals with the problem of using a finite number of measurement outputs to estimate the initial state of every individual system in the ensemble. We introduce in the paper a novel class of ensemble systems, termed distinguished ensemble systems, and establish sufficient conditions for controllability and observability of such systems. Every distinguished ensemble system has two key components, namely a set of distinguished control vector fields and a set of codistinguished observation functions. Roughly speaking, a set of vector fields is distinguished if it is closed (up to scaling) under Lie bracket, and moreover, every vector field in the set can be obtained by a Lie bracket of two vector fields in the same set. Similarly, a set of functions is codistinguished to a set of vector fields if the Lie derivatives of the functions along the given vector fields yield (up to scaling) the same set of functions. We demonstrate in the paper that the structure of a distinguished ensemble system can significantly simplify the analysis of ensemble controllability and observability. Moreover, such a structure can be used as a guiding principle for ensemble system design. We further address in the paper the problem about existence of a distinguished ensemble system for a given manifold. We provide an affirmative answer for the case where the manifold is a connected semi-simple Lie group. Specifically, we show that every such Lie group admits a set of distinguished vector fields, together with a set of codistinguished functions. The proof is constructive, leveraging the structure theory of semi-simple real Lie algebras and representation theory. Examples will be provided along the presentation of the paper illustrating key definitions and main results.

A theoretical investigation of Brockett’s ensemble optimal control problems

Article 06 September 2019

Coherence of Quantum Ensemble as a Dual to Uncertainty for a Single Observable

Article 18 October 2019

Quantifying the quantumness of ensembles via generalized α-z-relative rényi entropy

Article 12 July 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

We address in the paper controllability and observability of a continuum ensemble of control systems. Roughly speaking, ensemble control deals with the problem of using a finite number of control inputs to simultaneously steer a large population (in the limit, a continuum) of control systems. These individual control systems may be structurally identical, but show variations in their tuning parameters. Dual to ensemble control, ensemble estimation deals with the problem of estimating the state of every individual control system in the ensemble using only a finite number of measurement outputs. We refer the reader to Fig. 1 for an illustration of a continuum ensemble of control systems indexed by a parameter of a two-dimensional surface. Note that any finite ensemble of control systems can be viewed as a proper subsystem of the continuum ensemble. Controllability (or observability) of the continuum ensemble will guarantee the controllability (or observability) of any such finite subsystem of it.

The framework of ensemble control and estimation naturally has many applications across various disciplines in engineering and science. The individual control systems in the ensemble can be used to model, for example, spin dynamics that are controlled by a magnetic field [14], molecules that respond to external stimuli such as light [35] and heat [33], or micro-robotics that are steered by a broadcast control signal [3]. We further note that an individual control system does not necessarily have only one single physical entity, but rather it can comprise multiple interacting components (or agents). In this case, every individual control system is itself a networked control system (or a multi-agent system). For example, a mathematical model for a continuum ensemble of multi-agent formation systems has recently been proposed and investigated in [5].

Many existing ensemble control and estimation theories deal only with linear ensembles (i.e., ensembles of linear control systems). For nonlinear ensembles, the literature is relatively sparse on controllability, and much less on observability. There is also a lack of methodologies for designing nonlinear dynamics of individual control systems so that an ensemble of such systems is controllable and observable. To address the above issues, we introduce in the paper a novel class of nonholonomic ensemble systems, termed distinguished ensembles. Every such system has two key components: a set of finely structured control vector fields, termed distinguished vector fields, and a set of costructured observations functions, termed codistinguished functions. Details about the structure of a distinguished ensemble will be provided below. We will demonstrate that controllability and/or observability of a distinguished ensemble system can be easily fulfilled under some mild assumption. The first half of the paper is devoted to establishing the fact. For the second half, we will investigate the problem about existence of a distinguished ensemble. We focus on the case where the state space of every individual system is a Lie group or its homogeneous space. We leverage existing results [6] and structure theory of Lie algebras to construct explicitly distinguished vector fields and codistinguished functions.

1.1 Mathematical models for ensemble control and estimation

The model of an ensemble system considered in the paper comprises two parts, namely ensemble control and ensemble estimation. We introduce these two parts subsequently.

Model for ensemble control We consider a continuum ensemble of control systems indexed by a parameter $\sigma \in \Sigma $, where $\Sigma $ is the parameterization space. We assume in the paper that $\Sigma $ is compact, real analytic, and path-connected. We allow $\Sigma $ to have boundary. If an individual control system in the ensemble is associated with index $\sigma $, then we call it system-$\sigma $. The state space of each individual system is the same, which we denote by M. We assume that M is real analytic. Further, let $x_\sigma (t)\in M$ be the state of system-$\sigma $ at time t. Then, in general, the control model of an ensemble system can be described by the following differential equation:

$$\begin{aligned} \dot{x}_\sigma (t) := \frac{\partial x_\sigma (t)}{\partial t}= f(x_\sigma (t), \sigma , u(t)), \quad x_\sigma \in M \text{ for } \text{ all } \sigma \in \Sigma , \end{aligned}$$

(1)

where u(t) is a finite-dimensional control input common to all of the individual control systems and f is an analytic vector field. Let

$$\begin{aligned} x_\Sigma (t):= \{x_\sigma (t)\mid \sigma \in \Sigma \} \end{aligned}$$

be the collection of system states. One can treat $x_\Sigma (t)$ as a function from $\Sigma $ to M. We call $x_\Sigma (t)$ a profile. Let $\mathrm{C}^\omega (\Sigma , M)$ be the space of real analytic functions from $\Sigma $ to M. We assume that for any given t, the profile $x_\Sigma (t)$ belongs to $\mathrm{C}^\omega (\Sigma , M)$. We call $\mathrm{C}^\omega (\Sigma , M)$ the profile space.

We focus in the paper on a special class of ensemble systems, namely systems such that the vector fields f are separable in state x, the parameter $\sigma $, and the control input u. Specifically, we consider the following type of ensemble system:

$$\begin{aligned} \dot{x}_\sigma (t) = f_0(x_\sigma (t), \sigma ) + \sum ^m_{i = 1} \sum ^{r}_{s = 1} u_{i,s}(t)\rho _{s}(\sigma ) f_i (x_\sigma (t)), \quad x_\sigma \in M \text{ and } \sigma \in \Sigma , \end{aligned}$$

(2)

where $f_0$ is a drifting term, the $f_i$’s are control vector fields depending only on $x_\sigma (t)$, the $\rho _i$’s are parameterization functions defined on $\Sigma $, and the $u_{i,s}$’s are scalar control inputs. We assume in the paper that all the vector fields and parameterization functions are analytic in their variables. All the control inputs are integrable functions over any finite time interval. For convenience, we let u(t) be the collection of all the $u_{i,s}(t)$’s.

Model for ensemble estimation We assume that there are l (scalar) measurement outputs $y^j(t)$, for $j = 1, \ldots , l$, at our disposal. Each $y^j(t)$ is a certain average of an observation function $\phi ^j(x_\sigma (t))$ over the parameterization space $\Sigma $. Specifically, we first let $\Sigma $ be equipped with a strictly positive Borel measure, i.e., $\int _U \mathrm{d}\sigma > 0$ for any nonempty open subset U of $\Sigma $. Next, let each $\phi ^j$, for $j = 1,\ldots , l$, be an analytic function defined on M. Then, the measurement outputs $\{y^j(t)\}^l_{j = 1}$ are described by

$$\begin{aligned} y^j(t) =\displaystyle \int _\Sigma \phi ^j(x_\sigma (t)) \mathrm{d}\sigma , \quad j = 1,\ldots , l. \end{aligned}$$

(3)

For convenience, let y(t) be the collection of the $y^j(t)$’s.

Model for an ensemble system Combining (2) and (3), we arrive at the following mathematical model of an ensemble system:

$$\begin{aligned} \left\{ \begin{array}{ll} \dot{x}_\sigma (t) = f_0(x_\sigma (t), \sigma ) + \sum \nolimits _{i = 1}^{m}\sum \nolimits _{s = 1}^{r} u_{i,s}(t)\rho _{s}(\sigma ) f_i (x_\sigma (t)), &{} \forall \sigma \in \Sigma , \\ y^j(t) =\displaystyle \int _\Sigma \phi ^j(x_\sigma (t)) \mathrm{d}\sigma , &{} \forall j = 1,\ldots , l. \\ \end{array} \right. \end{aligned}$$

(4)

Examples of the above system will be given along the presentation.

1.2 Distinguished structure and examples

A major contribution of the paper is to introduce a novel class of nonholonomic ensemble systems (4), termed distinguished ensembles. Every such ensemble system has two key components: a set of distinguished control vector fields $\{f_i\}^m_{i = 1}$ and a set of codistinguished observation functions $\{\phi ^j\}^l_{j = 1}$. Roughly speaking, a set of vector fields $\{f_i\}^m_{i = 1}$ is said to be distinguished if the Lie bracket of any two vector fields $f_i$ and $f_j$ is, up to scaling, another vector field $f_k$, i.e., $[f_i,f_j]= \lambda f_k$ for $\lambda $ a constant, and conversely, any vector field $f_k$ in the set can be obtained in this way. Such a structure is motivated by Li and Khaneja [26] for their earlier study on ensemble control of Bloch equations. Similarly, a set of functions $\{\phi ^j\}^l_{j = 1}$ is said to be codistinguished to the vector fields $\{f_i\}^m_{i = 1}$ if the Lie derivative of any $\phi ^j$ along any $f_i$ is, up to scaling, another function $\phi ^k$, i.e., $f_i\phi ^j = \lambda \phi ^k$ for $\lambda $ a constant, and conversely, any function $\phi ^k$ in the set can be obtained in this way (see Definitions 2 and 3, Sect. 3.1 for details).

We note here that although the notion of a “distinguished set” of a Lie algebra appears to be new, such set arises naturally in different areas. Here are a few examples:

(1):

When dealing with the rigid motions of a three-dimensional object with a fixed center, we have that the infinitesimal motions of rotations around three axes of an orthonormal frame $\Theta \in {\text {SO}}(3)$ are given by

$$\begin{aligned} f_1(\Theta ) = \Theta \Omega _{23}, \qquad f_2(\Theta ) := \Theta \Omega _{31}, \qquad f_3(\Theta ) := \Theta \Omega _{12}, \end{aligned}$$

where each $\Omega _{ij}$ is a skew-symmetric matrix with 1 on the ijth entry, $-1$ on the jith entry, and 0 elsewhere. By computation, $[f_i, f_j] = f_k$ where (i, j, k) is any cyclic rotation of (1, 2, 3). Thus, the above vector fields form a distinguished set.

(2):

In quantum mechanics, the Pauli spin matrices are used to represent angular momentum operators. We recall that they are given by

$$\begin{aligned} \sigma _1 := \begin{bmatrix} 0&\quad 1 \\ 1&\quad 0 \end{bmatrix} \qquad \sigma _2 := \begin{bmatrix} 0&\quad -\mathrm {i} \\ \mathrm {i}&\quad 0 \end{bmatrix}\qquad \sigma _3 := \begin{bmatrix} 1&\quad 0\\ 0&\quad -1 \end{bmatrix}, \end{aligned}$$

where $\mathrm {i}$ is the imaginary unit. Similarly, if (i, j, k) is a cyclic rotation of (1, 2, 3), then $[\sigma _i, \sigma _j ] = 2\mathrm {i}\sigma _k$. Although the constant $2\mathrm {i}$ is not real, one can multiple all the three matrices by $\mathrm {i}$ so that the new set $\{\mathrm {i}\sigma _i\}^3_{i = 1}$ now satisfies $[\mathrm {i}\sigma _i, \mathrm {i}\sigma _j ] = -2\mathrm {i}\sigma _k$. Note that the set $\{\mathrm {i}\sigma _i\}^3_{i = 1}$ belongs to $\mathfrak {su}(2)$ i.e., the special unitary Lie algebra. However, we shall note that $\mathfrak {su}(2)$ is isomorphic to $\mathfrak {so}(3)$.

(3):

We also note that the ladder operators represented by the following matrices in the special linear Lie algebra $\mathfrak {sl}(2,\mathbb {R})$:

$$\begin{aligned} H := \begin{bmatrix} 1&\quad 0 \\ 0&\quad -1 \end{bmatrix} \qquad X := \begin{bmatrix} 0&\quad 1 \\ 0&\quad 0 \end{bmatrix} \qquad Y:= \begin{bmatrix} 0&\quad 0 \\ 1&\quad 0 \end{bmatrix} \end{aligned}$$

satisfy the desired property: $[H, X] = 2X$, $[H, Y] = -2Y$, and $[X, Y] = H$.

The examples given above demonstrate the existence of distinguished sets in Lie algebras $\mathfrak {so}(3) \approx \mathfrak {su}(2)$ and $\mathfrak {sl}(2, \mathbb {R})$. In fact, we have shown in [6] that every semi-simple real Lie algebra has a distinguished set. We review such a fact in Sect. 4.1.

1.3 Literature review

Among related works about controllability of nonlinear ensembles, we first mention [25, 26] by Li and Khaneja in which the authors establish the controllability of an ensemble of Bloch equations parameterized by a pair of scalar parameters $(\sigma _1,\sigma _2)$ over a square $\Sigma :=[a_1,b_1]\times [a_2, b_2]$ in $\mathbb {R}^2$:

$$\begin{aligned} \dot{x}(t) = (\sigma _1 \Omega _{12} + u_1(t) \sigma _2 \Omega _{13} + u_2(t)\sigma _2 \Omega _{23} )x(t). \end{aligned}$$

Ensemble control of Bloch equations has also been addressed in [2] using tools from functional analysis. We further note that the controllability of a general ensemble of control-affine systems has been recently addressed in [1], in which the authors established an ensemble version of Rachevsky–Chow theorem via a Lie algebraic method. We do not to intend to reproduce in the paper the results established there, but rather our contribution related to ensemble controllability is to demonstrate that if the set of control vector fields $\{f_i\}^m_{i = 1}$ is distinguished, then the ensemble version of Rachevsky-Chow criterion can be easily verified in analysis and fulfilled in system design. For ensemble control of linear systems, we refer the reader to [19, 24, 11, Ch. 12] and references therein. We further refer the reader to [4, 7, 8] for optimal control of probability distributions evolving along linear systems.

Observability of a continuum ensemble system has been mostly addressed within the class of linear systems. We first refer the reader to [11, Ch. 12] where the following ensemble of linear systems is investigated:

$$\begin{aligned} \dot{x}_\sigma (t) = A(\sigma ) x_\sigma (t) \in \mathbb {R}^n, \quad y(t) = \int _\Sigma C(\sigma ) x_\sigma (t) \mathrm{d}\sigma \in \mathbb {R}^l. \end{aligned}$$

The authors addressed the observability of the above ensemble system using the duality between controllability and observability of infinite-dimensional linear systems [9]. We also refer the reader to [36] for a related observability problem about estimating the probability distribution of the initial state. Specifically, the authors there considered a single time-invariant linear system: $\dot{x}(t) = A x(t) + Bu(t)$ and $y(t) = C x(t)$. An initial probability distribution $p_0$ of $x\in \mathbb {R}^n$ induces a distribution ${\bar{p}}_t$ of y(t) for a given control input u(t). The observability problem addressed there is whether one is able to estimate $p_0$ given that the entire distributions ${\bar{p}}_t$ (which are infinite-dimensional), for all $t \ge 0$, are known. A key difference between our model (4) and theirs is that we only allow a finite-dimensional measurement output y(t). We further refer the reader to [10, 12, 13, 20, 34] for the study of observability of a single nonlinear system using the so-called observability codistribution.

1.4 Outline of contribution and organization of the paper

The technical contribution of the paper is twofold: (1) We establish a structure theory for controllability and observability of a distinguished ensemble system. (2) We prove the existence of distinguished ensemble systems over semi-simple Lie groups.

Structure theory We establish in Sect. 3 a sufficient condition for controllability and observability of a distinguished (and pre-distinguished) ensemble system. In particular, we demonstrate how distinguished vector fields and codistinguished functions can simplify the analysis and lead to ensemble controllability and observability. The structure theory established in the paper also provides a solution to the problem of ensemble system design—i.e., the problem of codesigning the control vector fields $f_i$’s, the observations functions $\phi ^j$’s, and the parameterization functions $\rho _s$’s so that system (4) is controllable and/or observable. In particular, it divides the problem into two independent subproblems—one is about finding a set of distinguished vector fields $\{f_i\}^m_{i = 1}$ and a set of codistinguished function $\{\phi ^j\}^l_{j = 1}$ over the given manifold M while the other is about finding a set of parameterization functions $\{\rho _s\}^r_{s =1}$ that separates points of the parameterization space $\Sigma $.

Existence of distinguished ensembles We prove in Sect. 4 that every semi-simple Lie group G admits a set of distinguished vector fields, together with a set of codistinguished functions. The proof of the existence result is constructive: (1) For distinguished vector fields, we leverage the result established in [6] where we have shown how to construct a distinguished set on the Lie algebra level. We then identify the distinguished set with the corresponding set of left- (or right-) invariant vector fields over the group G. (2) For codistinguished functions, we show how to generate these functions using representation theory. In particular, we show in Sect. 4.2 that a selected set of matrix coefficients associated with a finite-dimensional Lie group representation could be used as a set of codistinguished functions (with respect to a set of left-invariant vector fields). Then, in Sect. 4.3, we focus on a special representation, namely the adjoint representation. We show, in this case, that there indeed exists a set of matrix coefficients as codistinguished functions. In particular, if G is a matrix Lie group, then these matrix coefficients are simply given by $\phi ^{ij}(g) = {\text {tr}}(g X_j g^{-1} X_i^\top )$ where $X_i$ and $X_j$ are selected matrices out of the Lie algebra $\mathfrak {g}$ of G. We further address, in Sect. 4.5, the existence problem for homogeneous spaces.

We provide key definitions and notations in Sect. 2 and conclusions at the end.

2 Definitions and notations

(1) Manifolds Let M be a real analytic manifold. For a point $x\in M$, let $T_xM$ be the tangent space and $T^*_x M$ be the cotangent space of M at x. Let $TM:= \cup _{x\in M} T_xM$ be the tangent bundle and $T^*M := \cup _{x\in M} T^*_xM$ be the cotangent bundle.

Let $\mathrm{C}^\omega (M)$ be the set of real analytic functions on M. Denote by $\mathbf{1}_M\in \mathrm{C}^\omega (M)$ the constant function whose value is 1 everywhere. Let $\mathfrak {X}(M)$ be the set of real analytic vector fields over M. Let $\phi \in \mathrm{C}^\omega (M)$ and $f\in \mathfrak {X}(M)$. Denote by $f\phi \in \mathrm{C}^\omega (M)$ the Lie derivative of $\phi $ along f. If we embed M into a Euclidean space, then $f\phi $ is simply given by

$$\begin{aligned} (f\phi )(x) := \lim _{\epsilon \rightarrow 0} \frac{\phi (x + \epsilon f(x)) - \phi (x)}{\epsilon }, \quad \forall x\in M. \end{aligned}$$

For any $\phi \in \mathrm{C}^\omega (M)$, we let $\mathrm{d}\phi \in T^*M$ be a one-form defined as follows: Let $\mathrm{d}\phi _x\in T^*_x M$ be the evaluation of $\mathrm{d}\phi $ at x. Then, for any $f\in \mathfrak {X}(M)$, we have that $\mathrm{d}\phi _x(f(x)) = (f\phi )(x)$. For two vector fields $f_i, f_j\in \mathfrak {X}(M)$, we let $[f_i,f_j]$ be the Lie bracket, which is defined such that $[f_i, f_j] \phi = f_i f_j \phi - f_j f_i \phi $ for all $\phi \in \mathrm{C}^\omega (M)$.

Let $\{f_i\}^m_{i = 1}$ be a subset of $\mathfrak {X}(M)$. Let $\mathrm{w} = w_1\cdots w_k$ be a word over the alphabet $\{1,\ldots , m\}$ of length k. For a function $\phi \in \mathrm{C}^\omega (M)$, we define $f_{\mathrm{w}}\phi := f_{w_1}\cdots f_{w_k}\phi $. If $\mathrm{w} = \varnothing $, i.e., an empty word (of zero length), then we set $f_{\mathrm{w}}\phi := \phi $.

Let $\eta : M\rightarrow N$ be a diffeomorphism. Denote by $\eta _*: TM\rightarrow TN$ the derivative of $\eta $. For a vector field $f\in \mathfrak {X}(M)$, let $\eta _*f\in \mathfrak {X}(N)$ be the pushforward defined as $(\eta _*f)(y) := \eta _*(f(\eta ^{-1}y))$ for all $y\in N$. For a function $\phi \in \mathrm{C}^\omega (N)$, let $\eta ^*\phi \in \mathrm{C}^\omega (M)$ be the pullback defined as $(\eta ^*\phi )(x) := \phi (\eta (x))$ for all $x\in M$.

(2) Algebra of functions Let $\Sigma $ be an analytic, compact manifold and $\{\rho _s\}^r_{s = 1}$ be a set of real-valued functions on $\Sigma $. For any $k \ge 0$, let $\rho ^k_s(\sigma ) := \rho _s(\sigma )^k$. Note, in particular, that $\rho ^0_s =\mathbf{1}_\Sigma $. If $\rho _s$ is everywhere nonzero, then $\rho ^k_s$ is defined for all $k\in \mathbb {Z}$. We call $\prod ^r_{s = 1}\rho ^{k_s}_s$, for $k_s \ge 0$, a monomial. Its degree is defined by $k:= \sum ^r_{s = 1}k_s$. Let ${\mathcal P}$ be the collection of all monomials. We decompose ${\mathcal P} = \sqcup _{k \ge 0} {\mathcal P}(k)$, where ${\mathcal P}(k)$ is comprised of monomials of degree k. Denote by ${\mathcal S}$ the subalgebra generated by the set of functions $\{\rho _s\}^r_{s = 1}$. It is defined such that if $\mathrm{p}\in {\mathcal S}$, then $\mathrm{p}$ can be expressed as a linear combination of a finite number of monomials with real coefficients.

(3) Lie groups and Lie algebras Let G be a Lie group with e the identity element. Let $\mathfrak {g}$ be the associated Lie algebra, and $[\cdot , \cdot ]$ be the Lie bracket. We identify each element $X\in \mathfrak {g}$ with a left-invariant vector field $L_X$ over G, i.e., $L_X(g) = gX$ for any $g\in G$. Thus, $L_{[X, Y]} = [L_X, L_Y]$. Note that to each $X\in \mathfrak {g}$, there also corresponds a right-invariant vector field $R_X$. For any $X, Y\in \mathfrak {g}$, we have $R_{[X, Y]} = - [R_X, R_Y]$.

A subalgebra $\mathfrak {h}$ of $\mathfrak {g}$ is a vector subspace closed under Lie bracket, i.e., $[\mathfrak {h}, \mathfrak {h}] \subseteq \mathfrak {h}$. An ideal $\mathfrak {i}$ of $\mathfrak {g}$ is a subalgebra such that $[\mathfrak {i}, \mathfrak {g}] \subseteq \mathfrak {i}$. We say that $\mathfrak {g}$ is simple if it is not abelian and, moreover, the only ideals of $\mathfrak {g}$ are 0 and itself. Simple real Lie algebras have been completely classified (up to isomorphism) by Élie Cartan. A complete list of (non-complex) simple real Lie algebras can be found in [22, Thm. 6.105]. A semi-simple Lie algebra is a direct sum of simple Lie algebras. A Cartan subalgebra $\mathfrak {h}$ of $\mathfrak {g}$ is maximal among the abelian subalgebras $\mathfrak {h}'$ of $\mathfrak {g}$ such that the adjoint representation ${\text {ad}}(X)(\cdot ):= [X,\cdot ]$ is simultaneously diagonalizable (over $\mathbb {C}$) for all $X\in \mathfrak {h}'$.

(4) Representation Let V be a finite-dimensional vector space over $\mathbb {R}$. Let ${\text {Aut}}(V)$ and ${\text {End}}(V)$ be the sets of automorphisms and endomorphisms of V, respectively. A representation $\pi $ of G on V is a group homomorphism $\pi :G \rightarrow {\text {Aut}}(V)$, i.e., $\pi (e)$ is the identity map and $\pi (gh) = \pi (g)\pi (h)$.

Let $\langle \cdot , \cdot \rangle $ be an inner product on V. We say that the representation $\pi $ is $\mathrm{C}^k$ (i.e., kth continuously differentiable) if the map $\pi : (g, v) \in G\times V \mapsto \pi (g)v\in V$ is $\mathrm{C}^k$. A matrix coefficient is any $\mathrm{C}^k$-function on G defined as $ \langle v_i, \pi (g)v_j \rangle $ where $v_i, v_j$ belong to V. In particular, if the $v_i$’s form an orthonormal basis of V, then $ \langle v_i, \pi (g)v_j \rangle $ is exactly the ijth entry of the matrix $\pi (g)$ with respect to the given basis.

A group representation $\pi $ induces a Lie algebra homomorphism $\pi _*: \mathfrak {g}\rightarrow {\text {End}}(V)$, where $\pi _*$ is the derivative of $\pi $ at the identity $e\in G$. It satisfies the following condition:

$$\begin{aligned} \pi _*([X, Y]) = \pi _*(X)\pi _*(Y) - \pi _*(Y)\pi _*(X),\quad \forall X, Y\in \mathfrak {g}. \end{aligned}$$

We call $\pi _*$ a representation of $\mathfrak {g}$ on V, or simply a Lie algebra representation.

Let ${\text {Ad}}: G\rightarrow {\text {Aut}}(\mathfrak {g})$ be the adjoint representation, i.e., for each $g\in G$, ${\text {Ad}}(g): T_e G\rightarrow T_e G$ is the derivative of the conjugation $h\in G \mapsto ghg^{-1}\in G$ at the identity e. Denote by ${\text {ad}}:\mathfrak {g}\rightarrow {\text {End}}(\mathfrak {g})$ the induced Lie algebra representation of ${\text {Ad}}$, which is given by ${\text {ad}}(X)(\cdot ) = [X,\cdot ]$ for all $X\in \mathfrak {g}$.

(5) Lie products Let $A:= \{X_1,\ldots , X_k\}$ be a set of free generators. Let ${\mathcal L}_A$ be the collection of formal Lie products of the $X_i$’s in A. For a given element $\xi \in {\mathcal L}_A$, we let ${\text {dep}}(\xi )$ be the depth of $\xi $ defined as the number Lie brackets in $\xi $. For example, the depth of $[X_{i_1}, [X_{i_2},X_{i_3}]]$ is 2. We further decompose ${\mathcal L}_A =\sqcup _{k \ge 0} {\mathcal L}_A(k)$ where ${\mathcal L}_A(k)$ is comprised of Lie products of depth k.

(6) Miscellaneous Let $\{e_i\}^n_{i = 1}$ be the standard basis of $\mathbb {R}^n$. We denote by $\det (e_{i_1},\ldots , e_{i_n})$ the determinant of a matrix whose jth column is $e_{i_j}$ for $i_j \in \{1,\ldots , n\}$.

Let V be a vector space over $\mathbb {R}$. We denote by $V^*$ the dual space, i.e., it is the collection of all linear functions from V to $\mathbb {R}$.

The following definition will be frequently used throughout the paper:

Definition 1

Two subsets $V'$ and $V''$ of a real vector space V are said to be projectively identical if for any $v'\in V'$, there exists a $v''\in V''$ and a constant $c\in \mathbb {R}$ such that $v' = cv''$, and vice versa. We write $V' \equiv V''$ to indicate such equivalence relation.

Let S be an arbitrary set with an operation “$*$” defined so that $s_1*s_2$ belongs to S for all $s_1, s_2\in S$. For any two subsets $S'$ and $S''$ of S, we let $S'*S''$ be the subset of S comprised of the elements $s'*s''$ for all $s'\in S'$ and $s''\in S''$. Here are two examples in which such a notation will be used: (i) If S is a vector space and “$*$” is the addition “$+$”, then we write $S' + S''$. (ii) If S is the commutative algebra of analytic functions $\mathrm{C}^\omega (\Sigma )$ and “$*$” is the pointwise multiplication, then we simply write $S' S''$.

However, we note that the above notation does not apply to $[\mathfrak {g}_1, \mathfrak {g}_2]$ for $\mathfrak {g}_1$ and $\mathfrak {g}_2$ two subsets of a Lie algebra $\mathfrak {g}$. By convention, $[\mathfrak {g}_1,\mathfrak {g}_2]$ is the linear span of all $[X_1, X_2]$ with $X_1\in \mathfrak {g}_1$ and $X_2\in \mathfrak {g}_2$. We adopt such a convention in the paper as well.

For a general control system $\dot{x}(t) = f(x(t), u(t))$, we denote by u[0, T] the control input u(t) over the time interval [0, T] for $T > 0$. Correspondingly, we let x[0, T] be the trajectory of the control system generated by u[0, T].

3 Distinguished ensemble systems

3.1 Distinguished vector fields and codistinguished functions

We introduce in the section the class of (pre-)distinguished ensemble systems and establish controllability and observability of any such ensemble system. We start by introducing two key components of the system, namely distinguished vector fields and codistinguished functions. We first have the following definition:

Definition 2

(Distinguished vector fields) A set of vector fields $\{f_i\}^m_{i = 1}$ over an analytic manifold M is distinguished if the following hold:

(1):

For any $x\in M$, the set $\{f_i(x)\}^m_{i = 1}$ spans $T_x M$.

(2):

For any two $f_i$ and $f_j$, there exist an $f_k$ and a real number $\lambda $ such that

$$\begin{aligned}{}[f_i, f_j] = \lambda f_k; \end{aligned}$$

(5)

conversely, for any $f_k$, there exist $f_i$ and $f_j$ and a nonzero $\lambda $ such that (5) holds.

Recall that $\mathfrak {X}(M)$ is the Lie algebra of analytic vector fields over M, which is infinite-dimensional. However, if $F:= \{f_i\}^m_{i = 1}$ is distinguished, then by item 2 of Definition 2, the $\mathbb {R}$-span of the $f_i$’s, which we denote by $\mathbb {L}_F$, is a finite-dimensional subalgebra of $\mathfrak {X}(M)$. We note here that $\mathbb {L}_F$ is perfect, i.e., $[\mathbb {L}_F, \mathbb {L}_F] = \mathbb {L}_F$.

Let N be any manifold diffeomorphic to M, and $\eta : M\rightarrow N$ be the diffeomorphism. Recall that for a vector field f over M, we denote by $\eta _* f$ the pushforward of f as a vector field over N. We have the following fact:

Lemma 1

If $\{f_i\}^m_{i = 1}$ is distinguished over M, then $\{\eta _* f_i\}^m_{i = 1}$ is distinguished over N.

Proof

If $[f_i,f_j] = \lambda f_k$, then $[\eta _* f_i, \eta _* f_j] = \eta _* [f_i,f_j] = \lambda \eta _* f_k$. $\square $

We next introduce the definition of codistinguished functions:

Definition 3

(Codistinguished functions) A set of functions $\{\phi ^j\}^l_{j = 1}$ on M is codistinguished to a set of vector fields $\{f_i\}^m_{i = 1}$ if the following hold:

(1):

For any $x\in M$, the set of (exact) one-forms $\{\mathrm{d}\phi ^j_x\}$ spans $T^*_x M$.

(2):

For any $f_i$ and any $\phi ^j$, there exist a $\phi ^k$ and a real number $\lambda $ such that

$$\begin{aligned} f_i \phi ^j = \lambda \phi ^k; \end{aligned}$$

(6)

conversely, for any $\phi ^k$, there exist $f_i$, $\phi ^j$, and a nonzero $\lambda $ such that (6) holds.

(3):

For $x, x'\in M$, if $\phi ^j(x) = \phi ^j(x')$ for all $j = 1,\ldots , l$, then $x = x'$.

If $\{\phi ^j\}^l_{j = 1}$ satisfies only (1) and (2), then it is weakly codistinguished to $\{f_i\}^m_{i = 1}$.

Let $\tilde{\eta }: N\rightarrow M$ be a diffeomorphism. Recall that for a function $\phi $ on M, we denote by $\tilde{\eta }^* \phi $ the pullback of $\phi $ as a function on N. We have the following fact:

Lemma 2

If $\{\phi ^j\}^l_{j = 1}$ on M is codistinguished to $\{f_i\}^m_{i = 1}$, then $\{\tilde{\eta }^*\phi ^j\}^l_{j = 1}$ on N is codistinguished to $\{\tilde{\eta }_*^{-1}f_i\}^m_{i = 1}$.

Proof

If $f_i \phi ^j = \lambda \phi ^k$, then $ (\tilde{\eta }_*^{-1} f_i)(\tilde{\eta }^*\phi ^j) = \tilde{\eta }^* (f_i \phi ^j) = \lambda \tilde{\eta }^*\phi ^k. $ $\square $

We say that a set of vector fields $F:=\{f_i\}^m_{i = 1}$ and a set of functions $\Phi :=\{\phi ^j\}^l_{j = 1}$ are (weakly) jointly distinguished if F is distinguished and $\Phi $ is (weakly) codistinguished to F. Note that Lemmas 1 and 2 imply that the property of having a set of (weakly) jointly distinguished pair $(F, \Phi )$ is topologically invariant. Let F and $\Phi $ be (weakly) disjoined. Recall that $\mathbb {L}_F$ is a finite-dimensional Lie algebra spanned by F (since F is distinguished). Let $\mathbb {L}_\Phi $ be the $\mathbb {R}$-span of $\Phi $. Then, by the second item of Definition 3, the following map:

$$\begin{aligned} (f, \phi )\in \mathbb {L}_F\times \mathbb {L}_\Phi \mapsto f\phi \in \mathbb {L}_\Phi \end{aligned}$$

is a finite-dimensional Lie algebra representation of $\mathbb {L}_F$ on $\mathbb {L}_\Phi $.

For the remainder of the subsection, we provide an example about jointly distinguished vector fields and functions on ${\text {SO}}(3)$. These vector fields and functions will be further generalized in Sect. 4 so that they exist on any semi-simple Lie group.

Example 1

Let ${\text {SO}}(3)$ be the matrix Lie group of $3\times 3$ special orthogonal matrices, and $\mathfrak {so}(3)$ be the associated Lie algebra. We define a basis $\{X_i\}^3_{i = 1}$ of $\mathfrak {so}(3)$ as follows:

$$\begin{aligned} X_i:= e_je^\top _k - e_k e_j^\top \,\, \text{ where } \det (e_i,e_j,e_k) = 1, \quad \forall i = 1,2,3. \end{aligned}$$

Let $\{L_{X_i}\}^3_{i = 1}$ be the corresponding left-invariant vector fields. By computation,

$$\begin{aligned} {[}L_{X_i}, L_{X_j}] = -\det (e_i,e_j,e_k) L_{X_k}, \quad \forall i\ne j. \end{aligned}$$

(7)

It follows that $\{L_{X_i}\}^3_{i = 1}$ is distinguished.

Denote by ${\text {tr}}(\cdot )$ the trace of a square matrix. We next define functions $\{\phi ^{ij}\}^3_{i,j= 1}$ on ${\text {SO}}(3)$ as follows:

$$\begin{aligned} \phi ^{ij}(g) := {\text {tr}}(g X_j g^\top X_i^\top ), \quad 1\le i, j \le 3. \end{aligned}$$

We show below that $\{\phi ^{ij}\}^3_{i, j = 1}$ is codistinguished to $\{L_{X_i}\}^3_{i = 1}$. First, for any left-invariant vector field $L_X$ with $X\in \mathfrak {so}(3)$, we obtain by computation that

$$\begin{aligned} \mathrm{d}\phi ^{ij}_g(L_X(g)) = (L_X \phi ^{ij})(g) = {\text {tr}}(g[X,X_j]g^\top X_i^\top ). \end{aligned}$$

(8)

We now prove that the three items of Definition 3 are satisfied for $\{\phi ^{ij}\}^3_{i, j = 1}$ and $\{L_{X_i}\}^3_{i = 1}$:

(1):

We fix an arbitrary group element $g\in {\text {SO}}(3)$ and show that $\{\mathrm{d}\phi _g^{ij}\}^3_{i, j = 1}$ spans $T^*_g{\text {SO}}(3)$. For convenience, let ${\hat{X}}_{ij}:=[X_j, g^\top X_i^\top g]$. Then, by (8), we obtain that

$$\begin{aligned} \mathrm{d}\phi ^{ij}_g(L_X(g)) = {\text {tr}}(X [X_j, g^\top X_i^\top g]) = {\text {tr}}(X {\hat{X}}_{ij}). \end{aligned}$$

Note that ${\text {tr}}(\cdot , \cdot )$ is negative definite on $\mathfrak {so}(3)$. Thus, $\{\mathrm{d}\phi _g^{ij}\}^3_{i, j = 1}$ spans $T^*_g{\text {SO}}(3)$ if and only if $\{{\hat{X}}_{ij}\}^3_{i,j = 1}$ spans $\mathfrak {so}(3)$. It now suffices to show that $\{{\hat{X}}_{ij}\}^3_{i,j = 1}$ spans $\mathfrak {so}(3)$. But, this holds because both $\{X_j\}^3_{j = 1}$ and $\{g^\top X_i^\top g\}^3_{i = 1}$ span $\mathfrak {so}(3)$. Moreover, $\mathfrak {so}(3)$ is simple so that $[\mathfrak {so}(3), \mathfrak {so}(3)] =\mathfrak {so}(3)$.

(2):

For the second item, we combine (7) and (8) to obtain the following:

$$\begin{aligned} L_{X_i}\phi ^{i'j}= \left\{ \begin{array}{ll} -\det (e_i,e_j,e_k)\phi ^{i'k}, &{}\quad \text{ if } i \ne j, \\ 0, &{}\quad \text{ otherwise. } \end{array} \right. \end{aligned}$$

(3):

Finally, let g and $g'$ be such that $\phi ^{ij}(g) = \phi ^{ij}(g')$ for all $1 \le i,j \le 3$:

$$\begin{aligned} {\text {tr}}(g X_j g^\top X_i^\top ) = {\text {tr}}(g' X_j g'^\top X_i^\top ), \quad \forall i = 1,2,3. \end{aligned}$$

Because $\{X_i\}^3_{i =1}$ spans $\mathfrak {so}(3)$ and ${\text {tr}}(\cdot , \cdot )$ is negative definite on $\mathfrak {so}(3)$, we have that $ g X_j g^\top = g' X_j g'^\top $. Since this holds for all $X_j$, it follows that $g^{\top }g'$ belongs to the center of ${\text {SO}}(3)$. But the center is trivial. We thus conclude that $g= g'$. $\square $

3.2 Controllability and observability of distinguished ensemble system

We establish in the subsection a sufficient condition for controllability and observability of ensemble system (4). For convenience, we reproduce below the mathematical model of the ensemble system introduced in Sect. 1:

$$\begin{aligned} \left\{ \begin{array}{ll} \dot{x}_\sigma (t) = f_0(x_\sigma (t), \sigma ) + \sum \nolimits _{i = 1}^{m} \sum \nolimits _{s = 1}^{r} u_{i, s}(t)\rho _{s}(\sigma ) f_i (x_\sigma (t)), &{}\ \forall \sigma \in \Sigma ,\\ y^j(t) = \displaystyle \int _\Sigma \phi ^j(x_\sigma (t)) \mathrm{d}\sigma , &{} \ \forall j = 1,\ldots , l.\\ \end{array} \right. \end{aligned}$$

(9)

The common state space M is an analytic manifold, equipped with a Riemannian metric. We denote by ${\text {d}}_M(x_1,x_2)$ the distance between two points $x_1$ and $x_2$ in M. The parameterization space $\Sigma $ is analytic, compact, and path-connected. It is equipped with a strictly positive measure. All vector fields and parameterization functions are analytic. For any $T > 0$, the control inputs $u_{i,s}:[0,T]\rightarrow \mathbb {R}$ are integrable functions. We denote by u(t) (resp. y(t)) the collection of $u_{i,s}(t)$ (resp. $y^j(t)$).

We recall that $x_\Sigma (t)$ is the profile of system (9) at time t, which can be viewed as an analytic function $\Sigma $ to M. We also recall that $\mathrm{C}^\omega (\Sigma , M)$ is the profile space. Now, let $x_\Sigma [0,T]$ be the collection of trajectories of individual systems:

$$\begin{aligned} x_\Sigma [0,T]:=\{x_\sigma [0,T] \mid \sigma \in \Sigma \}. \end{aligned}$$

We call $x_\Sigma [0,T]$ a trajectory of profiles. We assume in the paper that $x_\Sigma [0,T]$ is continuous in time t. We now have the following definition for ensemble controllability.

Definition 4

(Ensemble controllability) System (9) is approximately ensemble path-controllable if for any initial profile $x_\Sigma (0)$, any target trajectory of profiles ${\hat{x}}_\Sigma [0,T]$ of class $\mathrm{C}^1$ with ${\hat{x}}_\Sigma (0) = x_\Sigma (0)$, and any error tolerance $\epsilon > 0$, there is a control input u(t) such that the trajectory $x_\Sigma [0,T]$ generated by u(t) satisfies

$$\begin{aligned} \mathrm{d}_M(x_\sigma (t), {\hat{x}}_\sigma (t)) < \epsilon , \quad \forall (t, \sigma )\in [0,T]\times \Sigma . \end{aligned}$$

If, further, the control input u(t) can always be of class $\mathrm{C}^k$, then system (9) is approximately ensemble path-controllable under $\mathrm{C}^k$-inputs.

Remark 1

The continuity of ${\hat{x}}_\Sigma [0,T]$ implies that any two profiles ${\hat{x}}_\Sigma (t_1)$ and ${\hat{x}}_\Sigma (t_2)$, for $t_1, t_2\in [0,T]$, are homotopic. Thus, the above definition concerns about capability of approximating a target trajectory of profiles within a homotopy class. In general, there may exist multiple homotopy classes. For example, if $\Sigma = S^n$, then all the homotopy classes of continuous functions from $S^n$ to M form the so-called nth homotopy group [17]. If, further, $M = S^n$, then the group is known to be $\mathbb {Z}$. The above arguments imply that given an initial profile $x_\Sigma (0)$ and a target profile ${\hat{x}}_\Sigma (T)$, there may not exist a continuous trajectory of profiles that connects $x_\Sigma (0)$ and ${\hat{x}}_\Sigma (T)$. $\square $

We next introduce the definition for ensemble observability. To proceed, we first have the following one about output equivalence, which straightforwardly generalizes the notion for a single nonlinear control system (see, for example, [20]):

Definition 5

(Output equivalence) Two initial profiles $x_\Sigma (0)$ and ${\bar{x}}_\Sigma (0)$ of system (9) are output equivalent, which we denote by $x_\Sigma (0)\sim {\bar{x}}_\Sigma (0)$, if for any $T> 0$ and any integrable function $u:[0,T]\rightarrow \mathbb {R}^m$ as a control input, the following holds:

$$\begin{aligned} \int _\Sigma \phi ^{j}(x_\sigma (t)) \mathrm{d}\sigma = \int _\Sigma \phi ^{j}({\bar{x}}_\sigma (t)) \mathrm{d}\sigma , \end{aligned}$$

for all $t\in [0,T]$ and for all $j = 1,\ldots , l$.

For a given $x_\Sigma (0)$, we let $O(x_\Sigma (0))$ be the collection of all initial profiles in $\mathrm{C}^{\omega }(\Sigma , M)$ that are output equivalent to $x_\Sigma (0)$, i.e.,

$$\begin{aligned} O(x_\Sigma (0)):= \left\{ {\bar{x}}_\Sigma (0) \mid {\bar{x}}_\Sigma (0)\sim x_\Sigma (0) \right\} . \end{aligned}$$

(10)

The set $O(x_\Sigma (0))$ can be viewed as a “measure of ambiguity” for the ensemble estimation problem. With the above definition of output equivalence, we now introduce the definition of ensemble observability.

Definition 6

(Ensemble observability) System (9) is weakly ensemble observable if for any profile $x_\Sigma (0)$, there is an $\epsilon > 0$ such that if ${\bar{x}}_\Sigma (0)\sim x_\Sigma (0)$ and ${\bar{x}}_\Sigma (0)\ne x_\Sigma (0)$, then $\mathrm{d}_M(x_\sigma (0),{\bar{x}}_\sigma (0)) \ge \epsilon $ for all $\sigma \in \Sigma $. Further, system (9) is ensemble observable if for any profile $x_\Sigma (0)$, the set $O(x_\Sigma (0)) = \{x_\Sigma (0)\}$ is a singleton.

We establish below a sufficient condition for ensemble controllability and observability of system (9). To state the condition, we need a few more preliminaries.

First, we say that the set of parameterization functions $\{\rho _s\}^r_{s = 1}$ defined on $\Sigma $ is a separating set if for any two distinct points $\sigma , \sigma ' \in \Sigma $, there exists a function $\rho _s$, for some $s \in \{1,\ldots , r\}$, such that $\rho _s(\sigma ) \ne \rho _s(\sigma ')$. Note that by Stone–Weierstrass theorem [30, Chp. 7], if $\{\rho _s\}^r_{s = 1}$ separates point and contains an everywhere nonzero function, then the subalgebra generated by $\{\rho _s\}^r_{s = 1}$ is dense in the space $\mathrm{C}^0(\Sigma )$ of continuous functions on $\Sigma $.

Next, for convenience, we let $\phi := (\phi ^1,\ldots , \phi ^l)$ be a vector-valued function on M. For a given $x\in M$, we let $[x]_\phi $ be the pre-image of $\phi (x)$, i.e., $[x]_\phi $ is the collection of all points $x'$ in M such that $\phi (x') = \phi (x)$. Note that if the set of one-forms $\{\mathrm{d}\phi ^j_x\}^l_{j = 1}$ spans $T^*_xM$ for all $x\in M$, then $[x]_\phi $ is a discrete set. Let $\chi _\phi $ be defined as follows:

$$\begin{aligned} \chi _\phi := \sup _{x\in M} \left| [x]_\phi \right| . \end{aligned}$$

If $\chi _\phi $ is unbounded, then we set $\chi _\phi :=\infty $. We have the following fact:

Lemma 3

If M is compact and the one-forms $\{\mathrm{d}\phi ^j_x\}^l_{j = 1}$ span $T^*_xM$ for all $x\in M$, then $\chi _\phi $ is a finite number.

Proof

First, note that for any $x\in M$, $|[x]_\phi |$ is a finite number because otherwise $[x]_\phi $ contains an accumulation point $x_*$ and the one-forms $\{\mathrm{d}\phi ^j_{x_*}\}^l_{j = 1}$ cannot span $T^*_{x_*}M$. In fact, since the one-forms $\{\mathrm{d}\phi ^j_x\}^l_{j = 1}$ span $T^*_xM$ for all $x\in M$, there is an open ball $B_{\epsilon (x)}(x)$ centered at x with radius $\epsilon (x)$ such that $|[x']_\phi | = |[x]_\phi |$ for all $x'\in B_{\epsilon (x)}(x)$. The collection of open balls $\{B_{\epsilon (x)}(x)\}_{x\in M}$ is an open cover of M. Since M is compact, there is a finite subcover $\{B_{\epsilon (x_i)}(x_i)\}^N_{i = 1}$. It then follows that $\chi _\phi := \max ^N_{i = 1} |[x_i]_\phi |$. $\square $

We are now in a position to state the first main result of the paper. The result establishes connections between the “distinguished” structure introduced in the previous subsection and ensemble controllability/observability of system (9):

Theorem 1

Consider ensemble system (9). Suppose that $\{\rho _s\}^r_{s = 1}$ is a separating set and contains an everywhere nonzero function; then, the following hold:

(1)
If the set of control vector fields $\{f_i\}^m_{i = 1}$ is distinguished, then system (9) is approximately ensemble path-controllable under $\mathrm{C}^1$-inputs.
(2)
If the set of observation functions $\{\phi ^j\}^l_{j = 1}$ is (weakly) codistinguished to $\{f_i\}^m_{i = 1}$, then system (9) is (weakly) ensemble observable. If, further, M is compact, then for any $x_\Sigma (0)$, the set $O(x_\Sigma (0))$ defined in (10) is finite and $|O(x_\Sigma (0))|\le \chi _\phi $.

Following the above theorem, we introduce the following definition:

Definition 7

An ensemble system (9) is distinguished if (1) the set of parameterization functions $\{\rho _s\}^r_{s = 1}$ separates points and contains an everywhere nonzero function, and (2) the set of control vector fields $\{f_i\}^m_{i = 1}$ and the set of observation functions $\{\phi ^j\}^l_{j = 1}$ are (weakly) jointly distinguished.

By Theorem 1, a distinguished ensemble system is approximately ensemble path-controllable and (weakly) ensemble observable. We provide below an example of a distinguished ensemble system:

Example 2

Recall that in Example 1, we have introduced jointly distinguished left-invariant vector fields $\{L_{X_i}\}^3_{i = 1}$ and functions $\{{\text {tr}}(g X_j g^\top X^\top _i)\}^3_{i,j = 1}$ on ${\text {SO}}(3)$. Now, consider a continuum ensemble of control systems defined on ${\text {SO}}(3)$, parameterized by a scalar parameter $\sigma $ over a closed interval [a, b] with $0< a < b$. Let $\rho (\sigma ) := \sigma $ be the parameterization function. The singleton $\{\rho \}$ is a separating set and $\rho $ is everywhere nonzero. Thus, the following ensemble system is distinguished:

$$\begin{aligned} \left\{ \begin{array}{ll} \dot{g}_\sigma (t) = f_0(g_\sigma (t), \sigma ) + \sum \nolimits _{i = 1}^{3} u_i(t)\sigma L_{X_i}(g_\sigma (t)), &{} \sigma \in [a, b], \\ y^{ij}(t) = \displaystyle \int _\Sigma {\text {tr}}(g_\sigma (t) X_j g_\sigma ^\top (t) X^\top _i) \mathrm{d}\sigma , &{} 1\le i, j \le 3.\\ \end{array} \right. \end{aligned}$$

Thus, it is approximately ensemble path-controllable and ensemble observable. $\square $

We have the following remark on the set of parameterization functions:

Remark 2

For any analytic manifold $\Sigma $, there exists a set of separating set. By the Nash embedding theorem [15, 29], the manifold $\Sigma $ can be isometrically embedded into a Euclidean space $\mathbb {R}^N$. We write $\sigma = (\sigma _1,\ldots ,\sigma _N)$ as the coordinate of a point $\sigma \in \Sigma $. Now, let $\rho _s(\sigma ) := \sigma _s$, for $s = 1,\ldots , N$, be the standard coordinate functions (more precisely, the restrictions of the coordinate functions to $\Sigma $). Further, let $\rho _{N + 1} := \mathbf{1}_\Sigma $ be the unit function. Then, $\{\rho _s\}^{N + 1}_{s = 1}$ satisfies the assumption of Theorem 1. $\square $

We establish Theorem 1. The proof will be divided into two parts: We deal with ensemble controllability and ensemble observability separately. The proofs will be given in Sects. 3.3 and 3.4, respectively.

3.3 Proof of approximate ensemble path controllability

We establish here the first item of Theorem 1. The proof relies on the use of the technique of Lie extension, the structure of distinguished vector fields, and the Stone-Weierstrass theorem. We provide details below.

3.3.1 On the use of Lie extension and distinguished vector fields

Recall that for an arbitrary single control-affine system:

$$\begin{aligned} \dot{x}(t) = f_0(x(t)) + \sum ^m_{i = 1}u_i(t)f_i(x(t)), \end{aligned}$$

(11)

the first-order Lie extension of the system is a new control-affine system given by

$$\begin{aligned} \dot{x}(t) = f_0(x(t)) + \sum ^m_{i = 1}u_i(t) f_i(x(t)) + \sum ^m_{i, j = 1}u_{ij}(t)[f_i,f_j](x(t)). \end{aligned}$$

By repeatedly applying Lie extensions, we obtain a family of control-affine systems with an increasing number of control vector fields. All of these control vector fields can be expressed as Lie products involving the $f_i$’s in (11). We make the statement precise below. First, for the given set of vector fields $F:= \{f_i\}^m_{i = 1}$, we use ${\mathcal L}_F$ to denote the collection of Lie products generated by F in which the $f_i$’s are treated as if they were “free” generators. For ease of notation, we will simply write ${\mathcal L}$ by omitting the subindex F. Decompose ${\mathcal L}:= \sqcup _{k \ge 0}{\mathcal L}(k)$ where each ${\mathcal L}(k)$ is comprised of Lie products of depth k. Then, the kth-order Lie extension of (11) is a control-affine system given by

$$\begin{aligned} \dot{x}(t) = f_0(x(t)) + \sum ^k_{l = 0}\sum _{\xi \in {\mathcal L}(l)}u_\xi (t)\xi (x(t)). \end{aligned}$$

(12)

By increasing the order k, we obtain an infinite family of Lie extended systems. Lie extension has been used in [23, 28, 32] for nonholonomic motion planning.

It is known that original control-affine system (11) is approximately path-controllable if and only if any of its Lie extended systems is. Specifically, we let $u^*(t)$ be the collection of control inputs $u_\xi (t)$ of Lie extended system (12). The following fact is established in [27, 32] by Sussmann and Liu:

Lemma 4

Given any order k of Lie extension and any control input $u^*[0,T]$ of class $\mathrm{C}^1$ for system (12), there exist a sequence of control inputs $\{u^{(j)}[0,T]\}^\infty _{j = 1}$ of class $\mathrm{C}^1$ for original system (11) such that the trajectory generated by $u^{(j)}$ converges uniformly to the trajectory of system (12) generated by $u^*$ over [0, T].

Remark 3

We note here that the above result is “formal” in a sense that the control sequence $\{u^{(j)}\}^\infty _{j = 1}$ depends only on $u^*$ but not on the vector fields $f_i$ [27, 32]—if one replaces $f_i$ with any other sufficiently smooth vector fields $g_i$, then the same control sequence $\{u^{(j)}[0,T]\}^\infty _{j = 1}$ can still be used to obtain the convergence result. $\square $

We now apply the technique of Lie extension to ensemble system (9). For convenience, we reproduce below the control part of the system:

$$\begin{aligned} \dot{x}_\sigma (t) = f_0(x_\sigma (t), \sigma ) + \sum ^m_{i = 1} \sum ^{r}_{s = 1} u_{i,s}(t)\rho _{s}(\sigma ) f_i (x_\sigma (t)), \quad \forall \sigma \in \Sigma . \end{aligned}$$

(13)

In this case, we have that for any individual system-$\sigma $, the control vector fields are $\rho _s(\sigma ) f_i(x_\sigma )$, for $1\le s \le r$ and $1\le i \le m$. Note that the Lie bracket of any two of these control vector fields is given by $[\rho _s(\sigma ) f_i, \rho _{s'}(\sigma ) f_j] =\rho _s(\sigma )\rho _{s'}(\sigma ) [f_i, f_j]$. Thus, the first-order Lie extension of (13) is given by

$$\begin{aligned} \dot{x}_\sigma (t)= & {} f_0(x_\sigma (t), \sigma ) + \sum ^m_{i = 1} \sum ^{r}_{s = 1} u_{i,s}(t)\rho _{s}(\sigma ) f_i (x_\sigma (t)) \\&+\sum ^m_{i, j =1}\sum ^r_{s,s' = 1} u_{ij, ss'}(t) \rho _s(\sigma )\rho _{s'}(\sigma )[f_{i}, f_{j}](x_\sigma (t)), \quad \forall \sigma \in \Sigma . \end{aligned}$$

The last term of the above expression can be simplified as follows:

$$\begin{aligned} \sum _{\xi \in {\mathcal L}(1)} \sum _{\mathrm{p}\in {\mathcal P}(2)} u_{\xi , \mathrm{p}}(t) \mathrm{p}(\sigma ) \xi (x_\sigma (t)), \end{aligned}$$

where ${\mathcal P}(2)$ is the collection of monomials $\rho _s\rho _{s'}$ of degree 2. In general, we obtain the following kth-order Lie extension of (13):

$$\begin{aligned} \dot{x}_\sigma (t) = f_0(x_\sigma (t), \sigma ) + \sum ^{k}_{l = 0}\sum _{\xi \in {\mathcal L}(l)} \sum _{\mathrm{p}\in {\mathcal P}(l + 1)} u_{\xi , \mathrm{p}}(t) \mathrm{p}(\sigma ) \xi (x_\sigma (t)), \quad \forall \sigma \in \Sigma . \end{aligned}$$

(14)

Recall that two arbitrary sets of vector fields $\{f_i\}^m_{i = 1}$ and $\{f'_{i'}\}^{m'}_{i'= 1}$ over M are said to be projectively identical, which we denote by $\{f_i\}^m_{i = 1} \equiv \{f'_{i'}\}^{m'}_{i' = 1}$, if for any $f_i$, there exist an $f'_{i'}$ and a real number $\lambda $ such that $f_i = \lambda f'_{i'}$, and vice versa. We will use such an equivalence relation in the following way: In original ensemble control system (13), the set of control vector fields $\{f_i\}^m_{i = 1}$ is, by assumption, distinguished. Thus, by the second item of Definition 2, if we evaluate the Lie products in each ${\mathcal L}(k)$, then

$$\begin{aligned} {\mathcal L}(k) \equiv \left\{ f_i \right\} ^m_{i = 1}, \quad \forall k\ge 0. \end{aligned}$$

(15)

Since every control vector field f in (14) is obtained by evaluating a Lie product involving the $f_i$’s, by using the above fact, we can simplify Lie extended system (14) as follows:

$$\begin{aligned} \dot{x}_\sigma (t) =f_0(x_\sigma (t), \sigma ) + \sum ^m_{i = 1}\sum ^{k}_{l = 0}\sum _{\mathrm{p}\in {\mathcal P}(l + 1)} \left( u_{i,\mathrm{p}}(t) \mathrm{p}(\sigma ) \right) f_i(x_\sigma (t)), \quad \forall \sigma \in \Sigma . \end{aligned}$$

(16)

The control inputs $u_{i,\mathrm{p}}(t)$ in the above expression are defined such that

$$\begin{aligned} u_{i,\mathrm{p}}(t):= \sum _{\xi } \lambda _{\xi } \, u_{\xi , \mathrm{p}}(t), \end{aligned}$$

where the summation is over Lie products $\xi $ of depth $(\deg (\mathrm{p}) - 1)$ such that $ \xi = \lambda _{\xi } f_{i} $. We now have the following fact (see, also, similar results in [1]):

Lemma 5

Original system (13) is approximately ensemble path-controllable under $\mathrm{C}^1$-inputs if and only if any of its Lie extended system (16) is.

Proof

By Lemma 4 and Remark 3, we know that for any control input $u^*[0,T]$ of class $\mathrm{C}^1$ for Lie extended system (16), there is a sequence of control inputs $\{u^{(j)}[0,T]\}^\infty _{j= 1}$ of class $\mathrm{C}^1$ for each individual system-$\sigma $ such that the trajectory $x^{(j)}_\sigma [0,T]$ generated by $u^{(j)}[0,T]$ converges uniformly to the trajectory $x^*_\sigma [0,T]$ of system (16) generated by $u^*[0,T]$. We now fix an arbitrary $\epsilon > 0$ and show that there exists an integer $j_\Sigma $ such that if $j\ge j_\Sigma $, then

$$\begin{aligned} \mathrm{d}_M(x^{(j)}_{\sigma }(t), x^*_{\sigma }(t)) < \epsilon , \quad \forall (t,\sigma )\in [0,T]\times \Sigma . \end{aligned}$$

(17)

To establish the fact, we first note that the initial profile $x_\Sigma (0)$ is analytic in $\sigma $. We next note that the drifting vector field $f_0$ and the monomials $\mathrm{p}$ are analytic functions. It follows that for any $\sigma \in \Sigma $, there exist an integer $j_\sigma $ and an open neighborhood $U_\sigma $ of $\sigma $ such that $\mathrm{d}_M(x^{(j')}_{\sigma '}(t), x^*_{\sigma '}(t)) < \epsilon $ for any $j' \ge j_\sigma $, any $\sigma '\in U_\sigma $, and any $t\in [0,T]$. All such $U_\sigma $ form an open cover of $\Sigma $. Since $\Sigma $ is compact, there is a finite subcover $\{U_{\sigma _i}\}^N_{i = 1}$. It then suffices to set $j_\Sigma := \max ^N_{i = 1} \{j_{\sigma _i}\}$ so that (17) holds. $\square $

3.3.2 On the use of Stone–Weierstrass theorem

By Lemma 5, it now suffices to establish controllability of system (16) for a certain order k with $\mathrm{C}^1$-control inputs $u_{i,\mathrm{p}}[0,T]$. We prove the fact below. Let ${\hat{x}}_\Sigma [0,T]$ be an arbitrary target trajectory of profiles. By the first item of Definition 2, we have that the set $\{f_i(x)\}^m_{i = 1}$ spans $T_xM$ for all $x\in M$. This, in particular, implies that there are functions $c_i(t, \sigma )$ continuous in both t and $\sigma $, for $i = 1,\ldots , m$, such that

$$\begin{aligned} \frac{\partial {\hat{x}}_\sigma (t)}{\partial t} - f_0({\hat{x}}_\sigma (t), \sigma ) = \sum ^m_{i = 1} c_i(t, \sigma )f_i({\hat{x}}_\sigma (t)), \quad \forall (t, \sigma )\in [0,T]\times \Sigma . \end{aligned}$$

(18)

To see this, we first note that for any given $(t,\sigma )$, there is an open neighborhood U of $(t,\sigma )$ in $[0,T]\times \Sigma $ such that local existence of such continuous functions $c^U_{i}(t,\sigma )$ is guaranteed over U. All such open neighborhoods U form an open cover of $[0,T]\times \Sigma $. Since $[0,T]\times \Sigma $ is compact, there is a finite subcover $\{U_j\}^N_{j = 1}$. Let $\{h_j\}^N_{j = 1}$ be a partition of unity [31] subordinate to $\{U_j\}^N_{j = 1}$. We then define $c_i:= \sum ^N_{j = 1}h_j c^{U_j}_i$.

Comparing (16) with (18), we see that if there exist an order $k\ge 0$ and a set of control inputs $u_{i,\mathrm{p}}$, for $i = 1, \ldots , m$ and for $\mathrm{p}$ a monomial with $1\le \deg (\mathrm{p}) \le k + 1$, such that the following holds:

$$\begin{aligned} c_i(t, \sigma ) = \sum ^{k}_{l = 0}\sum _{\mathrm{p}\in {\mathcal P}(l + 1)} u_{i,\mathrm{p}}(t) \mathrm{p}(\sigma ), \quad \forall (t, \sigma )\in [0,T]\times \Sigma \, \text{ and } \, \forall i = 1,\ldots , m, \end{aligned}$$

(19)

then the trajectory of profiles $x_\Sigma [0,T]$ generated by system (16), with $x_\Sigma (0) = {\hat{x}}_\Sigma (0)$, will be exactly ${\hat{x}}_\Sigma [0,T]$. Said in another way, if (19) holds, then one can steer the kth-order Lie extended system (16) to follow the trajectory ${\hat{x}}_\Sigma [0, T]$.

But, in general, equality (19) cannot be satisfied by a finite sum. Nevertheless, we show below that the two sides of the expression can be made arbitrarily close to each other provided that k is sufficiently large, i.e.,

$$\begin{aligned} \left| \sum ^{k}_{l = 0}\sum _{\mathrm{p}\in {\mathcal P}(l + 1)} u_{i,\mathrm{p}}(t) \mathrm{p}(\sigma ) - c_i(t, \sigma ) \right| < \delta , \end{aligned}$$

(20)

for all $(t, \sigma )\in [0,T]\times \Sigma $ and for all $i = 1,\ldots , m$. This essentially follows from the Stone-Weierstrass theorem. We provide details below. Note that if (20) holds for any given $\delta > 0$, then one can apply Grönwall type inequalities [16] to show that the distance $\Vert x_\sigma (t) - {\hat{x}}_\sigma (t)\Vert $ or, in general, $\mathrm{d}_M(x_\sigma (t), {\hat{x}}_\sigma (t))$ can be made uniformly and arbitrarily small for all $(t,\sigma )\in [0,T]\times \Sigma $.

We now establish (20) for any given $\delta > 0$. By the assumption of Theorem 1, the set $\{\rho _s\}^r_{s = 1}$ is a separating set and contains an everywhere nonzero function. Without loss of generality, we let $\rho _1$ be such a function, i.e., $\rho _1(\sigma ) \ne 0$ for all $\sigma \in \Sigma $. It follows from the Stone-Weierstrass theorem that the subalgebra generated by the set $\{\rho _s\}^r_{s = 1}$ is dense in $\mathrm{C}^0(\Sigma )$. In particular, we have the following fact: For any given $\delta ' > 0$, there exist an integer $k \ge 0$ and a set of smooth functions $u'_{i, \mathrm{p}'}: [0,T]\rightarrow \mathbb {R}$, for $i = 1,\ldots , m$ and for $\mathrm{p}'$ a monomial with $0\le \deg (\mathrm{p}') \le k$, such that

$$\begin{aligned} \left| \sum ^{k}_{l = 0}\sum _{\mathrm{p}'\in {\mathcal P}(l)} u'_{i,\mathrm{p}'}(t) \mathrm{p}'(\sigma ) - \rho ^{-1}_1(\sigma ) c_i(t, \sigma ) \right| < \delta ', \end{aligned}$$

(21)

for all $(t, \sigma )\in [0,T]\times \Sigma $ and for all $i = 1,\ldots , m$. To see this, we first note that for any given $t\in [0,T]$, there is an open neighborhood $\mathcal {I}$ of t such that the local existence of such functions $u'_{i,\mathrm{p}'}:\mathcal {I}\rightarrow \mathbb {R}$ is guaranteed by the Stone-Weierstrass theorem. Then, by applying smooth partition of unity for the closed interval [0, T], we obtain desired functions $u'_{i,\mathrm{p}'}$ defined globally over the entire [0, T].

Let $\gamma :=\max \{ |\rho ^{-1}_1(\sigma )| \mid \sigma \in \Sigma \}$ > 0. Note that $\gamma $ exists because $\rho _1$ is everywhere nonzero and $\Sigma $ is compact. Now, given an arbitrary $\delta > 0$, we define $\delta ':= \delta / \gamma $ and let inequality (21) be satisfied. By the definition of $\gamma $, we have that

$$\begin{aligned} \left| \sum ^{k}_{l = 0}\sum _{\mathrm{p}'\in {\mathcal P}(l)} u'_{i,\mathrm{p}'}(t) (\rho _1(\sigma )\mathrm{p}'(\sigma )) - c_i(t, \sigma ) \right| < \gamma \delta ' = \delta , \end{aligned}$$

for all $(t, \sigma )\in [0,T]\times \Sigma $ and for all $i = 1,\ldots , m$. Note that each $\rho _1\mathrm{p}'$ in the above expression is a monomial and $1\le \deg (\rho _1\mathrm{p}') \le k + 1$. Next, for any $i = 1,\ldots , m$ and any monomial $\mathrm{p}$ with $1\le \deg (\mathrm{p}) \le k + 1$, we let the corresponding control input $u_{i,\mathrm{p}}(t)$ be defined such that for any $t\in [0,T]$,

$$\begin{aligned}u_{i,\mathrm{p}}(t):= \left\{ \begin{array}{ll} u'_{i,\mathrm{p}'}(t) &{} \text{ if } \mathrm{p}= \rho _1\mathrm{p}' \text{ with } 0\le \deg (\mathrm{p}') \le k, \\ 0 &{} \text{ otherwise }. \end{array} \right. \end{aligned}$$

With the above-defined control inputs $u_{i,\mathrm{p}}(t)$, we conclude that (20) is satisfied. $\square $

3.4 Proof of ensemble observability

We will now establish the second item of Theorem 1. Let a profile ${\bar{x}}_\Sigma (0)$ be chosen such that it is output equivalent to $x_\Sigma (0)$. The majority of effort will be devoted to proving the following fact: If $\{\phi ^j\}^l_{j = 1}$ is weakly codistinguished to $\{f_i\}^m_{i = 1}$, then there is an open neighborhood U of $x_\Sigma (0)$ in $\mathrm{C}^\omega (\Sigma ,M)$ such that if ${\bar{x}}_\Sigma (0)$ intersects U, then ${\bar{x}}_\Sigma (0) = x_\Sigma (0)$. The proof relies on the use of a special class of control inputs, namely piecewise constant control inputs and the structure of codistinguished functions.

3.4.1 On the use of piecewise constant control inputs

We first introduce a few key notations that will be used in the proof. For an arbitrary differential equation $\dot{x}(t) = f(x(t))$, we denote by $e^{tf} x(0)$ the solution of the equation at time t with initial condition x(0). We will use such a notation to denote a solution $x_\sigma (t)$, for any $\sigma \in \Sigma $, of system (9). Next, we recall that u(t) is the collection of the control inputs $u_{i,s}(t)$, for $1 \le i \le m$ and $1\le s \le r$, in system (9). We introduce a notation for a piecewise constant control input u(t) over [0, T] as follows:

$$\begin{aligned} u[0, T] := (i_{1}, s_1, \nu _1, t_1)\cdots (i_{k}, s_k, \nu _k, t_k), \end{aligned}$$

(22)

where $0< t_1< \cdots < t_{k} = T$ is an increasing sequence of switching times, $\nu _p$’s are real numbers, and $(i_p, s_p)$’s are pairs of indices chosen out of $\{1,\ldots , m\} \times \{1,\ldots , r\}$. The piecewise constant control input u[0, T] is defined such that if $t \in [t_{p - 1}, t_{p})$, then

$$\begin{aligned}u_{i,s}(t) = \left\{ \begin{array}{ll} \nu _p, &{}\quad \text{ if } (i, s) = (i_p, s_p),\\ 0, &{}\quad \text{ otherwise }. \end{array}\right. \end{aligned}$$

Note, in particular, that at any time $t\in [0,T]$, there is at most one nonzero scalar control input $u_{i,s}(t)$ in u(t).

We will now apply piecewise constant control input (22) to excite system (9). For convenience, we define

$$\begin{aligned}\tau _p:= t_{p} - t_{p-1}, \quad \forall p = 1, \ldots , k,\end{aligned}$$

with $t_0 := 0$. We further define a set of vector fields $\{{\tilde{f}}_p\}^k_{p = 1}$ as follows:

$$\begin{aligned} {\tilde{f}}_{p} := \nu _p \rho _{s_p} f_{i_p} + f_0, \quad \forall p = 1,\ldots , k, \end{aligned}$$

where we have omitted all the arguments in the expression. Since $x_\Sigma (0)\sim {\bar{x}}_\Sigma (0)$, we have that for all $j = 1,\ldots , l$,

$$\begin{aligned} \int _\Sigma \phi ^j\left( e^{\tau _k {\tilde{f}}_{k}}\cdots e^{\tau _1{\tilde{f}}_{1}}x_\sigma (0) \right) \mathrm{d}\sigma = \int _\Sigma \phi ^j\left( e^{\tau _k {\tilde{f}}_{k}}\cdots e^{\tau _1{\tilde{f}}_{1}}{\bar{x}}_\sigma (0)\right) \mathrm{d}\sigma . \end{aligned}$$

Moreover, the above equality holds for any $\tau _p$ and $\nu _p$, with $p = 1,\ldots , k$.

We next take the partial derivative $\nicefrac {\partial ^{k}}{\partial \tau _1\cdots \partial \tau _{k} }$ on both sides of the above expression and evaluate the derivatives at $\tau _1 = \cdots = \tau _k =0$. By computation, we obtain that

$$\begin{aligned} \int _\Sigma \left( {\tilde{f}}_{1} \cdots {\tilde{f}}_{k} \phi ^j \right) (x_\sigma (0)) \mathrm{d}\sigma = \int _\Sigma \left( {\tilde{f}}_{1} \cdots {\tilde{f}}_{k} \phi ^j \right) ({\bar{x}}_\sigma (0)) \mathrm{d}\sigma . \end{aligned}$$

We further take the partial derivative $\nicefrac {\partial ^k}{\partial \nu _1\cdots \partial \nu _k}$ and evaluate at $\nu _1 = \cdots = \nu _k = 0$. By computation, we obtain that

$$\begin{aligned} \int _\Sigma (f_{\mathrm{w}} \phi ^j)(x_\sigma (0)) \mathrm{p}(\sigma ) \mathrm{d}\sigma = \int _\Sigma ( f_{\mathrm{w}} \phi ^j)({\bar{x}}_\sigma (0))\mathrm{p}(\sigma ) \mathrm{d}\sigma , \end{aligned}$$

(23)

where $\mathrm{w} := i_1\cdots i_k$ is a word and $\mathrm{p}:= \rho _{s_1}\cdots \rho _{s_k}$ is a monomial.

3.4.2 On the use of codistinguished functions

Note that $\{\phi ^j\}^l_{j = 1}$ is (weakly) codistinguished to $\{f_i\}^m_{i = 1}$. By the second item of Definition 3, we have that for any $j' = 1,\ldots , l$, there exist a word $\mathrm{w}$ over the alphabet $\{1,\ldots , m\}$ of length k, a function $\phi ^{j}$, and a nonzero $\lambda $ such that $f_{\mathrm{w}}\phi ^{j} = \lambda \phi ^{j'}$. Since (23) holds for all words $\mathrm{w}$ of length k for k arbitrary, we obtain that

$$\begin{aligned} \int _\Sigma \phi ^j(x_\sigma (0)) \mathrm{p}(\sigma ) \mathrm{d}\sigma = \int _\Sigma \phi ^j({\bar{x}}_\sigma (0))\mathrm{p}(\sigma ) \mathrm{d}\sigma , \end{aligned}$$

(24)

for all $j = 1,\ldots , l$ and for all monomials $\mathrm{p}\in \mathcal {P}$.

We now let $\mathrm{L}^2(\Sigma )$ be the Hilbert space of all square-integrable functions on $\Sigma $, where the inner product is defined as follows:

$$\begin{aligned} \langle \mathrm{q}_1, \mathrm{q}_2 \rangle _{\mathrm{L}^2} : = \int _\Sigma \mathrm{q}_1(\sigma )\mathrm{q}_2(\sigma ) \mathrm{d}\sigma , \quad \forall \mathrm{q}_1, \mathrm{q}_2\in \mathrm{L}^2(\Sigma ). \end{aligned}$$

Note that $\Sigma $ is compact. By the assumption of Theorem 1, the set of parameterization functions $\{\rho _s\}^r_{s = 1}$ separates points and contains an everywhere nonzero function, so the subalgebra generated by the set is dense in $\mathrm{L}^2(\Sigma )$. Thus, if there is a function $\mathrm{q}\in \mathrm{L}^2(\Sigma )$ such that $\langle \mathrm{q}, \mathrm{p}\rangle _{\mathrm{L}^2} = 0$ for all monomials $\mathrm{p}\in {\mathcal P}$, then $\mathrm{q}$ is zero almost everywhere (it differs from the identically zero function over a set of measure zero). In the case here, we define for each $j = 1,\ldots , l$ the following function:

$$\begin{aligned} \mathrm{q}^j(\sigma ) := \phi ^j(x_\sigma (0)) - \phi ^j({\bar{x}}_\sigma (0)). \end{aligned}$$

Then, one can rewrite (24) as follows:

$$\begin{aligned} \langle \mathrm{q}^j, \mathrm{p}\rangle _{\mathrm{L}^2} = 0, \quad \forall \mathrm{p}\in {\mathcal P} \,\, \text{ and } \,\, \forall j = 1,\ldots , l. \end{aligned}$$

Because $x_\sigma (0), {\bar{x}}_\sigma (0)$ are analytic in $\sigma $ and each $\phi ^j(x)$ is analytic in x, we have that each $\mathrm{q}^j(\sigma )$ is analytic in $\sigma $. Furthermore, since $\Sigma $ is equipped with a strictly positive Borel measure, we have that each $\mathrm{q}^j$ is identically zero, i.e.,

$$\begin{aligned} \phi ^j(x_\sigma (0)) = \phi ^j({\bar{x}}_\sigma (0)), \quad \forall \sigma \in \Sigma \,\, \text{ and } \,\, \forall j = 1,\ldots ,l. \end{aligned}$$

(25)

Since $\{\phi ^j\}^l_{j = 1}$ is (weakly) codistinguished to $\{f_i\}^m_{i = 1}$, by the first item of Definition 3, the set of one-forms $\{\mathrm{d}\phi ^j_{x}\}^l_{j = 1}$ spans the cotangent space $T^*_{x}M$ for all $x\in M$. It follows that for any $x\in M$, there is an open ball $B_{\epsilon (x)}(x)$ centered at x with radius $\epsilon (x) > 0$ such that if ${\bar{x}}\in B_{\epsilon (x)}(x)$ and $\phi ^j(x) = \phi ^j({\bar{x}})$ for all $j = 1,\ldots , l$, then ${\bar{x}} = x$. Furthermore, since each $\phi ^j$ is analytic, for any fixed $x\in M$, the radius $\epsilon (x)$ of the open ball can be chosen such that it is locally continuous around x. Since the initial profile $x_\Sigma (0)$ is analytic in $\sigma $, the above arguments have the following implication: For each $\sigma \in \Sigma $, there is an open neighborhood $V_\sigma $ of $\sigma $ in $\Sigma $ and a positive number $\epsilon _\sigma $ such that if $\sigma '\in V_\sigma $ and ${\bar{x}}_{\sigma '}(0)$ belongs to the open ball $B_{\epsilon _\sigma }(x_{\sigma '}(0))$ with $\phi ^j(x_{\sigma '}(0)) = \phi ^j({\bar{x}}_{\sigma '}(0))$ for all $j = 1,\ldots , l$, then ${\bar{x}}_{\sigma '}(0) = x_{\sigma '}(0)$.

The collection of the above open sets $\{V_\sigma \}_{\sigma \in \Sigma }$ is an open cover of $\Sigma $. Since $\Sigma $ is compact, there exists a finite subcover $\{V_{\sigma _{i}}\}^N_{i =1} $ of $\Sigma $. We then let

$$\begin{aligned} \epsilon := \min \left\{ \epsilon _{\sigma _i} \mid i = 1,\ldots , N\right\} > 0. \end{aligned}$$

We show below that if there is a certain $\sigma \in \Sigma $ such that $\mathrm{d}_M(x_\sigma (0), {\bar{x}}_\sigma (0)) < \epsilon $, then ${\bar{x}}_\Sigma (0) = x_\Sigma (0)$. This, in particular, implies weak ensemble observability of system (9).

To establish the fact, we first note that by the construction of $\epsilon $, ${\bar{x}}_\sigma (0) = x_\sigma (0)$. Now, let $\sigma '$ be any other point of $\Sigma $. We need to show that ${\bar{x}}_{\sigma '}(0) = x_{\sigma '}(0)$. Because $\Sigma $ is path-connected, there is a continuous path $p:[0,1] \rightarrow \Sigma $ with $p(0) = \sigma $ and $p(1) = \sigma '$. Again, by the definition of $\epsilon $, we have that for any $\lambda \in [0,1]$, there are only two cases: Either ${\bar{x}}_{p(\lambda )}(0) = x_{p(\lambda )}(0)$ or $\mathrm{d}_M(x_{p(\lambda )}, {\bar{x}}_{p(\lambda )}) \ge \epsilon $. On the other hand, the profile ${\bar{x}}_\Sigma (0)$ is continuous in $\sigma $ and $p(\lambda )$ is continuous in $\lambda $, so ${\bar{x}}_{p(\lambda )}(0)$ is continuous in $\lambda $ as well. But then, since ${\bar{x}}_{p(0)}(0) = x_{p(0)}(0)$, it follows that ${\bar{x}}_{p(\lambda )}(0) = x_{p(\lambda )}(0)$ for all $\lambda \in [0,1]$. In particular, ${\bar{x}}_{\sigma '}(0) = x_{\sigma '}(0)$.

We now show that if, further, M is compact, then $|O(x_\Sigma (0))| \le \chi _\phi $. Recall that $\phi := (\phi ^1,\ldots ,\phi ^l)$ and $[x]_\phi $ is the pre-image of $\phi (x)$. Because any two different profiles in $O(x_\Sigma (0))$ are completely disjoint, it suffices to show that $|[x_\sigma (0)]_\phi |\le \chi _\phi $ for some (and, hence, any) $\sigma \in \Sigma $. But, this follows from the definition of $\chi _\phi $ and Lemma 3.

Finally, note that if $\{\phi ^j\}^l_{j = 1}$ is codistinguished to $\{f_i\}^m_{i = 1}$ (and, hence, the third item of Definition 3 is satisfied), then by (25), $x_\sigma (0) = {\bar{x}}_\sigma (0)$ for all $\sigma \in \Sigma $, i.e., $O(x_\Sigma (0)) = \{ x_\Sigma (0) \}$. Thus, system (9) is ensemble observable. This completes the proof. $\square $

3.5 Pre-distinguished ensemble system

We consider in the subsection a scenario where the set of control vector fields $\{f_i(x)\}^m_{i = 1}$ (resp. the set of one-forms $\{\mathrm{d}\phi ^j(x)\}^l_{j = 1}$) in system (9) does not necessarily span the tangent space $T_xM$ (resp. the cotangent space $T^*_xM$). Nevertheless, the two sets $\{f_i\}^m_{i = 1}$ and $\{\phi ^j\}^l_{j = 1}$ together can “generate” (weakly) jointly distinguished vector fields and functions. We make the statement precise below.

To proceed, we first introduce a few definitions and notations. Let $F:= \{f_i\}^m_{i = 1}$ and ${\mathcal L}_F$ be the collection of Lie products generated by F (the $f_i$’s are treated as “free” generators). We say that ${\mathcal L}_F$ is projectively finite if there is a finite set of vector fields ${\bar{F}}:= \{{\bar{f}}_i\}^{{\bar{m}}}_{i = 1}$ over M such that if one evaluates the Lie products in ${\mathcal L}_F$, then ${\mathcal L}_F\equiv {\bar{F}}$.

Next, let ${\mathcal W}$ be the collection of all words over the alphabet $\{1,\ldots , m\}$. Recall that for a given word $\mathrm{w} = i_1\cdots i_k$ and an analytic function $\phi $ on M, we use $f_{\mathrm{w}} \phi $ to denote $f_{i_1}\cdots f_{i_k}\phi $. If $\mathrm{w} = \varnothing $, then $f_{\mathrm{w}}\phi = \phi $. Given a set function $\Phi :=\{\phi ^j\}^l_{j = 1}$ on M and the set of vector fields F, we define

$$\begin{aligned} F_{\mathcal W} \Phi := \{f_{\mathrm{w}} \phi ^j \mid \mathrm{w} \in {\mathcal W} \text{ and } j = 1,\ldots , l \}. \end{aligned}$$

Similarly, we say that $F_{\mathcal W}\Phi $ is projectively finite if there is a finite subset ${\bar{\Phi }} :=\{{\bar{\phi }}^j\}^{{\bar{l}}}_{j = 1}$ of $\mathrm{C}^\omega (M)$ such that $F_{\mathcal W}\Phi \equiv {\bar{\Phi }}$. Note, in particular, that F and $\Phi $ are, up to scaling, subsets of ${\bar{F}}$ and ${\bar{\Phi }}$, respectively. We now have the following definition:

Definition 8

A set of vector fields $F := \{f_i\}^m_{i = 1}$ over M is pre-distinguished if there exists a distinguished set ${\bar{F}}$ of vector fields such that $ {\mathcal L}_F \equiv {\bar{F}}$. Similarly, a set of functions $\Phi := \{\phi ^j\}^l_{j = 1}$ on M is (weakly) pre-codistinguished to F if there exists a finite set ${\bar{\Phi }}$ of functions, (weakly) codistinguished to F, such that $F_{\mathcal W} \Phi \equiv {\bar{\Phi }}$.

Note that given a pair of jointly distinguished sets F and $\Phi $, one can look for (proper) subsets $F'\subseteq F$ and $\Phi ' \subseteq \Phi $ so that ${\mathcal L}_{F'} \equiv F$ and ${F'}_{\mathcal W}\Phi ' \equiv \Phi $, i.e., $F'$ and $\Phi '$ are jointly pre-distinguished. In particular, we say that $(F', \Phi ')$ is minimal if removal of any element out of $F'$ or $\Phi '$ will violate the condition in the above definition. We do not intend to characterize here minimal pairs for a given jointly distinguished pair $(F,\Phi )$. But instead, we provide below an example for illustration.

Example 3

We consider again the vector fields $F := \{L_{X_i}\}^3_{i = 1}$ and the functions $\Phi := \{ \phi _{ij} = {\text {tr}}(gX_j g^\top X^\top _i)\}^3_{i, j = 1}$ introduced in Example 1. We have shown that F and $\Phi $ are jointly distinguished on ${\text {SO}}(3)$. Now, we define for each $i = 1,2, 3$, a subset $F_i:= F - \{L_{X_i}\}$ and for each $j = 1,2,3$, a subset $\Phi ^j:= \{\phi ^{ij}\}^3_{i = 1}$. Recall that we have the following relationships:

$$\begin{aligned} {[}L_{X_i},L_{X_j}] = \det (e_i,e_j,e_k)L_{X_k} \quad \text{ and } \quad L_{X_i} \phi ^{i'j} = - \det (e_i,e_j,e_k) \phi ^{i'k}. \end{aligned}$$

It follows that ${\mathcal L}_{F_i} \equiv F$ for all $i = 1,2,3$, and $F_{{i}_{\mathcal W}} \Phi ^j \equiv \Phi $ for all $1\le i, j \le 3$. Moreover, every such pair $(F_i, \Phi ^j)$ is minimal. $\square $

With the above definition, we state the following fact which generalizes Theorem 1:

Theorem 2

Consider ensemble system (9). Suppose that $\{\rho ^2_s\}^r_{s = 1}$ is a separating set and contains an everywhere nonzero function; then, the following hold:

(1)
If the set of control vector fields $\{f_i\}^m_{i = 1}$ is pre-distinguished, then system (9) is approximately ensemble path-controllable under $\mathrm{C}^1$-inputs.
(2)
If the set of observation functions $\{\phi ^j\}^l_{j = 1}$ is (weakly) pre-codistinguished to $\{f_i\}^m_{i = 1}$, then system (9) is (weakly) ensemble observable. If, further, M is compact, then for any initial profile $x_\Sigma (0)$, the set $O(x_\Sigma (0))$ is finite and $|O(x_\Sigma (0))| \le \chi _\phi $.

We establish Theorem 2 in the following subsection. Similar to Definition 7, we have the following definition:

Definition 9

An ensemble system (9) is a pre-distinguished if (1) the set $\{\rho ^2_s\}^r_{s = 1}$ separates points and contains an everywhere nonzero function, and (2) the set of control vector fields $\{f_i\}^m_{i = 1}$ and the set of observation functions $\{\phi ^j\}^l_{j = 1}$ are (weakly) jointly pre-distinguished.

It follows from Theorem 2 that if a system is pre-distinguished, then it is approximately ensemble path-controllable and (weakly) ensemble observable. We next have the following remark on the existence of a desired set of parameterization functions that satisfies the assumption of Theorem 2 (compared to Remark 2):

Remark 4

We first note that if $\{\rho ^2_s\}^r_{s = 1}$ is a separating set, then, for any positive integer k, $\{\rho ^k_s\}^r_{s = 1}$ is also a separating set. Conversely, if $\{\rho _s\}^r_{s = 1}$ is a separating set and each $\rho _s$ is nonnegative (i.e., $\rho _s(\sigma ) \ge 0$ for all $\sigma \in \Sigma $), then $\{\rho ^2_s\}^r_{s = 1}$ will be a separating set. Such a set $\{\rho _s\}^r_{s = 1}$ exists for any analytic, compact manifold $\Sigma $. To see this, we again embed $\Sigma $ into a Euclidean space $\mathbb {R}^N$. Since $\Sigma $ is compact, one can translate the coordinates, if necessary, such that $\Sigma $ is embedded in the positive orthant of $\mathbb {R}^N$. Then, by restricting the coordinate functions of $\mathbb {R}^N$ to $\Sigma $, we obtain a separating set $\{\rho _i(\sigma ):= \sigma _i\}^N_{i = 1}$ comprised of all positive functions. $\square $

3.6 Analysis and proof of Theorem 2

3.6.1 Indicator sequences

Let ${\bar{F}} =\{{\bar{f}}_i\}^{{\bar{m}}}_{i =1}$ be such that ${\bar{F}} \equiv {\mathcal L}_F$. Decompose ${\mathcal L}_F:= \sqcup _{k\ge 0}{\mathcal L}_F(k)$ where ${\mathcal L}_F(k)$ is comprised of Lie products of depth k. In contrast to (15), we do not necessarily have that ${\mathcal L}(k)\equiv {\bar{F}}$ for all $k \ge 0$. It is possible that each ${\mathcal L}_F(k)$ is, up to scaling, a proper subset of ${\bar{F}}$ (see Example 4). To tackle the issue, we first introduce the following definitions:

Definition 10

Let ${\mathcal L}_F$ be projectively finite and ${\bar{F}} =\{{\bar{f}}_i\}^{{\bar{m}}}_{i =1}$ be such that ${\bar{F}} \equiv {\mathcal L}_F$. For each $i = 1,\ldots , {\bar{m}}$, define a sequence of natural numbers $\mathbb {N}_i$ as follows: If $k\in \mathbb {N}_i$, then there exist a Lie product $\xi \in {\mathcal L}_F(k)$ and a real number $\lambda $ such that by evaluating $\xi $, we have ${\bar{f}}_i = \lambda \xi $. We call every such sequence $\mathbb {N}_i$ an indicator sequence for ${\bar{f}}_i$.

Similarly, we have the following counterpart of the above definition:

Definition 11

Let $F_{\mathcal W}\Phi $ be projectively finite and ${\bar{\Phi }} = \{{\bar{\phi }}^j\}^{{\bar{l}}}_{j = 1}$ be such that ${\bar{\Phi }} \equiv F_{\mathcal W}\Phi $. For each $j = 1,\ldots , {\bar{l}}$, define a sequence of natural numbers $\mathbb {N}^j$ as follows: If $k \in \mathbb {N}^j$, then there exist a word $\mathrm{w}$ of length k over the alphabet $\{1,\ldots , m\}$, a function $\phi ^{j'}\in \Phi $, and a real number $\lambda $ such that ${\bar{\phi }}^j = \lambda f_{\mathrm{w}}\phi ^{j'}$. We call every such sequence $\mathbb {N}^j$ an indicator sequence for ${\bar{\phi }}^j$.

Note that if F and $\Phi $ are (weakly) jointly distinguished, then $\mathbb {N}_i = \mathbb {N}^j = \mathbb {N}$ for all $i = 1,\ldots , m (={\bar{m}})$ and for all $j = 1,\ldots , l (={\bar{l}})$.

Example 4

Consider the subsets $F_1 = F - \{L_{X_1}\}$ and $\Phi ^1 =\{\phi ^{i1}\}^3_{i = 1}$ introduced in Example 3. We have that ${\mathcal L}_{F_1} \equiv F$ and $F_{1_{\mathcal W}}\Phi ^1 \equiv \Phi $. By computation (with details omitted), the indicator sequences $\mathbb {N}_i$ for $L_{X_i}$ are given by $\mathbb {N}_1 = \{2k + 1\}_{k\ge 0}$ and $\mathbb {N}_2 = \mathbb {N}_3 = \{2k\}_{k \ge 0}$. The indicator sequences $\mathbb {N}^{ij}$ for $\phi ^{ij}$ are given by $\mathbb {N}^{i1} = \{2k\}_{k\ge 0}$ and $\mathbb {N}^{i2} = \mathbb {N}^{i3} = \{2k + 1\}_{k\ge 0}$ for all $i = 1,2,3$. $\square $

A sequence $\{n_k\}^\infty _{k = 0}$ is said to be an arithmetic sequence if there is a $\delta $ such that $n_{k + 1} - n_k = \delta $ for all $k \ge 0$. We now establish the following fact:

Proposition 1

Every indicator sequence $\mathbb {N}_i$ for ${\bar{f}}_i$ (or $\mathbb {N}^j$ for ${\bar{\phi }}^j$) contains an infinite arithmetic sequence as a subsequence.

Proof

We establish the proposition for $\mathbb {N}_i$ and $\mathbb {N}^j$ subsequently.

Proof for $\mathbb {N}^i$. We fix an $i = 1,\ldots , {\bar{m}}$ and prove that $\mathbb {N}_i$ contains an arithmetic sequence. Because F is pre-distinguished, there exists a Lie product $\xi _1\in {\mathcal L}_F$, with ${\text {dep}}(\xi _1) \ge 1$, and a real number $\lambda _1$ such that $\lambda _1 \xi _1 = {\bar{f}}_i$. Denote by $f_{i_1}\in F$ the first element that shows up in $\xi _1$ (e.g., $\xi _1 = [f_{i_1}, [f_{i'_1},f_{i''_1}]]$). Applying the same argument, but with ${\bar{f}}_{i}$ replaced by $f_{i_1}$, we obtain that $\lambda _2 \xi _2 = f_{i_1}$ for some $\xi _2\in {\mathcal L}_F$ with ${\text {dep}}(\xi _2) \ge 1 $ and some $\lambda _2\in \mathbb {R}$.

Next, we let ${\xi _1} \lhd \xi _2$ be a Lie product in ${\mathcal L}_F$ defined by replacing the first element $f_{i_1}$ in $\xi _1$ with the Lie product $\xi _2$. For example, if $\xi _1 = [f_{i_1}, [f_{i'_1},f_{i''_1}]]$, then ${\xi _1} \lhd \xi _2 = [\xi _2, [f_{i'_1},f_{i''_1}]]$. It should be clear that

$$\begin{aligned} \lambda _1\lambda _2 {\xi _1} \lhd \xi _2 = f_i, \quad \text{ with } {\text {dep}}({\xi _1} \lhd \xi _2) = {\text {dep}}({\xi _1}) + {\text {dep}}({\xi _2}). \end{aligned}$$

By repeating the above procedure, we obtain (1) a sequence of Lie products $\{\xi _k\}_{k\ge 1}$, (2) a sequence of vector fields $\{f_{i_k}\}_{k\ge 1}$ with $f_{i_k}\in F$, and (3) a sequence of real numbers $\{\lambda _k\}_{k\ge 1}$ such that the first element in $\xi _k$ is $f_{i_k}$ and $\lambda _k \xi _k = f_{i_{k-1}}$. It then follows that

$$\begin{aligned} \alpha _k {\xi _1} \lhd \cdots \lhd \xi _k = f_i\,\, \text{ where } \alpha _k:=\prod ^k_{l = 1} \lambda _l, \quad \forall k \ge 1. \end{aligned}$$

Note that ${\xi _1} \lhd \cdots \lhd \xi _k$ is well defined because the operator “$\lhd $” is associative.

Since each $f_{i_k}$ belongs to the finite set F, there is a repetition in the sequence. Without loss of generality, we assume that $f_{i_k} = f_{i_{k'}}$ for some $k' > k \ge 1$. We then define a Lie product $\xi $ as follows:

$$\begin{aligned} \xi := \xi _{k + 1}\lhd \cdots \lhd \xi _{k'} \quad \text{ and } \quad \delta : = {\text {dep}}(\xi ) = \sum ^{k'}_{l = k + 1}{\text {dep}}(\xi _l). \end{aligned}$$

Note that the first element in $\xi $ is $f_{i_k}$ and $\nicefrac {\alpha _{k'}}{\alpha _k} \xi = f_{i_k}$. In fact, the statement can be strengthened: For any given $N \ge 0$, we define

$$\begin{aligned} \xi ^N:= \xi \lhd \cdots \lhd \xi \end{aligned}$$

where the number of copies of $\xi $ in the expression is N. If $N = 0$, then we let $\xi ^0:=f_{i_k}$. It should be clear that for any $N \ge 0$, the first element in $\xi ^N $ is $f_{i_k}$ and, moreover, $\nicefrac {\alpha _{k'}^N}{\alpha _k^N} \xi ^N = f_{i_k}$. We further define a Lie product $\xi _0$ as follows:

$$\begin{aligned} \xi _0:= \xi _{1}\lhd \cdots \lhd \xi _k \quad \text{ and } \quad \delta _0:= {\text {dep}}(\xi _0) = \sum ^k_{l = 1}{\text {dep}}(\xi _l). \end{aligned}$$

It then follows that for any $N\ge 0$.

$$\begin{aligned} \left( \nicefrac {\alpha _{k'}^N}{\alpha _k^{N - 1}} \right) \xi _0 \lhd \xi ^N = {\bar{f}}_{i}, \end{aligned}$$

which implies that $\mathbb {N}_i$ contains $\{\delta _0 + N\delta \}_{N\ge 0}$ as a subsequence.

Proof for $\mathbb {N}^j$. The arguments will be similar to the ones used above. We fix a $j = 1,\ldots , {\bar{l}}$, and prove that $\mathbb {N}^j$ contains an arithmetic sequence. Since $\Phi $ is pre-codistinguished to F, there exist a word $\mathrm{w}_1$ of positive length, a function $\phi ^{j_1}$ out of $\Phi $, and a real number $\mu _1$ such that $\mu _1 f_{\mathrm{w}_1}\phi ^{j_1} = {\bar{\phi }}^j$. Applying the same argument, but with ${\bar{\phi }}^j$ replaced by $\phi ^{j_1}$, we obtain that $\mu _2 f_{\mathrm{w}_2}\phi ^{j_2} = \phi ^{j_1}$ for some word $\mathrm{w}_2$ of positive length, some function $\phi ^{j_2}$ out of $\Phi $, and some real number $\mu _2$. Note, in particular, that

$$\begin{aligned}\mu _1\mu _2f_{\mathrm{w}_1}f_{\mathrm{w}_2}\phi ^{j_2} = {\bar{\phi }}^{j}.\end{aligned}$$

By repeating the procedure, we obtain (1) a sequence of functions $\{\phi ^{j_k}\}_{k\ge 1}$ where each $\phi ^{j_k}$ belongs to $\Phi $, (2) a sequence of words $\{\mathrm{w}_k\}_{k \ge 1}$ of positive lengths, and (3) a sequence of real numbers $\{\mu _k\}_{k\ge 1}$ such that $\mu _k f_{\mathrm{w}_k} \phi ^{j_k} = \phi ^{j_{k-1}}$. It then follows that

$$\begin{aligned} \beta _k f_{\mathrm{w}_1} \cdots f_{\mathrm{w}_k}\phi ^{j_k} = {\bar{\phi }}^j \, \, \text{ where } \beta _k:=\prod ^k_{l = 1} \mu _l, \quad \forall k \ge 1. \end{aligned}$$

Since each $\phi ^{j_k}$ belongs to the finite set $\Phi $, there is a repetition in the sequence, say $\phi ^{j_{k}} = \phi ^{j_{k'}}$ for some $k' > k \ge 1$. It then implies that $ \nicefrac {\beta _{k'}}{\beta _k} f_{\mathrm{w}}\phi ^{j_{k}} = \phi ^{j_k}$ where $\mathrm{w} := \mathrm{w}_{k + 1} \cdots \mathrm{w}_{k'}$ is obtained by concatenation. Denote by $\delta $ the length $\mathrm{w}$. For a nonnegative integer N, we let $\mathrm{w}^N$ be a word obtained by concatenating N copies of $\mathrm{w}$. If $N = 0$, then $\mathrm{w}^N = \varnothing $. We further let $\mathrm{w}_0:= \mathrm{w}_{1} \cdots \mathrm{w}_{k}$ and $\delta _0$ be the length of $\mathrm{w}_0$. It then follows that for any $N \ge 0$,

$$\begin{aligned} \left( \nicefrac {\beta ^{N}_{k'}}{\beta ^{N - 1}_k} \right) f_{\mathrm{w}_0} f_{\mathrm{w}^N}\phi ^{j_k} = {\bar{\phi }}^j, \end{aligned}$$

which implies that $\mathbb {N}^j$ contains $\{\delta _0 + N\delta \}_{N\ge 0}$ as a subsequence. $\square $

3.6.2 Proof of Theorem 2

The arguments we will use for proving the theorem will be similar to those for Theorem 1. We elaborate below only on the difference.

We first establish item 1 of Theorem 2. By repeatedly applying Lie extensions of system (9), we obtain the following formal expression:

$$\begin{aligned} \dot{x}_\sigma (t) = f_0(x_\sigma (t),\sigma ) + \sum _{l \ge 0}\sum _{\xi \in {\mathcal L}_F(l)}\sum _{\mathrm{p}\in {\mathcal P(l + 1)}} u_{\xi , \mathrm{p}}(t) \mathrm{p}(\sigma ) \xi (x_\sigma (t)), \quad \forall \sigma \in \Sigma . \end{aligned}$$

One obtains a kth-order Lie extended system by truncating the infinite summation over l and keeping only the terms with $l \le k$. Because $F = \{f_i\}^m_{i = 1}$ is pre-distinguished, we let ${\bar{F}} =\{{\bar{f}}_i\}^{{\bar{m}}}_{i =1}$ be such that ${\bar{F}} \equiv {\mathcal L}_F$. Then, by the definition of indicator sequence $\mathbb {N}_i$ for ${\bar{f}}_i$, the above equation can be simplified as follows:

$$\begin{aligned} \dot{x}_\sigma (t) = f_0(x_\sigma (t),\sigma ) + \sum ^{{\bar{m}}}_{i = 1}\sum _{l\in \mathbb {N}_i}\sum _{\mathrm{p}\in {\mathcal P(l + 1)}} u_{i, \mathrm{p}}(t) \mathrm{p}(\sigma ) {\bar{f}}_i(x_\sigma (t)), \quad \forall \sigma \in \Sigma . \end{aligned}$$

To establish ensemble controllability of the above system (or more precisely, a truncated version after a certain order), it suffices to show that for any $i = 1,\ldots , {\bar{m}}$, the $\mathbb {R}$-span of monomials in $\sqcup _{l\in \mathbb {N}_i}{\mathcal P(l+1)}$ is dense in $\mathrm{C}^0(\Sigma )$. We prove this fact below.

We fix an $i = 1,\ldots , {\bar{m}}$. By Proposition 1, the indicator sequence $\mathbb {N}_i$ contains an infinite arithmetic sequence, which we denote by $\{n_k\}_{k \ge 0}$ with $\delta := n_{k + 1} - n_k > 0$ for all $k \ge 0$. We next define functions on $\Sigma $ as follows:

$$\begin{aligned}\bar{\rho }_s:=\rho ^\delta _s, \quad \forall s = 1,\ldots , r.\end{aligned}$$

By the assumption of Theorem 2, the set $\{\rho ^2_s\}^r_{s = 1}$ is a separating set and contains an everywhere nonzero function, say $\rho _1$. It follows that $\{\bar{\rho }_s\}^r_{s = 1}$ is also a separating set with $\bar{\rho }_1$ an everywhere nonzero function. Thus, the subalgebra generated by $\{\bar{\rho }_s\}^r_{s = 1}$ is dense in $\mathrm{C}^0(\Sigma )$. Denote the subalgebra by $\bar{\mathcal S}$. Since $\rho _1$ is everywhere nonzero, the following set:

$$\begin{aligned} \rho ^{n_0 + 1}_1 \bar{\mathcal S}:= \left\{ \rho ^{n_0 + 1}_1\mathrm{p}\mid \mathrm{p}\in \bar{\mathcal S}\right\} \end{aligned}$$

is dense in $\mathrm{C}^0(\Sigma )$ as well. On the other hand, the $\mathbb {R}$-span of $\sqcup _{l\in \mathbb {N}_i}{\mathcal P}(l + 1)$ contains $\rho _1^{n_0+1}\bar{\mathcal S}$ as a subset; indeed, if $\mathrm{p}$ is a monomial that can be expressed as

$$\begin{aligned} \mathrm{p}= \rho ^{n_0 + 1}_1 \prod ^r_{s = 1}\bar{\rho }^{k_s}_s \end{aligned}$$

with $k_s \ge 0$, then $\mathrm{p}\in {\mathcal P}(n_k + 1)$ where $k := \sum ^r_{s = 1} k_s$. We have thus shown that the $\mathbb {R}$-span of $\sqcup _{l\in \mathbb {N}_i}{\mathcal P(l+1)}$ is dense in $\mathrm{C}^0(\Sigma )$.

We now establish item 2 of Theorem 2. Let ${\bar{x}}_\Sigma (0)$ and $x_\Sigma (0)$ two initial profiles that are output equivalent. The same arguments in Sect. 3.4 can be used here to obtain the following fact: Let $k\ge 0$ be an arbitrary integer. Let $\mathrm{w}$ be any word of length k and $\mathrm{p}$ be any monomial of degree k. Then, for any $j = 1,\ldots , l$, we have

$$\begin{aligned} \int _\Sigma (f_{\mathrm{w}} \phi ^j)(x_\sigma (0)) \mathrm{p}(\sigma ) \mathrm{d}\sigma = \int _\Sigma ( f_{\mathrm{w}} \phi ^j)({\bar{x}}_\sigma (0))\mathrm{p}(\sigma ) \mathrm{d}\sigma . \end{aligned}$$

(26)

Because $\Phi =\{\phi ^j\}^l_{j = 1}$ is (weakly) pre-codistinguished to F, we let ${\bar{\Phi }} = \{{\bar{\phi }}^j\}^{{\bar{l}}}_{j = 1}$ be such that $F_{\mathcal {W}}\Phi = {\bar{\Phi }}$. For each $j = 1,\ldots , {\bar{l}}$, we define a function $\mathrm{q}^j$ on $\Sigma $ as follows:

$$\begin{aligned} \mathrm{q}^j(\sigma ):= {\bar{\phi }}^j(x_\sigma (0)) - {\bar{\phi }}^j({\bar{x}}_\sigma (0)). \end{aligned}$$

By the definition of indicator sequence $\mathbb {N}^j$ for ${\bar{\phi }}^j$, we can simplify (26) as follows:

$$\begin{aligned} \langle \mathrm{q}^j, \mathrm{p}\rangle _{\mathrm{L}^2} = 0, \quad \forall \mathrm{p}\in \sqcup _{l\in \mathbb {N}^j}{\mathcal P}(l). \end{aligned}$$

Note that the above expression holds for all $j = 1,\ldots , {\bar{l}}$. It now suffices to show that the $\mathbb {R}$-span of $\sqcup _{l\in \mathbb {N}^j}{\mathcal P}(l)$ is dense in $\mathrm{L}^2(\Sigma )$. This, again, follows from Proposition 1; indeed, since $\mathbb {N}^j$ contains an infinite arithmetic sequence, it follows by the same arguments (for $\mathbb {N}_i$) that the $\mathbb {R}$-span of $\sqcup _{l\in \mathbb {N}^j}{\mathcal P}(l)$ is dense in $\mathrm{C}^0(\Sigma )$. Because $\Sigma $ is compact, $\mathrm{C}^0(\Sigma )$ is dense in $\mathrm{L}^2(\Sigma )$. This completes the proof. $\square $

4 Existence of distinguished ensemble systems

We have shown in the previous section that (weakly) jointly distinguished vector fields $\{f_i\}^m_{i = 1}$ and functions $\{\phi ^j\}^l_{j = 1}$ are key ingredients for an ensemble system to be approximately ensemble path-controllable and (weakly) ensemble observable. We address in the section the issue about the existence of these finely structured vector fields and functions for a given manifold M. Among other things, we provide an affirmative answer for the case where M is a connected, semi-simple Lie group:

Theorem 3

For any connected semi-simple Lie group G, there exist weakly jointly distinguished vector fields $\{f_i\}^m_{i = 1}$ and functions $\{\phi ^j\}^l_{j = 1}$ on G. Moreover, if G has a trivial center, then $\{f_i\}^m_{i = 1}$ and $\{\phi ^j\}^l_{j = 1}$ are jointly distinguished.

4.1 Distinguished sets of semi-simple real Lie algebras

Let G be a semi-simple Lie group and $\mathfrak {g}$ be its Lie algebra. We address in the subsection the existence of distinguished vector fields over G. These vector fields will be certain left- (or right-) invariant vector fields. We can thus address the existence issue on the Lie algebra level. To proceed, we first have the following definition [6]:

Definition 12

Let $\mathfrak {g}$ be a semi-simple real Lie algebra. A spanning set $\{{X}_i\}^m_{i = 1}$ of $\mathfrak {g}$ is distinguished if for any ${X}_i$ and ${X}_j$, there exist an ${X}_k$ and a real number $\lambda $ such that

$$\begin{aligned}{}[X_i, X_j] = \lambda {X}_k. \end{aligned}$$

(27)

Conversely, for any ${X}_k$, there exist ${X}_i$, ${X}_j$, and a nonzero $\lambda $ such that (27) holds.

Note that the cardinality of a distinguished set $\{X_i\}^m_{i = 1}$ is, in general, greater than the dimension of $\mathfrak {g}$, i.e., the spanning set $\{X_i\}^m_{i =1}$ may contain a basis of $\mathfrak {g}$ as its proper subset. We have established in [6] the following result:

Proposition 2

Every semi-simple real Lie algebra admits a distinguished set.

The proposition then implies that every semi-simple Lie group admits a set of distinguished left- (or right-) invariant vector fields. Since the proposition will be of great use in the paper, we outline below a constructive approach for generating a desired distinguished set. A complete proof can be found in [6]. The proof leverages the structure theory of semi-simple real Lie algebras. A reader not interested in the proof can skip the remainder of the subsection.

Sketch of proof

Recall that ${\text {ad}}(X)(\cdot ) := [X,\cdot ]$ is the adjoint representation. Denote by $B(X, Y) := {\text {tr}}({\text {ad}}_X{\text {ad}}_Y)$ the Killing form. Let $\mathfrak {h}$ be a Cartan subalgebra of $\mathfrak {g}$, and $\mathfrak {g}^\mathbb {C}$ (resp. $\mathfrak {h}^\mathbb {C}$) be the complexification of $\mathfrak {g}$ (resp. $\mathfrak {h}$). We let $\Delta $ be the set of roots. For each $\alpha \in \Delta $, we let $h_\alpha \in \mathfrak {h}^\mathbb {C}$ be such that $\alpha (H) = B(h_\alpha , H)$ for all $H\in \mathfrak {h}^\mathbb {C}$. Denote by $\langle \alpha , \beta \rangle := B(h_\alpha , h_\beta )$, which is an inner product defined over the $\mathbb {R}$-span of $\Delta $. We denote by $|\alpha |:= \sqrt{\langle \alpha , \alpha \rangle }$ the length of $\alpha $. Let $H_\alpha := \nicefrac {2h_\alpha }{ |\alpha |^2 }$. For a root $\alpha \in \Delta $, let $\mathfrak {g}_\alpha $ be the corresponding root space (as a one-dimensional subspace of $\mathfrak {g}^\mathbb {C}$ over $\mathbb {C}$). $\square $

Suppose, for the moment, that one aims to obtain a distinguished set for the semi-simple complex Lie algebra $\mathfrak {g}^\mathbb {C}$; then, with slight modification, such a set can be obtained via the Chevalley basis [21, Chapter VII], which we recall below:

Lemma 6

There are ${X}_\alpha \in \mathfrak {g}^\mathbb {C}_\alpha $, for $\alpha \in \Delta $, such that the following hold:

(1)
For any $\alpha \in \Delta $, we have $[{X}_\alpha , {X}_{-\alpha }] = H_\alpha $.
(2)
For any two non-proportional roots $\alpha , \beta $, we let $\beta + n\alpha $, with $-q \le n \le p$, be the $\alpha $-string that contains $\beta $. Then,
$$\begin{aligned} {[}{X}_\alpha , {X}_\beta ] = \left\{ \begin{array}{ll} c_{\alpha , \beta } {X}_{\alpha + \beta }, &{} \text{ if } \alpha + \beta \in \Delta , \\ 0, &{} \text{ otherwise }, \end{array} \right. \end{aligned}$$
where $c_{\alpha , \beta }\in \mathbb {Z}$ with $c^2_{\alpha ,\beta } = (q + 1)^2$.

We also note that for any $\alpha , \beta \in \Delta $, $[H_\alpha , {X}_\beta ] = \nicefrac {2\langle \alpha , \beta \rangle }{|\alpha |^2} {X}_\beta $ and, moreover, $\nicefrac {2\langle \alpha , \beta \rangle }{|\alpha |^2}\in \mathbb {Z}$. It thus follows from Lemma 6 that

$$\begin{aligned} A:= \{ H_\alpha , {X}_\alpha , {X}_{-\alpha } \mid \alpha \in \Delta \} \end{aligned}$$

is a distinguished set of $\mathfrak {g}^\mathbb {C}$. The above arguments have the following implications:

(1):: A semi-simple complex Lie algebra can also be viewed as a Lie algebra over $\mathbb {R}$. We call any such real Lie algebra complex [22, Chapter VI]. In particular, if the real Lie algebra $\mathfrak {g}$ is complex, then the $\mathbb {R}$-span of $A \cup \mathrm {i}A$, with A defined above, is $\mathfrak {g}$. Moreover, since the coefficients $\nicefrac {2\langle \alpha , \beta \rangle }{|\alpha |^2}$ and $c_{\alpha , \beta }$ are all integers (and hence real), the set $A\cup \mathrm {i} A$ is a distinguished set of $\mathfrak {g}$.
(2):: If the Lie algebra $\mathfrak {g}$ is obtained as the $\mathbb {R}$-span of A (i.e., $\mathfrak {g}$ is a split real form of $\mathfrak {g}^\mathbb {C}$), then A is a distinguished set of $\mathfrak {g}$.

Thus, the technical difficulty for establishing Proposition 2 lies in the case where $\mathfrak {g}$ is neither complex nor a split real form of $\mathfrak {g}^\mathbb {C}$. We have dealt with such a case in [6]. We reproduce below a key result established in that paper.

First, recall that a Cartan involution $\theta : \mathfrak {g}\rightarrow \mathfrak {g}$ is a Lie algebra automorphism, with $\theta ^2 = {\text {id}}$. Moreover, the symmetric bilinear form $B_\theta $, defined as

$$\begin{aligned} B_\theta (X, Y):= -B(X, \theta Y), \end{aligned}$$

is positive definite on $\mathfrak {g}$. One can extend $\theta $ to $\mathfrak {g}^\mathbb {C}$ by $\theta (X + \mathrm {i} Y) = \theta X + \mathrm {i}\theta Y$.

Next, for a subset $S\subset \mathfrak {g}$, we let ${\mathcal L}_S$ be the collection of Lie products generated by S. Similarly, we say that ${\mathcal L}_S$ is projectively finite if there exists a finite subset ${\bar{S}}$ of $\mathfrak {g}$ such that ${\mathcal L}_S \equiv {\bar{S}}$. Further, we say that the set S is pre-distinguished if ${\bar{S}}$ is a distinguished set of $\mathfrak {g}$ (compared with Definition 8). We now have the following fact:

Proposition 3

Let $\mathfrak {g}$ be a simple real Lie algebra, which is neither complex nor a split real form of $\mathfrak {g}^{\mathbb {C}}$. Then, there exist a Cartan involution $\theta $ and elements ${X}_\alpha \in \mathfrak {g}^{\mathbb {C}}_\alpha $, for $\alpha \in \Delta $, such that the items of Lemma 6 are satisfied and the following set belongs to $\mathfrak {g}$:

$$\begin{aligned} S:=\left\{ Y_\alpha :={X}_{\alpha } - \theta {X}_{-\alpha }, \quad Z_\alpha : =\mathrm{i} ({X}_\alpha + \theta {X}_{- \alpha }) \mid \alpha \in \Delta \right\} . \end{aligned}$$

Furthermore, the following hold:

(1)
If the underlying root system of $\mathfrak {g}$ is not $G_2$, then the set S is pre-distinguished.
(2)
If the underlying root system of $\mathfrak {g}$ is $G_2$, then $\mathfrak {g}$ is the compact real form of $\mathfrak {g}^{\mathbb {C}}$. Decompose $\Delta = \Delta _\mathrm{short} \cup \Delta _\mathrm{long}$ where $\Delta _\mathrm{short}$ (resp. $\Delta _\mathrm{long}$) is comprised of short (resp. long) roots. Then, the following set is pre-distinguished:
$$\begin{aligned} \bigcup _{\gamma \in \Delta _\mathrm{long}} \{Y_\gamma , Z_\gamma \} \cup \bigcup _{\tiny \begin{array}{l} \alpha , \beta \in \Delta _\mathrm{short} \\ \text{ and } \alpha \ne \pm \beta \end{array} }\left\{ [Y_\alpha , Y_\beta ], \, [Y_\alpha , Z_\beta ], \, [Z_\alpha , Y_\beta ], \, [Z_\alpha , Z_\beta ]\right\} . \end{aligned}$$

We refer the reader to [6] for a complete proof. It follows from Proposition 3 that every semi-simple real Lie algebra admits a distinguished set. This establishes Proposition 2.

4.2 Matrix coefficients as codistinguished functions

Let $\{X_i\}^m_{i = 1}$ be a distinguished set of $\mathfrak {g}$. We address in the subsection the existence of (weakly) codistinguished functions on G to the set of left- (resp., right-) invariant vector fields $\{L_{X_i}\}^m_{i = 1}$ (resp. $\{R_{X_i}\}^m_{i = 1}$). Because of the symmetry, the focus will be mostly on the functions codistinguished to the left-invariant vector fields. We provide a remark at the end of the subsection to address the existence of codistinguished functions to the right-invariant vector fields.

To proceed, we first recall that the so-called right-regular representation of G on $\mathrm{C}^\omega (G)$, denoted by $r: G\times \mathrm{C}^\omega (G)\rightarrow \mathrm{C}^\omega (G)$, is defined by

$$\begin{aligned} (x, \phi ) \in G\times \mathrm{C}^\omega (G) \mapsto (r(x)\phi )(g) := \phi (gx). \end{aligned}$$

Correspondingly, the induced Lie algebra representation $r_*$ is the Lie derivative along a left-invariant vector field, i.e., $ r_*(X)\phi = L_X \phi $. Note, in particular, that if $\Phi = \{\phi ^j\}^l_{j = 1}$ is codistinguished to $\{L_{X_i}\}^m_{i = 1}$, then $r_*|_{\mathbb {L}_\Phi }$ is a finite-dimensional representation of $\mathfrak {g}$ on $\mathbb {L}_\Phi $; indeed, we have that

$$\begin{aligned} r_*([X_i, X_{i'}]) \phi ^j= & {} r_*(X_{i})r_*(X_{i'})\phi ^j - r_*(X_{i'})r_*(X_i)\phi ^j \\= & {} L_{X_{i}} L_{X_{i'}}\phi ^j - L_{X_{i'}} L_{X_i} \phi ^j = L_{[X_i,X_{i'}]}\phi ^j \in \mathbb {L}_\Phi . \end{aligned}$$

Thus, in order to find a set of codistinguished functions to $\{L_{X_i}\}^m_{i = 1}$, our strategy is comprised of two steps as outlined below:

(1):: Construct a finite-dimensional subspace $\mathbb {L}$ of $\mathrm{C}^\omega (G)$ such that it is closed under r so that $r_* |_{\mathbb {L}}$ will be a Lie algebra representation of $\mathfrak {g}$ on $\mathbb {L}$;
(2):: Find a finite subset $\Phi = \{\phi ^j\}^l_{j = 1}$ out of the space $\mathbb {L}$ such that it is codistinguished to a certain set of left-invariant vector fields $\{L_{X_i}\}^m_{i = 1}$.

We now address, one by one, the above two steps.

Our approach for the first step about constructing a finite-dimensional subspace $\mathbb {L}$ of $\mathrm{C}^\omega (G)$ is to use matrix coefficients associated with a Lie group representation. Specifically, we consider an arbitrary analytic representation $\pi $ of G on a finite-dimensional inner-product space $(V, \langle \cdot , \cdot \rangle )$. Let $\{v_i\}^p_{i = 1}$ be any spanning subset of V. We next define a set of matrix coefficients as follows:

$$\begin{aligned} \pi ^{ij}(g):= \langle v_i, \pi (g) v_j\rangle \in \mathrm{C}^\omega (G), \quad 1\le i, j\le p. \end{aligned}$$

(28)

Then, we let $\mathbb {L}_\pi $ be a finite-dimensional subspace of $\mathrm{C}^\omega (G)$ spanned by $\pi ^{ij}$:

$$\begin{aligned} \mathbb {L}_\pi := \left\{ \sum ^p_{i,j = 1} c_{ij} \pi ^{ij} \mid c_{ij} \in \mathbb {R}\right\} . \end{aligned}$$

The following fact is certainly known in the literature. But, for completeness of presentation, we provide a proof after the statement:

Lemma 7

The vector space $\mathbb {L}_\pi $ is closed under r(x) for all $x\in G$, i.e., for any $\phi \in \mathbb {L}_\pi $, $r(x) \phi \in \mathbb {L}_\pi $. Thus, $r|_{\mathbb {L}_\pi }$ (resp. $r_*|_{\mathbb {L}_\pi }$) is a representation of G (resp. $\mathfrak {g}$) on $\mathbb {L}_\pi $.

Proof

The lemma follows directly from computation. For any $x\in G$ and any $g\in G$,

$$\begin{aligned} (r(x)\pi ^{ij})(g) = \pi ^{ij}(gx) = \langle v_i, \pi (gx) v_j \rangle = \langle v_i, \pi (g)\pi (x) v_j \rangle . \end{aligned}$$

Since $\{v_1,\ldots , v_p\}$ spans V, there exist real coefficients $c_{lk}$’s such that

$$\begin{aligned} \pi (x) v_j = \sum ^p_{l,k= 1} c_{lk}\langle v_l, \pi (x)v_j \rangle v_k = \sum ^p_{l,k= 1} c_{lk} \pi ^{lj}(x) v_k. \end{aligned}$$

It then follows that

$$\begin{aligned} (r(x)\pi ^{ij})(g) = \sum ^p_{l,k =1}\left( c_{lk} \pi ^{lj}(x) \right) \pi ^{ik}(g), \end{aligned}$$

which implies that $r(x)\pi ^{ij}$ is a linear combination of $\pi ^{ik}$ for $k = 1,\ldots , p$. $\square $

We now address the second step of our strategy about finding a finite subset $\{\phi ^j\}^l_{j = 1}$ out of $\mathbb {L}_\pi $ so that it is codistinguished to a given set of left-invariant vector fields $\{L_{X_i}\}^m_{i = 1}$. To proceed, we first have the following definition as a dual to Definition 12:

Definition 13

Let $\pi $ be a finite-dimensional representation of G on V, and $\pi _*$ be the corresponding Lie algebra representation. A spanning set $\{v_j\}^p_{j = 1}$ of V is codistinguished to a subset $\{X_i\}^m_{i =1}$ of $\mathfrak {g}$ if it satisfies the following properties:

(1):

The set of one-forms $\{\mathrm{d}\pi ^{ij}_e\}^p_{i,j = 1}$ spans $T^*_eG \approx \mathfrak {g}^*$.

(2):

For any $X_i$ and $v_j$, there exist a $v_k$ and a real number $\lambda $ such that

$$\begin{aligned} \pi _*(X_i) v_j = \lambda v_k; \end{aligned}$$

(29)

conversely, for any $v_k$, there exist $X_i$, $v_j$, and a nonzero $\lambda $ such that (29) holds.

(3):

For any $g, g'\in G$, if $\pi ^{ij}(g) = \pi ^{ij}(g')$ for all $1\le i, j \le p$, then $g = g'$.

If only (1) and (2) hold, then $\{v_j\}^p_{j =1}$ is weakly codistinguished to $\{X_i\}^m_{i = 1}$.

With the above definition, we now have the following fact:

Lemma 8

If $\{v_j\}^p_{j = 1}$ is codistinguished to $\{X_i\}^m_{i = 1}$, then the set of matrix coefficients $\{\pi ^{ij}\}^p_{i,j = 1}$ is codistinguished to the set of left-invariant vector fields $\{L_{X_i}\}^m_{i = 1}$.

Proof

We show below that if $\{v_j\}^p_{j = 1}$ is codistinguished to $\{X_i\}^m_{i = 1}$, then the three items of Definition 3 are satisfied.

(1):

For the first item of Definition 3, we show that for any $g\in G$, the one-forms $\{\mathrm{d}\pi ^{ij}_g\}^p_{i, j = 1}$ span $T^*_g G$. With slight abuse of notation, we write

$$\begin{aligned} \mathrm{d}\pi ^{ij}_g(X):= \mathrm{d}\pi ^{ij}_g(gX) = \langle v_i, \pi (g) \pi _*(X) v_j\rangle , \quad \forall X\in \mathfrak {g}. \end{aligned}$$

In this way, each one-forms $\mathrm{d}\pi ^{ij}_g$ can be viewed as an element in $\mathfrak {g}^*$. But then, the two subspaces of $\mathfrak {g}^*$: ${\text {span}}\{\mathrm{d}\pi ^{ij}_e\}^p_{i, j = 1}$ and ${\text {span}}\{\mathrm{d}\pi ^{ij}_g\}^p_{i, j = 1}$ are isomorphic:

The first item of Definition 3 then follows from the first item of Definition 13.

(2):

For the second item of Definition 3, it suffices to show that if $\pi _*(X_i) v_j = \lambda v_k$, then $ L_{X_i}\pi ^{qj} = \lambda \pi ^{qk} $ for any $q = 1,\ldots , p$. This holds because

$$\begin{aligned} (L_{X_i}\pi ^{qj} )(g) = \langle v_q, \pi (g)\pi _*(X_i) v_j \rangle = \lambda \langle v_q, \pi (g) v_k\rangle = \lambda \pi ^{qk}(g). \end{aligned}$$

(3):

The third item of Definition 3 directly follows from the third item of Definition 13.

$\square $

We have so far provided an approach for generating a set of matrix coefficients that is (weakly) codistinguished to a given set of left-invariant vector fields. The same approach can be slightly modified to generate a set of functions codistinguished to a set of right-invariant vector fields. We provide details in the following remark:

Remark 5

We first recall that the left-regular representation of G is given by

$$\begin{aligned} (x, \phi ) \in G\times \mathrm{C}^\omega (G) \mapsto (l(x)\phi )(g) := \phi (x^{-1}g), \end{aligned}$$

The corresponding Lie algebra representation is given by $l_*(X)\phi = -R_X\phi $. We again let $\pi $ be a representation of G on a finite-dimensional inner-product space $(V, \langle \cdot , \cdot \rangle )$, and $\{v_i\}^p_{i = 1}$ be a spanning set of V. We next define functions on G as follows:

$$\begin{aligned} \tilde{\pi }^{ij}(g):= \langle v_i, \pi (g^{-1}) v_j\rangle , \quad \forall 1\le i, j \le p. \end{aligned}$$

(30)

Let $ \mathbb {L}_{\tilde{\pi }}$ be the $\mathbb {R}$-span of these $\tilde{\pi }^{ij}$. The same arguments in the proof of Lemma 7 can be used here to show that $\mathbb {L}_{\tilde{\pi }}$ is closed under l(x) for all $x\in G$. Furthermore, if the set $\{v_j\}^p_{j = 1}$ is chosen to be codistinguished to $\{X_i\}^m_{i = 1}$, then similar arguments in the proof of Lemma 8 can be used to show that the set of functions $\{\tilde{\pi }^{ij}\}^p_{i,j = 1}$ is codistinguished to the set of right-invariant vector fields $\{R_{X_i}\}^m_{i = 1}$. $\square $

In summary, we have shown in the subsection that a finite-dimensional representation $\pi $ of G on an inner-product space V can be used to generate a set of matrix coefficients codistinguished to a given set of left- (or right-) invariant vector fields provided that the assumption of Lemma 8 is satisfied.

4.3 On the adjoint representation

We follow the discussions in the previous subsection, and consider here the adjoint representation of G on $\mathfrak {g}$, i.e., $\pi = {\text {Ad}}$ and $V = \mathfrak {g}$. We show that in this special case, there indeed exists a set of matrix coefficients (weakly) codistinguished to a distinguished set of left- (or right-) invariant vector fields.

To proceed, we first recall that $B(X, Y) = {\text {tr}}({\text {ad}}_X{\text {ad}}_Y)$ is the Killing form, $\theta $ is a Cartan involution of $\mathfrak {g}$, and $B_\theta (X, Y) = -B(X, \theta Y)$ is an inner product on $\mathfrak {g}$. We also recall that by Proposition 2, there exists a distinguished set $\{X_i\}^m_{i = 1}$ out of $\mathfrak {g}$. We fix such a set in the sequel. Note, in particular, that by Definition 12, the distinguished set $\{X_i\}^m_{i = 1}$ spans $\mathfrak {g}$. Now, we follow the two-step strategy proposed in the previous section and define a set of matrix coefficients $\{\phi ^{ij}\}^m_{i,j = 1}$ as follows:

$$\begin{aligned} \phi ^{ij} (g):= {\text {Ad}}^{ij}(g) = B_\theta ({\text {Ad}}(g)X_j, X_i ), \quad 1\le i, j \le m. \end{aligned}$$

(31)

This is nothing but specializing (28) to the case of adjoint representation. To further illustrate (31), we take advantage of the following fact [22, Prop. 6.28]:

Lemma 9

Every semi-simple real Lie algebra $\mathfrak {g}$ is isomorphic to a Lie algebra of real matrices that is closed under transpose, with the Cartan involution $\theta $ carried to negative transpose, i.e., $\theta X = - X^\top $ for all $X\in \mathfrak {g}$.

We note that for a given semi-simple Lie algebra $\mathfrak {g}$ of real matrices, the Killing form B(X, Y) is linearly proportional to ${\text {tr}}(XY)$, i.e., $B(X, Y) = c {\text {tr}}(X Y)$ for a real positive constant c. Now, suppose that G is isomorphic to a matrix Lie group; then, it follows from Lemma 9 that one can rewrite (31) as follows:

$$\begin{aligned} \phi ^{ij} (g) = c{\text {tr}}(g X_j g^{-1} X_i^\top ). \end{aligned}$$

(32)

In particular, it generalizes the functions $\{\phi ^{ij}\}_{1\le i, j \le 3}$ on ${\text {SO}}(3)$ introduced in Example 1 to functions on an arbitrary semi-simple matrix Lie group. However, we shall note that not every semi-simple Lie group is isomorphic to a matrix Lie group. Nevertheless, expression (31) is always valid.

Recall that a center Z(G) of a group G is defined such that if $z\in G$, then z commutes with every group element g of G, i.e.,

$$\begin{aligned} Z(G):= \{z\in G \mid zg = gz, \quad \forall g\in G\}. \end{aligned}$$

Let $\phi :=(\ldots ,\phi ^{ij},\ldots )$ be the collective of $\phi ^{ij}$. For any group element $g\in G$, we let $[g]_\phi $ be the pre-image of $\phi (g)$. We now have the following result:

Theorem 4

Let $\{X_i\}^m_{i = 1}$ be a distinguished set of $\mathfrak {g}$. Then, the set of matrix coefficients $\{\phi ^{ij}\}^m_{i,j = 1}$ defined in (31) is weakly codistinguished to $\{L_{X_i}\}^m_{i = 1}$. Moreover,

$$\begin{aligned} {[}g]_\phi =\{gz \mid z\in Z(G)\}, \quad \forall g\in G. \end{aligned}$$

(33)

In particular, $\{\phi ^{ij}\}^m_{i,j = 1}$ is codistinguished to $\{L_{X_i}\}^m_{i = 1}$ if and only if Z(G) is trivial.

Theorem 3 then follows from Proposition 2 and Theorem 4. We establish Theorem 4 in the next subsection.

Remark 6

Note that if one aims to construct a set of functions codistinguished to the right-invariant vector fields $\{R_{X_i}\}^m_{i =1}$; then, by Remark 5, one can simply define functions as follows:

$$\begin{aligned} \tilde{\phi }^{ij}(g) := B_\theta ({\text {Ad}}(g^{-1})X_j, X_i), \quad \forall 1\le i, j \le m. \end{aligned}$$

(34)

If one replaces in the statement $L_{X_i}$ with $R_{X_i}$ and correspondingly, $\phi ^{ij}$ with $\tilde{\phi }^{ij}$, then Theorem 4 will still hold.

Since $\mathfrak {g}$ is semi-simple, the center Z(G) is discrete. If, further, G is compact, then Z(G) is finite. The centers of a few commonly seen matrix Lie groups are given below:

(1):: If $G = {\text {SU}}(n)$ is the special unitary group, then $Z(G) = \left\{ zI \mid z^n = 1, z\in \mathbb {C}\right\} $.
(2):: If $G = {\text {SL}}(n, \mathbb {R})$ is the special linear group or if $G = {\text {SO}}(n)$ is the special orthogonal group, then
$$\begin{aligned} Z(G) = \left\{ \begin{array}{ll} \{I\} &{} \text{ if } n \text{ is } \text{ odd }, \\ \{\pm I\} &{} \text{ if } n \text{ is } \text{ even }. \end{array} \right. \end{aligned}$$
(3):: Similarly, if $G = {\text {SO}}^+(p,q)$ is the identity component of indefinite orthogonal group ${\text {O}}(p,q)$ (e.g., the Lorentz group ${\text {O}}(1,3)$), then
$$\begin{aligned} Z(G) = \left\{ \begin{array}{ll} \{I\} &{} \quad \text{ if } p + q \text{ is } \text{ odd }, \\ \{\pm I\} &{} \quad \text{ if } p + q \text{ is } \text{ even }. \end{array} \right. \end{aligned}$$
(4):: If $G = {\text {Sp}}(2n, \mathbb {R})$ is the symplectic group, then $Z(G) = \{\pm I_{2n}\}$.

4.4 Analysis and proof of Theorem 4

We establish in subsection Theorem 4. By Lemma 8, it suffices to show that the subset $\{X_i\}^m_{i = 1}$ of $\mathfrak {g}$ is codistinguished to itself with respect to the adjoint representation. This fact will be established after a sequence of lemmas. For convenience, we reproduce below the set of functions $\{\phi ^{ij}\}^m_{i,j = 1}$:

$$\begin{aligned} \phi ^{ij} (g):= {\text {Ad}}^{ij}(g) = B_\theta ({\text {Ad}}(g)X_j, X_i ), \quad 1\le i, j \le m. \end{aligned}$$

We show below that the set $\{\phi ^{ij}\}^m_{i,j = 1}$ satisfies the three items of Definition 13 under the assumption of Theorem 4. The arguments we will use below generalize the ones used in Example 1. For the first item of Definition 13, we have the following fact:

Lemma 10

The set of one-forms $\{\mathrm{d}\phi ^{ij}_e\}^m_{i, j = 1}$ spans the cotangent space $T^*_eG \approx \mathfrak {g}^*$.

Proof

First, note that for any $X\in \mathfrak {g}$, we have

$$\begin{aligned} \mathrm{d}\phi ^{ij}_e (X)= B_\theta ([X,X_j], X_i) = - B([X,X_j], \theta X_i). \end{aligned}$$

Because the Killing form is adjoint-invariant, i.e., $ B([X, Y], Z) = B(X, [Y, Z]) $ for any $X, Y, Z\in \mathfrak {g}$, it follows that

$$\begin{aligned} \mathrm{d}\phi ^{ij}_e (X) = -B([X,X_j], \theta X_i) = - B(X, [ X_j, \theta X_i]) = B_\theta (X, [\theta X_j, X_i]), \end{aligned}$$

where the last equality holds because $\theta $ is a Lie algebra automorphism with $\theta ^2 = \mathrm{id}$ and, hence, $\theta [ X_j, \theta X_i] = [\theta X_j, X_i]$.

For convenience, let $Y_j := \theta X_j$ for all $j = 1,\ldots , m$. Since $\theta $ is a Lie algebra automorphism and $\{X_i\}^m_{i = 1}$ spans $\mathfrak {g}$, the set $\{Y_j\}^m_{j = 1}$ spans $\mathfrak {g}$ as well. Next, note that $\mathfrak {g}$ is semi-simple and, hence, $[\mathfrak {g}, \mathfrak {g}] = \mathfrak {g}$. Thus, $\{{\hat{X}}_{ij}:=[Y_j,X_i]\}^m_{i, j = 1}$ is a spanning set of $\mathfrak {g}$. It now remains to show that the set of one-forms $\{B_\theta (\cdot , {\hat{X}}_{ij})\}^m_{i, j = 1}$ spans $\mathfrak {g}^*$. But, this follows from the fact that $B_\theta $ is positive definite on $\mathfrak {g}$; indeed, any nondegenerate bilinear form induces a linear isomorphism between $\mathfrak {g}$ and $\mathfrak {g}^*$. Since the set $\{{\hat{X}}_{ij}\}^m_{i,j = 1}$ spans $\mathfrak {g}$, the set of one-forms $\{B_\theta (\cdot , {\hat{X}}_{ij})\}^m_{i, j = 1}$ spans $\mathfrak {g}^*$. $\square $

For the second item of Definition 13, we have the following fact:

Lemma 11

If $[X_i, X_j] = \lambda X_k$, then $L_{X_i} \phi ^{i'j} = \lambda \phi ^{i'k}$ for all $i' =1,\ldots , m$.

Proof

The lemma directly follows from computation:

$$\begin{aligned} (L_{X_i} \phi ^{i'j})(g) = B_\theta ({\text {Ad}}(g)[X_i, X_j], X_{i'}) = \lambda B_\theta ({\text {Ad}}(g)X_k, X_{i'}) = \lambda \phi ^{i'k}(g), \end{aligned}$$

which holds for any $g\in G$. $\square $

Combining Lemmas 10 and 11, we have that the set of functions $\{\phi ^{ij}\}^m_{i,j = 1}$ is weakly codistinguished to the set of left-invariant vector fields $\{L_{X_i}\}^m_{i = 1}$. Finally, for (33), we have the following fact:

Lemma 12

If $\phi ^{ij}(g) = \phi ^{ij}(g')$ for all $1\le i,j \le m$, then $g^{-1}g' \in Z(G)$ and vice versa.

Proof

We fix a $j = 1,\ldots , m$ and have the following:

$$\begin{aligned} \phi ^{ij}(g) - \phi ^{ij}(g') = B_\theta ({\text {Ad}}(g) X_j - {\text {Ad}}(g') X_j, X_i) = 0, \quad \forall i = 1,\ldots , m. \end{aligned}$$

Since $B_\theta $ is positive definite and $\{X_i\}^m_{i = 1}$ spans $\mathfrak {g}$, ${\text {Ad}}(g) X_j = {\text {Ad}}(g') X_j$. This holds for all $j = 1,\ldots , m$. Using again the fact that $\{X_j\}^m_{j = 1}$ spans $\mathfrak {g}$, we obtain that ${\text {Ad}}(g^{-1}g') X = X$ for all $X\in \mathfrak {g}$. Thus, $g^{-1}g'$ belongs to the centralizer of the identity component of G. Since G is connected, this holds if and only if $g^{-1}g'\in Z(G)$. $\square $

4.5 On homogeneous spaces

Let a group G act on a manifold M. We say that the group action is transitive if for any $x, y\in M$, there exists a group element $g\in G$ such that $g x = y$. Correspondingly, the manifold M is said to be a homogeneous space of G. Note that any homogeneous space can be identified with the space G / H of left cosets gH for H a closed Lie subgroup of G. More specifically, we pick an arbitrary point $x\in M$, and let H be the subgroup of G which leaves x fixed (i.e., H is the stabilizer of x). Then, M is diffeomorphic to G / H, and we write $M \approx G/H$. The group action can thus be viewed as a map by sending a pair $(g, g'H)$ to $gg'H$. We also note that the homogeneous space M can be equipped with a unique analytic structure (see [18, Thm. 4.2]).

We address in the subsection the existence of distinguished vector fields and codistinguished functions on homogeneous spaces of a semi-simple Lie group. We provide at the end of the subsection a simple example in which the unit sphere $S^2 \approx {\text {SO}}(3)/{\text {SO}}(2)$ is considered.

4.5.1 On distinguished vector fields

There is a canonical way of translating a distinguished set $\{X_i\}^m_{i = 1}$ of the Lie algebra $\mathfrak {g}$ to a distinguished set of vector fields over a homogeneous space of G. Precisely, we define a map $\tau : \mathfrak {g}\rightarrow \mathfrak {X}(M)$ as follows: Let $\exp : \mathfrak {g} \rightarrow G$ be the exponential map. For a given $X\in \mathfrak {g}$, we define a vector field $\tau (X)\in \mathfrak {X}(M)$ such that for any $\phi \in \mathrm{C}^\omega (M)$, the following hold:

$$\begin{aligned} (\tau (X) \phi )(x) := \lim _{t\rightarrow 0} \frac{\phi (\exp (tX) x) - \phi (x)}{t}, \quad \forall x\in M. \end{aligned}$$

(35)

Let $X_i$ and $X_j$ be any two elements in $\mathfrak {g}$. It is known [18, Chapter 2.3]) that

$$\begin{aligned}{}[\tau (X_i), \tau (X_j)] = -\tau ([X_i, X_j]), \end{aligned}$$

(36)

which then leads to the following result:

Proposition 4

Let G be a semi-simple Lie group with $\mathfrak {g}$ the Lie algebra, and M be a homogeneous space of G. If $\{X_i\}^m_{i = 1}$ is a distinguished set of $\mathfrak {g}$, then $\{\tau (X_i)\}^m_{i = 1}$ is a distinguished set of vector fields over M.

Proof

It suffices to show that $\{\tau (X_i)(x)\}^m_{i = 1}$ spans the tangent space $T_x M$ for all $x\in M$. Let H be the stabilizer of x, and $\mathfrak {h}$ be the corresponding Lie algebra of H. Since $\{X_i\}^m_{i = 1}$ spans $\mathfrak {g}$, there must exist a subset of $\{X_i\}^m_{i = 1}$, say $\{X_i\}^{m'}_{i = 1}$, such that if we let $\mathfrak {m}:= {\text {span}}\{X_i\}^{m'}_{i = 1}$, then $\mathfrak {g}= \mathfrak {m} \oplus \mathfrak {h}$. Moreover, the following map:

$$\begin{aligned} (a_1, \ldots , a_{m'}) \in \mathbb {R}^{m'} \mapsto \exp \left( \sum ^{m'}_{i = 1} a_i X_i \right) x \in M \end{aligned}$$

is locally a diffeomorphism around $0\in \mathbb {R}^{m'}$ to an open neighborhood of $x\in M$. This, in particular, implies that $\{\tau (X_i)(x)\}^{m'}_{i = 1}$ is a basis of the tangent space $T_x M$. $\square $

4.5.2 On codistinguished functions

We now discuss how to translate a set of codistinguished functions defined on a Lie group G to a set of codistinguished functions on its homogeneous space $M\approx G/H$. We consider below for the case where the closed subgroup H is compact.

We say that a function $\phi \in \mathrm{C}^\omega (G)$ is H-invariant if for any $g\in G$ and $h\in H$, we have $\phi (gh) = \phi (g)$. In particular, if $\phi $ is H-invariant, then one can simply define a function $ \psi $ on M by $ \psi (g H) := \phi (g)$. This is well defined because if $gH =g'H$, then $g^{-1}g'$ belongs to H and, hence, $\phi (g) = \phi (g g^{-1}g') = \phi (g')$. Thus, without any ambiguity, we can treat an H-invariant $\phi $ as a function defined on M as well.

If a function $\phi $ is not H-invariant, then one can construct an H-invariant function by averaging $\phi $ over the subgroup H. Since H is compact, we equip H with the normalized Haar measure [22, Ch. VIII], i.e., $\int _H \mathbf{1}_H dh = 1$. We then define a function on G (and on M) by averaging the given function $\phi $ over H as follows:

$$\begin{aligned} {\bar{\phi }}(g) := \int _H \phi (g h) dh. \end{aligned}$$

(37)

It should be clear that ${\bar{\phi }}$ is H-invariant; indeed, for any $h'\in H$, we have

$$\begin{aligned} {\bar{\phi }}(gh') :=\int _H \phi (gh' h) dh = \int _H\phi (g h) d(h'^{-1} h) = \int _H\phi (g h) d h = {\bar{\phi }}(g). \end{aligned}$$

Note that if $\phi $ itself is H-invariant, then ${\bar{\phi }} = \phi $. We now have the following fact:

Lemma 13

Let $\{\phi ^j\}^l_{j = 1}$ be a set of functions on G codistinguished to a set of right-invariant vector fields $\{R_{X_i}\}^m_{i = 1}$. If $R_{X_i} \phi ^j =\lambda \phi ^k$, then $\tau (X_i) {\bar{\phi }}^j = \lambda \bar{\phi }^k$.

Proof

The lemma directly follows from computation:

$$\begin{aligned} (\tau (X_i) {\bar{\phi }}^j) (gH) = \int _H (R_{X_i}\phi ^j)(gh)d h = \lambda \int _H \phi ^k(gh) dh = \lambda {\bar{\phi }}^k(gH), \end{aligned}$$

which holds for all $gH \in M\approx G/H$. $\square $

Thus, if the set of one-forms $\{d{\bar{\phi }}^{j}_x\}^l_{j = 1}$ spans $T^*_xM$, then, by Lemma 13, $\{{\bar{\phi }}^{j}\}^l_{j = 1}$ is (weakly) codistinguished to $\{\tau (X_i)\}^m_{i = 1}$. We provide below an example for illustration.

4.5.3 Example on $S^2\approx {\text {SO}}(3)/{\text {SO}}(2)$

Let ${\text {SO}}(3)$ act $S^2$ by sending $(g, x) \in {\text {SO}}(3)\times S^2$ to $gx\in S^2$. Let H be a subgroup of G defined as follows:

$$\begin{aligned} H = \left\{ h(\theta ): = \begin{bmatrix} 1&0&0\\ 0&\cos (\theta )&\sin (\theta ) \\ 0&-\sin (\theta )&\cos (\theta ) \end{bmatrix} \mid \theta \in [0,2\pi )\right\} \approx {\text {SO}}(2). \end{aligned}$$

It follows that H is the stabilizer of the vector $e_1\in S^{2}$. Let $\{X_i\}^3_{i = 1}$ and $\{\phi ^{ij}\}^3_{i, j = 1}$ be given in Example 1, i.e.,

$$\begin{aligned} \left\{ \begin{array}{lr} X_i = e_je_k^\top - e_k e_j^\top , &{} \quad \text{ where } \det (e_i, e_j, e_k) = 1, \\ \phi ^{ij}(g) = {\text {tr}}(g X_j g^\top X_i^\top ), &{} \quad 1\le i, j \le 3. \end{array} \right. \end{aligned}$$

Because the set $\{X_i\}^3_{i = 1}$ is distinguished in $\mathfrak {so}(3)$, by Proposition 4, it induces a distinguished set of vector fields $\{\tau (X_i)\}^3_{i = 1}$ over $S^2$ as follows:

$$\begin{aligned} \tau (X_1)(x) = \begin{bmatrix} 0 \\ x_3 \\ - x_2 \end{bmatrix}, \quad \tau (X_2)(x) = \begin{bmatrix} -x_3 \\ 0 \\ x_1 \end{bmatrix}, \quad \tau (X_3)(x)= \begin{bmatrix} x_2 \\ - x_1 \\ 0 \end{bmatrix}. \end{aligned}$$

These vector fields satisfy the following relationship:

$$\begin{aligned} {[}\tau (X_i), \tau (X_j)] = \det (e_i,e_j,e_k)\tau (X_k). \end{aligned}$$

We next compute the averaged H-invariant functions $\{{\bar{\phi }}^{ij}\}^3_{ i, j = 1}$. The normalized Haar measure on H, in this case, is simply given by $dh =\nicefrac {\mathrm{d}\theta }{2\pi }$. It follows that

$$\begin{aligned} {\bar{\phi }}^{ij}(gH) = \frac{1}{2\pi }\int ^{2\pi }_0 {\text {tr}}(gh(\theta ) X_j h(\theta )^\top g^\top X_i^\top )\mathrm{d}\theta . \end{aligned}$$

To evaluate the above integral, we first have the following computational result:

$$\begin{aligned} \frac{1}{2\pi }\int ^{2\pi }_{0} h(\theta ) X_j h(\theta )^\top \mathrm{d}\theta = \left\{ \begin{array}{ll} X_1 &{} \text{ if } j = 1, \\ 0 &{} \text{ otherwise. } \end{array} \right. \end{aligned}$$

Thus, the nonzero ${\bar{\phi }}^{ij}$’s are given by

$$\begin{aligned} \bar{\phi }^{i}(gH):= \bar{\phi }^{i 1}(gH) = {\text {tr}}(gX_1g^\top X_i^\top ), \quad \forall i = 1, 2, 3. \end{aligned}$$

(38)

Each left coset gH corresponds to the point $x = g e_1\in S^2$. Note that $ge_1$ is simply the first column of g. We now compute each function ${\bar{\phi }}^i(x)$ and express the results using only the coordinates $x_i$ of x. First, by computation, we obtain

$$\begin{aligned} gX_1 g^\top = \begin{bmatrix} 0&c_{31}&c_{21} \\ -c_{31}&0&c_{11} \\ -c_{21}&-c_{11}&0 \end{bmatrix}, \end{aligned}$$

where each $c_{ij}$ is the ijth entry of the cofactor matrix $[c_{ij}]$ of $g\in {\text {SO}}(3)$. Since g is a special orthogonal matrix, $g = [c_{ij}]$. In particular, $(c_{11},c_{21}, c_{31})$ is the first column of g, i.e., $(c_{11},c_{21}, c_{31}) = ge_1 = (x_1,x_2, x_3)$ and, hence,

$$\begin{aligned} gX_1 g^\top = \begin{bmatrix} 0&x_3&x_2 \\ -x_3&0&x_1 \\ -x_2&-x_1&0 \end{bmatrix}. \end{aligned}$$

Thus, the functions ${\bar{\phi }}^i$ in (38), for $i = 1, 2,3$, are nothing but twice the coordinate functions, i.e.,

$$\begin{aligned} {\bar{\phi }}^i(x) = 2 x_i. \end{aligned}$$

It should be clear that $\{\bar{\phi }^j\}^3_{j = 1}$ satisfies items 1 and 3 of Definition 3. For item 2, we have that $\tau (X_i){\bar{\phi }}^j = \det (e_i,e_j,e_k) {\bar{\phi }}^k$. Thus, $\{{\bar{\phi }}^j\}^3_{j = 1}$ is codistinguished to $\{\tau (X_i)\}^3_{i = 1}$.

5 Conclusions

We introduced in the paper a novel class of ensemble systems, which we call distinguished ensemble systems. Every such system is comprised of two key components: A set of distinguished vector fields and a set of (weakly) codistinguished functions. We established in Sect. 3 that a distinguished ensemble system is approximately ensemble path controllability and (weakly) ensemble observable. We further extended in Sect. 3.5 the result to a pre-distinguished ensemble system.

We proposed and addressed in Sect. 4 the problem about existence of distinguished vector fields and codistinguished functions on a given manifold M. We provided an affirmative answer for the case where M is a connected, semi-simple Lie group G. Specifically, we showed that every such Lie group G admits a set of distinguished left- (or right-) invariant vector fields, together with a set of matrix coefficients that is (weakly) codistinguished to the set of vector fields. Finally, we discussed in Sect. 4.5 how to translate distinguished vector fields and codistinguished functions from the Lie group G to its homogeneous spaces, yet the problem has not been solved completely and will be addressed in our future work.

References

Agrachev A, Baryshnikov Y, Sarychev A (2016) Ensemble controllability by Lie algebraic methods. ESAIM Control Optim Calc Var 22(4):921–938
Article MathSciNet Google Scholar
Beauchard K, Coron JM, Rouchon P (2010) Controllability issues for continuous-spectrum systems and ensemble controllability of Bloch equations. Commun Math Phys 296(2):525–557
Article MathSciNet Google Scholar
Becker A, Bretl T (2012) Approximate steering of a unicycle under bounded model perturbation using ensemble control. IEEE Trans Robot 28(3):580–591
Article Google Scholar
Brockett RW (2007) Optimal control of the Liouville equation. AMS IP Stud Adv Math 39:23
Article MathSciNet Google Scholar
Chen X (2018) Controllability of ensemble formation systems over digraphs. Automatica arXiv:1805.11196
Chen X, Gharesifard B (2017) Distinguished sets of semi-simple Lie algebras. arXiv:1711.01719
Chen Y, Georgiou TT, Pavon M (2016) Optimal steering of a linear stochastic system to a final probability distribution, Part I. IEEE Trans Autom Control 61(5):1158–1169
Article MathSciNet Google Scholar
Chen Y, Georgiou TT, Pavon M (2016) Optimal steering of a linear stochastic system to a final probability distribution, Part II. IEEE Trans Autom Control 61(5):1170–1180
Article MathSciNet Google Scholar
Curtain RF, Zwart H (2012) An introduction to infinite-dimensional linear systems theory, vol 21. Springer, Berlin
MATH Google Scholar
De Persis C, Isidori A (2000) On the observability codistributions of a nonlinear system. Syst Control Lett 40(5):297–304
Article MathSciNet Google Scholar
Fuhrmann PA, Helmke U (2015) The mathematics of networks of linear systems. Springer, Berlin
Book Google Scholar
Gauthier J, Bornard G (1981) Observability for any $u(t)$ of a class of nonlinear systems. IEEE Trans Autom Control 26(4):922–926
Article MathSciNet Google Scholar
Gauthier JP, Kupka IA (1994) Observability and observers for nonlinear systems. SIAM J Control Optim 32(4):975–994
Article MathSciNet Google Scholar
Glaser SJ, Schulte-Herbrüggen T, Sieveking M, Schedletzky O, Nielsen NC, Sørensen OW, Griesinger C (1998) Unitary control in quantum ensembles: maximizing signal intensity in coherent spectroscopy. Science 280(5362):421–424
Article Google Scholar
Greene RE, Jacobowitz H (1971) Analytic isometric embeddings. Ann Math 93(1):189–204
Google Scholar
Gronwall TH (1919) Note on the derivatives with respect to a parameter of the solutions of a system of differential equations. Ann Math 20(4):292–296
Article MathSciNet Google Scholar
Hatcher A (2002) Algebraic topology. Cambridge University Press, Cambridge
MATH Google Scholar
Helgason S (2001) Differential geometry, Lie groups, and symmetric spaces, vol 34. American Mathematical Society, Providence
MATH Google Scholar
Helmke U, Schönlein M (2014) Uniform ensemble controllability for one-parameter families of time-invariant linear systems. Syst Control Lett 71:69–77
Article MathSciNet Google Scholar
Hermann R, Krener A (1977) Nonlinear controllability and observability. IEEE Trans Autom Control 22(5):728–740
Article MathSciNet Google Scholar
Humphreys J (1972) Introduction to Lie algebras and representation theory, vol 9. Springer, Berlin
Book Google Scholar
Knapp AW (2013) Lie groups beyond an introduction, vol 140. Springer, Berlin
MATH Google Scholar
Leonard NE, Krishnaprasad PS (1995) Motion control of drift-free, left-invariant systems on Lie groups. IEEE Trans Autom control 40(9):1539–1554
Article MathSciNet Google Scholar
Li JS (2011) Ensemble control of finite-dimensional time-varying linear systems. IEEE Trans Autom Control 56(2):345–357
Article MathSciNet Google Scholar
Li JS, Khaneja N (2006) Control of inhomogeneous quantum ensembles. Phys Rev A 73(3):030302
Article Google Scholar
Li JS, Khaneja N (2009) Ensemble control of Bloch equations. IEEE Trans Autom Control 54(3):528–536
Article MathSciNet Google Scholar
Liu W (1997) An approximation algorithm for nonholonomic systems. SIAM J Control Optim 35(4):1328–1365
Article MathSciNet Google Scholar
Murray RM, Sastry SS (1993) Steering nonholonomic control systems using sinusoids. In: Li Z, Canny JF (eds) Nonholonomic motion planning. Springer, Berlin, pp 23–51
Chapter Google Scholar
Nash J (1966) Analyticity of the solutions of implicit function problems with analytic data. Ann Math 84(3):345–355
Article MathSciNet Google Scholar
Rudin W (1976) Principles of mathematical analysis. McGraw-Hill, New York
MATH Google Scholar
Rudin W (2006) Real and complex analysis. Tata McGraw-Hill Education, New York
MATH Google Scholar
Sussmann HJ, Liu W (1993) Lie bracket extensions and averaging: the single-bracket case. In: Li Z, Canny JF (eds) Nonholonomic motion planning. Springer, Berlin, pp 109–147
Chapter Google Scholar
Taniguchi T, Sugiyama H, Uekusa H, Shiro M, Asahi T, Koshima H (2018) Walking and rolling of crystals induced thermally by phase transition. Nat Commun 9(1):538
Article Google Scholar
Van der Schaft A (1982) Observability and controllability for smooth nonlinear systems. SIAM J Control Optim 20(3):338–354
Article MathSciNet Google Scholar
Yu Y, Nakano M, Ikeda T (2003) Photomechanics: directed bending of a polymer film by light. Nature 425(6954):145
Article Google Scholar
Zeng S, Waldherr S, Ebenbauer C, Allgöwer F (2016) Ensemble observability of linear systems. IEEE Trans Autom Control 61(6):1452–1465
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

College of Engineering and Applied Science, University of Colorado Boulder, 425 UCB #1B55, Boulder, CO, 80309, USA
Xudong Chen

Authors

Xudong Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xudong Chen.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, X. Structure theory for ensemble controllability, observability, and duality. Math. Control Signals Syst. 31, 1–40 (2019). https://doi.org/10.1007/s00498-019-0237-5

Download citation

Received: 08 May 2018
Accepted: 10 June 2019
Published: 18 June 2019
Issue Date: June 2019
DOI: https://doi.org/10.1007/s00498-019-0237-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Structure theory for ensemble controllability, observability, and duality

Abstract

Similar content being viewed by others

A theoretical investigation of Brockett’s ensemble optimal control problems

Coherence of Quantum Ensemble as a Dual to Uncertainty for a Single Observable

Quantifying the quantumness of ensembles via generalized α-z-relative rényi entropy

1 Introduction

1.1 Mathematical models for ensemble control and estimation

1.2 Distinguished structure and examples

1.3 Literature review

1.4 Outline of contribution and organization of the paper

2 Definitions and notations

Definition 1

3 Distinguished ensemble systems

3.1 Distinguished vector fields and codistinguished functions

Definition 2

Lemma 1

Proof

Definition 3

Lemma 2

Proof

Example 1

3.2 Controllability and observability of distinguished ensemble system

Definition 4

Remark 1

Definition 5

Definition 6

Lemma 3

Proof

Theorem 1

Definition 7

Example 2

Remark 2

3.3 Proof of approximate ensemble path controllability

3.3.1 On the use of Lie extension and distinguished vector fields

Lemma 4

Remark 3

Lemma 5

Proof

3.3.2 On the use of Stone–Weierstrass theorem

3.4 Proof of ensemble observability

3.4.1 On the use of piecewise constant control inputs

3.4.2 On the use of codistinguished functions

3.5 Pre-distinguished ensemble system

Definition 8

Example 3

Theorem 2

Definition 9

Remark 4

3.6 Analysis and proof of Theorem 2

3.6.1 Indicator sequences

Definition 10

Definition 11

Example 4

Proposition 1

Proof

3.6.2 Proof of Theorem 2

4 Existence of distinguished ensemble systems

Theorem 3

4.1 Distinguished sets of semi-simple real Lie algebras

Definition 12

Proposition 2

Sketch of proof

Lemma 6

Proposition 3

4.2 Matrix coefficients as codistinguished functions

Lemma 7

Proof

Definition 13

Lemma 8

Proof

Remark 5

4.3 On the adjoint representation

Lemma 9

Theorem 4

Remark 6

4.4 Analysis and proof of Theorem 4

Lemma 10

Proof

Lemma 11