A Family of Anisotropic Distributions on the Hyperbolic Plane

Chevallier, Emmanuel

doi:10.1007/978-3-319-68445-1_83

Emmanuel Chevallier¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10589))

Included in the following conference series:

International Conference on Geometric Science of Information

2320 Accesses
1 Citations

Abstract

Most of the parametric families of distributions on manifold are constituted of radial distributions. The main reason is that quantifying the anisotropy of a distribution on a manifold is not as straightforward as in vector spaces and usually leads to numerical computations. Based on a simple definition of the covariance on manifolds, this paper presents a way of constructing anisotropic distributions on the hyperbolic space whose covariance matrices are explicitly known. The approach remains valid on every manifold homeomorphic to vector spaces.

Access provided by CONRICYT-eBooks. Download conference paper PDF

A Refinement of Peetre’s Theorem

Article 15 November 2019

Diffusion Means and Heat Kernel on Manifolds

Homogeneous Measures and Positive Alexandrov Curvature

Article 30 August 2021

1 Introduction

Probability density estimation on Riemannian manifolds is the subject of several recent studies. The different approaches can be separated into two categories, the parametric and non-parametric ones. The context of Riemannian manifolds brings difficulties of two kinds. Firstly, the theoretical results about distributions and the convergence of estimators known for random variables valued in $\mathbb {R}^n$ have to be adapted to the case of random variables valued in Riemannian manifolds, see [1,2,3, 8, 9, 12,13,14,15]. Secondly, the construction of probability distribution and of density estimators should require a reasonable amount of computational complexity, see [8, 12, 13, 16,17,18]. A generalization of the Gaussian distribution on manifolds was proposed in [8]. Although the expression of the proposed law is hard to compute on general manifolds, expressions of radial Gaussians on symmetric spaces can be found in [12,13,14]. On isotropic spaces, an isotropic density is simply a radial density. The anisotropy of a density can be evaluated with the notion of covariance proposed in [8].

In this paper, we are interested in the construction of anisotropic distributions on the hyperbolic space. The problem of anisotropic normal distributions on manifold have been addressed in [19] through anisotropic diffusion. The construction is valid on arbitrary manifolds but requires important computations. The hyperbolic space is a very particular Riemannian manifold: it is at the same time isotropic and diffeomorphic to a vector space. These two specificities significantly ease the construction of probability distributions and probability density estimators. Generally, it is difficult to control the covariance of a distribution on a Riemannian manifold, e.g. the covariance of the Gaussian law proposed in [8]. We propose a simple way of constructing distributions whose covariance is fully controlled. The method is derived from the density kernel proposed by [1]. These distributions can be used in the non parametric kernel density estimator but also to design mixture models for parametric density estimation.

The paper is organised as follows. Section 2 is a very brief introduction to the hyperbolic plane. Section 3 reviews some general facts about probabilities on Riemannian manifolds. Section 4 describes how to built anisotropic density functions on the hyperbolic space.

2 The Hyperbolic Space

The hyperbolic geometry results of a modification of the fifth Euclid’s postulate on parallel lines. In two dimensions, given an line D and a point $p\notin D$, the hyperbolic geometry is an example where there are at least two lines going through p, which do not intersect D. Let us consider the open unit disk of the Euclidean plane endowed with the Riemannian metric:

$$\begin{aligned} ds_{\mathbb {D}}^2=4\frac{dx^2+dy^2}{(1-x^2-y^2)^2} \end{aligned}$$

(1)

where x and y are the Cartesian coordinates. The unit disk $\mathbb {D}$ endowed with $ds_{\mathbb {D}}$ is called the Poincaré disk and is a model of the two-dimensional hyperbolic geometry. The construction is generalized to higher dimensions. Let ISO be the isometry group of $\mathbb {D}$. It can be shown that:

$\mathbb {D}$ is homogeneous: $\forall p,q \in \mathbb {D},\exists \phi \in ISO, \phi (p)=q$, points are indistinguishable.
$\mathbb {D}$ is isotropic: for any couple of geodesics $\gamma _1$ and $\gamma _2$ going through a point $p\in \mathbb {D}$, there exists $\phi \in ISO$ such that $\phi (p)=p$ and $\phi (\gamma _1)=\gamma _2$. In other words, directions are indistinguishable.
the Riemannian exponential applications are bijective.
$\mathbb {D}$ has a constant negative curvature.

Let x denote the coordinates of elements of $T_p\mathbb {D}$ in an orthogonal basis. x is mapped to a point on $\mathbb {D}$ by the Riemannian exponential application noted $exp_p$ and form thus a chart of $\mathbb {D}$. This chart is called an exponential chart at the point p.

Given a reference point p the point of polar coordinates $(r,\alpha )$ of the hyperbolic space is defined as the point at distance r of p on the geodesic with initial direction $\alpha \in \mathbb {S}^1$. Since the hyperbolic space is isotropic, the expression of the metric in polar coordinates only depends on r,

$$\begin{aligned} ds^2=dr^2+\sinh (r)^2d\alpha ^2, \end{aligned}$$

(2)

see [10, 11].

3 Distributions on $\mathbb {D}$

3.1 Densities

The metric of a Riemannian manifold provides a measure of volumes vol. In a chart, if G is the matrix of the metric, the density of vol with respect to the Lebesgue measure of the chart is

$$\begin{aligned} \frac{dvol}{dLeb}=|det(\sqrt{G})| \end{aligned}$$

where $\sqrt{G}$ is the matrix square root of G. Let $\mu $ be a measure on $\mathcal {M}$. If $\mu $ has a density f with respect to the Lebesgue measure of a chart, then the density with respect to the Riemannian volume measure is given by

$$\begin{aligned} \frac{d\mu }{dvol}=\frac{d\mu }{dLeb}\frac{dLeb}{dvol}=\frac{1}{|det(\sqrt{G})|}f. \end{aligned}$$

(3)

3.2 Intrinsic Means

Given a distribution $\mu $, the variance at p be defined by

$$\begin{aligned} \sigma ^2(p)= \int _{\mathbb {D}}d(p,.)^2 d\mu . \end{aligned}$$

When the variance is finite everywhere, its minima are called mean points. The hyperbolic space is a Cartan-Hadamar manifold, that is to say it is complete, simply connected and of negative curvature. On Cartan-Hadamar manifolds, when the variance is finite everywhere, the mean exists and is unique, see [8] corollary 2. It is achieved at p such that

$$\begin{aligned} \int _{T_p\mathbb {D}} x d\tilde{\mu }=0, \end{aligned}$$

where $\tilde{\mu }$ is the image of the measure $\mu $ by the inverse of the exponential application at p.

3.3 Covariance on Manifold

The covariance of a random vector is the matrix formed by the covariance of its coordinates. In a vector space the coordinates of a vector are given in terms of projection on the corresponding axis. On a Riemannian manifold the notions of projection on a geodesic usually do not lead to explicit expressions. Even if it does not conserve all the properties of the covariance of vectors, when possible, the simplest generalisation to manifolds is to take the Euclidean covariance after lifting the distribution on a tangent space by the inverse of the exponential map, see [8]. Since on the hyperbolic space the exponential application a bijection, it is always possible to lift distributions on tangent spaces. Given a distribution $\mu $ and a orthogonal basis of $T_p\mathbb {D}$, the covariance at $p\in \mathbb {D}$ is thus defined as

$$\begin{aligned} \varSigma _p(\mu )=\int _{T_p\mathbb {D}} xx^t d\tilde{\mu } \end{aligned}$$

This definition of covariance was used to define a notion of principal geodesic analysis on manifolds in [20]. It can be noted that the covariance at the point p is a point in $T\mathbb {D} \otimes T\mathbb {D}$.

4 Constructing Anisotropic Distributions

The author of [8] proposes a generalization of Gaussian distributions on manifolds as the distribution that maximizes the entropy given its barycenter and covariance. This generalization leads to a density of the form,

$$\begin{aligned} N_{(p,\varGamma )}(exp_{p}(x))= k. \exp \left( -\frac{x^t \varGamma x}{2} \right) \end{aligned}$$

Given p and the covariance matrix $\varSigma _p$, the main difficulties are to obtain expressions of the normalizing factor k and of the concentration matrix $\varGamma $. Since hyperbolic space is homogenous, k and $\varGamma $ only depend on the matrix $\varSigma _p$. The expression of k and $\varGamma $ when $\varSigma _p$ is a (positive) multiple of the identity matrix can be found in [12]. However, it is difficult to obtain these relations when $\varSigma _p$ is not diagonal.

It might be interesting to define parametric families of distributions whose means and covariances can easily be controlled, even if they do not verify the same statistical properties as the Gaussian distributions. Let $K:\mathbb {R}_+\rightarrow \mathbb {R}_+$ be a function such that,

i.
$\int _{\mathbb {R}^2} K(\Vert y\Vert )\,dy=1$
ii.
$\int _{\mathbb {R}^2} \Vert y \Vert ^2 K(\Vert y\Vert )\,dy=2$

Given $\varGamma $ a symmetric positive definite matrix, we have then

$$\begin{aligned} \int _{\mathbb {R}^2} \frac{1}{\sqrt{det(\varGamma )}} K(\sqrt{x^t\varGamma ^{-1} x})dx=1. \end{aligned}$$

Let $\overline{p}$ be a point in $\mathbb {D}$. Set an orthonormal basis of the tangent space $T_{\overline{p}}\mathbb {D}$ and consider the distribution $\nu _{\overline{p},\varGamma }$ on $T_{\overline{p}}\mathbb {D}$ whose density with respect to the Lebesgue measure of $T_{\overline{p}}\mathbb {D}$ is given by $\frac{1}{\sqrt{det(\varSigma )}} K(\sqrt{x^t\varGamma ^{-1} x})$, where x and $\varGamma $ are expressed in the reference basis. Let $\mu _{\overline{p},\varGamma }=exp_{\overline{p}*}(\nu _{\overline{p},\varGamma })$ be the pushforward measure of $\nu _{\overline{p},\varGamma }$ by the Riemannian exponential at $\overline{p}$.

Theorem 1

$\overline{p}$ is the unique mean of $\mu _{\overline{p},\varGamma }$.

Proof

It can be checked that $\mu _{\overline{p},\varGamma }$ has a finite variance everywhere. Moreover,

$$\begin{aligned} \int _{T_{\overline{p}}\mathbb {D}} \sqrt{\varGamma ^{-1}}x\frac{1}{\sqrt{\det \varGamma }}K(\sqrt{x^t\varGamma ^{-1} x})\,dx=0. \end{aligned}$$

The integrability of the function can be deduced from i and ii and the nullity from its symmetry. Therefore according to Sect. 3.2 $\overline{p}$ is the unique mean of $\mu _{\overline{p},\varGamma }$.

Theorem 2

The covariance $\varSigma _{\overline{p}}$ of $\mu _{\overline{p},\varGamma }$ at $\overline{p}$ and the concentration matrix $\varGamma $ are equal.

Proof

In the reference basis, making use of ii with the change of variables $y=\sqrt{\varGamma ^{-1}}x$

$$\begin{aligned} \varSigma _{\overline{p}}= & {} \int _{\mathbb {R}^2} xx^t \frac{1}{\sqrt{\det (\varGamma )}} K(\sqrt{x^t\varGamma ^{-1} x})dx \\= & {} \varGamma ^{1/2} \int _{\mathbb {R}^2}yy^t K(\sqrt{y^t y})dy\varGamma ^{1/2} \\= & {} \varGamma ^{1/2}\left( \int _{\mathbb {R}}\int _0^{2\pi } r^2\begin{pmatrix} \cos (\theta )\\ sin(\theta ) \end{pmatrix} \begin{pmatrix} \cos (\theta )\\ sin(\theta ) \end{pmatrix}^t K(r)rdrd\theta \right) \varGamma ^{1/2} \\= & {} \varGamma ^{1/2} \left( \frac{1}{2}\int _{\mathbb {R}}r^2I K(r) 2\pi rdr \right) \varGamma ^{1/2} \\= & {} \varGamma ^{1/2} I\left( \frac{1}{2}\int _{\mathbb {R}^2} \Vert y \Vert ^2 K(\Vert y\Vert )\,dy \right) \varGamma ^{1/2} \\= & {} \varGamma . \end{aligned}$$

The tangent space $T_{\overline{p}}\mathbb {D}$ endowed with the reference basis provides a parametrization of the hyperbolic space. By definition, the density of $\mu _{\overline{p},\varGamma }$ in this parametrization is given by $\frac{1}{\sqrt{det(\varSigma )}} K(\sqrt{x^t\varSigma ^{-1} x})$. In order to obtain the density with respect to the Riemannian measure this term should be multiplied by the density of the Lebesgue measure of the parametrization with respect to the Riemannian measure, see Eq. 3. In an adapted orthonormal basis of $T_{\overline{p}}\mathbb {D}$, Eq. 2 leads to the following expression of the matrix of the metric,

$$ G=\begin{pmatrix} 1&{}0\\ 0 &{} \frac{\sinh (r)^2}{r^2}\\ \end{pmatrix}. $$

Thus,

$$\begin{aligned} det(\sqrt{G})=\frac{\sinh (r)}{r}. \end{aligned}$$

Equation 3 leads to the density ratio,

$$\begin{aligned} \frac{dx}{dvol}(x)=\frac{||x||}{\sinh (||x||)}, \end{aligned}$$

where dx is the Lebesgue measure induced by the reference basis. Recall that in this parametrization, the Euclidean norm of x is the distance between $exp_{\overline{p}}(x)$ and $\overline{p}$. The density of $\mu _{\overline{p},\varGamma }$ with respect to the Riemannian measure is given by

$$\begin{aligned} f(exp_{\overline{p}}(x))=\frac{||x||}{\sinh (||x||)\sqrt{det(\varSigma )}}K\left( \sqrt{x^t\varSigma ^{-1} x}\right) . \end{aligned}$$

Figure 1 shows the level lines when K is Gaussian.

5 Estimating the Mean and the Covariance

Let the function K and the distribution $\mu _{\overline{p},\varGamma }$ be as defined in Sect. 4. Given a set of draws drawn from this distribution it is important to have estimators of the two parameters: the mean and the covariance. In order to estimate the unknown parameters $(\overline{p},\varSigma _{\overline{p}})$ given a set of independent samples $(p_1,..,p_n)$, it is usual to try to maximize the likelihood function. The $\log $-likelihood of a set of samples is defined as

$$\begin{aligned} \mathcal {L}(p_1,..,p_n; (\hat{p},\hat{\varSigma }))= & {} \sum _i \log \left( \frac{||x_i||}{\sinh (||x_i||)\sqrt{det(\hat{\varSigma )}}} K\left( \sqrt{x_i^t\hat{\varSigma }^{-1} x_i}\right) \right) \\= & {} \sum _i \log \left( \frac{||x_i||}{\sinh (||x_i||)\sqrt{det(\hat{\varSigma )}}}\right) + log\left( K\left( \sqrt{x_i^t\hat{\varSigma }^{-1} x_i}\right) \right) .\\ \end{aligned}$$

The major difficulty is that it is not possible to optimize the mean and the covariance separately. Thus there might not be explicit expressions of the maximum likelihood. However, the mean and the covariance have natural estimators. It is already known that the empirical barycenter is a strongly consistent estimator of the barycenter, see [21] Theorem 2.3.

Given an estimate of the barycenter, it is possible to compute the empirical covariance in the corresponding tangent plane,

$$\begin{aligned} \hat{\varSigma }_{\hat{p}}=\frac{1}{N}\sum x_i x_i^t \end{aligned}$$

(4)

Using a similar construction as the Sasakian metric, see [22], the vector bundle $T\mathbb {D} \otimes T\mathbb {D}$ can be endowed with a Riemannian metric. Although we do not prove it in this paper, we are convinced that almost surely

$$\begin{aligned} d((\hat{p},\hat{\varSigma }_{\hat{p}}),(\overline{p},\varSigma )) \underset{n\rightarrow +\infty }{\longrightarrow } 0, \end{aligned}$$

where d is the Riemannian distance on $T\mathbb {D} \otimes T\mathbb {D}$.

6 Conclusion

In this paper we proposed a set of parametric families of anisotropic distributions on the hyperbolic plane. The main interest of these distributions is that the covariance matrix and concentration matrix are equal. The empirical mean and covariance provide thus simple estimators of the parameters of the distribution. Working with anisotropic distributions is expected to reduce the number of distributions used in mixture models and thus to reduce the computational complexity of the parameter estimation of the mixture models. On the one hand, our future work will focus on deriving convergence rates of the estimation of the covariance. On the other hand, we will study the use of these distributions in problems of radar signal classification.

References

Pelletier, B.: Kernel density estimation on Riemannian manifolds. Stat. Probab. Lett. 73, 297–304 (2005)
Article MATH MathSciNet Google Scholar
Hendriks, H.: Nonparametric estimation of a probability density on a Riemannian manifold using Fourier expansions. Ann. Stat. 18, 832–849 (1990)
Article MATH MathSciNet Google Scholar
Huckemann, S., Kim, P., Koo, J., Munk, A.: Mobius deconvolution on the hyperbolic plan with application to impedance density estimation. Ann. Stat. 38, 2465–2498 (2010)
Article MATH Google Scholar
Barbaresco, F.: Robust statistical radar processing in Fréchet metric space: OS-HDR-CFAR and OS-STAP processing in siegel homogeneous bounded domains. In: Proceedings of the 2011 12th International Radar Symposium (IRS), Leipzig, Germany, 7–9 September 2011
Google Scholar
Barbaresco, F.: Information geometry of covariance matrix: Cartan-Siegel homogeneous bounded domains, Mostow/Berger fibration and Fréchet median. In: Nielsen, F., Bhatia, R. (eds.) Matrix Information Geometry, pp. 199–256. Springer, Heidelberg (2012). doi:10.1007/978-3-642-30232-9_9
Google Scholar
Barbaresco, F.: Information geometry manifold of Toeplitz Hermitian positive definite covariance matrices: Mostow/Berger fibration and Berezin quantization of Cartan-Siegel domains. Int. J. Emerg. Trends Signal Process. (IJETSP) 1, 1–87 (2013)
Google Scholar
Cannon, J.W., Floyd, W.J., Kenyon, R., Parry, W.R.: Hyperbolic Geometry, vol. 31. MSRI Publications, Cambridge (1997)
MATH Google Scholar
Pennec, X.: Intrinsic statistics on Riemannian manifolds: basic tools for geometric measurements. J. Math. Imaging Vis. 25, 127 (2006)
Article MathSciNet Google Scholar
Kim, P.T., Richards, D.St.P: Deconvolution density estimation on the space of positive definite symmetric matrices. In: Nonparametric Statistics and Mixture Models, pp. 147–168 (2011)
Google Scholar
Grigoryan, A.: Heat Kernel and Analysis on Manifolds, vol. 47. American Mathematical Soc., Providence (2012)
Book Google Scholar
Anker, J.-P., Ostellari, P.: The heat kernel on noncompact symmetric spaces. In: Lie Groups and Symmetric Spaces. AMS Transl. Ser. 2, vol. 210, pp. 27–46 (2003)
Google Scholar
Said, S., Bombrun, L., Berthoumieu, Y.: New Riemannian priors on the univariate normal model. Entropy 16(7), 4015–4031 (2014)
Article MATH MathSciNet Google Scholar
Said, S., Bombrun, L., Berthoumieu, Y., Jonathan Manton, J.: Riemannian Gaussian distributions on the space of symmetric positive definite matrices. arXiv:1507.01760 [math.ST] (2015)
Said, S., Hajri, H., Bombrun, L., Vemuri, B.C.: Gaussian distributions on Riemannian symmetric spaces: statistical learning with structured covariance matrices. arXiv:1607.06929 [math.ST] (2016)
Le Bihan, N., Flamant, J., Manton, J.H.: Density estimation on the rotation group using diffusive wavelets. arXiv:1512.06023 (2015)
Chevallier, E., Barbaresco, F., Angulo, J.: Probability density estimation on the hyperbolic space applied to radar processing. In: Nielsen, F., Barbaresco, F. (eds.) GSI 2015. LNCS, vol. 9389, pp. 753–761. Springer, Cham (2015). doi:10.1007/978-3-319-25040-3_80
Chapter Google Scholar
Chevallier, E., Forget, T., Barbaresco, F., Angulo, J.: Kernel density estimation on the siegel space with an application to radar processing. Entropy 18(11), 396 (2016)
Article Google Scholar
Chevallier, E., Kalunga, E., Angulo, J.: Kernel density estimation on spaces of Gaussian distributions and symmetric positive definite matrices. SIAM J. Imaging Sci. 10(1), 191–215 (2017)
Article MATH MathSciNet Google Scholar
Sommer, S.: Anisotropic distributions on manifolds: template estimation and most probable paths. In: Ourselin, S., Alexander, D.C., Westin, C.-F., Cardoso, M.J. (eds.) IPMI 2015. LNCS, vol. 9123, pp. 193–204. Springer, Cham (2015). doi:10.1007/978-3-319-19992-4_15
Chapter Google Scholar
Fletcher, P.T., Lu, C., Pizer, S.M., Joshi, S.: Principal geodesic analysis for the study of nonlinear statistics of shape. IEEE Trans. Med. Imaging 23(8), 995–1005 (2004)
Article Google Scholar
Bhattacharya, R., Patrangenaru, V.: Large sample theory of intrinsic and extrinsic sample means on manifolds. I. Ann. Stat. 31, 1–29 (2003)
Article MATH MathSciNet Google Scholar
Kappos, E.: Natural metric on tangent bundles. Master’s thesis (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Weizmann Institute of Science, Rehovot, Israel
Emmanuel Chevallier

Authors

Emmanuel Chevallier
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Emmanuel Chevallier .

Editor information

Editors and Affiliations

Ecole Polytechnique, Palaiseau, France
Frank Nielsen
Thales Land and Air Systems, Limours, France
Frédéric Barbaresco

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chevallier, E. (2017). A Family of Anisotropic Distributions on the Hyperbolic Plane. In: Nielsen, F., Barbaresco, F. (eds) Geometric Science of Information. GSI 2017. Lecture Notes in Computer Science(), vol 10589. Springer, Cham. https://doi.org/10.1007/978-3-319-68445-1_83

Download citation

DOI: https://doi.org/10.1007/978-3-319-68445-1_83
Published: 24 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68444-4
Online ISBN: 978-3-319-68445-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Family of Anisotropic Distributions on the Hyperbolic Plane

Abstract