Abstract
Heavily tailed probability distributions are important objects in anomalous statistical physics. For such probability distributions, expectations do not exist in general. Therefore, an escort distribution and an escort expectation have been introduced. In this paper, by generalizing such escort distributions, a sequence of escort distributions is introduced. For a deformed exponential family, we study the fundamental properties of statistical manifold structures derived from the sequence of escort expectations.
H. Matsuzoe—This research was partially supported by JSPS (Japan Society for the Promotion of Science), KAKENHI (Grants-in-Aid for Scientific Research) Grant Numbers JP26108003, JP15K04842 and JP16KT0132.
Access provided by CONRICYT-eBooks. Download conference paper PDF
Similar content being viewed by others
Keywords
- Statistical manifold
- Escort distribution
- Escort expectation
- Deformed exponential family
- Information geometry
1 Introduction
Heavily tailed probability distributions are important objects in anomalous statistical physics (cf. [11, 15]). Such probability distributions do not have expectations in general. Therefore the notion of escort distribution has been introduced [4] in order to give a suitable down weight for heavy tail probability. Consequently, there exists a modified expectation for such a probability distributions.
For a deformed exponential family, an escort distribution is given by the differential of a deformed exponential function. Therefore, the first named author considered further generalizations of escort distributions In q-exponential case, he introduced a sequential structure of escort distributions [7].
In this paper, we consider a sequential structure of escort distributions on a deformed exponential family. It is known that a deformed exponential family naturally has at least three kinds of different statistical manifold structures [8]. We elucidate relations between these statistical manifold structures and the structures derived from the sequence of escort expectations. Consequently, we find that dually flat structures and generalized conformal structures for statistical manifolds naturally arise in this framework.
2 Deformed Exponential Families
Throughout this paper, we assume that all the objects are smooth. In this section, we summarize foundations of deformed exponential functions and deformed exponential families. For further details, see [11].
Let \(\chi \) be a strictly increasing function from \(\mathbf {R}_{++}\) to \(\mathbf {R}_{++}\). We call this function \(\chi \) a deformation function. By use of a deformation function, we define a \(\chi \) -exponential function \(\exp _{\chi }t\) (or a deformed exponential function) by the eigenfunction of the following non-linear differential equation
The inverse of a \(\chi \)-exponential function is called a \(\chi \) -logarithm function or a deformed logarithm function, and it is given by
If the deformation function is a power function \(\chi (t) = t^q \ (q>0, q\ne 1)\), the deformed exponential and the deformed logarithm are given by
and they are called a q-exponential and a q-logarithm, respectively.
We suppose that a statistical model \(S_{\chi }\) has the following expression
where \(F_1(x), \dots , F_n(x)\) are functions on the sample space \(\varOmega \), \(\theta = {}^t(\theta ^1, \dots , \theta ^n)\) is a parameter, and \(\psi (\theta )\) is the normalization defined by \(\int _{\varOmega }p(x;\theta )dx = 1\). We call the statistical model \(S_{\chi }\) a \(\chi \) -exponential family or a deformed exponential family. Under suitable conditions, \(S_{\chi }\) is regarded as a manifold with coordinate system \(\theta = (\theta ^1, \dots , \theta ^n)\). When the deformed exponential function is a q-exponential, we denote the statistical model by \(S_q\) and call it a q-exponential family.
We remark that the regularity conditions for \(S_{\chi }\) is very difficult. To elucidate such conditions is quite an open problem. For example, regularity conditions for a statistical model (see Chap. 2 in [1]) and the well-definedness of a deformed exponential function should be satisfied simultaneously. A few arguments of this problem is given in the first and the third named author’s previous work [9].
3 A Sequential Structure of Expectations
In this section we consider a sequential structure of expectations. As we will see later, statistical manifold structures are defined from this sequence.
Let \(S_{\chi } = \{p_{\theta }\} = \{p(x;\theta )\}\) be a \(\chi \)-exponential family. We say that \(P_{\chi }(x;\theta )\) is an escort distribution of \(p_\theta \in S_{\chi }\) if
We say that \(P_{\chi }^{esc}(x;\theta )\) is a normalized escort distribution of \(p_{\theta }\) if
We generalize the escort distribution by use of higher-order differentials.
Definition 1
Let \(S_{\chi }\) be a \(\chi \)-exponential family. Denote by \(\exp _{\chi }^{(n)}x\) the n-th differential of the \(\chi \)-exponential function. For \(p_{\theta } \in S_{\chi }\), we define the n-th escort distribution \(P_{\chi , (n)}(x;\theta )\) by
and the normalized n-th escort distribution \(P_{\chi , (n)}^{esc}(x;\theta )\) by
For a given function f(x) on \(\varOmega \), we define the n-th escort expectation of f(x) and the normalized n-th escort expectation of f(x) by
respectively.
For example, in the case of q-exponential family \(S_q\), the n-th escort distribution of \(p_q(x;\theta )\) is given by
When we consider geometric structure determined from the unbiasedness of generalized score function, that is,
a sequential structure of expectations naturally arises. This is one of our motivations to study sequential expectations. When we consider correlations of random variables, another kinds of sequence of expectations will be required.
4 Geometry of Statistical Models
Let (M, g) be a Riemannian manifold, and C be a totally symmetric (0, 3)-tensor field on M. We call the triplet (M, g, C) a statistical manifold [6]. In this case, the tensor field C is called a cubic form. For a given statistical manifold (M, g, C), we can define one parameter family of affine connections by
where \(\alpha \in \mathbf {R}\) and \(\nabla ^{(0)}\) is the Levi-Civita connection with respect to g. It is easy to check that \(\nabla ^{(\alpha )}\) and \(\nabla ^{(-\alpha )}\) are mutually dual with respect to g, that is,
We say that S is a statistical model if S is a set of probability density functions on \(\varOmega \) with parameter \(\xi \in \varXi \) such that
Under suitable conditions, we can define a Fisher metric \(g^F\) on S by
where \(\partial _i = \partial /\partial \xi ^i\), \(l_{\xi } = l(x;\xi ) = \ln p(x;\xi )\), and \(E_{p}[f]\) is the standard expectation of f(x) with respect to \(p(x;\xi )\).
Next, we define a totally symmetric (0, 3)-tensor field \(C^F\) by
From Eq. (1), we can define one parameter family of affine connections. In particular, the connection \(\nabla ^{(e)} = \nabla ^{(1)}\) is called theexponential connection and \(\nabla ^{(m)} = \nabla ^{(-1)}\) is called the mixture connection. These connections are given by
It is known that \(g^F\) and \(C^F\) are independent of the choice of reference measure on \(\varOmega \). Therefore, the triplet \((S, g^F, C^F)\) is called an invariant statistical manifold. If a statistical model S is an exponential family, then the invariant statistical manifold \((S, g^F, C^F)\) determines a dually flat structure on S. (See [1, 13].) However, this fact may not be held for a deformed exponential family \(S_{\chi }\) and an invariant structure may not be important for \(S_{\chi }\). Therefore, we consider another statistical manifold structures.
We summarize statistical manifold structures for \(S_{\chi }\) based on [8].
Let \(S_{\chi }\) be a \(\chi \)-exponential family. We define a Riemannian metric \(g^M\) by
where \(\partial _i = \partial /\partial \theta ^i\). The Riemannian metric \(g^M\) is a generalization of the representation of Fisher metric (3). A pair of dual affine connections are given by
The difference of two affine connections \(C_{ijk}^M = \varGamma ^{M(m)}_{ij,k} - \varGamma ^{M(e)}_{ij,k}\) determines a cubic form. In addition, from the definition of the deformed exponential family \(S_{\chi }\), \(\varGamma ^{M(e)}_{ij,k}(\theta )\) always vanishes. Therefore, we have the following proposition.
Proposition 1
For a \(\chi \)-exponential family \(S_{\chi }\), the triplet \((S_{\chi }, g^{M}, C^M)\) is a statistical manifold. In particular, \((S_{\chi }, g^{M}, \nabla ^{M(e)}, \nabla ^{M(m)})\) is a dually flat space.
By setting
we define a U-divergence [10] by
It is known that the U-divergence \(D_{\chi }(p||r)\) on \(S_{\chi }\) coincides with the canonical divergence for \((S_{\chi }, g^{M}, \nabla ^{M(m)}, \nabla ^{M(e)})\) (See [8, 10]).
Next, we define another statistical manifold structure from the viewpoint of Hessian geometry.
For a \(\chi \)-exponential family \(S_{\chi }\), suppose that the normalization \(\psi \) is strictly convex. Then we can define a \(\chi \) -Fisher metric \(g^{\chi }\) and a \(\chi \) -cubic form \(C^{\chi }\) [3] by
Obviously, the triplet \((S_{\chi }, g^{\chi }, C^{\chi })\) is a statistical manifold. From Eq. (1), we can define a torsion-free affine connection \(\nabla ^{\chi (\alpha )}\) by
where \(\nabla ^{\chi (0)}\) is the Levi-Civita connection with respect to \(g^{\chi }\). By standard arguments in Hessian geometry [13], \((S_{\chi }, g^{\chi }, \nabla ^{\chi (1)}, \nabla ^{\chi (-1)})\) is a dually flat space. The canonical divergence for \((S_{\chi }, g^{\chi }, \nabla ^{\chi (-1)}, \nabla ^{\chi (1)})\) is given by
5 Statistical Manifolds Determined from Sequential Escort Expectations
In this section, we consider statistical manifold structures determined from sequential escort expectations.
For a \(\chi \)-exponential family \(S_{\chi }\), we define \(g^{(n)}\) and \(C^{(n)}\) by
We suppose that \(g^{(n)}\) is a Riemannian metric on \(S_{\chi }\). Then we obtain a sequence of statistical manifolds:
The limit of this sequence is not clear at this moment. In the q-Gaussian case, the sequence of normalized escort distributions \(\{P^{esc}_{q,(n)}(x;\theta )\}\) converges to the Dirac’s delta function \(\delta (x-\mu )\) (cf. [14]).
Theorem 1
Let \(S_q = \{p(x;\theta )\}\) be a \(\chi \)-exponential family. Then \((S_{\chi }, g^{(1)}, C^{(1)})\) coincides with \((S_{\chi }, g^{M}, C^{M})\).
Proof
From the definition of \(\chi \)-logarithm and \(P_{\chi }(x;\theta ) = P_{\chi , (1)}(x;\theta ) = \chi (p_{\theta })\), we obtain
Therefore, we obtain
Recall that \(\{\theta ^i\}\) is a \(\nabla ^{M(e)}\)-affine coordinate system [8]. In addition, the generalized score function \(\partial _i\ln _{\chi }p_{\theta }\) is unbiased with respect to the escort expectation, that is,
Therefore we obtain
From the second escort expectation, we have the following theorem.
Theorem 2
Let \(S_q = \{p(x;\theta )\}\) be a \(\chi \)-exponential family. Then \((S_{\chi }, g^{(2)}, C^{(2)})\) and \((S_{\chi }, g^{\chi }, C^{\chi })\) have the following relations:
Proof
Set \(u(x) = (\exp _q x)'\). Then we have
Since \(\int _{\varOmega }\partial _ip(x;\theta )dx = \int _{\varOmega }\partial _i\partial _jp(x;\theta )dx =0\) and \(Z_{\chi }(p) = \int _{\varOmega }\chi (p(x;\theta ))dx = \int _{\varOmega }P_{\chi , (1)}(x;\theta )dx\), we obtain
From a straight forward calculation, we have
By integrating (4), we obtain the relation \(C^{(2)}\) and \(C^{\chi }\).
We remark that the statistical manifold \((S_{\chi }, g^{(2)}, C^{(2)})\) cannot determine a dually flat structure in general whereas \((S_{\chi }, g^{\chi }, C^{\chi })\) determines a dually flat structure. The relations in Theorem 2 imply that two statistical manifolds have a generalized conformal equivalence relation in the sense of Kurose [5].
6 Concluding Remarks
In this paper, we considered a sequential structure of escort expectations and statistical manifold structures that are defined from the sequence of escort expectations. Further geometric properties of the sequence \(\{(S_{\chi }, g^{(n)}, C^{(n)})\}_{n \in \mathbf {N}}\) are not clear at this moment. However. the sequential structure will be important in the geometric theory of non-exponential type statistical models. Actually, in the case of q-exponential family, \((S_q, g^{(1)}, C^{(1)})\) is induced from a \(\beta \)-divergence. In addition, \((S_q, g^{(2)}, C^{(2)})\) are essentially equivalent to the invariant statistical manifold structure \((S_q, g^F. C^F)\), which are induced from an \(\alpha \)-divergence [7].
The authors would like to express their sincere gratitude to the referees for giving helpful comments to improve this paper.
References
Amari, S., Nagaoka, H.: Method of Information Geometry. Amer. Math. Soc., Providence, Oxford University Press, Oxford (2000)
Amari, S.: Information Geometry and Its Applications. AMS, vol. 194. Springer, Tokyo (2016). doi:10.1007/978-4-431-55978-8
Amari, S., Ohara, A., Matsuzoe, H.: Geometry of deformed exponential families: invariant, dually-flat and conformal geometry. Phys. A 391, 4308–4319 (2012)
Beck, C., Schlögl, F.: Thermodynamics of Chaotic Systems: An Introduction. Cambridge University Press, Cambridge (1993)
Kurose, T.: On the divergences of \(1\)-conformally flat statistical manifolds. Tôhoku Math. J. 46, 427–433 (1994)
Lauritzen, S.L.: Statistical manifolds. In: Differential Geometry in Statistical Inferences. IMS Lecture Notes Monograph Series, vol. 10, pp. 96–163. Hayward California, Institute of Mathematical Statistics (1987)
Matsuzoe, H.: A sequence of escort distributions and generalizations of expectations on \(q\)-exponential family. Entropy 19(1), 7 (2017)
Matsuzoe, H., Henmi, M.: Hessian structures and divergence functions on deformed exponential families. In: Nielsen, F. (ed.) Geometric Theory of Information. SCT, pp. 57–80. Springer, Cham (2014). doi:10.1007/978-3-319-05317-2_3
Matsuzoe, H., Wada, T.: Deformed algebras and generalizations of independence on deformed exponential families. Entropy 17(8), 5729–5751 (2015)
Murata, N., Takenouchi, T., Kanamori, T., Eguchi, S.: Information geometry of U-boost and Bregman divergence. Neural Comput. 16, 1437–1481 (2004)
Naudts, J.: Generalised Thermostatistics. Springer, London (2011). doi:10.1007/978-0-85729-355-8
Sakamoto, M., Matsuzoe, H.: A generalization of independence and multivariate student’s t-distributions. In: Nielsen, F., Barbaresco, F. (eds.) GSI 2015. LNCS, vol. 9389, pp. 740–749. Springer, Cham (2015). doi:10.1007/978-3-319-25040-3_79
Shima, H.: The Geometry of Hessian Structures. World Scientific, Singapore (2007)
Tanaka, M.: Meaning of an escort distribution and \(\tau \)-transformation. J. Phys: Conf. Ser. 201, 012007 (2010)
Tsallis, C.: Introduction to Nonextensive Statistical Mechanics: Approaching a Complex World. Springer, New York (2009). doi:10.1007/978-0-387-85359-8
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Matsuzoe, H., Scarfone, A.M., Wada, T. (2017). A Sequential Structure of Statistical Manifolds on Deformed Exponential Family. In: Nielsen, F., Barbaresco, F. (eds) Geometric Science of Information. GSI 2017. Lecture Notes in Computer Science(), vol 10589. Springer, Cham. https://doi.org/10.1007/978-3-319-68445-1_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-68445-1_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68444-4
Online ISBN: 978-3-319-68445-1
eBook Packages: Computer ScienceComputer Science (R0)