1 Introduction

A large number of papers have reported micrographic imaging of graphene membranes, revealing the underlying lattice structure with superimposed local variations in topography and induced charges. For example, Hamilton [9] showed typical structural features at several scales in transmission electron micrographs. Costamagna and Dobry [3] studied 2D to 3D transitions in graphene sheets and obtained a correlation between the standard deviation of the out of plane distance and the mean lattice dimension. Meyer et al. [16] showed a transmission electron micrograph of a few-layer graphene membrane and Stolyarova et al. [18] used scanning tunneling microscopy to reveal vertical variations of 0.8 nm over regions of 20 nm. Couto et al. [4] found a linear correlation between reciprocal mobility and the carrier density in all samples tested: thus devices with smaller density fluctuations had greater mobility and this is consistent with the effect of random strain variations in graphene. They were able to confirm the role of random strain fluctuations as an important source of disorder and so account for the above mentioned correlation. Whether this relationship can be linked to a linear correlation between strain standard deviation and mean is an interesting question.

Ishigami et al. [11] described a novel heat cleaning method in argon/hydrogen which at the 600 nm scale reduced the standard deviation of height of graphene on \(SiO_2\) from 8 to 3 Å. The nominal monolayer thickness is 3.4 Å in bulk graphite and on \(SiO_2\) the layer thickness was measured with an AFM as 4.2 Å; in air the thickness was 9 Å because of the presence of impurity ambient species at the interface or on the graphene monolayer. From this it suggests that the standard deviation of local thickness of a graphene stratum is of the same order as its mean thickness at the 600 nm scale. We note also the report by Xu et al. [19] of thermal effects on height fluctuations of freestanding graphene during scanning tunnelling microscopy experiments.

We consider first the simplest situation of tiny Gaussian variations applied to the locations of the carbon atoms in the nominally planar hexagonal graphene structure, which has lattice constants 2.461 and 6.708 Å. Such, sparse, distributed structural fluctuations would result in behavioural variations, as would departures from planarity caused by substrate carried impurities or other ambient species trapped in between. We believe that these type of fluctuations may be treated as spatially distributed variations and so modelled by probability density functions, which we may formulate as a geometrical space on which transitions or evolution of fluctuation patterns may be represented and linked to observable physical behaviour.

The degeneration of 2D crystal order was simulated by Lucarini [13] through perturbations of the three regular tessellations of the plane: square, hexagonal and triangular, by an increasing spatial Gaussian noise applied to vertices. Physically, there the perturbing spatial noise intensity corresponds to a lattice temperature in the structural symmetry degeneration. The statistical parameters of the evolving changes were analyzed through those of the (convex) cells in the Voronoi tessellations, which are optimal partitions of the space from the given set of generating vertices of the structure. In all cases the gamma distribution was an excellent model for the observed probability density functions of all metric properties: inter-vertex distance, perimeters of polygons, areas of polygons and with the same result for 3D cubic crystal lattices (SC, BCC, FCC) [15], where also the volumes of polyhedra followed gamma distributions. With the onset of noise, quite quickly the three 2D tessellations became indistinguishable. Similar results were found also for perturbations of the three 3D cubic crystal lattices [5, 15]. Lucarini [14] suggested that such an approach could be made to model the structure of graphene.

The results of Lucarini’s simulations of 2D and 3D crystal disordering were put into an information geometric framework [5] by means of which the space of perturbations was represented by a Riemannian manifold of gamma probability density functions provided with the Fisher information metric [2]. That allowed the representation of the tessellation-constrained degeneration down to the Poisson Voronoi limit.

Remark

Here we are not concerned with such large scale degeneration as in [13] but with very small variations in the hexagonal structure of graphene, which might resemble the early stages of the Gaussian perturbations of a hexagonal crystal lattice. The small scale of such perturbations mitigates the concern that applying the same Gaussian variation to the location of carbon atoms for graphene and to the ‘mother’ point for Voronoi tessellation could result in different structures. We believe that at the scale envisaged this difference would be small, but certainly worth investigating through simulations, and if possible experimentally. A further concern in the case of graphene was that the dynamical behaviour of the network of carbon atoms could possibly bring about different distributional properties from the case of ‘static’ Voronoi tessellations. Again this is worthy of investigation but at small scales the dynamical variations might be expected to yield oscillations that centred on the mean static case.

The application of the information geometric methodology to real data depends on estimating the mean and variance of the distributions of e.g. hexagon perimeters in an ideal planar model but additionally obtaining also distributions of out of plane distances for vertices if non-planar fluctuations are present. Such data could then be used to represent the distributions of fluctuations from a regular planar hexagonal crystal, either using individual lengths or combinations thereof. In any case we expect that the distributions involved will be gamma and that variations in the influences causing the fluctuations will move the distribution about in the space of gamma distributions. We outline how an information geometric approach may help model the behaviour through the provision of a geometrically formulated natural state space on which to represent physical influences. It may be that the bivariate gamma distribution discussed with applications in [2] could be used as the model if correlated horizontal and vertical fluctuations are observed in graphene.

The gamma distribution, which seems to be encountered in many naturally occurring processes [2], can be characterised by the following uniqueness theorem:

Theorem 1.1

([10, 12]) For independent positive random variables with a common probability density function f,  having independence of the sample mean and the sample coefficient of variation is equivalent to f being the gamma distribution.

A proof was given by Hwang and Hu [10] but the result seems to have been known earlier and in [6] we gave a proof partly based on the 1954 article by Laha [12].

The family of gamma distributions with random variable x in event space \(\Omega ={\mathbb {R}}^+\) has a collection of probability density functions given by

$$\begin{aligned} \left\{ f(x;\mu ,\kappa )=\left( \frac{\kappa }{\mu }\right) ^\kappa \, \frac{x^{\kappa -1}}{\Gamma (\kappa )}\, e^{-x\kappa /\mu }\ | \ \mu ,\kappa \in {\mathbb {R}}^+\right\} \equiv {\mathbb {R}}^+\times {\mathbb {R}}^+ , \end{aligned}$$
(1)

and \(\Gamma \) is the gamma function. The gamma probability density functions (1) depend smoothly on parameters \(\mu ,\kappa \in {\mathbb {R}}^+.\) The mean is \(E[x]=\mu ,\) the variance is \(E[x^2]-E[x]^2=\sigma ^2= \mu ^2/\kappa \) and so we see that \(\frac{1}{\sqrt{\kappa }}\) is the constant of proportionality between the standard deviation and the mean, which reflects the result in Theorem 1.1.

Given a set of identically distributed, independent data values \(X_1,X_2,\ldots ,X_n,\) the ‘maximum likelihood’ or ‘maximum entropy’ parameter values \(\hat{\mu }, \hat{\kappa }\) for fitting the gamma distribution (1) are computed in terms of the mean and mean logarithm of the \(X_i\) by maximizing the likelihood function

$$\begin{aligned} Lik_f(\mu ,\kappa ) = \prod _{i=1}^n f(X_i;\mu ,\kappa ). \end{aligned}$$

Taking the logarithm and setting the gradient to zero we obtain

$$\begin{aligned} \hat{\mu }= & {} \bar{X}=\frac{1}{n}\sum ^n_{i=1}X_i\end{aligned}$$
(2)
$$\begin{aligned} \log \hat{\kappa } -\frac{\Gamma '(\hat{\kappa })}{\Gamma (\hat{\kappa })}= & {} \log \bar{X} - \frac{1}{n}\sum ^n_{i=1}\log X_i \nonumber \\= & {} \log \bar{X} - \overline{\log X} . \end{aligned}$$
(3)

We note that the distribution of a sum of independent gamma random variables is itself gamma distributed. The special case \(\kappa =1\) in (1) corresponds to the situation of the random or Poisson process with mean \(\mu \) (and so also \(\sigma =\mu \)) then the distribution of size of inter-event spaces is exponential, the unique distribution with unit mean having maximum entropy–that is least constraints.

In fact, the gamma distribution has an essential generalising property of the exponential distribution since in particular it represents inter-event distances for generalisations of the Poisson process to a ‘censored’ Poisson process. Indeed, for integer \(\kappa =1,2,\ldots ,\) (1) models a process that is Poisson but with intermediate events removed to leave only every \(\kappa ^{th}.\) Formally, the gamma distribution is the \(\kappa \)-fold convolution of the exponential distribution, called also the Pearson Type III distribution. The Chi-square distribution with integer \(n=2\kappa \) degrees of freedom models the distribution of a sum of squares of n independent random variables all having the Gaussian distribution with zero mean and standard deviation \(\sigma ;\) this is a gamma distribution with mean \(\mu =n\sigma ^2\) for integer \(\kappa =1,2,\ldots .\)

We might expect that the horizontal structural fluctuations in graphene are small-scale rare events in a random process, such as a perturbation of a Poisson spatial process. A wide range of near-Poisson processes is discussed in terms of the information geometry of the gamma family in [2]. In the context of small structural fluctuations in graphene there may be an interpretation of the anticipated very high values of \(\kappa \) compared with unit mean in terms of the censored Poisson process corresponding to rare events, for example, small and rare variations in hexagon perimeter or area. Accordingly, here we investigate the properties of the gamma distribution for possible modeling of graphene, if observational data becomes available then other choices may arise, including bivariate models for vertical and horizontal variability.

Whereas the nominal monolayer thickness of graphene is 3.4 Å in bulk graphite, on \(SiO_2\) it is 4.2 Å and in air the impurities increased the thickness to 9 Å. At the 600 nm scale the standard deviation of height was of similar size to the mean thickness [11], which suggests that the coefficient of variation is unity and the vertical process is approximately exponential.

2 Information Geometry of the Gamma Distribution

For any smooth family of probability density functions

$$\begin{aligned} \{p_\theta |\theta \in {\Theta }\subseteq {\mathbb {R}}^n\} \end{aligned}$$
(4)

defined on some fixed event space \(\Omega \) (typically \({\mathbb {R}}, \ {\mathbb {R}}^+\) or products thereof) the covariance matrix \(\left[ g_{ij}\right] \) is the expectation of the matrix of derivatives of the log-likelihood function \(l=\log p_\theta ,\) with respect to parameters \((\theta ^i).\) This is positive definite and hence defines a Riemannian metric on the smooth n-manifold of probability density functions with coordinates \((\theta ^i).\)

The components of the Riemannian metric are given by the expectation of the covariance matrix of gradients of the log-likelihood function l with respect to the n parameters \((\theta ^i)\) [1, 2]:

$$\begin{aligned} \left[ g_{ij}\right] = E\left( {\partial {l}\over {\partial \theta ^i}}{\partial { l}\over {\partial \theta ^j}}\right) = \left[ {\int _\Omega p_\theta }\left( {\partial {l}\over {\partial \theta ^i}}{\partial { l}\over {\partial \theta ^j}}\right) \right] , \end{aligned}$$
(5)

or equivalently, under mild regularity conditions [17],

$$\begin{aligned} \left[ g_{ij}\right] = E\left( \partial ^2{l}\over {\partial \theta ^i \partial \theta ^j}\right) = \left[ {\int _\Omega p_\theta }\left( \partial ^2{l}\over {\partial \theta ^i \partial \theta ^j}\right) \right] , \end{aligned}$$
(6)

with arc length function

$$\begin{aligned} ds^2 = \sum _{i,j} g_{ij} \, d\theta ^i \, d\theta ^j . \end{aligned}$$

The family (4) is called an exponential family if the \(p_\theta \) admit expression in terms of functions \(\{C,F_1,\ldots ,F_n\}\) on \(\Omega \) and a function \(\varphi \) on \(\Theta \) as:

$$\begin{aligned} p_\theta (x)= e^{\{C(x)+ \sum _i\theta _i\, F_i(x)-\varphi (\theta )\}}\, , \end{aligned}$$
(7)

then these \( (\theta _i)\) are its natural parameters, and \(\varphi \) is the potential function. By integrating (7) over \(\Omega ,\) then using

$$\begin{aligned} \int p(x;\theta ) \, dx =1, \ \ \mathrm{on \ the \ left } \end{aligned}$$

we take logs and obtain:

$$\begin{aligned} \varphi (\theta )= \log \int e^{\{C(x)+ \sum _i\theta _i\, F_i(x)\}} \,dx\,. \end{aligned}$$
(8)

For the gamma distribution, it can be seen that \((\nu =\kappa /\mu ,\kappa )\) are natural parameters and its potential function is

$$\begin{aligned} \varphi (\nu ,\kappa )=\log \Gamma (\kappa ) - \kappa \, \log \nu . \end{aligned}$$
(9)

In these coordinates the components of the metric are given by

$$\begin{aligned} \left[ g_{ij}\right] (\nu ,\kappa )= \left[ \begin{array}{cc} \frac{\kappa }{{\nu }^2} &{} -\frac{1}{\nu } \\ -\frac{1}{\nu } &{} \frac{d^2}{d\kappa ^2 }\log (\Gamma ) \end{array} \right] \ \end{aligned}$$
(10)

In such cases the n-manifold of probability density functions can be represented by a natural affine immersion in \({\mathbb {R}}^{n+1}\) via

$$\begin{aligned} h: (\theta )\in {\mathbb {R}}^n \mapsto \left( (\theta ),\varphi (\theta )\right) \in {\mathbb {R}}^{n+1}. \end{aligned}$$
(11)

So the natural representation of the manifold of gamma distributions M in \({\mathbb {R}}^3\) is

$$\begin{aligned} h: M \rightarrow {\mathbb {R}}^3: (\nu ,\kappa ) \mapsto (\nu ,\kappa , \log \Gamma (\kappa ) - \kappa \log \nu ) \end{aligned}$$
(12)

This is convenient for visualising and metrising curves that depict trajectories in the space of gamma distributions [2]; observational data for graphene structures under varying external conditions could then be visualised and correlated with the physics.

For an exponential family (7) there is a simpler method to compute the information metric (5) from the log-likelihood function \(l(\theta ,x)=\log p_\theta (x):\)

$$\begin{aligned} \partial _{i} l (\theta ,x) = F_i (x)- \partial _{i} \varphi (\theta ) \end{aligned}$$
(13)

and

$$\begin{aligned} \partial _{i}\partial _{j} l (\theta ,x) = - \partial _{i}\partial _{j} \varphi (\theta ) \ , \ \mathrm{\ which \ is \ independent \ of } \ x. \end{aligned}$$
(14)

Then the information metric g on the n-dimensional space of parameters \(\Theta \subset {\mathbb {R}}^n,\) equivalently on the set \(\{p_\theta | \theta \in \Theta \subset {\mathbb {R}}^n \},\) has components:

$$\begin{aligned}{}[g_{ij}](\theta )=-\int _\Omega [\partial _{i}\partial _{j} l (\theta ,x)] \ p_\theta (x) \ dx=\partial _{i}\partial _{j}\varphi (\theta )=[\varphi _{ij}] (\theta ) \ . \end{aligned}$$
(15)

The gamma distribution has a surprisingly tractable information geometry [1, 2], and the Riemannian metric in the 2-dimensional manifold (Mg) of gamma distributions (1) is easily computed from the definition (5) in \((\mu , \kappa )\) coordinates:

$$\begin{aligned} \left[ g_{ij}\right] (\mu ,\kappa )= & {} = \left[ \begin{array}{cc} \frac{\kappa }{{\mu }^2} &{} 0 \\ 0 &{} \frac{d^2}{d\kappa ^2}\log (\Gamma )-\frac{1}{\kappa } \end{array} \right] . \end{aligned}$$
(16)

So the coordinates \((\mu ,\kappa )\) yield an orthogonal basis of tangent vectors, which is useful in calculations because then the arc length function in M is simply

$$\begin{aligned} ds^2=\frac{\kappa }{\mu ^2} \, d\kappa ^2 + \left( \left( \frac{\Gamma '(\kappa )}{\Gamma (\kappa )}\right) ' - \frac{1}{\kappa }\right) \, d\kappa ^2 = \frac{\kappa }{\mu ^2} \, d\kappa ^2 + \left( \psi '(\kappa ) - \frac{1}{\kappa }\right) \, d\kappa ^2 \end{aligned}$$

where \(\psi ^\prime (\kappa )=\left( \frac{\Gamma '(\kappa )}{\Gamma (\kappa )}\right) '.\)

Such a manifold with the Levi-Civita metric connection (cf. eg [7]) is a pseudosphere, with its negative scalar curvature given by [2]:

$$\begin{aligned} R{(\kappa )}=\frac{ \psi '(\kappa ) + {\kappa }\,\psi ''(\kappa ) }{4\, {\left( \kappa \,\psi '(\kappa ) -1\right) }^2} \ \ \ \mathrm{so}\ -\frac{1}{2}<R{(\kappa )}<- \frac{1}{4} \end{aligned}$$
(17)

from which it follows that \(R{(\kappa )}\rightarrow - \frac{ 1}{2}\) as \(\kappa \rightarrow \infty .\)

In the context of horizontal structural fluctuations of inter-vertex distances or hexagon perimeter length in graphene with a unit mean, we expect the standard deviation \(\sigma =\frac{\mu }{\sqrt{\kappa }}<<1\) hence it follows that \(\kappa >>1\) and the scalar curvature (17) will be close to \(-\frac{1}{2}.\)

3 A State Space for Graphene Structural Fluctuations

For the standard deviation of, say, hexagon perimeters, \(\sigma (P)=\mu (P)/\kappa \), to be between 1 and 10 % of the mean, for example, we have a range of \(100 < \kappa < 10000\) for the relevant gamma distributions. A propos the characterisation of the gamma distribution in Theorem 1.1, Couto et al. [4] confirmed the role of random strain fluctuations as an important source of disorder and so accounted for the observed correlation between reciprocal mobility and the carrier density in all samples tested. Whether this relationship can be linked to a linear correlation between strain standard deviation and mean strain is an interesting question from the point of view of a model using a gamma distribution.

Formulating the family of gamma probability density functions as a Riemannian manifold in natural coordinates we can use geometric methods to describe the progress of fluctuation changes on property behaviour and provide a graphical representation through a natural affine immersion in \({\mathbb {R}}^3.\)

Given a differential equation on the manifold to represent an evolution of fluctuations, its integral curves would yield trajectories for the structural state. On a space of probability density functions there is always one natural vector field, arising from the gradient field of the entropy function. This differential equation and its integral curves in the manifold of gamma probability density functions represent progress of random disordering (maximising entropy) as intensity of disturbance increases. So this field may represent well enough the initial development of structural changes with increasing fluctuation intensity in graphene. Then the physical manifestation in properties or associated behaviour might be linked to the effect of fluctuation intensity.

The Shannon entropy of the gamma family (1) is given in both sets of coordinates by:

$$\begin{aligned} S_f (\nu ,\kappa )= & {} -\int _0^\infty f \, \log f \ dx :{\mathbb {R}}^{2+} \rightarrow {\mathbb {R}} \nonumber \\ (\nu ,\kappa )\mapsto & {} \kappa -(\kappa -1) \psi (\kappa )+\log (\Gamma (\kappa ))-\log (\nu ) \end{aligned}$$
(18)
$$\begin{aligned} (\mu ,\kappa )\mapsto & {} \kappa -(\kappa -1) \psi (\kappa )+\log (\Gamma (\kappa ))-\log (\mu /\kappa ) \end{aligned}$$
(19)

where \(\psi = \frac{\Gamma '}{\Gamma }\) is the digamma function. The entropy gradient vector field is

$$\begin{aligned} \nabla S_f(\nu ,\kappa )= & {} \left\{ -\frac{1}{\nu },1-(\kappa -1) \psi ^{(1)}(\kappa )\right\} \end{aligned}$$
(20)
$$\begin{aligned} \nabla S_f(\mu ,\kappa )= & {} \left\{ -\frac{\kappa }{\mu },1-(\kappa -1) \psi ^{(1)}(\kappa )\right\} \end{aligned}$$
(21)

and an unconstrained degeneration of order would follow its integral curves down to \(\kappa =1.\) In fact of course we do not have unconstrained disordering for graphene because we must retain the hexagonal structure, so we would expect a slightly less steep descent with increasing fluctuation intensity, ie with increasing noise level a in the terms of the hexagonal crystal lattice simulations. For the hexagonal simulation [15] the distribution of perimeters of the hexagons followed a gamma distribution; the mean perimeter was \(\bar{P}\approx 0.65\) and the standard deviation of the perimeter was given approximately by

$$\begin{aligned} \sigma (P) \approx \frac{a}{\sqrt{6}} \ \ \mathrm {for} \ 0<a<0.5. \end{aligned}$$

In the 2D crystal cases simulated by Lucarini [13] the hexagonal structure was stable under small perturbations but the square and triangular structures were not stable. However, with the onset of noise, quite quickly all three possible tessellations became indistinguishable above noise \(a\approx 0.5.\) With intense noise they converged to the 2D Poisson-Voronoi tessellations, for which exact analytic results are known [8]. The limiting values were \(\kappa \approx 16\) for the perimeter of polygons and \(\kappa \approx 3.7\) for areas. Hence in our context, for fluctuations of order 1% of the mean, say, \(\sigma (P)\approx 0.01 \mu (P)\approx 0.0065,\) which corresponds to the noise amplitude \(a\approx 0.016\) in [13].

4 Discussion

Ideally, we would like experimental data on vertical and horizontal spatial variations in graphene. Certainly there is little in the published literature on the distribution and scale of structural fluctuations that have been observed. Until more details become available we take the view that a representation of spatially distributed fluctuations could provide a stimulus to generate suitable data; when such data is available the model here is easily adapted, and could incorporate a bivariate distribution for vertical and horizontal variability. In due course it may be that a product distribution of vertical and horizontal variations may seem more appropriate, then, for example the bivariate gamma distribution described with applications in [2] may be appropriate. Such a bivariate model will be considered elsewhere.

If variations other than Gaussian perturbations better represent the fluctuations in graphene, then suitable information geometric models could use appropriate other models. For the present, we have some quantitative clues to the possible parameters of a model for fluctuations in graphene structure. These come from computer simulations of Gaussian perturbations of hexagonal crystal lattices [13, 15], from limited experimental data on spatial variability of vertical deviations [11, 16, 18], and from observed correlation of vertical standard deviation with mean lattice dimension [3] which confirmed the role of random strain variations in graphene.

Whereas the nominal monolayer thickness of graphene is 3.4 Å in bulk graphite, on \(SiO_2\) it is 4.2 Å and in air the impurities increased the thickness to 9 Å. At the 600 nm scale the standard deviation of height was of similar size to the mean thickness, which suggests that the coefficient of variation is unity and the vertical process is approximately exponential and hence arises from a Poisson spatial process. The spatial variations in the horizontal structure are, however, expected to have a standard deviation much smaller than the mean for local hexagon perimeter.

The information geometry of the gamma manifold turns out to be that of a Riemannian pseudosphere [2] with scalar curvature \(R{(\kappa )}\) given in (17), from which it follows that \(R{(\kappa )}\rightarrow - \frac{ 1}{2}\) as \(\kappa \rightarrow \infty .\) In the context of structural fluctuations of inter-vertex distances or hexagon perimeters in graphene with a unit mean, we expect the standard deviation \(\sigma =\frac{\mu }{\sqrt{\kappa }}<<1\) hence it follows that \(\kappa >>1\) and the scalar curvature (17) will be close to \(-\frac{1}{2}.\) There is a natural embedding of this space as a curved surface in \({\mathbb {R}}^3\) [2], and trajectories of structural changes can be represented on this surface as curves to which any observed physical features could be attached.