High Dimensional Modelling

Højsgaard, Søren; Edwards, David; Lauritzen, Steffen

doi:10.1007/978-1-4614-2299-0_7

Søren Højsgaard⁴,
David Edwards⁵ &
Steffen Lauritzen⁶

Part of the book series: Use R! ((USE R))

13k Accesses

Abstract

This chapter describes methods suitable for high-dimensional graphical modeling. Recent years have seen intense interest in applying graphical modeling techniques to data of high dimension: by this we mean from hundreds to tens of thousands of variables. Such data arise routinely in fields such as molecular biology. We first describe two typical datasets: one from a study of gene expression in breast cancer patients, and the other from the HapMap project, in which a large number of genomic markers and gene expression measurements are recorded for 90 individuals. We compare the computational efficiency of some model selection algorithms, as applied to one of the example datasets. Of these, an extension of the Chow-Liu algorithm to find the minimal BIC forest, implemented in the gRapHD package, is found to be most efficient. Also the glasso algorithm and a stepwise decomposable search algorithm are highly efficient. We describe these algorithms in more detail and illustrate their use on the example datasets. Finally, as a more advanced example, we illustrate how a Bayesian equivalent to the minimal BIC forest algorithm for high-dimensional discrete data may be obtained. Assuming a hyper-Dirichlet prior, the maximum a posteriori forest is derived by using the extended Chow-Liu algorithm with appropriate user-defined edge weights. This is illustrated using a subset of the HapMap data.

Access provided by Autonomous University of Puebla. Download chapter PDF

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

7.1 Introduction

This chapter describes and compares some methods available in R for selecting and working with high-dimensional graphical models. By ‘high-dimensional’ we are thinking of models with hundreds to tens of thousands of variables. Modelling such data has become of central importance in molecular biology and other fields, but is challenging. Many graph-theoretic operations scale poorly: for example, finding the cliques of a general undirected graph is known to be NP-hard. Model selection algorithms that work well in low dimensional applications may be quite infeasible for high dimensional ones. There can be statistical as well as algorithmic limitations: for example, for high-dimensional Gaussian data with modest numbers of observations, maximum likelihood estimates will not exist for complex models. Generally it is necessary to assume that relatively simple models are adequate to model high-dimensional data.

In Sect. 7.2 two example datasets are described. In Sect. 7.3 some model selection algorithms available in R are compared in respect to their scalability. Sections 7.4, 7.5 and 7.6 describe the use of some of the more scalable methods in more detail. Finally, in Sect. 7.7 we describe a Bayesian approach, showing how to identify the MAP (maximum a posteriori) forest for high-dimensional discrete data.

7.2 Two Datasets

We illustrate the methods in this chapter using two datasets. The first is supplied along with gRbase and is taken from a study comparing gene expression profiles in tumours taken from two groups of breast cancer patient, namely those with and those without a mutation in the p53 tumour suppression gene. See Miller et al. (2005) for a further description of the study.

There are N=250 observations and 1001 variables, comprising 1000 continuous variables (the log-transformed gene expression values) and one binary factor, code. There are 58 cases (with a p53 mutation) and 192 controls (without the mutation).

The second dataset comes from a large multinational project to study human genetic variation, the HapMap project (http://www.hapmap.org/). The dataset concerns a sample of 90 Utah residents with northern and western European ancestry, the so-called CEU population, and contains information on genetic variants and gene expression values for this sample. The genetic variants are SNPs (single nucleotide polymorphisms), that is to say, individual bases at particular loci on the genome that show variation in the population. In statistical terms SNPs are categorical variables with three levels—two homozygotes and a heterozygote. Around 10 million SNPs have been identified in humans. Datasets containing both SNP and gene expression data enable study of the the genetic basis for differences in gene expression. The data from this sample are supplied along with the package GGtools in the BioConductor repository. The code

loads an object hmceuB36.2021 containing SNP data from chromosomes 20 and 21, gene expression data and other phenotypic information recorded for individuals in the sample. In all it contains data on 199921 SNPs on chromosome 20, 50165 on chromosome 21, and expression values for 47293 genes. The above code fragment creates a dataframe ggdata by extracting the individuals sex, the first 200 SNPs from chromosome 21 and the first 200 log-transformed gene expression values from hmceuB36.2021. Prior to extraction the SNPs are filtered so that SNPs with a minimum allele frequency of less than 10% are discarded. Some values of the SNPs are missing, but here the missing values are coded as a distinct character value, so the SNPs are factors with up to four levels. The last line converts the discrete variables into factors.

7.3 Computational Efficiency

A thorough study of the computational efficiency of the algorithms described in this book would be a huge and complex task. In this section, we report on some timings of the algorithms when applied to a specific dataset, in a specific computing environment. It is hoped that this will give at least a rough impression of their relative efficiency on similar datasets in other computing environments.

The algorithms were applied to data from the HapMap project described in Sect. 7.2. For various values of the dimension p, the first p gene expression values were used. So there were 90 cases and p Gaussian variables, where p ranges from 50 to 50000. Three algorithms to select undirected graphical models, and four to select (equivalence classes of) graphical models based on DAGs were compared. The computations were run under Redhat Fedora 10 Linux on a Intel i7 four-core 2.93 GHz machine with 48 GB RAM. The timings are shown in Fig. 7.1.

The undirected model selection methods were:

(i)
The extended Chow-Liu algorithm, implemented in the minForest() function in the gRapHD package, that finds the minimum BIC forest. This is further described in Sect. 7.4.
(ii)
A greedy decomposable search algorithm, implemented in the stepw() function in the gRApHD package, that seeks (but is not guaranteed to find) the minimum BIC decomposable model. See Sect. 7.5 below. Here the minimum BIC forest is used as initial model.
(iii)
The glasso() function in the glasso package described in Sect. 4.4.2. Here the tuning parameter ρ=0.2 is used.

The DAG selection methods were:

(iv)
The PC-algorithm implemented in the pc() function in the pcalg package, as described in Sect. 4.6.1. Here α=0.05 is used.
(v)
The hill-climbing algorithm implemented in the hc() function in the bnlearn package, as described in Sect. 4.6.2.1.
(vi)
The max-min hill-climbing algorithm implemented in the mmhc() function in the bnlearn package, a hybrid constraint- and score-based algorithm, described above in Sect. 4.6.2.2. Here α=0.05 is used.

Note that (i) and (ii) return decomposable models, which represent equivalence classes of DAGs (see Sect. 4.5.1), so these can also be regarded as DAG selection methods.

We note that the algorithms for undirected models are more efficient than those for directed models. The most efficient of the latter is the pc-algorithm: however, when p=5000, this takes approximately 24 hours whereas the extended Chow–Liu algorithm takes about 1 minute for these data.

7.4 The Extended Chow–Liu Algorithm

In a paper predating much of the theoretical development of graphical models, Chow and Liu (1968) described an algorithm to find the maximum likelihood tree model for multivariate discrete data. In modern terminology, tree models are discrete graphical models whose graphs are trees. Trees and forests are special cases of undirected graphs. A forest is an acyclic undirected graph, that is, an undirected graph with no cycles. A tree is a connected acyclic undirected graph. So a forest may have several connected components, these being trees. Chow and Liu showed that finding the maximum likelihood tree can be formulated as finding a maximum weight spanning tree—a task for which highly efficient algorithms exist. Their approach requires, first, that all edge weights are calculated, and then a maximum weight spanning tree algorithm is applied to find a maximum weight spanning tree. (This may be non-unique if there are ties in the edge weights.)

Usually the algorithm due to Kruskal (1956) is used to find the maximum weight spanning tree. This starts with the null graph and successively selects edges e ₁,…,e _r. If edges e ₁,…,e _k have been selected, the algorithm selects an edge e such that

(a)
$e \not \in \{e_{1}, \dots e_{k}\}$ and {e ₁,…e _k,e} is a forest, and
(b)
e has maximum weight among all edges satisfying (a).

Chow and Liu’s approach may be extended in various ways (Edwards et al. 2010):

It can be applied to Gaussian data using appropriate weights.
By modifying the weights appropriately it can be adapted to find the minimal AIC or BIC forest. If this has several connected components we can analyze these separately—a dimension reduction that can be very useful with high-dimensional problems.
It can be applied to mixed discrete and Gaussian data by modifying the weights appropriately and limiting the search space in (a) to strongly decomposable forests, that is, forests containing no forbidden paths. Recall that a forbidden path is a path between non-adjacent discrete nodes passing through continuous nodes. This restriction implies that in each tree of the forest, the discrete nodes induce a connected subgraph.
In the conditional Chow–Liu algorithm (Kirshner et al. 2004) the search space is extended to graphs that include a given set of edges, E ₀ say. Formally, the search space becomes
$$\{ \mathcal{G} = (V,E): E_0 \subseteq E \wedge \mbox{any cycle in}\ \mathcal{G}\ \mbox{has all edges in}\ E_0 \}$$
To do this, the algorithm starts off from $\mathcal{G}_{0}=(V,E_{0})$ and (a) is modified to restrict candidate edges to those that do not create new cycles.

The extended algorithm is implemented in the minForest() function in the gRapHD package. It requires the data to be supplied as a dataframe with discrete and/or continuous variables. The discrete variables must be represented as factors. For example, we can apply it to the breastcancer dataframe as follows:

Per default, the minForest function returns the minimal BIC forest, in the form of a gRapHD object. Note that bF has 1001 nodes and 1000 edges. Since a forest with n nodes and k connected components has n−k edges, we see that bF is a tree: all nodes are interconnected.

These gRapHD objects are essentially undirected graphs represented in node and edge list form, in which nodes are identified by their column numbers in the input dataframe. They also contain information on variable types (discrete or continuous) and names (which are used to label the nodes in plots). They may be displayed using the plot function

but here, plotting a high-dimensional graph like bF would not be a good idea: no structure would be visible. Instead, since we are primarily interested in the effect of the mutation on gene expression, let us look at the neighbourhood of the discrete variable, code. This is the last column of breastcancer, column number 1001. The following two lines of code extract the nodes of bF whose path length from code is less than or equal to 4, and then display the subgraph of bF induced by these nodes.

The plot() function, when applied to gRapHD objects, shows discrete variables as dots and continuous variables as circles. It uses the iterative layout algorithm of Fruchterman and Reingold (1991): here we specify 1000 iterations to get a clear layout. We see that the effect of the mutation on gene expression appears to be mediated by its effect on the expression of gene A.202870_s_at. To find out more about this gene we can google this string, from which we learn that under the alias CDC20 the gene “…appears to act as a regulatory protein interacting with several other proteins at multiple points in the cell cycle. It is required for two microtubule-dependent processes, nuclear movement prior to anaphase and chromosome separation.” In other words, it is involved in cell division. Below, using strongly decomposable models, we re-examine the hypothesis that the effect of p53 mutation on gene expression is mediated by its effect on the expression of this gene.

The following code illustrates the extended Chow-Liu approach applied to the ggdata dataset. The minimal BIC forest is obtained using the minForest() function:

We see that ggF is a forest with 401−392=9 connected components. The Degree() function returns a vector containing the degree (number of adjacent nodes) of each node: we see that there are 5 isolated nodes. We can identify the components by converting the gRapHD object to a graphNEL object using the as() function, and then applying the connComp() function, which returns a list of components.

The 9 connected components consist of two large components (with 172 and 218 nodes), two components with 3 nodes, and 5 isolated nodes. If we look at the two largest components

we see that the first contains only one discrete variable (sex), but the second contains 189 SNPs and 29 gene expression variables.

7.5 Decomposable Stepwise Search

In a celebrated paper, Chickering (1996) showed that identifying the Bayesian network that maximizes a score function is in general a NP-hard problem, and it is reasonable to suppose that this is also true of undirected graphical models (Markov networks). However, there are ways to improve computational efficiency. A useful approach is to restrict the search space to models with explicit estimates, the decomposable models. The following key result is exploited: if $\mathcal{M}_{0} \subset \mathcal{M}_{1}$ are decomposable models differing by one edge e={u,v} only, then e is contained in one clique C of $\mathcal{M}_{1}$ only, and the likelihood ratio test for $\mathcal{M}_{0}$ versus $\mathcal{M}_{1}$ can be performed as a test of . These computations only involve the variables in C. It follows that for likelihood-based scores such as AIC or BIC, score differences can be calculated locally—which is far more efficient then fitting both $\mathcal{M}_{0}$ and $\mathcal{M}_{1}$—and then stored, indexed by u, v and C, so that they can be reused again if needed in the course of the search. This can lead to considerable efficiency gains.

The stepw() function in the gRapHD package implements forward search through decomposable models to minimize the AIC or BIC. At each step, the edge giving the greatest reduction in AIC or BIC is added. A convenient choice of start model is the minimal AIC/BIC forest, but an arbitrary decomposable start model may be used. We illustrate use of this function by resuming the analysis of the breast cancer dataset. The minimal BIC forest for the neighbourhood of the code variable is obtained as follows.

The gene adjacent to code is A.202870_s_at (CDC20) as we saw before. This suggests that the effect of p53 mutation on gene expression is mediated by its effect on CDC20. However, this might be a consequence of adopting this restrictive—and sparse—model class. It is interesting to expand the search space to decomposable models. The minimal BIC decomposable model is obtained using the stepw() function:

To plot mbG using the same layout as in the previous plot, we store the node coordinates from the previous plot and reuse them when plotting mbG, as follows:

Although the minimal BIC decomposable model is considerably less sparse, the interpretation is unaltered: it still appears that the effect of p53 mutation on gene expression is mediated by its effect on the expression of CDC20.

An interesting aspect of this example is the presence of so-called hub genes—nodes of high degree—that may play a key role in the regulatory network. If we compare the degree distributions of the two graphs

we see that the hub genes in the—presumably more realistic—graph mbG are reliably identified using the forest mbF.

7.6 Selection by Approximation

Here we illustrate use of the graphical lasso algorithm of Friedman et al. (2008) described in Sect. 4.4.2. We apply it to the breastcancer dataset (omitting the discrete class variable since the algorithm is only applicable to Gaussian data).

The example selects a model to the variables in the neighbourhood of the code variable in the breast cancer dataset (omitting the code variable itself since it is discrete). We apply the glasso() function to the expirical correlation matrix of the variables, in effect standardizing the variables to unit variance. Note that since the glasso procedure is not scale invariant, this is normally a sensible step. As penalty parameter we use ρ=0.8: this choice was made so as to obtain a graph of comparable density to those obtained previously. The glasso() function returns a list containing the estimated inverse covariance matrix wi. Note that the method combines model selection with parameter estimation. In the code fragment shown we derive the adjacency matrix from the inverse covariance and use this to construct a graphNEL graph of the model. The diagonal elements of the adjacency matrix are set to false to omit self-edges from the graph.

The graph selected by the algorithm contains a module of interconnected variables and a large number of isolated ones. To see the former more closely, we can use the following code:

7.7 Finding MAP Forests

In this section we describe a Bayesian equivalent to the minimal AIC/BIC forest approach described in Sect. 7.4. This builds on the framework of Dawid and Lauritzen (1993). We write a collection of d discrete random variables as X=(X _v)_v∈V, and we write a generic observation as i=(i ₁,…,i _d), and the observed data as $\mathcal{X} = (i^{v}, v=1, \dots, N)$. We are interested in a collection of graphs Γ with vertex set V. For each graph $\mathcal{G} \in \varGamma$, let $\varTheta_{\mathcal{G}}$ be the associated parameter space, and $\mathcal{L}_{\mathcal{G}}$ be a prior distribution (or law) on $\varTheta_{\mathcal{G}}$. Then the marginal likelihood of $\mathcal{G}$ is

$$f(\mathcal{X} | \mathcal{G}) = \int_{\varTheta_{\mathcal{G}}} f(\mathcal{X} | \mathcal{G}, \theta)\mathcal{L}_{\mathcal{G}}(d\theta).$$

If $\pi(\mathcal{G})$ is the prior probability of $\mathcal{G}$, then the posterior probability is given as

$$\pi^*(\mathcal{G}) = \pi(\mathcal{G}| \mathcal{X}) \propto f(\mathcal{X} | \mathcal{G}) \pi(\mathcal{G}). $$

(7.1)

The maximum a posteriori (MAP) estimate is the graph in Γ that maximizes $\pi^{*}(\mathcal{G})$.

We now sketch how a prior distribution $\mathcal{L}_{\mathcal{G}}$ on $\varTheta_{\mathcal{G}}$ may be chosen. Consider first an unconstrained multinomial distribution on an array with parameters $p=(p(i))_{i \in \mathcal{I}}$, and let $\lambda=(\lambda(i))_{i \in \mathcal{I}}$ be an array of positive numbers. The Dirichlet distribution D(λ) has density

$$\pi(p| \lambda) \propto \prod p(i)^{\lambda(i)-1}.$$

If the prior distribution of p is D(λ) and counts $n=(n(i))_{i \in \mathcal{I}}$ are observed, then the posterior distribution of p is D(λ+n): in other words, the Dirichlet distribution is the conjugate prior of the multinomial. The numbers λ are called the equivalent sample size, or smoothing parameter.

Dawid and Lauritzen (1993) generalize this to construct the conjugate prior for a decomposable graphical model $\mathcal{G}$, which they term the hyper-Dirichlet distribution. Essentially this involves specifying a Dirichlet prior for each clique of $\mathcal{G}$. Let $\mathcal{C}=(C_{1}, \dots, C_{k})$ be these cliques. Thus a hyper-Dirichlet prior is specified though the collection of arrays $(\lambda_{C})_{C \in \mathcal{C}}$. These must satisfy a consistency criterion, namely that for all cliques $C,D \in \mathcal{C}$, λ _C(i _C∩D)=λ _D(i _C∩D) for all cells i _C∩D. Without loss of generality we can specify the λ _C’s by specifying a $\lambda =(\lambda(i))_{i \in \mathcal{I}}$ for the whole array and setting λ _C to the marginal totals $\lambda_{C}(i_{C})=\sum_{j \in \mathcal{I}: j_{C}=i_{C}}\lambda(j)$. This construction automatically fulfills the consistency criteria. It also allows the array λ to function as a ‘master-prior’ to specify the smoothing parameters for the hyper-Dirichlet prior for the parameters for any decomposable model.

Dawid and Lauritzen (1993) also show that for a hyper-Dirichlet prior, the marginal likelihood factorizes in a fashion similar to the likelihood:

$$f(\mathcal{X} | \mathcal{G}) = \prod_{i = 1 \dots k} \frac{f(\mathcal{X}_{C_i} | \mathcal{G})}{f(\mathcal{X}_{S_i} | \mathcal{G})} $$

(7.2)

where $\mathcal{S}=(S_{1}, \dots, S_{k})$ are the separators corresponding to $\mathcal{C}$. Moreover the factors $f(\mathcal{X}_{C_{i}} |\mathcal{G})$ are constant for all $\mathcal{G}$ in which C _i is (or is contained in) a clique, so in that sense the conditioning on $\mathcal{G}$ is unnecessary.

Let now Γ be the set of forests with vertex set V. These models are decomposable, and so using (7.2) we obtain for $\mathcal{G} \in \varGamma$

$$f(\mathcal{X} | \mathcal{G}) = \frac{\prod_{e \in E(\mathcal{G})} f(\mathcal{X}_e)}{\prod_{v \in V} f(\mathcal{X}_v)^{d_{\mathcal{G}}(v)-1}} $$

(7.3)

where $d_{\mathcal{G}}(v)$ is the degree of v in $\mathcal{G}$. Let BF(e) be the Bayes factor for independence along edge e=(u,v), so that

$$\mbox{BF}(e) = \frac{f(\mathcal{X}_e)}{f(\mathcal{X}_u)f(\mathcal{X}_v)}.$$

Then (7.3) can be written as

$$f(\mathcal{X} | \mathcal{G}) = \prod_{v \in V} f(\mathcal{X}_v) \prod_{e \in E(\mathcal{G})} \mathrm{BF}(e) $$

(7.4)

It follows from (7.1) and (7.4) that assuming a uniform prior on Γ we can find the MAP estimate by using a maximum weight spanning tree algorithm, using logarithms to the BF(e) as edge weights.

From (41) in Dawid and Lauritzen (1993) we can derive an expression for BF(e) in terms of ratios of gamma functions:

$$\mbox{BF}(e) = \frac{\varGamma(\lambda_{\cdot\cdot}+n_{\cdot\cdot})/\varGamma(\lambda_{\cdot\cdot})\prod_{ij} \varGamma(\lambda_{ij}+n_{ij})/\varGamma(\lambda_{ij})}{\prod_{i} \varGamma(\lambda_{i\cdot}+n_{i\cdot})/\varGamma(\lambda_{i\cdot})\prod_{j} \varGamma(\lambda_{\cdot j}+n_{\cdot j})/\varGamma(\lambda_{\cdot j})}$$

where i and j range over the number of levels of X _u and X _v, {n _ij} is the corresponding table of counts, {λ _ij} the corresponding array of smoothing parameters, and the ⋅ notation indicates marginal totals.

The following example illustrates application of this approach to find the MAP forest for a dataset with 400 discrete variables (SNPs). A convenient choice is $\lambda(i)=\alpha/|\mathcal{I}|\ \forall i$, where α is a scalar. This implies that λ _u,v=α/(|X _u||X _v|) where |X _u| and |X _v| are the number of levels of X _u and X _v. In the following fragment we extract the dataset from the GGtools package, define a function to calculate the logarithms of the Bayes factors, and call the minForest() function specifying that logBF be used to calculate the edge weights. For comparison purposes we also find the minimum BIC forest:

Then we display the two graphs:

We see that the MAP estimate has many isolated vertices, indicating a stronger tendency to negative logBF values than negative BIC values for weakly associated variables.

References

Chickering DM (1996) Learning Bayesian networks is NP-complete. In: Fisher D, Lenz HJ (eds) Learning from data: artificial intelligence and statistics V. Springer, New York, pp 121–130
Google Scholar
Chow CK, Liu CN (1968) Approximating discrete probability distributions with dependence trees. IEEE Trans Inf Theory 14:462–467
Article MATH Google Scholar
Dawid AP, Lauritzen SL (1993) Hyper Markov laws in the statistical analysis of decomposable graphical models. Ann Stat 21:1272–1317
Article MathSciNet MATH Google Scholar
Edwards D, de Abreu GCG, Labouriau R (2010) Selecting high-dimensional mixed graphical models using minimal AIC or BIC forests. BMC Bioinform 11:18
Article Google Scholar
Friedman J, Hastie T, Tibshirani R (2008) Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9(3):432–441
Article MATH Google Scholar
Fruchterman T, Reingold EM (1991) Graph drawing by force-directed placement. Softw Pract Exp 21:1129–1164
Article Google Scholar
Kirshner S, Smyth P, Robertson AW (2004) Conditional Chow-Liu tree structures for modeling discrete-valued vector time series. In: Proceedings of the 20th conference on uncertainty in artificial intelligence, UAI ’04, AUAI Press, Arlington, pp 317–324. http://portal.acm.org/citation.cfm?id=1036843.1036882
Google Scholar
Kruskal J (1956) On the shortest spanning subtree of a graph and the traveling Salesman problem. Proc Am Math Soc 7:48–50
Article MathSciNet MATH Google Scholar
Miller LD, Smeds J, George J, Vega VB, Vergara L, Ploner A, Pawitan Y, Hall P, Klaar S, Liu ET, Bergh J (2005) An expression signature for p 53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival. Proc Natl Acad Sci USA 102(38):13550–13555. http://dx.doi.org/10.1073/pnas.0506230102
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematical Sciences, Aalborg University, Aalborg, Denmark
Søren Højsgaard
Centre for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
David Edwards
Department of Statistics, University of Oxford, Oxford, UK
Steffen Lauritzen

Authors

Søren Højsgaard
View author publications
You can also search for this author in PubMed Google Scholar
David Edwards
View author publications
You can also search for this author in PubMed Google Scholar
Steffen Lauritzen
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Højsgaard, S., Edwards, D., Lauritzen, S. (2012). High Dimensional Modelling. In: Graphical Models with R. Use R!. Springer, Boston, MA. https://doi.org/10.1007/978-1-4614-2299-0_7

Download citation

DOI: https://doi.org/10.1007/978-1-4614-2299-0_7
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4614-2298-3
Online ISBN: 978-1-4614-2299-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics