Adaptive AMG with coarsening based on compatible weighted matching

D’Ambra, Pasqua; Vassilevski, Panayot S.

doi:10.1007/s00791-014-0224-9

Adaptive AMG with coarsening based on compatible weighted matching

Published: 17 September 2014

Volume 16, pages 59–76, (2013)
Cite this article

Download PDF

Access provided by CONRICYT – Journals CONACYT

Computing and Visualization in Science

Adaptive AMG with coarsening based on compatible weighted matching

Download PDF

Pasqua D’Ambra¹ &
Panayot S. Vassilevski²

534 Accesses
27 Citations
Explore all metrics

Abstract

We introduce a new composite adaptive Algebraic Multigrid (composite $\alpha $AMG) method to solve systems of linear equations without a-priori knowledge or assumption on characteristics of near-null components of the AMG preconditioned problem referred to as algebraic smoothness. Our version of $\alpha $AMG is a composite solver built through a bootstrap strategy aimed to obtain a desired convergence rate. The coarsening process employed to build each new solver component relies on a pairwise aggregation scheme based on weighted matching in a graph, successfully exploited for reordering algorithms in sparse direct methods to enhance diagonal dominance, and compatible relaxation. The proposed compatible matching process replaces the commonly used characterization of strength of connection in both the coarse space selection and in the interpolation scheme. The goal is to design a method leading to scalable AMG for a wide class of problems that go beyond the standard elliptic Partial Differential Equations (PDEs). In the present work, we introduce the method and demonstrate its potential when applied to symmetric positive definite linear systems arising from finite element discretization of highly anisotropic elliptic PDEs on structured and unstructured meshes. We also report on some preliminary tests for 2D and 3D elasticity problems as well as on problems from the University of Florida Sparse Matrix Collection.

αAMG Based on Weighted Matching for Systems of Elliptic PDEs Arising from Displacement and Mixed Methods

Application of the AmgX Library to the Discontinuous Galerkin Methods for Elliptic Problems

Robust Algebraic Multilevel Preconditioners for Anisotropic Problems

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

We are interested in solving large and sparse linear systems of equations

$$\begin{aligned} A \mathbf{x}= \mathbf{b}, \end{aligned}$$

where $A \in \mathbb {R}^{n \times n}$ is assumed symmetric positive definite (s.p.d.), by algebraic multigrid (AMG) and more specifically by aggregation based AMG. The AMG methods, originated in [5], together with the smoothed aggregation AMG (or SA AMG) [29], have become a powerful tool for solving problems of linear algebraic equations that typically arise from discretization of elliptic PDEs. In recent years substantial progress has been made to extend the applicability of AMG to more general sparse linear systems by developing methods that use appropriate adaptive strategies (cf., [3, 4, 8, 9, 20, 23], etc.) aimed at capturing the near-null components of the error (sometimes referred to as algebraically smooth components) that the current solver cannot efficiently handle so that they are then used to improve the solver by modifying its hierarchy of coarse spaces.

The approach that we utilize builds upon the adaptive AMG ideas however presents several new features. It is also fairly general in the sense that we do not assume any specific knowledge of the near-nullspace of $A$ (or of a preconditioned version of $A$, such as $B^{-1}A$). The main philosophy is the same as in the original adaptive AMG papers (cited above); namely we test the current method (represented by an operator $B$) applied to the trivial system $A {\mathbf x}= 0$ starting with a nonzero random initial iterate ${\mathbf x}$, by computing ${\mathbf x}:= (I-B^{-1}A) {\mathbf x}$, which effectively provides an approximation to the eigenvector of $B^{-1}A$ corresponding to the minimal eigenvalue of $B^{-1}A$. If during this process a slow convergence is encountered, we use the most recent iterate to build a new coarse hierarchy. This is the first main difference with the previously studied adaptive AMG methods. As a result, we end up with a composite AMG solver $B$, given by the product formula

$$\begin{aligned} I-B^{-1}A = \prod _j (I-B^{-1}_j A), \end{aligned}$$

where each $B_j$ corresponds to a separate hierarchy constructed driven by a particular algebraically smooth vector.

Another difference in our approach is in the coarsening process employed to obtain a multilevel hierarchy. We consider coarsening by pairwise aggregation based on a weighted matching (for definitions, see Sect. 2) applied to the matrix adjacency graph. At each level of the hierarchy, starting from a maximum product matching of the graph associated with the current matrix, we generate two complementary coarser vector spaces by simple piecewise constant interpolation of a given algebraically smooth vector. We select the coarse space based on the principles of compatible relaxation (originated in [2]), i.e., we test the convergence of a pointwise smoother on homogeneous systems associated to the two available coarser matrices and choose as new coarse matrix and new algebraically smooth vector those for which slower convergence is observed. In fact, if we use matching so that the aggregates gather together pairs of fine degrees of freedom (or dofs) that are “strongly connected” the complementary space gives rise to a hierarchically complement matrix that is well-conditioned (when preconditioned by the smoother). In general, the procedure can end up building a binary tree of multiple coarse spaces by matching-based aggregation where, at each level, selection of coarsening branch is based on compatible relaxation of a given vector. We use both optimal solution for maximum product matching and an approximation algorithm and demonstrate the performance of our adaptive AMG on the difficult (for multigrid) s.p.d. linear systems arising from discretization of anisotropic PDEs on structured and unstructured meshes. In particular, we demonstrate that our coarsening strategy clearly detects the direction of anisotropy in both structured and unstructured mesh cases. We also include some preliminary tests of the method on (2D and 3D) elasticity problems as well as on some matrices from the University of Florida Sparse Matrix Collection.

The remainder of the paper is organized as follows. In Sect. 2, we recall the notion of graph associated to a sparse matrix and remind the relation between maximum product bipartite matching and linear algebra applications. Then we describe the algorithm for pairwise aggregation based on weighted matching. In Sect. 3, we introduce two algebraic coarsening processes based on the pairwise aggregation, depending on the weights we used for matching. The actual coarse vector space is chosen based on compatible relaxation principles. In Sect. 4, we outline the bootstrap strategy employed to build a composite $\alpha $AMG with a prescribed convergence rate, whereas in Sect. 5 we present an extensive set of numerical results illustrating our approach. Finally, some remarks and future work are included in Sect. 6.

2 Pairwise aggregation based on weighted matching

To find matching in a graph is a classical problem in combinatorial optimization which has wide range of applications in Sparse Linear Algebra [13]. The starting point is the representation of the sparse matrices in terms of graphs [27]. Let $A=(a_{ij})_{i,j=1, \ldots , n}$ be a sparse matrix, the graph associated with $A$ is the pair $G_U=(V, E)$, where the vertex set $V$ correspond to the row/column indices of $A$ and the edge set $E$ corresponds to the set of nonzeros of the matrix $A$ so that $(i,j) \in E$ iff $a_{ij} \ne 0$. For matrices with symmetric sparsity pattern, the edges $(i,j)$ are undirected pairs of vertices, i.e. $(i,j)=(j,i) \in E$ iff $a_{ij} \ne 0$ and $a_{ji} \ne 0$, and $G_U$ is called undirected graph. In the case of a graph $G_P=\{ V_r \cup V_c, E \}$, where the vertex set has a partition to two subsets $V_r$ and $V_c$ (for example the rows and the columns of $A$), such that $(i,j) \in E$ connects $i \in V_r$ and $j \in V_c$, the graph is called bipartite [10]. A matching ${\mathcal M}\subseteq E$ in a graph ($G_U$ or $G_P$) is a set of edges such that no two edges share the same vertex. The number of edges in ${\mathcal M}$ is called the cardinality of the matching and a matching for $G_U$ or $G_P$ is referred to as perfect one if its edges touch all vertices. We refer to [13] and the reference herein for conditions which guarantee the existence of perfect matching. A perfect matching ${\mathcal M}$ for $G_U$ or $G_P$ corresponds to $n$ nonzeros no two of which are in the same row or column and can be represented in terms of a column permutation

$$\begin{aligned} \pi _{ji}= \left\{ \begin{array}{ll} 1, &{} \quad \text{ if } (i,j) \in \mathcal {M}\\ 0, &{} \quad \text{ otherwise } \end{array} \right. \end{aligned}$$

such that the matrix $A\pi $ has a zero-free diagonal. Generally, in linear algebra applications, we are interested in finding matching that controls the size of the diagonal elements of $A\pi $, and such a requirement is formulated in terms of a maximum weighted matching problem, i.e. in finding a matching ${\mathcal M}\subseteq E$ such that $C({\mathcal M})= \sum _{ (i,j) \in {\mathcal M}} c_{ij} = \max _{{\mathcal M}^{'}} C(\mathcal {M}^{'})$, with ${\mathcal M}^{'}$ matching of $G_U/G_P$ and $c_{ij} \ge 0$ edge weights. In particular, matrices with larger entries on the diagonal can be obtained by solving the following optimization problem [12, 13].

Maximum Product Bipartite Matching Problem: Given a graph $G_P$ corresponding to a sparse matrix $A$, find a matching ${\mathcal M}$ that maximizes the product of the matched entries, i.e., find a permutation matrix $\pi $ such that $\prod abs((A\pi )_{ii})$ is maximum among all permutations.

Therefore, if row $i$ is matched to column $j$ in a maximum product bipartite matching problem, we can reasonably assume that $|a_{ij}| \approx max_{k \ne i} |a_{ik}|$, which in terms of the classical AMG characterization of the strength of matrix connections is equivalent to say that index $i$ is strongly connected to index $j$. The difference is that the maximum product bipartite matching problem optimizes a global measure, whereas in classical AMG the strength of connection is a local notion. We demonstrate in the present paper that this global matching is able to capture very accurately the direction of strong anisotropy for difficult AMG test problems with anisotropy that is not grid-aligned. We note however that the maximum product bipartite matching problem if implemented exactly can become too costly, on the other hand a similar matching problem can be described for undirected graphs, so in practice we use an approximation of the maximum product matching problem in undirected graph to end up with setup cost of order ${\mathcal O}(n)$ and still be able to capture the direction of strong anisotropy as in the more expensive accurate solution of the maximum product bipartite matching problem.

Starting from the above considerations, we propose a coarsening process based on the pairwise aggregation described in Algorithm 1. It builds a partition $\mathfrak {a}_k, \; k=1, \ldots , n_c$ of the index set $\{1, \ldots , n\}$, where each aggregate $\mathfrak {a}_k$ is generally a pair of matched indices. In the general case of possible unmatched indices, i.e., in the case of non-perfect matchings (structurally rank-deficient matrices) or sub-optimal solutions, we can obtain a partition with possible singletons.

We observe that Algorithm 1 is an automatic aggregation procedure once the matching is being constructed; it only uses information on the matrix entries and no additional information is needed. We note that to use pairwise aggregation for coarsening is not a new concept; it has been used previously, e.g., in the widely used partitioner METIS [18] and it seems to be a common practice nowadays, cf., e.g., [7] and the references therein. Our pairwise aggregation does not depend on any user-defined threshold for strong/weakly connection or coarse-grid quality measure, as in the case of pairwise aggregations proposed in [24, 25]. A main novelty in our procedure is the connection which we recognized between the aggregation based on weighted matching and the algorithms and software developed by the sparse direct solvers community that utilizes matchings to reorder the sparse matrix with the goal to improve its diagonal dominance [13]. In the following Sect. 3, we employ the aggregation procedure within an adaptive method exploiting the relation between aggregation based on maximum product matching and the compatible relaxation methods investigated previously [2, 6, 21].

Computation of a maximum product matching in a graph is a challenging problem in terms of computational complexity, indeed classical algorithms require a running time of [11]. On the other hand, the problem can be solved for bipartite graphs with the widely used algorithm described in [12] and implemented in the HSL-MC64 subroutine [16], whose computational complexity is , where $nnz$ is the number of nonzeros of the matrix. The latter cost is a worst case estimate. At any rate, from AMG perspective the latter cost is still unacceptable since our ultimate goal is aiming at ${\mathcal O}(n)$ algorithm. For that reason, we also use an approximate version of a maximum product matching algorithm in an undirected graph that uses ${\mathcal O}(n)$ operations. We demonstrated that, although in the case of approximate matching, the coarsening ratio of our approach is reduced with respect to the coarsening by a factor of two in the exact matching, the overall performance of the adaptive process does not deteriorate substantially.

3 Coarsening based on compatible weighted matching

3.1 Main ingredients for coarsening

Given a set of aggregates $\mathfrak {a}_1, \ldots \mathfrak {a}_{n_c}$, built by Algorithm 1, and a starting (arbitrary) vector ${\mathbf w}$, per each pair $\mathfrak {a}_l=\{i,j\}, \; l=1, \ldots , n_p$, let

$$\begin{aligned} {\mathbf w}_{\mathfrak {a}_l}=\frac{1}{\sqrt{w^2_i + w^2_j}}\left[ \begin{array}{c} w_i\\ w_j \end{array} \right] , \; \; {\mathbf w}^\perp _{\mathfrak {a}_l}=\frac{1}{\sqrt{w^2_i + w^2_j}}\left[ \begin{array}{c} -w_j\\ w_i \end{array} \right] \end{aligned}$$

be the normalized restrictions of ${\mathbf w}$ to the set $\mathfrak {a}_l$ and its orthonormal complement. We then define the following matrices:

$$\begin{aligned} \tilde{P}_c&= \text {blockdiag}( {\mathbf w}_{\mathfrak {a}_1}, \ldots , {\mathbf w}_{\mathfrak {a}_{n_p}} ) \in \mathbb {R}^{2n_p \times n_p},\\ \tilde{P}_f&= \text {blockdiag}({\mathbf w}^\perp _{\mathfrak {a}_1}, \ldots , {\mathbf w}^\perp _{\mathfrak {a}_{n_p}}) \in \mathbb {R}^{2n_p \times n_p}. \end{aligned}$$

For the singletons $\mathfrak {a}_l=\{k\}, \; l=1, \ldots , n_s$, ($n_c = n_p + n_s$, $n = 2n_p + n_s$), we introduce the diagonal matrix:

$$\begin{aligned} W=diag(w_k/|w_k|) \in \mathbb {R}^{n_s \times n_s}. \end{aligned}$$

From the above matrices, we obtain two prolongation matrices corresponding to two complementary coarse index sets:

$$\begin{aligned} P_c = \left( \begin{array}{cc} \tilde{P}_c &{} 0\\ 0 &{} W \end{array} \right) \in \mathbb {R}^{n \times n_c}, \; \; P_f= \left( \begin{array}{c} \tilde{P}_f \\ 0 \end{array} \right) \in \mathbb {R}^{n \times n_p}. \end{aligned}$$

(3.1)

The $n \times n_c$ matrix $P_c$, referred to as tentative prolongator, maps vectors associated with the coarse index set $\{1,\;2,\; \ldots , n_c\}$ on the original fine-grid set $\{1,\;2,\;\ldots ,\;n\}$, whereas $P_f$, referred to as complementary tentative prolongator, is an $n \times n_p$ matrix which transfers vectors associated with the complementary coarse index set $\{1, \;2,\;\ldots , n_p\}$ also on the fine-grid index set $\{1,\;2,\;\ldots ,\;n\}$. We recall that $n_c = n_p + n_s$ and $n = 2n_p + n_s$, where $n_p$ is the number of pairwise aggregates and $n_s$ is the number of singletons. Note that $\mathbb {R}^n = \mathrm {Range}(P_c) \oplus ^{\perp } \mathrm {Range}(P_f)$, where $\mathrm {Range}(P_c)\ni \mathbf{w}$ and $\mathrm {Range}(P_f) \ni \mathbf{w}^\perp $ form an orthogonal decomposition of $\mathbb {R}^n$. In other words, we have that the matrix $P = \left[ P_f,\; P_c \right] $ has orthogonal columns.

After proper reordering of $A$, the following two coarser matrices can be formed via Galerkin triple matrix product

$$\begin{aligned} A_c&= P_c^TAP_c \in \mathbb {R}^{n_c \times n_c}, \nonumber \\ A_f&= P_f^TAP_f \in \mathbb {R}^{n_p \times n_p}. \end{aligned}$$

(3.2)

These are the diagonal blocks of the transformed fine-grid matrix $P^TAP$ under the orthogonal transformation $P$, i.e., we have

$$\begin{aligned} P^TAP= \left[ \begin{array}{ll} A_f &{} A_{fc} \\ A_{cf} &{} A_c \end{array} \right] . \end{aligned}$$

The off-diagonal blocks read: $A_{fc}=P_f^TAP_c$ and $A_{cf}=P_c^TAP_f$.

The choice of the best coarse matrix $A_c$ for a multilevel hierarchy can be driven by the basic principle of compatible relaxation first introduced by Brandt in [2] and extended in [14] (see also [30]). The compatible relaxation is defined as a relaxation scheme which is able to keep coarse-level variables invariant. It gives a practical way to measure the quality of a set of coarse variables, indeed, since in an efficient multigrid method relaxation scheme has to be effective on the fine variables, the convergence rate of a compatible relaxation scheme can be used as a measure of the quality of a set of coarse variables. This basic idea was used in different approaches to select coarse grids [6, 21]. Here, we apply the principle of compatible relaxation to choose the best coarse matrix from the two available matrices in (3.2), and the corresponding coarse index set, by applying a simple point-wise relaxation scheme to the homogeneous systems associated to each of the matrices, starting from a random initial guess and then relaxing on the two complementary vector spaces separately. If the vector ${\mathbf w}$ is chosen based on a relaxation scheme applied to the original matrix $A$ so that it is in the near-null space of $A$, it is natural to expect that $A_f$ will be better conditioned than $A_c$. For a more general iterative process, we allow the option to choose between $A_f$ and $A_c$ when selecting the coarse-level variables.

3.2 The multilevel adaptive coarsening schemes

Our overall adaptive multilevel coarsening strategy can be described as follows. We propose two versions. The first one, referred to as coarsening based on compatible matching (version 1) is sketched in Algorithm 2. We start with the given system matrix and a given smooth vector, for example the unitary vector. Then, we apply Algorithm 1 for building the two complementary coarse matrices in (3.2). After that, we test the convergence of a simple smoother on homogeneous systems associated to the two available matrices and choose as new coarse matrix and new algebraically smooth vector those for which slower convergence is observed. The process can be applied in a recursive way until a desired small size of the coarse matrix is obtained. Therefore, our procedure builds a binary tree of multiple coarse spaces by matching-based aggregation, where, at each level, selection of the new coarsening branch is based on compatible relaxation of a given vector.

Note that, as shown in [26], in the case of strongly diagonally dominant or s.p.d. matrices maximum product (perfect) matching produces permutation matrices equal to the identity matrices, i.e. it produces a set of $n$ self-aggregated indices. Therefore, in order to obtain an effective pairwise aggregation, in Algorithm 2, we apply the maximum product matching to the matrix $A^k-\text {diag}(A^k)$, where $\text {diag}(A^k)$ is the diagonal matrix obtained by the diagonal elements of $A^k$. We also observe that in Algorithm 2, when we build the two complementary coarse matrices $A_c$ and $A_f$, we need to compute the normalized restriction of the smooth vector ${\mathbf w}$ on each set of the partition computed by Algorithm 1. It may happen that during the coarsening process, the smooth vector components corresponding to some set of the partition are very small, i.e. the corresponding error components are sufficiently damped by the smoother. In these cases we associate the corresponding unknowns to the vector space $\mathrm {Range}(P_f)$. More specifically, if $\mathfrak {a}_l=\{ i, j\}$ is a pair of matched indices such that $\sqrt{w^2_i + w^2_j} <{ TOL}$, we consider the corresponding indices as unpaired. Furthermore, per each index $i$ such that $|w_i|<{ TOL}$, we consider $i$ as only fine-grid index and we modify operators in (3.1) including a zero row in the diagonal matrix $W$ for $P_c$, while the complementary tentative prolongator appears as in the following:

$$\begin{aligned} P_f = \left( \begin{array}{cc} \tilde{P}_f &{} 0\\ 0 &{} I \end{array} \right) \in \mathbb {R}^{n \times (n_p+n_f)}, \end{aligned}$$

where $I \in \mathbb {R}^{n_f \times n_f}$ is the identity matrix and $n_f$ is the number of only fine-grid indices. In our experiments we choose TOL as the machine epsilon.

Convergence rates in Algorithm 2 can be estimated as the ratios of the A-norm of two successive iterates, that is $\rho _{c/f}=\Vert {\mathbf w}^k_{c/f}\Vert _{A_{c/f}}/\Vert {\mathbf w}^{k-1}_{c/f}\Vert _{A_{c/f}}$.

There is an alternative to Algorithm 2 that we consider, still using both the orthogonal decomposition of $\mathbb {R}^n$ defined by the matrices in (3.1) and the principles of compatible relaxation to build an effective coarsening process. Indeed, after we have built the matrices in (3.2), we accept $A_c$ as coarse matrix if the corresponding complementary matrix $A_f$ is as diagonally-dominant as possible, i.e., if $A_f$ has the compatible relaxation fast to converge. We observe that given the original matrix $A$, its associated graph $G_U$ or $G_P$, and a vector ${\mathbf w}$, the diagonal entries of the resulting $A_f$ are a subset of the following values:

$$\begin{aligned} {\widehat{a}}_{i,j} =\frac{1}{w^2_j+w^2_i}\; \left[ \begin{array}{c} -w_j \\ w_i \end{array} \right] ^T \left( \begin{array}{cc} a_{i,i} &{} a_{i,j}\\ a_{j,i} &{} a_{j,j} \end{array} \right) \left[ \begin{array}{c} -w_j \\ w_i \end{array} \right] , \nonumber \\ (i,j) \in E. \end{aligned}$$

(3.3)

Consider the thus modified symmetric matrix ${\widehat{A}}=\left( {\widehat{a}}_{i,j}\right) $ having a null diagonal and the same sparsity pattern as $A$. Note that building ${\widehat{A}}$ has a computational cost of ${\mathcal O}(nnz)$. Therefore, if we compute a maximum product weighted matching ${\mathcal M}\subseteq E$ from ${\widehat{A}}$ and build the corresponding aggregates, we see that the complementary tentative prolongator $P_f$ in (3.1) produces a matrix $A_f$ which has on its diagonal entries ${\widehat{a}}_{i,j},\; (i,j) \in {\mathcal M}$ with maximal product. The latter can be seen as an approximation to the notion of diagonal dominance giving rise to a fast convergent compatible relaxation. The process can be applied in a recursive way to define a new adaptive coarsening algorithm which we refer to as coarsening based on compatible matching (version 2). It is sketched in Algorithm 3. Note that also in this algorithm, at each level possible small smooth vector entries are associated to only-fine grid indices.

The above two compatible matching-based coarsening algorithms can be used to define a hierarchy of coarse vector spaces and matrices from which a multilevel method $B$ can be designed. In the following we describe an adaptive strategy to improve the efficiency of an initial multilevel method, obtained with compatible matching-based coarsening, by successively building a composite method with a prescribed convergence rate.

4 Composite AMG with prescribed convergence rate

Following the $\alpha $AMG principle, once an algebraic multilevel solver $B$ has been constructed, we test its performance by solving the homogeneous problem $A {\mathbf x}= {\mathbf 0}$, i.e. by performing the following iterations:

$$\begin{aligned} {\mathbf x}_k = (I - B^{-1} A) {\mathbf x}_{k-1}, \quad \quad k=1, 2, \ldots , \end{aligned}$$

starting with a random initial iterate ${\mathbf x}_0$ and monitoring convergence through two successive values of the $A$-norm of the error (which is equal to the respective iterate, since the exact solution is zero). The above iterates provide approximation to the lowest eigenmode of $B^{-1}A$, which is commonly referred to as algebraic smooth vectors with respect to the current AMG method. If the convergence factor of the method is close to one, we can select ${\mathbf w}= {\mathbf x}_k/\Vert {\mathbf x}_k \Vert _A$ and apply one of the coarsening algorithms described in the preceding section to generate a new method $B_1$ based on this new vector ${\mathbf w}$. Assuming that we have constructed two (or more) methods $B_r$, $r =0,1,\ldots ,\;m$ via the above bootstrap scheme aimed at improving the initial AMG, we consider the homogeneous system and monitor the convergence of the following composite method, starting with a random initial guess ${\mathbf x}_0$,

$$\begin{aligned} \mathbf{x}_k= \prod _{r=1}^{m} (I-B_r^{-1}A)\mathbf{x}_{k-1}, \quad \quad k=1, 2, \ldots , \end{aligned}$$

(4.1)

or of its symmetrized version:

$$\begin{aligned} {\mathbf x}_k = \prod ^{2m+1}_{r=0}(I- B^{-1}_r A){\mathbf x}_{k-1}, \quad \quad k=1, 2, \ldots , \end{aligned}$$

(4.2)

where $B_{m+r} = B_{m+1-r}, \; r =1,\;\dots ,\;m+1$. The process may be repeated by computing at each stage a new multilevel method until the convergence rate of the composite AMG is acceptable. The final adaptive procedure is sketched in Algorithm 4.

5 Results

In this section we illustrate the performance of our composite $\alpha $AMG in terms of the cost of the setup phase described in Algorithm 4 and the ability of the coarsening procedures based on maximum product matching to obtain effective coarse grids.

We considered the following anisotropic PDE posed in the unit square, when homogeneous Dirichlet boundary conditions are considered:

$$\begin{aligned} - \text {div}(K\; \nabla u)=f, \end{aligned}$$

where $K$ is the coefficient matrix

$$\begin{aligned} K = \left[ \begin{array}{ll} a &{} c\\ c &{} b \end{array} \right] , \quad \text { with } \quad \left\{ \begin{array}{l} a= \epsilon + \cos ^2(\theta )\\ b= \epsilon + \sin ^2(\theta )\\ c= \cos (\theta )\sin (\theta ) \end{array} \right. \end{aligned}$$

The parameter $0 < \epsilon \le 1$ defines the strength of anisotropy in the problem, while the parameter $\theta $ specifies the direction of anisotropy. In the following we discuss results related to $\epsilon =0.001$ and $\theta = 0$, $\pi /8$, $\pi /4$, $\pi /3$, $\pi /2$ for a total of $5$ test cases, which we refer to as Test Case 1 to 5, respectively. The above problem was discretized by the Matlab PDE toolbox, using bilinear finite elements on triangular and rectangular meshes.

We measure the setup cost in terms of AMG components (nstages) built by the adaptive process in Algorithm 4, both in the case of the coarsening described in Algorithm 2 and in the case of Algorithm 3. In addition to the number of the components, we also report, per each test case and per each mesh, the convergence factor ($\rho $) of the composite solver, the average number of levels (nlev) of all built solver components and the average of their operator complexity (cmpx). This last parameter is commonly defined as the ratio between the sum of nonzero entries of the matrices of all levels and the number of nonzero entries of the fine matrix; it gives an estimate of the cost of application of a cycle. Many algorithmic and parameter choices are possible to test our method; here we discuss results related to the following particular choices. The desired convergence factor required for the composite AMG was set to $\rho _{desired}=0.7$ and a symmetrized multiplicative composition of the AMG components as in (4.2) was applied. The number of iterations used to estimate solver convergence rates at each stage was set to $\nu _2=15$. Weighted Jacobi (with weight $\omega =1/3$ for triangular meshes and $\omega =1/4$ for rectangular meshes) was applied as relaxation scheme in Algorithms 2 and 3, where we have fixed the number of iterations equal to $\nu _1=20$. We stop the coarsening process when the size of the coarsest matrix was at most $\text {maxsize}=100$. Note that we did various experiments with increased values of $\nu _1$ and $\nu _2$ but estimated values of the obtained convergence rates did not differ significantly.

We developed a Matlab implementation of the composite $\alpha $AMG and we analyze its behavior when the coarsening algorithm is based on algorithm HSL-MC64 (Sect. 5.1), or based on a Matlab implementation of the half-approximation maximum weighted matching algorithm for undirected graphs described in [28] (Sect. 5.2).

5.1 Composite AMG based on exact matching

Here we discuss results obtained using the HSL-MC64 routine which, for non-singular matrices, is able to compute a perfect weighted matching for bipartite sparse matrix graphs. In this case, Algorithm 1 has a coarsening factor less but close to two, since it can produce a (small) number of singletons (unaggregated DOFs), essentially due to possible unsymmetric matching (e.g. row $i$ is matched at column $j$ and row $j$ is matched at column $k$, with $k \ne i$). Since the cost for application of exact matching is about , i.e. it is super-linear, in the following we analyze the setup cost of our bootstrap strategy for building a composite multigrid of type (4.2), when each AMG component was a W-cycle, which has a super-linear complexity for coarsening factor less than two, as in our case. Later on, we relax this cycle to a hybrid V–W one (cf., e.g., [30]) in order to ensure order ${\mathcal O}(n)$ cost of the cycle. One sweep of symmetric Gauss-Seidel was used as both pre/post smoother and as coarsest level solver.

5.1.1 Unstructured mesh

In this section we present results for matrices corresponding to discretization of our test cases on unstructured triangular meshes with a total number of nodes $n=2705, 10657, 42305$, that correspond to three different mesh sizes. We report, in Tables 1 and 2, all parameters leading to the setup cost of the composite AMG achieving convergence rate not larger than the prescribed one, $\rho _{desired}=0.7$.

Table 1 Setup cost for different mesh sizes when exact bipartite matching is used for aggregation.

Adaptive AMG with coarsening based on compatible weighted matching

Abstract

Similar content being viewed by others

αAMG Based on Weighted Matching for Systems of Elliptic PDEs Arising from Displacement and Mixed Methods

Application of the AmgX Library to the Discontinuous Galerkin Methods for Elliptic Problems

Robust Algebraic Multilevel Preconditioners for Anisotropic Problems

1 Introduction

2 Pairwise aggregation based on weighted matching

3 Coarsening based on compatible weighted matching

3.1 Main ingredients for coarsening

3.2 The multilevel adaptive coarsening schemes

4 Composite AMG with prescribed convergence rate

5 Results

5.1 Composite AMG based on exact matching

5.1.1 Unstructured mesh

5.1.2 Structured mesh

5.2 Composite AMG based on approximate matching

5.2.1 Unstructured mesh

5.2.2 Structured mesh

5.3 Further results

5.4 Results with Algorithms 2 and 3 with random initial guess replaced by the restricted smooth vector from previous level

5.5 Results on S.P.D. Matrices arising from UF Sparse Matrix Collection

6 Concluding remarks

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation