Construction of a p-Adaptive Continuous Residual Distribution Scheme

Abgrall, R.; Viville, Q.; Beaugendre, H.; Dobrzynski, C.

doi:10.1007/s10915-017-0399-6

Construction of a p-Adaptive Continuous Residual Distribution Scheme

Published: 13 March 2017

Volume 72, pages 1232–1268, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Scientific Computing Aims and scope Submit manuscript

Construction of a p-Adaptive Continuous Residual Distribution Scheme

Download PDF

R. Abgrall¹,
Q. Viville^2,3,4,
H. Beaugendre^2,3,4 &
…
C. Dobrzynski^2,3,4

277 Accesses
7 Citations
Explore all metrics

Abstract

A p-adaptive continuous residual distribution scheme is proposed in this paper. Under certain conditions, primarily the expression of the total residual on a given element K into residuals on the sub-elements of K and the use of a suitable combination of quadrature formulas, it is possible to change locally the degree of the polynomial approximation of the solution. The discrete solution can then be considered non continuous across the interface of elements of different orders, while the numerical scheme still verifies the hypothesis of the discrete Lax–Wendroff theorem which ensures its convergence to a correct weak solution. We detail the theoretical material and the construction of our p-adaptive method in the frame of a continuous residual distribution scheme. Different test cases for non-linear equations at different flow velocities demonstrate numerically the validity of the theoretical results.

Error Estimates and Adaptivity of the Space-Time Discontinuous Galerkin Method for Solving the Richards Equation

Article Open access 20 August 2024

An Introduction to Hybrid High-Order Methods

An hp finite element method for 4^th order singularly perturbed problems

Article 26 February 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Because of their potential in delivering higher accuracy with lower cost than low order methods, high-order methods for computational fluid dynamics (CFD) have obtained considerable attention in the past two decades [28]. By high order, we mean third order or higher. Most industrial codes used today for CFD simulations are based upon second-order finite volume methods (FVM), finite difference methods (FDM), or finite element methods (FEM) [28]. As second-order methods are used in most CFD codes, some very complex flows simulations might remain out of their reach. Indeed, in some cases, second-order methods are still too much dissipative and as a consequence they require much finer meshes and become too expensive even on modern supercomputer clusters. In order to deal with a large and diverse range of problems, lots of researches have been conducted with the aim of designing robust and stable high order methods, see [16] and [26]. High-order methods allow the use of coarser meshes [26,27,28], high-order boundary representation [20], and improve the accuracy of the solutions [11, 26, 28]. Because of their potential, we believe that the next generation of CFD solvers will have to be based upon high order methods. Besides the use of higher order methods, a very promising approach is the use of hp-adaptation to change locally the order of accuracy and the size of the mesh according to the solution [19]. Among high-order methods, the Discontinuous Galerkin (DG) method [15, 18, 23], the residual distribution (RD) method [1, 4, 5, 12, 22] and the hp finite element method (hp-FEM) [8, 9, 14, 24] have attracted a lot of interest in the recent years. The DG method has a compact stencil regardless the order of the polynomials representing the solution [17]. This very local formulation leads to a great flexibility, especially for the parallelization of its implementation. However, DG methods suffer from the rapid increase of the number of nodes [10], and then, simulations in three-dimensional space may quickly become too expensive. A possible way to overcome this problem is to use p-adaptation [9] (which is conceptually easy with DG methods), where the local approximation order p (hence the term p-adaptation) is dictated by the flow field. The optimal solution is hp-adaptation (mesh and polynomial adaptations) to achieve the best accuracy with the minimum cost. In smooth regions, p-adaptation is preferred, whereas in discontinuous regions, h-adaptation is preferred.

Another possible approach is the class of residual distribution schemes. RD methods can be easily stabilized and show good shock capturing abilities [22]. They offer a very compact stencil like DG methods, but with a smaller number of nodes [5]. The drawback is that to achieve this low number of nodes, the continuity of the approximation is required and consequently, the use of p-adaptation with continuous finite elements in the frame of RD schemes is a priori not possible as this would violate the continuity requirement.

The aim of this work is to demonstrate that it is indeed possible to use p-adaptation with continuous finite elements in the frame of RD schemes. Such an approach, while offering the same advantages as classical residual distribution schemes (like among others the low number of nodes, the non-oscillatory behavior and the accuracy on smooth problems [5]), exhibits some interesting advantages thanks to p-adaptation, like an improved convergence and better shock capturing abilities. The practical implementation of p-adaptation for RD schemes results in a residual based solver that can use high order elements in smooth regions, and low order elements in discontinuous regions. In this sense, this approach is following the recommendations for the next generation of CFD solvers [28].

The hp finite element method can be traced back to the work of I. Babuska et al. [9]. The authors presented an evolution of the finite element method that showed super convergent properties thanks to a suitable combination of h-adaptation and p-adaptation. Of particular interest to our work, the hp finite element method applied to the simulation of turbulent and laminar flows presented in [8], is a method that couples dynamic adaptation techniques with a high order streamline/upwind Petrov–Galerkin (SUPG) finite element scheme. This method shares some traits with the work presented here, notably the authors try to achieve similar goals: the construction of a high order adaptive scheme using continuous finite elements for the simulation of fluid dynamics. But, as we will show, the construction presented by the authors is very different from the method we present here for residual distribution schemes.

The paper is organized as follows. In Sect. 2, the mathematical problem is defined and we recall briefly the general principles of the residual distribution schemes. In Sect. 3, we expose how it is theoretically possible to use p-adaptation in the frame of a continuous Residual Distribution scheme and we propose the detailed construction of the p-adaptive RD scheme. We present some numerical results in Sect. 4, along with some benefits brought by p-adaptation. In conclusion, we invoke some possible future extensions and developments to the work presented here.

2 Mathematical Problem and Residual Distribution Schemes

2.1 Basic Notions of Residual Distribution Schemes

In this section, the basic notions of RD schemes are summarized, more details can be found in [5]. We are interested in the numerical approximation of steady hyperbolic problems of the form

$$\begin{aligned} \text { div } {\mathbf {f}}({\mathbf {u}})=0\,, \end{aligned}$$

(1)

where ${\mathbf {u}}:\Omega \subset {\mathbb {R}}^{d}\rightarrow {\mathbb {R}}^{d+2}$ is the vector of conservative variables, $\Omega $ is an open set of ${\mathbb {R}}^d, \, d=2,3$ is the spatial dimension and ${\mathbf {f}}=({\mathbf {f}}_1,\ldots ,{\mathbf {f}}_d)$ is the vector of flux functions, with ${\mathbf {f}}_{i}({\mathbf {u}}):\Omega \rightarrow {\mathbb {R}}^{d+2}$, $ i=1,\ldots ,d$. For convenience, the flux vector ${\mathbf {f}}$ can be written in column:

$$\begin{aligned} {\mathbf {f}}=\begin{pmatrix}f_1 \\ \vdots \\ f_{d+2} \end{pmatrix} \end{aligned}$$

(2)

with $f_j$=$({\mathbf {f}}_{1_{j}},\ldots ,{\mathbf {f}}_{d_{j}})$. Boundary conditions, that are problem dependent, are applied, we come back to this point in Sect. 2.4.

We study in the present work the numerical solution of the compressible Euler system of equations with the vector of unknowns

$$\begin{aligned} {\mathbf {u}}=\begin{pmatrix}\rho \\ \rho {\mathsf {v}} \\ E\end{pmatrix},\qquad {\mathbf {f}}=\begin{pmatrix} \rho {\mathsf {v}}\\ \rho {\mathsf {v}} \otimes {\mathsf {v}} +p \text { Id}_{d\times d}\\ (E+p){\mathsf {v}} \end{pmatrix} \end{aligned}$$

where $\rho $ is the density, ${\mathsf {v}}$ is the fluid velocity, p is the pressure and $E=\rho e^t$ is the total energy per unit volume where $e^t$ is the specific total energy. We consider a calorically perfect gas, and the Euler equations are closed with the relations for energy

$$\begin{aligned} e^t= e+\dfrac{{\mathsf {v}}\cdot {\mathsf {v}}}{2}\quad \text{ and }\quad e=\dfrac{1}{\gamma -1}RT , \end{aligned}$$

(3)

with the thermodynamic relation of state for a perfect gas

$$\begin{aligned} p=\rho RT \end{aligned}$$

(4)

with the specific heat ratio $\gamma =1.4$, T the gas temperature and R the gas constant which is 287 N m/kg for sea-level air.

In this setting, the vector of unknowns ${\mathbf {u}} \in {\mathbb {R}}^{d+2}$ is such that $\rho >0$ and $e>0$. On solid boundaries, we impose weakly no slip boundary conditions.

In this section, the discussion will stay rather general, and we will not focus particularly on the Euler system, but we are going to work on a generic hyperbolic system. We want to find an approximate solution to Eq. (1) with boundary conditions. We assume that $\Omega $ is a polyhedric open set, more precisely we suppose that $\bar{\Omega }$ is constituted of simplices (triangles in dimension two and tetrahedrons in dimension three). We can then consider a finite decomposition of the domain

$$\begin{aligned} \bar{\Omega }=\underset{K \in {\mathscr {T}}_{h}}{\cup }K \end{aligned}$$

(5)

where K is a simplex and ${\mathscr {T}}_{h}$ is a conforming triangulation of $\bar{\Omega }$. We associate to each simplex $K \in {\mathscr {T}}_{h}$ a finite element $(K,{\mathbb {P}}^{k}(K){^{d+2}},\Sigma _K)$ where $\Sigma _K$ denotes the set of nodes of K defined by

$$\begin{aligned} \Sigma _K = \Big \{ x \in {\mathbb {R}}^{{d}}, \lambda _j(x) \in \{0,\frac{1}{k},{\frac{2}{k}},\ldots ,\frac{k-1}{k},1\}, 1\le j \le {d}+1 \Big \} \end{aligned}$$

(6)

where $\lambda _j, j=1,\ldots ,d+1$ are the barycentric coordinates with respect to the vertices of K (the finite elements thus defined are affine equivalent to each other).

For practical applications, it is important to define the basis of the vector space ${\mathbb {P}}^k(K)$. A natural choice is made by the Lagrange basis. Another choice that will be useful in this work is the Bézier basis. From the definition (6), any node $a_{\mu } \in \Sigma _K$ can be written as

$$\begin{aligned} a_{\mu } = \frac{1}{k} \sum _{j=1}^{d+1} \mu _j a_j, \ \mu =(\mu _1,\ldots ,\mu _{d+1}) \end{aligned}$$

(7)

where $(a_j)_{j=1,\ldots ,d+1}$ are the vertices of K and $\frac{1}{k}(\mu _1,\ldots ,\mu _{d+1})$ are the barycentric coordinates of $a_{\mu }$ with respect to the vertices of K.

The Lagrange basis function $\varphi _{\mu }$ associated to $a_{\mu }$ can then be written as

$$\begin{aligned} \varphi _{\mu } = \left( \prod _{j=1}^{d+1}(\mu _j!)\right) ^{-1} \prod _{j=1,\ \mu _j \ge 1}^{d+1} \prod _{i=0}^{\mu _j-1}\big (k \lambda _j - i \big ) \end{aligned}$$

(8)

and the Bézier basis function $B_{\mu }$ associated to $a_{\mu }$ can be written as

$$\begin{aligned} B_{\mu }=\dfrac{k!}{\prod \limits _{j=1}^{d+1}\mu _j!} \prod _{j=1}^{d+1}\lambda _j^{\mu _j} \ . \end{aligned}$$

(9)

We remark that the Bézier and Lagrange basis functions sum up to unity, and that the Bézier basis functions are positive on K.

We introduce now the global finite element space

$$\begin{aligned} X_{h}= \left\{ {\mathbf {u}}_{h}\in C^{0}(\bar{\Omega }){^{d+2}}; \ \forall K \in {\mathscr {T}}_{h}, \ {\mathbf {u}}_{h|K} \in {\mathbb {P}}^{k}(K){^{d+2}}, k\in {\mathbb {N}}^{*} \right\} \end{aligned}$$

(10)

where we look for the discrete solution to equations (1). We denote by $\Sigma _h$ the set of $N_h$ nodes of the finite elements

$$\begin{aligned} \Sigma _h = \underset{K \in {\mathscr {T}}_{h}}{\cup }\Sigma _K \end{aligned}$$

(11)

and from the conformity assumption of the triangulation, a Lagrange basis of $X_h$ is constituted of the Lagrange basis functions $\psi _i, 1 \le i \le N_h$ of the finite elements $(K,{\mathbb {P}}^{k}(K){^{d+2}}, \Sigma _K), K \in {\mathscr {T}}_h$ and likewise, a Bézier basis of $X_h$, denoted again $\psi _i, 1 \le i \le N_h$, is constituted of the Bézier basis functions of the finite elements. In either basis, we can now write the approximate solution ${\mathbf {u}}_h \in X_h$ as

$$\begin{aligned} {\mathbf {u}}_h=\sum _{i\in \Sigma _h} {\mathbf {u}}_i\psi _i \ . \end{aligned}$$

(12)

In the Lagrange case, we have ${\mathbf {u}}_j={\mathbf {u}}_h(a_j)$, where $a_j \in \Sigma _h$ is the node associated with the global shape function $\psi _j$ as described above.

In this paper, all the numerical applications will be made with simplices, but the work presented here can be extended to other kinds of geometric elements like hexahedrons.

We can now write the discrete equations of the residual distribution scheme. We first introduce the total residual of the element K

$$\begin{aligned} \Phi ^K({\mathbf {u}}_h) = \int _K \nabla \cdot {\mathbf {f}}({\mathbf {u}}_h) dx. \end{aligned}$$

(13)

As $X_h$ is a subspace of $H^1(\Omega )^{d+2}$, we have

$$\begin{aligned} {\mathbf {f}}({\mathbf {u}}_h) \in H^1(\Omega )^{(d+2)\times d} \end{aligned}$$

and so we can transform the volume integral (13) into a surface integral with the divergence formula

$$\begin{aligned} \Phi ^K({\mathbf {u}}_h) = \int _{\partial K} {\mathbf {f}}({\mathbf {u}}_h)\cdot {\mathbf {n}} \ dx \end{aligned}$$

(14)

where ${\mathbf {n}}$ is the exterior normal to $\partial K$. This is the form that is used in practice. We define as well the boundary total residual for a boundary element (edge in dimension two and face in dimension three) $\Gamma \in {\mathscr {T}}_h \cap \partial \bar{\Omega }$

$$\begin{aligned} \Phi ^{\Gamma }=\int _{\Gamma }\big ({\mathbf {f}}({\mathbf {u}}^\Gamma )-{\mathbf {f}}({\mathbf {u}}^{h}) \big ) \cdot {\mathbf {n}} \, dx \ . \end{aligned}$$

(15)

Once the total residual $\Phi ^K$ is computed, the next step is to compute the ”nodal residuals” $\Phi _\sigma ^K$ (also called ”residuals” or ”sub-residuals” in the literature) that write in generic form

$$\begin{aligned} \Phi _\sigma ^K=\beta _\sigma ^K({\mathbf {u}}_h)\Phi ^K({\mathbf {u}}_h), \sigma \in \Sigma _K. \end{aligned}$$

(16)

Similarly, the boundary nodal residuals $\Phi _\sigma ^K$ are defined by

$$\begin{aligned} \Phi _\sigma ^\Gamma =\beta _\sigma ^\Gamma ({\mathbf {u}}_h) \Phi ^\Gamma ({\mathbf {u}}_h), \sigma \in \Sigma _\Gamma \end{aligned}$$

(17)

where $\Sigma _\Gamma $ is the set of nodes of the face $\Gamma $ lying on the boundary of the domain. The coefficients $\beta _\sigma ^K$ et $\beta _\sigma ^\Gamma $ are the distribution coefficients and in general they depend on the solution ${\mathbf {u}}_h$. They are real numbers in the scalar case and are matrices in the case of a system of equations. The nodal residuals must satisfy the conservation constraints

$$\begin{aligned} \sum _{\sigma \in \Sigma _h} \Phi _\sigma ^K=\Phi _K , \forall \ K \in {\mathscr {T}}_h \ \end{aligned}$$

(18)

and

$$\begin{aligned} \sum _{\sigma \in \Sigma _\Gamma } \Phi _\sigma ^\Gamma =\Phi _\Gamma , \forall \ \Gamma \in {\mathscr {T}}_h \cap \partial \bar{\Omega } \ . \end{aligned}$$

(19)

The total residual is written with a slight abuse of notation as the integral of ${\mathbf {f}}({\mathbf {u}}_h)\cdot {\mathbf {n}}$ which is a vector valued function of size $d+2$. With the notation of Sect. 2.1, the total residual is then a vector of size $d+2$ where the component $j,1\le j \le d+2$, is the integral of the scalar product $f_j \cdot {\mathbf {n}}$. Like the total residual, the nodal residuals are vectors of dimension $d+2$. The boundary total and nodal residuals are of the same dimension.

Finally, the residual distribution scheme reads

$$\begin{aligned} \forall \,\sigma \in \Sigma _{h}, \sum _{K,\,\sigma \in \Sigma _K}\Phi ^{K}_{\sigma }+ \sum _{\Gamma ,\,\sigma \in \Sigma _\Gamma }\Phi ^{\Gamma }_{\sigma }=0.\ \end{aligned}$$

(20)

Before going further, let us give some examples of residual distribution schemes.

2.2 Some Particular Residual Distribution Schemes

2.2.1 The Lax–Wendroff Scheme

The Lax–Wendroff scheme is a central linear scheme, which name comes from the fact that in its scalar version, in the case of ${\mathbb {P}}^1$ interpolation and for a constant advection speed, the scheme coincides with the scheme named Lax–Wendroff presented in [21]. It is defined by

$$\begin{aligned} \Phi ^{K,LW}_{\sigma }=\frac{\Phi ^{K}}{N^{K}_{dof}}+\int _{\Omega _{K}}\big (A \cdot \nabla \varphi _{\sigma }\big ) \, \Xi \,\big ( A \cdot \nabla {\mathbf {u}}_{h}\big ) dx \ . \end{aligned}$$

(21)

The scalar $N^{K}_{dof}$ is the cardinal of $\Sigma _K$, the term A represents $A=(A_1, \ldots , A_d)$ where

$$\begin{aligned} A_i({\mathbf {u}})=\frac{\partial {\mathbf {f}}_i({\mathbf {u}})}{\partial {\mathbf {u}}} \end{aligned}$$

(22)

is the Jacobian matrix of the i-th component of the flux. The product $A \cdot \nabla \varphi _\sigma $ is the matrix defined by

$$\begin{aligned} A\cdot \nabla \varphi _\sigma =\sum _{i=1}^d A_i \dfrac{\partial \varphi _\sigma }{\partial x_i} \end{aligned}$$

(23)

where $\varphi _\sigma $ is the basis function (Lagrange or Bézier, see Sect. 2.1) associated to the node $\sigma $. The matrix $\Xi $ is a scaling matrix defined by

$$\begin{aligned} \Xi = \frac{1}{d}|K|\bigg ( \sum _{\sigma \in \Sigma _K} R_{{\mathbf {n}}_\sigma }({\bar{{\mathbf {u}}}}) \Lambda _{{\mathbf {n}}_\sigma }^+({\bar{{\mathbf {u}}}}) L_{{\mathbf {n}}_\sigma }(\bar{{\mathbf {u}}}) \bigg )^{-1}. \end{aligned}$$

(24)

In the definition above |K| is the volume of the element K, $\bar{\mathbf {u}}$ is the average value of the vector of conservative variables on the element K:

$$\begin{aligned} {\bar{\mathbf {u}}}=\frac{1}{N^{K}_{dof}}\sum _{\sigma \in \Sigma _K}\mathbf {u}_\sigma \ , \end{aligned}$$

(25)

$R_{\mathbf {n_\sigma }}$ and $L_{\mathbf {n_\sigma }}$ are respectively the matrices of the right and left eigenvectors of the matrix $A\cdot \mathbf {n_\sigma }$ defined as in (23), $\wedge _{\mathbf {n}_\sigma }^+$ is the diagonal matrix with $\text {max}(\lambda _{\mathbf {n_\sigma }},0)$ on the diagonal, where the $\lambda _{\mathbf {n_\sigma }}$ are the eigenvalues of the matrix $A\cdot \mathbf {n_\sigma }$. The vector ${\mathbf {n}}_\sigma $ is defined by

$$\begin{aligned} {\mathbf {n}}_\sigma = \frac{1}{d} \int _{{K}}\nabla \varphi _{\sigma } \ dx . \end{aligned}$$

(26)

The nodal residuals defined in (21) satisfy the conservation relation (18) as we have

$$\begin{aligned} \sum _{\sigma \in \Sigma _K} \nabla \varphi _\sigma = 0 \end{aligned}$$

(27)

for the Lagrange (8) and Bézier basis functions (9).

2.2.2 The Rusanov Scheme

The Rusanov scheme is a generalization of the one dimensional Rusanov scheme. This scheme is obtained from a central distribution of the total residual with a dissipation term added. In the case of a system of equations it is defined by

$$\begin{aligned} \Phi ^{K, Rsv}_{\sigma }=\frac{1}{N^{K}_{dof}}\bigg (\Phi ^{K} ({\mathbf {u}}_{h})+\alpha ^{K}\sum _{\sigma _j\in \Sigma _K}({\mathbf {u}}_{\sigma }-{\mathbf {u}}_{\sigma _j})\bigg ). \end{aligned}$$

(28)

The parameter $\alpha ^K$ is chosen as to be larger than the maximum of the spectral radius of the matrix $A\cdot \nabla \varphi _{\sigma }$. In the scalar case, this choice guaranties that the scheme satisfies a local maximum principle. In the system case, it is proved to be non oscillatory and known to be very dissipative [2].

2.3 Construction of a High Order Monotonicity Preserving Residual Distribution Scheme

If the flow is smooth, we can use the Lax–Wendroff scheme presented above. However, when the solution presents discontinuities, special care has to be taken to handle them. We recall now the method we follow.

2.3.1 Limitation

From the Godunov theorem [13, 25], there is no monotone and linear scheme that is more than first order of accuracy. What we call limitation is a procedure that guaranties, at least for scalar problem, that we can get at the same time a local maximum preserving scheme that is formally of maximum accuracy; the accuracy being driven by the polynomial degree plus 1. In the system case, the procedure we describe in this section is high order accurate and provides non oscillatory solutions. The wording ”limitation” is not very adequate but it is the traditional wording in this field.

To construct such procedure, in [2], the idea is to start from a first order monotone scheme and to apply a limitation procedure to the distribution coefficients $\beta _\sigma ^K$ (presented in Sect. 2.1), such that the limited distribution coefficients are uniformly bounded. In doing so, we see from relation (16) that we obtain high order nodal residuals.

We choose to apply the limitation procedure for the Rusanov scheme (28), as the resulting scheme is well suited to handle the shocks occurring in high speed flows [2].

The limitation procedure is achieved through the following sequence of operations. First, we compute the matrices $L_n$ and $R_n$ which are respectively the matrices of the left and right eigenvectors of the matrix

$$\begin{aligned} A(\bar{{\mathbf {u}}}) \cdot \frac{\bar{{\mathsf {v}}}}{||\bar{\mathsf {v}}||} \end{aligned}$$

(29)

where $A(\bar{\mathbf {u}})$ represents the Jacobian matrix (22) evaluated at the average state (25) and $\bar{\mathsf {v}}$ is the average speed vector computed the same way as $\bar{{\mathbf {u}}}$. The nodal residuals (28) are projected on the vector space generated by the left eigenvectors $L_n$, so the intermediate nodal residuals write

$$\begin{aligned} \Phi ^{K,*}_{\sigma }=L_n \Phi ^{K, Rsv}_{\sigma } \end{aligned}$$

(30)

and we have the total residual

$$\begin{aligned} \Phi ^{K,*}=\sum _{\sigma \in \Sigma _K}\Phi ^{K,*}_{\sigma } \ . \end{aligned}$$

(31)

Then, the high order distribution coefficients are computed from the original first order distribution coefficients by the non linear mapping:

$$\begin{aligned} \displaystyle {\hat{\beta }}^{K,*}_{\sigma }=\frac{ \displaystyle \left( \frac{\Phi ^{K,*}_{\sigma }}{\Phi ^{K,*}} \right) ^+ }{ \displaystyle \sum _{\sigma _j\in K} \left( \frac{\Phi ^{K,*}_{\sigma _j}}{\Phi ^{K,*}} \right) ^+ } \ \ . \end{aligned}$$

(32)

We have built the intermediate high order nodal residuals in the characteristic space:

$$\begin{aligned} {\hat{\Phi }}^{K,*}_{\sigma }={\hat{\beta }}^{K,*}_{\sigma }\Phi ^{K,*} \end{aligned}$$

(33)

and finally we project them from the characteristic space to the physical space:

$$\begin{aligned} {\hat{\Phi }}^{K}_{\sigma }=R_n {\hat{\Phi }}^{K,*}_{\sigma } \end{aligned}$$

(34)

where $R_n$ are the right eigenvectors defined above.

In the particular case where ${\bar{\mathsf {v}}}$ is close to the null vector:

$$\begin{aligned} ||\bar{\mathsf {v}}|| < tol \end{aligned}$$

where tol is a tolerance parameter (for example we used $tol=10e^{-8}$ for the simulations of Sect. 4), the limited distribution coefficients are directly obtained by the same procedure with $\frac{\bar{\mathsf {v}}}{||\bar{\mathsf {v}}||}$ in (29) replaced by an arbitrary vector of norm unity. In this paper we choose the first element of the canonical basis, we have shown in [2] that this choice has no influence on the stability of the method.

2.3.2 Stabilization

The solution obtained with a limited Rusanov scheme can exhibit spurious modes and can show poor iterative convergence [5]. These problems are solved via the addition of the filtering term presented above for the scheme (21) and so the high order filtered Rusanov scheme for the Euler equations reads

$$\begin{aligned} {\widehat{\Phi }}^{K,Rus}_{\sigma }(\mathbf {u}_{h}) ={\widehat{\Phi }}^{K}_{\sigma }({\mathbf {u}}_{h})+\theta \int _{\Omega _{K}}\big (A \cdot \nabla \varphi _{\sigma }\big ) \, \Xi \,\big ( A \cdot \nabla {\mathbf {u}}_{h}\big ) dx \end{aligned}$$

(35)

where $\theta $ is a shock capturing term. In most applications, we take $\theta =1$, and in some cases a more elaborated version must be chosen. In this paper, we take $\theta =1$.

2.4 Boundary Conditions

For the nodes lying on the boundary of the domain, we use the following formula for the nodal residuals:

$$\begin{aligned} \forall \ \Gamma \in {\mathscr {T}}_h \cap \partial \bar{\Omega }, \ \forall \sigma \in \Sigma _\Gamma , \ \Phi _{\sigma }^{\Gamma }({\mathbf {u}}_{h})=\int ^{}_{\Gamma }\varphi _{\sigma } \big ( {\mathcal {F}}^{}({\mathbf {u}}_{h},{\mathbf {u}}_{-},{\mathbf {n}}) -{\mathbf {f}}({\mathbf {u}}_{h})\cdot {\mathbf {n}}\, \big )\,dx \end{aligned}$$

(36)

where ${\mathbf {u}}_{-}$ is the state that is imposed by the Dirichlet conditions and ${\mathcal {F}}$ is a numerical flux that depends on ${\mathbf {u}}_{-}$, the outward normal ${\mathbf {n}}$ and the local state ${\mathbf {u}}_h$. In the case of a no-slip condition, the boundary condition ${\mathbf {u}}_{-}$ satisfies this condition and the numerical flux is only a pressure flux. More details can be found in [5].

2.5 A Lax–Wendroff Like Theorem and Its Consequences

The following theorem has been proved in [6]:

Theorem 2.1

Assume the family of meshes ${\mathscr {T}}=({\mathscr {T}}_h)_{h}$ is regular. For K an element or a boundary element of ${\mathscr {T}}_h$, we assume that the residuals $\{ \Phi _\sigma ^{K}\}_{\sigma \in K}$ satisfy:

For any $M\in {\mathbb {R}}^{+}$, there exists a constant C which depends only on the family of meshes ${\mathcal {T}}_h$ and M such that for any ${\mathbf {u}}_h\in X_h$ with $||{\mathbf {u}}_h||_{\infty }\le M$, then
$$\begin{aligned} \Vert \Phi ^{K}_{\sigma }(u_{h|K})\Vert \le C\sum _{\sigma ,\sigma ^{'}\in \Sigma _K}|{\mathbf {u}}_{\sigma }-{\mathbf {u}}_{\sigma ^{'}}| \end{aligned}$$
(37)
They satisfy the conservation property (18)–(19).

Then if there exists a constant $C_{max}$ such that the solutions of the scheme (20) satisfy $||{\mathbf {u}}_h||_{\infty }\le C_{max}$ and a function ${\mathbf {v}}\in L^2(\Omega )^p$ such that $({\mathbf {u}}_h)_{h}$ (or at least a sub-sequence) verifies $\text {lim}_h||{\mathbf {u}}_h-{\mathbf {v}}||_{L^2_\text {loc}(\Omega )^p}=0$, then ${\mathbf {v}}$ is a weak solution of (1).

One of the interests of this result, besides indicating automatic consistency constraints for a scheme (20), is that the important constraint is to have the conservation relations (18)–(19) at the element level. This is a weaker statement than the usual one: usually, we ask conservation at the element interface level, and not globally on the element. Note that (18)–(19) enables however to explicitly construct the numerical flux, so that the scheme (18)–(19) can be reinterpreted as an almost standard finite volume scheme, see [3] for more details. Of course there is something to pay: the flux needs to be genuinely multidimensional.

Let us take a closer look at the relation (18)–(19). We need to see how they are actually implemented. Indeed, in the numerical implementation, we do not ask for (18)–(19), but for a discrete version of these relations, namely:

For any element $K\in {\mathscr {T}}_h$,

$$\begin{aligned} \sum _{\sigma \in \Sigma _K}\Phi _\sigma ^K({\mathbf {u}}_{h})=\int _{\partial K}{\mathbf {f}}({\mathbf {u}}_{h}) \cdot {\mathbf {n}}\end{aligned}$$

(38a)

and any boundary element $\Gamma \in {\mathscr {T}}_h \cap \partial \bar{\Omega }$,

$$\begin{aligned} \sum _{\sigma \in \Sigma _\Gamma }\Phi ^{\Gamma }_{\sigma }\,=\int _{\Gamma } \displaystyle \left( {\mathcal {F}}^{}({\mathbf {u}}_{h},{\mathbf {u}}_{-},{\mathbf {n}}) -{\mathbf {f}}\,({\mathbf {u}}_{h})\cdot {\mathbf {n}}\,\right) \,dx \end{aligned}$$

(38b)

where here $\int $ signifies that the integral is computed with a quadrature formula, which reads

1.
On elements:
$$\begin{aligned} \int _{\partial K}{\mathbf {f}}({\mathbf {u}}^h)\cdot {\mathbf {n}}=\sum _{e\text { edge/face }\subset \partial K}\bigg ( |e|\sum _{q}\omega _q {\mathbf {f}}({\mathbf {u}}_q)\cdot {\mathbf {n}}\bigg ) \end{aligned}$$
2.
On boundary elements:
$$\begin{aligned} \int _{\Gamma } \displaystyle \left( {\mathcal {F}}({\mathbf {u}}_h,{\mathbf {u}}_{-},{\mathbf {n}}) -{\mathbf {f}}({\mathbf {u}}_h) \cdot {\mathbf {n}}\right) dx = |\Gamma | \sum _q \omega _q \left( {\mathcal {F}}({\mathbf {u}}_q,{\mathbf {u}}_{-},{\mathbf {n}}) - {\mathbf {f}}({\mathbf {u}}_q)\cdot \,{\mathbf {n}}\right) \end{aligned}$$

If we have a close look at the proof of Theorem 2.1, we see that besides the boundedness of the sequence of solutions in suitable norms that enable to use compactness argument, what really matters at the algebraic level is that we have the following property on any edge:

If $\Gamma =\Gamma '$ is the same face shared by respectively the two adjacent elements K and $K'$, then we have ${\mathbf {n}}_{|\Gamma }=-{\mathbf {n}}_{\Gamma '}$ and consequently

$$\begin{aligned} \int _{\Gamma }\varphi _{\sigma }{\mathbf {f}}({\mathbf {u}}_{h})\cdot {\mathbf {n}}\quad +\int _{\Gamma '}\varphi _{\sigma }{\mathbf {f}} ({\mathbf {u}}_{h})\cdot {\mathbf {n}}=0, \end{aligned}$$

(39)

where $\varphi _{\sigma }\in {\mathbb {P}}^{k}(K)$ is the basis function associated to the node $\sigma $. This is clearly true because ${\mathbf {u}}_{h}$ is continuous. Now, in the numerical implementation, the easiest way to do so is that the quadrature points on $\Gamma $ seen from K are the same as the ones on $\Gamma $ seen from $K'$. These two remarks are at the core of the present development, as we see now.

3 Residual Distribution Schemes and p-Adaptation

As we have seen in the previous section, what matters for conservation is that, for any element K, the sum of the residuals is equal to the total residual, or more precisely

$$\begin{aligned} \sum _{\sigma \in K} \Phi _\sigma ^K=\int _{\partial K}{\mathbf {f}}({\mathbf {u}}_{h})\cdot {\mathbf {n}}. \end{aligned}$$

The total residual $\Phi ^K=\int _{\partial K}{\mathbf {f}}({\mathbf {u}}_{h})\cdot {\mathbf {n}}$ is obtained thanks to quadrature formulas. We show in this section that in many cases this quantity can be rewritten as a weighted sum of total residuals on sub-elements. To make this more precise, we first look at the quadratic case on a triangle, where the Simpson formula is used on each edge. The main idea of this paper comes from this observation.

We then present the quadratic case on a tetrahedron. From these particular cases, we present two formulas that generalize to arbitrary polynomial orders the formulas given for quadratic polynomials. These general formulas make it possible to construct a p-adaptive residual distribution scheme of arbitrary orders, with the definition of a modified nodal residual that we present at the end of the section. We choose to present the particular cases of quadratic approximation along with their generalization, instead of giving only the general formulas, as we think that it is necessary for the comprehension of the reader.

As the formulas presented in this section allow to combine in the same mesh different polynomial orders for the approximation of the solution, we use the term p-adaptation. Our method of p-adaptation is proposed within the frame of a continuous residual distribution scheme and is not as general as the method described for example in [8] which is proposed in the frame of the Finite Element method (see “Appendix 2”).

3.1 Quadratic Interpolation on Triangular Elements

Let us now consider the case of a triangular element and a quadratic approximation.

We consider a triangle and for simplicity its vertices are denoted by 1,2,3 and the mid-points of the edges are denoted by 4,5,6 (see Fig. 1). We subdivide the triangle $K=(1,2,3)$ into four sub-triangles: $K_{1}=(1,4,6),$ $K_{2}=(4,2,5),$ $K_{3}=(5,3,6)$ and $K_{4}=(4,5,6)$, as shown in Fig. 1. If we denote by $\lambda _{i}, i=1,2,3$ the barycentric coordinates corresponding respectively to the vertices $i=1,2,3$ then the linear interpolant of the flux ${\mathbf {f}}$ reads

$$\begin{aligned} {\mathbf {f}}^{(1)}=\sum _{i=1}^3 {\mathbf {f}}({\mathbf {u}}_i)\lambda _i \end{aligned}$$

and the quadratic interpolant of the same flux reads

$$\begin{aligned} {\mathbf {f}}^{(2)}=\sum _{i=1}^6 {\mathbf {f}}({\mathbf {u}}_i)\varphi _i \end{aligned}$$

with, from (8):

$$\begin{aligned}\begin{array}{lll} \varphi _i=(2\lambda _i-1)\lambda _i, &{}\text {for }i=1,2,3 \\ \varphi _4=4\lambda _{2}\lambda _{1}, &{} \varphi _5=4\lambda _{3}\lambda _{2}, &{}\varphi _6=4\lambda _{1}\lambda _{3}\,. \\ \end{array} \end{aligned}$$

Let us evaluate the total residual for the quadratic interpolant, we denote by ${\mathbf {n}}_{i}$ the integral $\int _{K}\nabla \lambda _{i}dx$, $i=1,2,3$ and we have:

$$\begin{aligned} \int _K\text {div}\,{\mathbf {f}}^{(2)} dx= & {} \sum _{i=1}^6 \int _K {\mathbf {f}}_i \cdot \nabla \varphi _i dx\nonumber \\= & {} \sum _{i=1}^6 {\mathbf {f}}_i \cdot \int _K\nabla \varphi _i dx\nonumber \\= & {} \sum _{i=1}^3 {\mathbf {f}}_i \dfrac{{\mathbf {n}}_i}{3} + \dfrac{4}{3}\big (\,({\mathbf {f}}_4({\mathbf {n}}_1+{\mathbf {n}}_2) +{\mathbf {f}}_5({\mathbf {n}}_2+{\mathbf {n}}_3) +{\mathbf {f}}_6({\mathbf {n}}_1+{\mathbf {n}}_3) \ \big )\nonumber \\= & {} \dfrac{2}{3}\big ({\mathbf {f}}_1\frac{{\mathbf {n}}_1}{2}+{\mathbf {f}}_4\frac{{\mathbf {n}}_2}{2} +{\mathbf {f}}_6\frac{{\mathbf {n}}_3}{2}\big )\nonumber \\&+\,\dfrac{2}{3}\big ({\mathbf {f}}_4\frac{{\mathbf {n}}_1}{2}+{\mathbf {f}}_2\frac{{\mathbf {n}}_2}{2} +{\mathbf {f}}_5\frac{{\mathbf {n}}_3}{2}\big )\nonumber \\&+\,\dfrac{2}{3}\big ({\mathbf {f}}_6\frac{{\mathbf {n}}_1}{2}+{\mathbf {f}}_5\frac{{\mathbf {n}}_2}{2} +{\mathbf {f}}_3\frac{{\mathbf {n}}_3}{2}\big )\nonumber \\&-\,2\big ({\mathbf {f}}_5\frac{{\mathbf {n}}_1}{2}+{\mathbf {f}}_6\frac{{\mathbf {n}}_2}{2} +{\mathbf {f}}_4\frac{{\mathbf {n}}_3}{2}\big )\nonumber \\= & {} \frac{2}{3}\bigg (\int _{K_{1}}\text {div }{\mathbf {f}}^{(1)} dx +\int _{K_{2}}\text {div }{\mathbf {f}}^{(1)} dx\nonumber \\&+\int _{K_3}\text {div }{\mathbf {f}}^{(1)} dx\bigg ) +2\int _{K_4}\text {div }{\mathbf {f}}^{(1)} dx \end{aligned}$$

(40)

where ${\mathbf {f}}^{(1)}$ denotes the ${\mathbb {P}}^{1}$ interpolant of the flux in each of the sub-triangles of Fig. 1. The change in signs comes from the fact that the inward normals of the sub-triangle $K_{4}$, appearing in the expression of $\int _{K_4}\text {div }{\mathbf {f}}^{(1)} dx,$ are the opposite of the vectors ${\mathbf {n}}_{i}/2$.

The relation (40) demonstrates that a very simple relation, with positive weights, exists between the ${\mathbb {P}}^{1}$ residuals in the sub-triangles and the quadratic residual in K.

3.2 Quadratic Approximation on Tetrahedral Elements

The formula (40) has no equivalent in dimension three if we use Lagrange finite elements, but this problem can be solved by a change of basis as explained in the following. For better clarity, we detail the reference quadratic tetrahedron K in Fig. 2. The ten nodes of K are numbered 1 to 10 and their coordinates are given by:

$$\begin{aligned} 1= & {} (0,0,0) ; \quad 2=(1,0,0) ; \quad 3=(0,0,1) ; \quad 4=(0,1,0) ;\\ 5= & {} (1/2,0,0); \quad 6=(1/2,0,1/2) ; \quad 7=(0,0,1/2) ;\\ 8= & {} (1/2,1/2,0) ; \quad 9=(0,1/2,1/2) ; \quad 10=(0,1/2,0) . \end{aligned}$$

We subdivide the tetrahedron into eight sub-tetrahedrons as shown in Fig. 2. The central octahedron $(5,6,7,8,9,10)$ can be split with the diagonal $(7,8)$, $(6,10)$ or $(5,9),$ as shown in Fig. 3, and the sub-tetrahedrons obtained are then:

Exterior tetrahedrons:

$K_{1}=(5,7,10,1)$; $K_{2}=(2,6,5,8)$; $K_{3}=(3,9,6,7)$; $K_{4}=(8,9,10,4)$.
With diagonal (7, 8):

$K_{5}=(6,9,8,7)$; $K_{6}=(8,9,10,7)$; $K_{7}=(8,10,5,7)$; $K_{8}=(8,6,5,7)$.
With diagonal (6, 10):

$K_{5}=(6,9,8,7)$; $K_{6}=(8,9,10,7)$; $K_{7}=(8,10,5,7)$; $K_{8}=(8,6,5,7)$.
With diagonal (5, 9):

$K_{5}=(5,9,6,7)$; $K_{6}=(5,9,10,7)$; $K_{7}=(5,9,6,8)$; $K_{8}=(5,9,10,8)$.

Now if we try to compute the equivalent of formula (40) for the three dimensional case using a Lagrange interpolation, we find that the coefficients corresponding to the sub-tetrahedrons $K_{1}$, $K_{2}$, $K_{3}$, $K_{4}$ are equal to $0$. This is due to a property of the ${\mathbb {P}}^{2}$ Lagrange basis functions in dimension three:

$$\begin{aligned} \int _{K}\frac{\partial \varphi _{j}}{\partial x_{d}}=0,\quad j=1,\ldots ,4 , \quad d=1,2,3. \end{aligned}$$

This implies that the nodal residuals corresponding to the vertices of the tetrahedron $K$ will not contribute to Eq. (20), which may give a problematic residual scheme. To avoid this problem, while still using quadratic elements, we use the Bézier basis functions (9). More precisely, instead of using a Lagrange interpolation of the flux, we approximate the flux with a Bézier expansion. From the linear algebra point of view, this is a change of basis, but with this basis we obtain positive weights.

If we denote by $\Phi $ a quadratic polynomial, with the notation of definition (9), the expansion of $\Phi $ in the Bézier basis reads

$$\begin{aligned} \Phi =\sum _{\sigma =1}^{10 }\Phi _{\sigma } B_{\sigma } \end{aligned}$$

and the difference with Lagrange interpolation is that $\Phi _{\sigma }=\Phi (\sigma )$ only for $\sigma =1, 2, 3, 4$. For the other nodes, we have

$$\begin{aligned} \Phi (\sigma )=\frac{\Phi _{j_1}+\Phi _{j_2}}{4}+\frac{\Phi _{\sigma }}{2} \end{aligned}$$

(41)

where ${j_1}$ and ${j_2}$ are the two vertices of the tetrahedron on the edge where $\sigma $ lies. It is clear that $\Phi _{\sigma }=\Phi (\sigma )+O(h^2)$. We also notice that

$$\begin{aligned}\nabla B_{\sigma } = \sum _{j=1}^{4}P_j(\lambda _1,\lambda _2,\lambda _3,\lambda _4) \nabla \lambda _j \end{aligned}$$

where $P_j(\lambda _1,\lambda _2,\lambda _3,\lambda _4)$ is a polynomial in $\lambda _1$, $\lambda _2$, $\lambda _3$, and $\lambda _4$ with positive coefficients.

Now we need to find an equivalent formula of (40) for the three dimensional case with the Bézier basis functions. In the same spirit as for (40), we expand the flux in term of Bézier polynomials:

$$\begin{aligned} {\mathbf {f}}^B:=\sum ^{10}_{\sigma =1}{\mathbf {f}}_{\sigma }B_{\sigma } \end{aligned}$$

where for $\sigma =1,\ldots ,4$, ${\mathbf {f}}_\sigma $ is the value of the flux at the vertices of the tetrahedron and when $\sigma >4$, ${\mathbf {f}}_\sigma $ is defined according to (41).

With the notation $N_i=\nabla \lambda _{i}$, the gradients of the Bézier basis functions write:

$$\begin{aligned} \begin{aligned} \nabla B_1&=2\lambda _1 N_1; \quad \nabla B_2=2\lambda _2 N_2; \quad \nabla B_3=2\lambda _3 N_3; \quad \nabla B_4=2\lambda _4 N_4; \\ \nabla B_5&=2\big ( \lambda _1N_2+\lambda _2N_1\big ); \quad \nabla B_6=2\big ( \lambda _2N_3+\lambda _3N_2\big );\\ \nabla B_7&=2\big ( \lambda _1N_3+\lambda _3N_1\big ); \quad \nabla B_8=2\big ( \lambda _2N_4+\lambda _4N_2\big ); \\ \nabla B_9&=2\big ( \lambda _3N_4+\lambda _4N_3\big ); \quad \nabla B_{10}=2\big ( \lambda _1N_4+\lambda _4N_1\big ) \ . \end{aligned} \end{aligned}$$

(42)

With ${\mathbf {n}}_j=\int _K N_j$, we find the following formula for the three dimensional case:

$$\begin{aligned} 2\int _K \text { div }{\mathbf {f}}^B dx= & {} 2\sum _{\sigma =1}^{10} \big (\int _K {\mathbf {f}}_{\sigma }\text {div } B_{\sigma } dx\big )\nonumber \\= & {} ( {\mathbf {f}}_1 {\mathbf {n}}_1 + {\mathbf {f}}_2 {\mathbf {n}}_2+ {\mathbf {f}}_3 {\mathbf {n}}_3+{\mathbf {f}}_4{\mathbf {n}}_4)\nonumber \\&+\,({\mathbf {f}}_5 {\mathbf {n}}_2+{\mathbf {f}}_5 {\mathbf {n}}_1) +( {\mathbf {f}}_6 {\mathbf {n}}_3+{\mathbf {f}}_6 {\mathbf {n}}_2)\nonumber \\&+\, ({\mathbf {f}}_7 {\mathbf {n}}_1 + {\mathbf {f}}_7{\mathbf {n}}_3)+({\mathbf {f}}_8{\mathbf {n}}_4+{\mathbf {f}}_8{\mathbf {n}}_2)\nonumber \\&+\,({\mathbf {f}}_9{\mathbf {n}}_3 +{\mathbf {f}}_9{\mathbf {n}}_4)+({\mathbf {f}}_{10}{\mathbf {n}}_4+{\mathbf {f}}_{10}{\mathbf {n}}_1)\nonumber \\= & {} ({\mathbf {f}}_1 {\mathbf {n}}_1+{\mathbf {f}}_5 {\mathbf {n}}_2+{\mathbf {f}}_{10} {\mathbf {n}}_4+{\mathbf {f}}_7 {\mathbf {n}}_3)\qquad (I)\nonumber \\&+\, ({\mathbf {f}}_2 {\mathbf {n}}_2+{\mathbf {f}}_6 {\mathbf {n}}_3+{\mathbf {f}}_5 {\mathbf {n}}_1+{\mathbf {f}}_8 {\mathbf {n}}_4)\quad (II)\nonumber \\&+\,({\mathbf {f}}_3 {\mathbf {n}}_3+{\mathbf {f}}_9 {\mathbf {n}}_4+{\mathbf {f}}_6 {\mathbf {n}}_2+{\mathbf {f}}_7 {\mathbf {n}}_1)\quad (III)\nonumber \\&+\,({\mathbf {f}}_8 {\mathbf {n}}_2+{\mathbf {f}}_9 {\mathbf {n}}_3+{\mathbf {f}}_{10} {\mathbf {n}}_1+{\mathbf {f}}_4 {\mathbf {n}}_4)\quad (IV) \end{aligned}$$

(43)

The quantities (I), (II), (III), (IV) are interpreted as the integrals of the divergence of the following one degree polynomial functions:

$$\begin{aligned} (I)= & {} \int _{K_1} \text {div }\tilde{{\mathbf {f}}}^{(1)} dx\\ (II)= & {} \int _{K_2} \text {div }\tilde{{\mathbf {f}}}^{(1)} dx\\ (III)= & {} \int _{K_3} \text {div }\tilde{{\mathbf {f}}}^{(1)} dx\\ (IV)= & {} \int _{K_4} \text {div }\tilde{{\mathbf {f}}}^{(1)} dx \end{aligned}$$

where

On $K_1$, $\tilde{{\mathbf {f}}}^{(1)}={\mathbf {f}}_1 \lambda _1+{\mathbf {f}}_5 \lambda _2+{\mathbf {f}}_{10} \lambda _4+{\mathbf {f}}_7 \lambda _3$ ,
on $K_2$, $\tilde{{\mathbf {f}}}^{(1)}={\mathbf {f}}_2 \lambda _2+{\mathbf {f}}_6 \lambda _3+{\mathbf {f}}_5 \lambda _1+{\mathbf {f}}_8 \lambda _4$ ,
on $K_3$, $\tilde{{\mathbf {f}}}^{(1)}={\mathbf {f}}_3 \lambda _3+{\mathbf {f}}_9 \lambda _4+{\mathbf {f}}_6 \lambda _2+{\mathbf {f}}_7 \lambda _1$ ,
on $K_4$, $\tilde{{\mathbf {f}}}^{(1)}={\mathbf {f}}_8 \lambda _2+{\mathbf {f}}_9 \lambda _3+{\mathbf {f}}_{10} \lambda _1+{\mathbf {f}}_4 \lambda _4$.

Strictly speaking, ${\mathbf {f}}_{\sigma }$ is not the value of the flux at the vertices of $K_j$, but this is not a problem, following [7].

The equality (43) shows like equality (40) a relation between the quadratic residual in K and the affine residuals in the sub-tetrahedrons, with positive weights independent of the splitting of the central octahedron.

3.3 General Subdivision Formulas for all Orders of Approximation

The formulas presented above for the specific case of quadratic approximation can be generalized to any degree of approximation as we show now. For dimension two and dimension three, we use a Bézier basis to obtain consistent formulas for both cases. The derivation of these formulas is quite technical, so in order to eliminate any ambiguity, we detail exactly the construction of the formulas with figures, for dimension two and for dimension three, at the risk of being repetitive. A lot of indices are required for the presentation of the formulas and we notify the reader that the notation used here is local to this subsection. With a slight abuse of notation, in this section the term ${\mathbb {P}}^k$ triangle (respectively ${\mathbb {P}}^k$ tetrahedron) represents the finite element $(K,{\mathbb {P}}^k,\Sigma _K)$ where K is a triangle (respectively a tetrahedron). We denote ${\mathbf {f}}^B$ the $k^{\text {th}}$-order Bézier expansion of the flux ${\mathbf {f}}$:

$$\begin{aligned} {\mathbf {f}}^B=\sum _{\sigma \in \Sigma _K}{\mathbf {f}}_\sigma B_\sigma . \end{aligned}$$

(44)

3.3.1 The Two Dimensional Case

For the ${\mathbb {P}}^k$ triangle, the sub-triangles are numbered from the top to the bottom of the triangle as shown in Fig. 1. This numbering is consistent with the iterative construction of the ${\mathbb {P}}^k$ triangle from the ${\mathbb {P}}^{k-1}$ triangle: a set of ${\mathbb {P}}^1$ sub-triangles is added at the bottom of the ${\mathbb {P}}^{k-1}$ triangle, and the whole set constitutes the ${\mathbb {P}}^k$ triangle. With this construction, the ${\mathbb {P}}^k$ triangle is constituted of k layers of ${\mathbb {P}}^1$ sub-triangles: the ${\mathbb {P}}^1$ triangle is constituted of the layer 1, the ${\mathbb {P}}^2$ triangle is constituted of layers 1 and 2, and so on until the ${\mathbb {P}}^k$ triangle. Each layer $i, i=1,\ldots ,k$ is of length $2i-1$.

Thanks to this numbering, we can now derive the general subdivision formula in the two dimensional case. We first need the following preliminary results:

$$\begin{aligned} \begin{aligned} \int _K \nabla B_{i} =&\frac{2}{k+1} \int _K \nabla \lambda _{i} ,\ i=1,2,3 \\ \int _K \nabla B_{i} =&\frac{2}{k+1} \big ( \int _K \nabla \lambda _{i_1} + \int _K \nabla \lambda _{i_2} \big ) ,\ i=4,\ldots ,3k \\ \int _K \nabla B_{i} =&\frac{2}{k+1} \big ( \int _K \nabla \lambda _{1} + \int _K \nabla \lambda _{2} + \int _K \nabla \lambda _{3}\big )=0 ,\ i=3k+1,\ldots ,\frac{(k+1)(k+2)}{2} \end{aligned} \end{aligned}$$

where $i_1$ and $i_2$ are the vertices of the segment that contains the node i. The results above are proved with the following formula:

$$\begin{aligned} \int _K \lambda _1^{i_1} \ \lambda _{2}^{i_{2}} \ \lambda _{3}^{i_{3}} = 2\ \text {mes}(K) \frac{i_1! \ i_2! \ i_3!}{(i_1+i_2+i_3+2)!}. \end{aligned}$$

(45)

We can now present the subdivision formula for the k-th order in dimension two.

Proposition 3.1

In dimension two, we have the following formula, for $k \ge 1$:

$$\begin{aligned} \begin{aligned} \int _K \text { div }\, {\mathbf {f}}^B dx&= \frac{2}{k+1} \sum \limits ^{k}_{i=1} \bigg ( \int _{K_{(i-1)^2+1}} \text {div }\tilde{\mathbf{f }}^{(1)} dx + \int _{K_{(i-1)^2+3}} \text {div }\tilde{\mathbf{f }}^{(1)} dx + \cdots \\&\quad + \int _{K_{i^2-2}}\text {div }\tilde{\mathbf{f }}^{(1)} dx +\int _{K_{i^2}} \text {div }\tilde{\mathbf{f }}^{(1)} dx \bigg ). \\ \end{aligned} \end{aligned}$$

(46)

Proof

First, let us precise the notation used in the proof. The indices in $\mathbf{n }_{j_{1}}$ and $\mathbf{n }_{j_{2}}$ represent the vertices of the segment containing the node j. The triangles $K_l, \ l=1,\ldots ,k^2$ are the ${\mathbb {P}}^{1}$ sub-triangles of the triangle K (see Fig. 1). For a given l, the vertices of $K_l$ are denoted by $l_1,l_2,l_3$ and the function $\tilde{\mathbf{f }}^{(1)}$ is defined by:

$$\begin{aligned} \tilde{\mathbf{f }}^{(1)} = \mathbf{f }_{l_1} \lambda _1 + \mathbf{f }_{l_2} \lambda _2 + \mathbf{f }_{l_3} \lambda _3 \ . \end{aligned}$$

(47)

This definition depends on the ordering of the vertices of $K_l$. They are ordered in the reference triangle as in Fig. 4. The quantities $\mathbf{f }_{j}$ are not the values of $\mathbf{f }$ at the nodes j, they are considered as the values of $\mathbf{f }$ at the Bézier control points. Practically, they are computed from the definition of the Bézier basis functions (9) and by solving a linear system (Fig. 5).

We denote by ${\mathbf {n}}_{i}$ the integral $\int _{K}\nabla \lambda _{i}\,dx$, $i=1,2,3$ and we have:

$$\begin{aligned} \int _K\text {div}\, {\mathbf {f}}^B dx= & {} \sum ^{\frac{(k+1)(k+2)}{2}}_{j=1} \mathbf{f }_{j} \cdot \int _K \nabla B_{j} \nonumber \\= & {} \frac{2}{k+1} \bigg ( \sum ^{3}_{j=1} \mathbf{f }_{j} \cdot \int _K \nabla \lambda _{j} \nonumber \\&+\,\sum ^{3k}_{j=4} \mathbf{f }_{j} \cdot \big ( \int _K \nabla \lambda _{j_1} + \int _K \nabla \lambda _{j_2} \big ) \nonumber \\&+\, \sum ^{\frac{(k+1)(k+2)}{2}}_{j=3k+1} \mathbf{f }_{j} \cdot \big ( \int _K \nabla \lambda _{1} + \int _K \nabla \lambda _{2} + \int _K \nabla \lambda _{3}\big ) \bigg ) \nonumber \\= & {} \frac{2}{k+1} \bigg ( \mathbf{f }_{1} \cdot \mathbf{n }_{1} + \mathbf{f }_{2} \cdot \mathbf{n }_{2} + \mathbf{f }_{3} \cdot \mathbf{n }_{3} \nonumber \\&+ \,\mathbf{f }_{4} \cdot \big (\mathbf{n }_{4_{1}} + \mathbf{n }_{4_{2}} \big ) + \cdots + \mathbf{f }_{3k} \cdot \big (\mathbf{n }_{3k_{1}} + \mathbf{n }_{3k_{2}} \big ) \nonumber \\&+\, \mathbf{f }_{3k+1} \cdot \big (\mathbf{n }_1 + \mathbf{n }_2 + \mathbf{n }_3 \big ) + \cdots + \mathbf{f }_{\frac{(k+1)(k+2)}{2}} \cdot \big (\mathbf{n }_1 + \mathbf{n }_2 + \mathbf{n }_3 \big ) \bigg ) \nonumber \\= & {} \frac{2}{k+1} \sum ^{k}_{i=1} \bigg ( \int _{K_{(i-1)^2+1}} \text {div }\tilde{\mathbf{f }}^{(1)} dx + \int _{K_{(i-1)^2+3}} \text {div }\tilde{\mathbf{f }}^{(1)} dx + \cdots \nonumber \\&+ \int _{K_{i^2-2}}\text {div}\tilde{\mathbf{f }}^{(1)} dx +\int _{K_{i^2}} \text {div}\tilde{\mathbf{f }}^{(1)} dx \bigg ). \end{aligned}$$

(48)

$\square $

3.3.2 The Three Dimensional Case

The construction is similar to the two dimensional case, but slightly more complicated. The ${\mathbb {P}}^k$ tetrahedron is iteratively constructed by layers constituted of ${\mathbb {P}}^1$ tetrahedrons. Each i-layer, $i=1,\ldots ,k$ has a bottom face constituted of $i^2$ ${\mathbb {P}}^1$ triangles. The iterative layers and their bottom faces are described in Fig. 6. Let us now consider the bottom face of the layer i. Its triangles are numbered from the top to the bottom, like in the two dimensional case, as shown in Fig. 7. Each triangle is a face of a ${\mathbb {P}}^1$ tetrahedron of the layer i, and we number this tetrahedron with the number of such triangles. With this numbering, we can obtain a general three dimensional subdivision formula. We use again the Bézier basis functions (9) like in the two dimensional case, and we have the following preliminary results:

$$\begin{aligned} \begin{aligned} \int _K \nabla B_{i} =&\frac{6}{{(k+1)(k+2)}} \int _K \nabla \lambda _{i} ,\ i \in S_v \\ \int _K \nabla B_{i} =&\frac{6}{{(k+1)(k+2)}} \big ( \int _K \nabla \lambda _{i_1} + \int _K \nabla \lambda _{i_2} \big ) ,\ i \in S_e \\ \int _K \nabla B_{i} =&\frac{6}{{(k+1)(k+2)}} \big ( \int _K \nabla \lambda _{i_1} + \int _K \nabla \lambda _{i_2} + \int _K \nabla \lambda _{i_3}\big ) ,\ i \in S_f \\ \int _K \nabla B_{i} =&\frac{6}{{(k+1)(k+2)}} \big ( \int _K \nabla \lambda _{1} + \int _K \nabla \lambda _{2} + \int _K \nabla \lambda _{3} + \int _K \nabla \lambda _{4} \big )=0 ,\ i \in S_t \end{aligned} \end{aligned}$$

where

$S_{v}$ is the set of vertices of K,
$S_{e}$ is the set nodes situated on the edges of K, excepted the vertices ($i_{1}$ and $i_{2}$ are the vertices of the edge that contains the node i),
$S_{f}$ is the set nodes situated on the faces of K, excepted those on the edges ($i_1$, $i_{2}$ and $i_{3}$ are the vertices of the face containing the node i),
$S_{t}$ is the set of nodes situated strictly inside K,
$B_{i}$ is the $k^{\text {th}}$-order Bézier basis function associated to the node i.

The results above are proved with the 3D version of (45):

$$\begin{aligned} \int _K \lambda _1^{i_1} \ \lambda _{2}^{i_{2}} \ \lambda _{3}^{i_{3}} \ \lambda _{4}^{i_{4}} = 6\ \text {mes}(K) \frac{i_1! \ i_2! \ i_3! \ i_4! }{(i_1+i_2+i_3+i_4 + 3)!}. \end{aligned}$$

(49)

We can now present the subdivision formula for the k-th order in dimension three.

Proposition 3.2

In dimension three, we have the following formula, for $k \ge 1$:

$$\begin{aligned} \begin{aligned} \int _K\text {div}\, {\mathbf {f}}^B dx =\,&\frac{6}{{(k+1)(k+2)}} \sum ^{k}_{i=1}\sum ^{i}_{i'=1} \bigg ( \ \int _{K_{(i'-1)^2+1}} \text {div }\tilde{\mathbf{f }}^{(1)} dx + \int _{K_{(i'-1)^2+3}} \text {div }\tilde{\mathbf{f }}^{(1)} dx + \cdots \\&+\int _{K_{i'^2-2}}\text {div }\tilde{\mathbf{f }}^{(1)} dx +\int _{K_{i'^2}} \text {div }\tilde{\mathbf{f }}^{(1)} dx \ \ \bigg ). \end{aligned} \end{aligned}$$

(50)

Proof

Let us precise the notation used in the proof. We denote by $\mathbf f _{j}$ the value of the flux at the global node j, and by $\mathbf f _{i_{\alpha _{\beta }}}$ the flux at the node $\beta $ of the tetrahedron $\alpha $ of the layer i. We precise the sublayer, it indicates in which layer of the bottom face of triangles the tetrahedron is taken (see Fig. 7). It corresponds to the index $i'$. The indices in $\mathbf n _{j_{1}}$, $\mathbf n _{j_{2}}$ (for $j \in S_e$) represent the vertices of the edge containing the node j and in $\mathbf n _{j_{1}}$, $\mathbf n _{j_{2}}$, $\mathbf n _{j_{3}}$ (for j $\in S_f$) they represent the vertices of the face containing the node j. For a given layer i, $i=1,\ldots ,k$, the tetrahedrons $K_l$ are ${\mathbb {P}}^{1}$ sub-tetrahedrons of the tetrahedron K in the layer i. The number l, with $l=(i-1)^2,\ldots ,i^2$, is local to the layer i and coincides with the number of the triangle l of the bottom face of the layer i which is a face of $K_l$ (see Figs. 6, 7). With this numbering, we have a general formula in the three dimensional case that is similar to the two dimensional case. For a given l, the vertices of $K_l$ are denoted by $l_1,l_2,l_3,l_4$ and the function $\tilde{\mathbf{f }}^{(1)}$ is defined by:

$$\begin{aligned} \tilde{\mathbf{f }}^{(1)} = \mathbf f _{l_1} \lambda _1 + \mathbf f _{l_2} \lambda _2 + \mathbf f _{l_3} \lambda _3 + \mathbf f _{l_4} \lambda _4 \ . \end{aligned}$$

(51)

This definition depends on the ordering of the vertices of $K_l$. They are ordered in the reference tetrahedron as in Fig. 8. Like in the two dimensional case, the quantities $\mathbf f _{j}$ are not the values of $\mathbf f $ at the nodes j, they are considered as the values of $\mathbf f $ at the Bézier control points and in practice are computed from the definition of the Bézier basis functions (9) and by solving a linear system.

We denote by ${\mathbf {n}}_{i}$ the integral $\int _{K}\nabla \lambda _{i}\,dx$, $i=1,2,3$ and we have:

$$\begin{aligned} \int _K\text {div}\, {\mathbf {f}}^B dx= & {} \sum ^{\frac{(k+1)(k+2)(k+3)}{6}}_{j=1} \mathbf f _{j} \cdot \int _K \nabla B_{j} \nonumber \\= & {} \frac{6}{{(k+1)(k+2)}} \bigg ( \sum _{j \in S_v} \mathbf f _{j} \cdot \int _K \nabla \lambda _{j} \nonumber \\&+ \,\sum _{j \in S_e} \mathbf f _{j} \cdot \big ( \int _K \nabla \lambda _{j_1} + \int _K \nabla \lambda _{j_2} \big ) \nonumber \\&+ \,\sum _{j \in S_f} \mathbf f _{j} \cdot \big ( \int _K \nabla \lambda _{j_1} + \int _K \nabla \lambda _{j_2} + \int _K \nabla \lambda _{j_3}\big ) \nonumber \\&+\, \sum _{j \in S_t} \mathbf f _{j} \cdot \big ( \int _K \nabla \lambda _{1} + \int _K \nabla \lambda _{2} + \int _K \nabla \lambda _{3} + \int _K \nabla \lambda _{4} \big ) \bigg ) \nonumber \\= & {} \frac{6}{{(k+1)(k+2)}} \bigg ( \sum _{j \in S_v} \mathbf f _{j} \cdot \mathbf n _{j} \nonumber \\&+\, \sum _{j \in S_e} \mathbf f _{j} \cdot \big (\mathbf n _{j_{1}} + \mathbf n _{j_{2}} \big ) \nonumber \\&+\, \sum _{j \in S_f} \mathbf f _{j} \cdot \big (\mathbf n _{j_{1}} + \mathbf n _{j_{2}} + \mathbf n _{j_{3}} \big ) \nonumber \\&+\, \sum _{j \in S_t} \mathbf f _{j} \cdot \big (\mathbf n _{1} + \mathbf n _{{2}} + \mathbf n _{{3}} + \mathbf n _{{4}} \big ) \ \bigg ) \nonumber \\= & {} \frac{6}{{(k+1)(k+2)}} \bigg ( \mathbf f _{1_{1_{1}}}\cdot \mathbf n _{1} + \mathbf f _{1_{1_{2}}}\cdot \mathbf n _{2} + \mathbf f _{1_{1_{3}}}\cdot \mathbf n _{3} + \mathbf f _{1_{1_{4}}}\cdot \mathbf n _{4} \ \ (\text {lay. 1 })\nonumber \\&+\, \mathbf f _{2_{1_{1}}}\cdot \mathbf n _{1} + \mathbf f _{2_{1_{2}}}\cdot \mathbf n _{2} + \mathbf f _{2_{1_{3}}}\cdot \mathbf n _{3} + \mathbf f _{2_{1_{4}}}\cdot \mathbf n _{4} \ \ (\text {lay. 2, tet. 1, sublay. 1})\nonumber \\&+\, \mathbf f _{2_{2_{1}}}\cdot \mathbf n _{1} + \mathbf f _{2_{2_{2}}}\cdot \mathbf n _{2} + \mathbf f _{2_{2_{3}}}\cdot \mathbf n _{3} + \mathbf f _{2_{2_{4}}}\cdot \mathbf n _{4} \ \ (\text {lay}. \,2, \text {tet}. \,2, \text {sublay}.\, 2)\nonumber \\&+\, \mathbf f _{2_{4_{1}}}\cdot \mathbf n _{1} + \mathbf f _{2_{4_{2}}}\cdot \mathbf n _{2} + \mathbf f _{2_{4_{3}}}\cdot \mathbf n _{3} + \mathbf f _{2_{4_{4}}}\cdot \mathbf n _{4} \ \ (\text {lay}. \,2, \text {tet}. \,4, \text {sublay}. \,2)\nonumber \\&+ \cdots \nonumber \\&+\, \mathbf f _{k_{1_{1}}}\cdot \mathbf n _{1} + \mathbf f _{k_{1_{2}}}\cdot \mathbf n _{2} + \mathbf f _{k_{1_{3}}}\cdot \mathbf n _{3} + \mathbf f _{k_{1_{4}}}\cdot \mathbf n _{4} \ \ (\text {lay}. \,k, \text {tet}. \,1, \text {sublay}.\, 1)\nonumber \\&+\, \mathbf f _{k_{2_{1}}}\cdot \mathbf n _{1} + \mathbf f _{k_{2_{2}}}\cdot \mathbf n _{2} + \mathbf f _{k_{2_{3}}}\cdot \mathbf n _{3} + \mathbf f _{k_{2_{4}}}\cdot \mathbf n _{4} \ \ (\text {lay}. \,k, \text {tet}. \,2, \text {sublay}. \,2)\nonumber \\&+\, \mathbf f _{k_{4_{1}}}\cdot \mathbf n _{1} + \mathbf f _{k_{4_{2}}}\cdot \mathbf n _{2} + \mathbf f _{k_{4_{3}}}\cdot \mathbf n _{3} + \mathbf f _{k_{4_{4}}}\cdot \mathbf n _{4} \ \ \left( \text {lay}. \,k, \text {tet}. \,4, \text {sublay}. \,2\right) \nonumber \\&+ \cdots \nonumber \\&+\, \mathbf f _{k_{{(k-1)^2+1}_{1}}}\cdot \mathbf n _{1}+\mathbf f _{k_{{(k-1)^2+1}_{2}}}\cdot \mathbf n _{2} \nonumber \\&+\, \mathbf f _{k_{{(k-1)^2+1}_{3}}}\cdot \mathbf n _{3} + \mathbf f _{k_{{(k-1)^2+1}_{4}}}\cdot \mathbf n _{4} \ \ (\text {lay}. \,k, \text {tet}.\, (k-1)^2+1, \text {sublay}.\, k)\nonumber \\&+ \cdots \nonumber \\&+\, \mathbf f _{k_{{k^2}_{1}}}\cdot \mathbf n _{1} + \mathbf f _{k_{{k^2}_{2}}}\cdot \mathbf n _{2} + \mathbf f _{k_{{k^2}_{3}}}\cdot \mathbf n _{3} + \mathbf f _{k_{{k^2}_{4}}}\cdot \mathbf n _{4} \ \ (\text {lay}. \,\,k, \text {tet}. \,\,k^2, \text {sublay}. \,k) \bigg ) \ \ \nonumber \\= & {} \frac{6}{{(k+1)(k+2)}} \sum ^{k}_{i=1}\sum ^{i}_{i'=1} \bigg ( \ \int _{K_{(i'-1)^2+1}} \text {div }\tilde{\mathbf{f }}^{(1)} dx + \int _{K_{(i'-1)^2+3}} \text {div }\tilde{\mathbf{f }}^{(1)} dx + \cdots \nonumber \\&+ \int _{K_{i'^2-2}}\text {div }\tilde{\mathbf{f }}^{(1)} dx +\int _{K_{i'^2}} \text {div }\tilde{\mathbf{f }}^{(1)} dx \ \ \bigg ). \end{aligned}$$

(52)

$\square $

3.4 Definition of the Nodal Residuals in a Divided Element

We denote here by S the set of ${\mathbb {P}}^1$ sub-elements of a given element K. The term element represents a triangle in dimension two or a tetrahedron in dimension three as above. For a given element K subdivided into smaller elements indexed by $\xi , \xi \in S$ (for example $\xi =K_1,\ldots ,K_4$ in the case of quadratic interpolation in dimension two), we have, as we have proved, a relation between the residual $\Phi ^K$ and the residuals $\Phi ^\xi $ of the sub-elements of K. Once the residuals $\Phi ^\xi $ are computed, the next step is to compute the nodal residuals $\Phi _\sigma ^\xi $, as described in Sect. 2.2. Then, the nodal residuals $\Phi _\sigma ^\xi $ need to be modified in order to satisfy the conservation relation (18), as we show now. We denote by $\gamma _\xi , \xi \in S$ the coefficients given by the formulas of Sect. 3.3.

Proposition 3.3

If the nodal residuals $\Phi _\sigma ^\xi , \sigma \in \Sigma _\xi , \xi \in S$ satisfy the conservation relation (18), then the nodal residuals defined by

$$\begin{aligned} \Phi ^K_\sigma :=\sum _{\xi \in S, \sigma \in \Sigma _\xi }\gamma _\xi \Phi ^{\xi }_\sigma \end{aligned}$$

(53)

satisfy the conservation relation (18).

Proof

With the notation of Sect. 3, ${\mathbf {f}}^{(k)}$ represents the k-th order interpolant of the flux ${\mathbf {f}}$. We assume that

$$\begin{aligned} \forall \xi \in S, \sum _{\sigma \in \Sigma _\xi }\Phi ^{\xi }_{\sigma }=\int _{\partial \xi }{\mathbf {f}}^{(1)}\cdot {\mathbf {n}}\end{aligned}$$

and that

$$\begin{aligned} \Phi ^K_\sigma =\sum _{\xi \in S, \sigma \in \Sigma _\xi }\gamma _\xi \Phi ^{\xi }_\sigma . \end{aligned}$$

We then have

$$\begin{aligned} \begin{aligned} \sum _{\sigma \in \Sigma _K} \Phi _{\sigma }^K=&\sum _{\sigma \in \Sigma _K} \bigg ( \sum _{\xi \in S, \sigma \in \Sigma _\xi }\gamma _\xi \Phi ^{\xi }_\sigma \bigg )\\ =&\sum _{\xi \in S}\gamma _\xi \bigg (\sum _{\sigma \in \Sigma _\xi }\Phi ^{\xi }_{\sigma }\bigg )\\ =&\sum _{\xi \in S}\gamma _\xi \int _{\partial \xi }{\mathbf {f}}^{(1)}\cdot {\mathbf {n}}\\ =&\int _{\partial K} {\mathbf {f}}^{(k)}\cdot {\mathbf {n}}\ dx . \end{aligned} \end{aligned}$$

$\square $

We remark that it is possible to use any scheme inside the elements $\xi \in S$ as long as they satisfy the conservation relation (18) and the nodal residuals in K are defined by relation (53). Under the assumption of the Lax–Wendroff theorem, the new scheme with arbitrary mixed ${\mathbb {P}}^{1}$ and ${\mathbb {P}}^{k}$ elements will also be convergent to a weak solution of the problem because the conservation property (18) is still satisfied (see [4]). We follow the same reasoning for the boundary residuals.

3.5 Practical Implementation

The nodal residuals are modified according to the definition given in Sect. 3.4. We describe now the consequences on the practical implementation of the residual distribution scheme.

3.5.1 Computation of the Nodal Residuals

As the computation of the nodal residuals is made inside each sub-element $\xi $ of a divided element K, it is convenient to introduce $\Phi ^{K,\xi }_{\sigma }$ the contribution of the node $\sigma $ in triangle K brought by the sub-element $\xi $, as it is the quantity that is actually computed. It is defined by

$$\begin{aligned} \Phi ^{K,\xi }_{\sigma }:=\gamma _{\xi }\,\Phi ^{\xi }_{\sigma }. \end{aligned}$$

(54)

3.5.2 Computation of the Jacobian Matrix of the Implicit Scheme

The system of equations (20) is solved with an implicit scheme detailed in Appendix 1. To implement the subdivision formulas, the implicit scheme requires only a slight modification in the definition of the Jacobian matrices. Indeed, to compute the Jacobian matrix, for example in the case of the Lax–Wendroff scheme (21), we need to differentiate the two terms of (21):

$$\begin{aligned} \frac{\partial \Phi ^{\xi }_{\sigma }({\mathbf {u}}_{h})\ }{\partial {\mathbf {u}}_{j}}=\frac{1}{N^{\xi }_{dof}}\frac{\partial }{\partial {\mathbf {u}}_{j}}\big (\,\Phi ^{\xi }({\mathbf {u}}_{h})+\int _{\xi }A \cdot \nabla \varphi _{\sigma } \, \Xi \, A \cdot \nabla {\mathbf {u}}_{h} dx \, \big ) \end{aligned}$$

(55)

and so, according to (53), we have, with the same notation as in Sect. 3.5.1:

$$\begin{aligned} \frac{\partial \Phi ^{K,\xi }_{\sigma }({\mathbf {u}}_{h})\ }{\partial {\mathbf {u}}_{j}}{:=}\gamma _{\xi } \ \frac{\partial \Phi ^{\xi }_{\sigma }({\mathbf {u}}_{h})\ }{\partial {\mathbf {u}}_{j}}. \end{aligned}$$

(56)

3.5.3 Boundary Conditions

The nodal residuals of boundary faces (here we use the term face for either the edge of a triangle in dimension two or the face of a tetrahedron in dimension three) are computed by using the formula (36). If we denote by $\Gamma $ the face of a subdivided element K lying on the boundary of $\Omega $, $\Gamma $ is therefore subdivided into sub-faces. Let $\varsigma $ be such a sub-face. From the relation (53), we see that in order to be consistent with relation (36) in the case of subdivision, we need to multiply the nodal residuals of the subdivided boundary by the sub-division coefficient of the element containing this sub-divided boundary face, and so the contribution of the node $\sigma $ in the face $\Gamma $ brought by the sub-face $\xi $ is

$$\begin{aligned} \Phi _{\sigma } ^{\Gamma ,\varsigma \ }:=\gamma _{\xi } \ \Phi _{\sigma } ^{\varsigma \ } \end{aligned}$$

(57)

and thus the contribution to the global Jacobian is

$$\begin{aligned} \frac{\partial \Phi _{\sigma }^{\Gamma ,\varsigma \ }({\mathbf {u}}_{h})\ }{\partial {\mathbf {u}}_{j}}:=\gamma _{\xi } \ \frac{\partial \Phi _{\sigma } ^{\varsigma \ }({\mathbf {u}}_{h})\ }{\partial {\mathbf {u}}_{j}} \ . \end{aligned}$$

(58)

3.5.4 Choice of Quadrature Formulas

As explained in Sect. 2.5, the relation (39) is verified if we use the same quadrature points at the interfaces between elements. In the case of an interface between two elements of the same degree, we simply use the same quadrature formula, and so the requirements of relation (39) are automatically satisfied. For an interface between a subdivided element and a non-subdivided one, we use quadrature formulas such that all the quadrature points at the interface physically coincide.

4 Numerical Results

We present now some numerical results for different speed flows in dimension two and three that illustrate the theoretical results exposed above. Even if the formulas presented in Sect. 3.3 make it possible to use any polynomial order, for simplicity the test cases presented in this paper use ${\mathbb {P}}^{1}$ and ${\mathbb {P}}^{2}$ elements. As explained in Sect. 2.3, for subsonic flows we use the Lax–Wendroff scheme (21) and for transonic and faster flows we use the Rusanov scheme (35).

Our approach can be described as follows. In all test cases except the subsonic test case, the mesh is mostly made of ${\mathbb {P}}^{2}$ elements except in the shock zone where ${\mathbb {P}}^{1}$ elements are used. The shock zone is located by a shock detector (see [5]) based here on the variation of pressure inside each element of the mesh:

$$\begin{aligned} {\varphi _{K}}=\max _{\sigma \in \Sigma _K}\Bigg (\max _{\ K',\,\sigma \in \Sigma _{K'}}\frac{\displaystyle |\max _{\nu \in \Sigma _{K'}}p_{\nu }-\displaystyle \min _{\nu \in \Sigma _{K'}}p_{\nu }|}{\mid \displaystyle \max _{\nu \in \Sigma _{K'}}p_{\nu } \mid +\mid \displaystyle \min _{\nu \in \Sigma _{K'}}p_{\nu }\mid +\,\epsilon }\Bigg ) \end{aligned}$$

(59)

where $p_\nu $ is the pressure at the node $\nu $, $K,K'$ are elements of the mesh ${\mathscr {T}}_h$ and $\epsilon $ is the machine epsilon (in our implementation, $\epsilon =1e-16$). This shock detector is used as follows. We set a threshold $\varphi $ which depends on the test case, we start with a mesh that contains only subdivided ${\mathbb {P}}^{1}$ elements, then after a short number of iterations (which depends on the test case too), the values $\varphi _K$ are computed for each element K. If the value $\varphi _K$ is above the threshold $\varphi $, then the element remains subdivided with ${{\mathbb {P}}}^{1}$ elements, otherwise, the element becomes a ${{\mathbb {P}}}^{2}$ element and is not subdivided. As a result, we use ${{\mathbb {P}}}^{1}$ elements in the shock zone, and ${{\mathbb {P}}}^{2}$ elements everywhere else. The threshold $\varphi $ is determined empirically for each test case by initially running the same simulation with different values of $\varphi $ for a short number of iterations. The optimal value of $\varphi $ is the one for which the shock zone seems to be visually the closest to the zone where there are strong variations of the isolines of pressure.

We precise now what happens at the interface between two elements. To illustrate this, we have represented in Fig. 9 the possible transformations of two adjacent elements in the quadratic two-dimensional case. As described above, the two adjacent elements are initially subdivided into ${\mathbb {P}}^1$ elements. Then, they can be transformed into either one element subdivided into ${{\mathbb {P}}}^1$ elements and one ${{\mathbb {P}}}^2$ element (case a and c), or they can be transformed into two ${{\mathbb {P}}}^2$ elements (case b). It is important to remark that the method does not generate hanging nodes, in dimension two or three and for any degree k of approximation.

4.1 Subsonic Flow

We show here, just for theoretical purposes, how our method behaves with the mesh of a NACA0012 wing profile constituted of randomly subdivided elements. The inflow condition is Mach=0.5, the pressure is $P_{inlet}=0.7$ and half of the elements of the mesh are randomly selected and subdivided into ${\mathbb {P}}^{1}$ elements. As the flow is subsonic, we use the Lax–Wendroff scheme (21). The mesh is shown in Fig. 10.

We can see, in Fig. 11, a comparison of the convergence curves when ${\mathbb {P}}^{1}$, ${\mathbb {P}}^{2}$ and mixed elements (${\mathbb {P}}^{2}$ elements and subdivided ${\mathbb {P}}^{1}$ elements) are used. We remark that with the same level of convergence for the three types of finite elements, the convergence speed with subdivided elements is as expected between those obtained with ${\mathbb {P}}^{1}$ and ${\mathbb {P}}^{2}$ elements.

As stated above, for this test case the elements are arbitrarily subdivided and we do not take advantage of p-adaptation to improve the accuracy of the solution. In the next test cases, the elements will be subdivided according to the properties of the solution.

4.2 Transonic Flow

We test our method on a Naca0012 wing profile with Mach=0.8, a pressure of 0.71 and an angle of attack of 1.25 degrees. For this problem, we use the limited Rusanov scheme (35), more adapted for transonic flows than the Lax–Wendroff scheme, as explained in Sect. 2.3. As small size, low-order elements capture irregular solutions better than high-order elements, which to the contrary approximate smooth solutions better than low-order elements [9], we use the shock detector (59) that allows to use ${\mathbb {P}}^{1}$ elements only where the shock is detected and ${\mathbb {P}}^{2}$ elements otherwise. The mesh is shown in Fig. 12, the subdivided elements are dark-colored and correspond to the elements containing a strong variation of pressure. The convergence curve obtained with p-adaptation, Fig. 13, shows a jump of the residual due to the switch from ${\mathbb {P}}^{1}$ elements to ${\mathbb {P}}^{2}$ elements everywhere except in the shock zone. After the jump, we observe the convergence of the residual. For this test case ${\varphi }$ was set to 1.5, and the switch was set after 300 iterations. We remark from Fig. 14, that the position of the shock obtained with mixed elements is in very good agreement with the position predicted using classical ${\mathbb {P}}^{2}$ or ${\mathbb {P}}^{1}$ elements. This is important for the validation of our method and proves that we are here consistent with the results obtained with classical ${\mathbb {P}}^{1}$ and ${\mathbb {P}}^{2}$ elements.

In addition, we make the following interesting observation. As we use smaller ${\mathbb {P}}^{1}$ elements in the discontinuous zone where they are better suited to capture a discontinuity than coarser ${\mathbb {P}}^{2}$ elements, we obtain a better representation of the shock, as with the ${\mathbb {P}}^{1}$ mesh. This observation is confirmed if we compare, in Fig. 15, the solutions obtained with p-adaptation to the solution obtained with a ${\mathbb {P}}^{2}$ mesh. We can see that the shock is better represented with the p-adaptive solution.

4.3 Supersonic Flow

Now we present a numerical test with a higher speed, Mach=3. Because of the higher speed used, such test cases can be more difficult to run. The application of p-adaptation, even to a little number of elements (only in the shock zone), allows convergence of the residual and proves to be an efficient approach to simulate such phenomenon. Since the number of sub-divided elements is small the method remains mostly ${\mathbb {P}}^{2}$ based.

The following parameters are set: the mesh is made of 3749 vertices and contains a sphere of diameter 1, centered in 0, which is moving at Mach=3.0.

We divide the boundary conditions into four sub-boundaries as shown in Fig. 16, and we detail in the following the conditions applied to each of these boundaries:

on the sphere (boundary 1) inside the domain, we impose a slipping wall boundary condition,
in front of the sphere (boundary 2, the half circle on the left of the domain), we impose Dirichlet boundary conditions with $(\rho ,u,v,p)=(1.4,\;3,\;0,\;1)$,
on the upper and lower horizontal lines (boundary 3), we impose a slipping wall boundary condition,
behind the sphere (boundary 4, the vertical line at the right of the domain), we use a Steger-Warming exit boundary condition [5] with $(\rho ,u,v,p)=(1,\;0.8,\;0,\;0.3)$.

As initial conditions we set a discontinuity line at $x=0.435$, with $(\rho ,u,v,p)=(1.4,\;0,\;0,\;1)$ on the left of the discontinuity and $(\rho ,u,v,p)=(1.4,\;3,\;0,\;1.4)$ on the right.

We use the same method as before with the shock detector (59) and obtain the mesh shown in Fig. 17. The threshold is set at ${\varphi }=3$ and the switch is set at 300 iterations. Again, we notice in Fig. 18 a jump in the residual due to the change from a ${\mathbb {P}}^{1}$-only scheme to a mixed ${\mathbb {P}}^{1}$-${\mathbb {P}}^{2}$ scheme after the activation of the shock detector.

The isolines of the Mach number, the pressure and the density are shown in respectively Figs. 19, 20, and 21. As of today, we have not been able to make the classical ${\mathbb {P}}^{2}$ scheme converge for this test case. With p-adaptation we obtain a high-order solution that physically agrees with the solution obtained with a classical ${\mathbb {P}}^{1}$ scheme.

4.4 Hypersonic Three Dimensional Flow

We present now a numerical test case with a hypersonic speed (Mach=8) in dimension three. Like in dimension two, the application of p-adaptation to a little number of elements in the shock zone (the method remaining mostly ${\mathbb {P}}^{2}$ based), proves practically to be efficient as it allows the convergence of the residual and gives a solution that looks physically admissible.

The following parameters are set: a sphere of diameter two is centered in 0 and is moving at the speed of Mach=8. The boundary conditions are divided into four sub-boundaries, as shown in Fig. 22. On the sphere inside the domain (boundary 1), we impose a slipping wall boundary condition, in the left face of the domain (boundary 2), we impose a Steger-Warming entry boundary condition, with $(\rho ,u,v,w,p)=(8.0\;,\,8.25\;,\,0.0\;,\,0.0\;,\,116.5)$, in the right face of the domain (boundary 3), we impose a Steger-Warming exit boundary condition, with $(\rho ,u,v,w,p)=(1.4\;,\,0.0\;,\,0.0\;,\,0.0\;,\,1.0)$, and on the other faces of the boundary (boundary 4), we impose a slipping wall boundary condition. As an initial condition we set a vertical plan of discontinuity at $x=0.09$, with $(\rho ,u,v,w,p)=(8.0\;,\,8.25\;,\,0.0\;,\,0.0\;,\,116.5)$ at the left of the discontinuity, and $(\rho ,u,v,w,p)=(1.4\;,\,0.0\;,\,0.0\;,\,0.0\;,\,1.0)$ at the right of the discontinuity.

As shown in Fig. 23, the residual converges well, even with only a very small number of ${\mathbb {P}}^{1}$ elements in the shock zone (see Fig. 24). Like in test case 4.3, with here a threshold ${\varphi }=3$, we start with subdivided ${\mathbb {P}}^{1}$ elements everywhere. After 300 iterations, only elements in the shock zone remain ${\mathbb {P}}^{1}$, all the others become ${\mathbb {P}}^{2}$ elements. The converged solution (Mach number, pressure and density), is shown in respectively Figs. 25, 26, and 27.

5 Conclusion

We have described a way to use p-adaptation with continuous finite elements within the frame of residual distribution schemes. We have showed with general formulas that the method can be theoretically extended to arbitrary polynomial orders, and we have shown for complex problems modeled by the Euler equations that in practice, in the case of quadratic approximation in dimension two and three, the method is robust for subsonic, transonic, supersonic and hypersonic flows. Other applications can be envisaged and in particular, the extension of our method to the equations of Navier–Stokes coupled with the Penalization method will be the subject of a forthcoming paper.

References

Abgrall, R.: Toward the ultimate conservative scheme: following the quest. J. Comput. Phys. 167(2), 277–315 (2001)
Article MathSciNet MATH Google Scholar
Abgrall, R.: Essentially non-oscillatory residual distribution schemes for hyperbolic problems. J. Comput. Phys. 214(2), 773–808 (2006)
Article MathSciNet MATH Google Scholar
Abgrall, R.: Two remarks on the approximation of hyperbolic problems by finite element methods. In: Jang S.Y., Kim Y.R., Lee D.-W., and Yie I. (eds.) Proceeding of the International Congress of Mathematicians, vol. IV, pp. 699–726. KM Kyong Moon SA (2014)
Abgrall, R., Barth, T.: Residual Distribution schemes for conservation laws via adaptive quadrature. SIAM J. Sci. Comput. 24(3), 732–769 (2003)
Article MathSciNet MATH Google Scholar
Abgrall, R., Larat, A., Ricchiuto, M.: Construction of very high order Residual Distribution schemes for steady inviscid flow problems on hybrid unstructured meshes. J. Comput. Phys. 230(11), 4103–4136 (2011)
Article MathSciNet MATH Google Scholar
Abgrall, R., Roe, P.L.: High order fluctuation schemes on triangular meshes. J. Sci. Comput. 19(1–3), 3–36 (2003)
Article MathSciNet MATH Google Scholar
Abgrall, R., Trefilik, J.: An example of high order residual distribution scheme using non-lagrange elements. J. Sci. Comput. 45(1–3), 3–25 (2010)
Article MathSciNet MATH Google Scholar
Ahrabi, B.R., Anderson, W.K., Newman, J.C.: High-order finite-element method and dynamic adaptation for two-dimensional laminar and turbulent Navier–Stokes. In: AIAA paper 2014-2983. 32nd AIAA Applied Aerodynamics Conference (2014)
Babuška, I., Guo, B.Q.: The $h, p$ and $h-p$ version of the finite element method; basis theory and applications. Adv. Eng. Softw. 15(3–4), 159–174 (1992)
Article MATH Google Scholar
Cangiani, A., Chapman, J., Georgoulis, E.H., Jensen, M.: Implementation of the continuous-discontinuous Galerkin finite element method. In: Numerical mathematics and advanced applications 2011, Proceedings of ENUMATH 2011, the 9th European conference on numerical mathematics and advanced applications, Leicester, UK, September 5–9, 2011. pp. 315–322. Springer, Berlin (2013)
Cheng, J., Shu, C.W.: High order schemes for cfd: a review. Chin. J. Comput. Phys. 5, 002 (2009)
Google Scholar
Deconinck, H., Paillere, H., Struijs, R., Roe, P.L.: Multidimensional upwind schemes based on fluctuation-splitting for systems of conservation laws. Comput. Mech. 11(5–6), 323–340 (1993)
Article MATH Google Scholar
Deconinck, H., Struijs, R., Bourgois, G., Roe, P.L.: Compact advection schemes on unstructured grids. In: Computational Fluid Dynamics, VKI LS 1993-04. SEE N94-18557 04-34 (1993)
Demkowicz, L.: Computing with hp-ADAPTIVE FINITE ELEMENTS: Volume 1 One and Two Dimensional Elliptic and Maxwell Problems. CRC Press, Boca Raton (2006)
Book Google Scholar
Di Pietro, D.A., Ern, A.: Mathematical Aspects of Discontinuous Galerkin Methods, vol. 69. Springer Science & Business Media, New York (2011)
MATH Google Scholar
Ekaterinaris, J.A.: High-order accurate, low numerical diffusion methods for aerodynamics. Prog. Aerosp. Sci. 41(3), 192–300 (2005)
Article Google Scholar
Hartmann, R.: Discontinuous Galerkin methods for compressible flows: higher order accuracy, error estimation and adaptivity. Lecture Series—Von Karman Institute for Fluid Dynamics 1, 5 (2006)
Karniadakis, G.E., Shu, C.W., Cockburn, B.: Discontinuous Galerkin Methods: Theory, Computation and Applications. Springer, New York (2000)
MATH Google Scholar
Mitchell, W.F., McClain, M.A.: A comparison of hp-adaptive strategies for elliptic partial differential equations. ACM Trans. Math. Softw. (TOMS) 41(1), 2 (2014)
Article MathSciNet MATH Google Scholar
Moxey, D., Green, M.D., Sherwin, S.J., Peiró, J.: An isoparametric approach to high-order curvilinear boundary-layer meshing. Comput. Methods Appl. Mech. Eng. 283, 636–650 (2015)
Article MathSciNet Google Scholar
Ni, Ron-Ho: A multiple-grid scheme for solving the euler equations. AIAA J. 20(11), 1565–1571 (1982)
Article MATH Google Scholar
Ricchiuto, M.: Contributions to the development of residual discretizations for hyperbolic conservation laws with application to shallow water flows. Université Sciences et Technologies - Bordeaux I, Habilitation à diriger des recherches (2011)
Shahbazi, K., Fischer, P.F., Ethier, C.R.: A high-order discontinuous Galerkin method for the unsteady incompressible Navier–Stokes equations. J. Comput. Phys. 222(1), 391–407 (2007)
Article MathSciNet MATH Google Scholar
Solin, P., Segeth, K., Dolezel, I.: Higher-Order Finite Element Methods. CRC Press, Boca Raton (2003)
Google Scholar
Toro., E.F.: Riemann Solvers and Numerical Methods for Fluid Dynamics. A Practical Introduction. 2nd ed., 2nd edn. Springer, Berlin (1999)
Book MATH Google Scholar
Wang, Z.J.: High-order methods for the Euler and Navier–Stokes equations on unstructured grids. Prog. Aerosp. Sci. 43(1), 1–41 (2007)
Article MathSciNet Google Scholar
Wang, Z.J.: High-order computational fluid dynamics tools for aircraft design. Phil. Trans. R. Soc. A. 372, 20130318 (2014). doi:10.1098/rsta.2013.0318
Wang, Z.J., Fidkowski, K., Abgrall, R., Bassi, F., Caraeni, D., Cary, A., Deconinck, H., Hartmann, R., Hillewaert, K., Huynh, H.T., et al.: High-order cfd methods: current status and perspective. Int. J. Numer. Methods Fluids 72(8), 811–845 (2013)
Article MathSciNet Google Scholar

Download references

Acknowledgements

Q. V. has been funded by the French State via the French National Research Agency (ANR) in the frame of the “Investments for the future” Programme IdEx Bordeaux - CPU (ANR-10-IDEX-03-02). All the numerical experiments presented in this paper were carried out using the cluster Avakas and the PLAFRIM experimental testbed. PLAFRIM is developed under the Inria PlaFRIM development action with support from LABRI, IMB and other entities: Conseil Régional d’Aquitaine, FeDER, Université de Bordeaux and CNRS. R. A. has been partialy funded with the SNF Grant # 200021_153604.

Author information

Authors and Affiliations

Institüt für Mathematik, Universität Zürich, Winterthurerstrasse 190, 8057, Zurich, Switzerland
R. Abgrall
Institut de Mathématiques de Bordeaux, 33 405, Talence, France
Q. Viville, H. Beaugendre & C. Dobrzynski
Inria Bordeaux Sud Ouest, 33 405, Talence, France
Q. Viville, H. Beaugendre & C. Dobrzynski
Institut Polytechnique de Bordeaux, 33 405, Talence, France
Q. Viville, H. Beaugendre & C. Dobrzynski

Authors

R. Abgrall
View author publications
You can also search for this author in PubMed Google Scholar
Q. Viville
View author publications
You can also search for this author in PubMed Google Scholar
H. Beaugendre
View author publications
You can also search for this author in PubMed Google Scholar
C. Dobrzynski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Abgrall.

Appendices

Appendix 1: Implicit Numerical Solver

We need to solve the system of equations (20), written in a compact way as:

$$\begin{aligned} R({\mathbf {u}}_{h})=0. \end{aligned}$$

(60)

This problem is first relaxed as:

$$\begin{aligned} \frac{d{\mathbf {u}}_{h}}{dt}=-R({\mathbf {u}}_{h}). \end{aligned}$$

(61)

To approximate this time derivative, we use the backward Euler formula, and the problem becomes:

$$\begin{aligned} \frac{{\mathbf {u}}^{n+1}_{h}-{\mathbf {u}}^{n}_{h}\mathbf {{}}_{}}{\Delta t^{n}}=-R({\mathbf {u}}^{n+1}_{h}), \ n=0,1,2,\ldots \end{aligned}$$

(62)

Thus, we have to solve a non linear problem at each time step $n$. We use for that a Newton method, that when applied to (60) reads:

$$\begin{aligned} {\mathbf {u}}^{k+1}_{h}={\mathbf {u}}^{k}_{h} -J^{-1}R({\mathbf {u}}^{k}_{h}),\;\;k=0,1,2, \end{aligned}$$

(63)

where

$$\begin{aligned} J=\frac{\partial R({\mathbf {u}}^{k}_{h})}{\partial {\mathbf {u}}} \end{aligned}$$

(64)

is the Jacobian of R. In practice, the Jacobian J is computed from a first order scheme with the stabilization term. And so, with our Newton method applied to (62) , the linear system we have to solve at each time step $n$ reads:

$$\begin{aligned}&[\frac{1}{\Delta t^{n}}+J({\mathbf {u}}^{n}_{h_{k}})]\Delta {\mathbf {u}}^{n}_{h_{k}}=-R({\mathbf {u}}^{n}_{h_{k}}) \end{aligned}$$

(65)

$$\begin{aligned}&{\mathbf {u}}^{n}_{h_{k+1}}={\mathbf {u}}^{n}_{h_{k}}+\Delta {\mathbf {u}}^{n}_{h_{k}},\;\;k=0,1,2\ldots \end{aligned}$$

(66)

In practice, for each time step $n$, we use only one iteration on $k$, which seems enough to reach convergence of the nodal residuals to the zero machine.

Appendix 2: Comparison with Constrained Approximation for the Finite Element Method

In [8] the authors present a mesh adaptation method called “constrained approximation“ in the frame of a Petrov–Galerkin finite-element method. The idea behind the method is to use a non regular mesh (non conforming and of various order) and to constrain the value of the discrete solution at the interface shared by elements of different size (h-adaptation) and different order (p-adaptation) to keep the solution continuous across the interface.

In Fig. 28 inspired from [8], the authors give an example of h-refinement and p-refinement. In the h-refinement case, the two triangles share a common edge with two different sizes, which generates a hanging node in the middle of the edge. In the p-refinement case, the triangles are of two different polynomial orders (${\mathbb {P}}1$ and ${\mathbb {P}}2$) and so a hanging node is generated in the middle of the edge. In both cases, the solution is constrained at the hanging node so that it remains continuous.

The authors use a hierarchical basis of shape functions [24]. For example, in dimension two this basis is constituted of functions associated with the vertices, the edges and the element and are called accordingly vertex, edge and bubble functions. They are built incrementally with respect to the desired order of the approximation, by using kernel functions in the definition of the edge and bubble functions. The main advantages of this basis over the classical Lagrange basis functions, is that the polynomial order of the approximation can be changed easily (by adding or removing kernel functions) and in the case of p-adaptation, like in Fig. 28, the hanging node is naturally suppressed, by changing to one the polynomial order of the shape function associated with the edge at the interface in the ${\mathbb {P}}_2$ element.

Once the constrains are established, the next step is to apply these constrains. The authors present a complete set of rules to impose the continuity of the solution in the case of h-adaptation and p-adaptation. The authors suggest to follow a rule called ”1-irregularity“ rule (only one hanging node between two elements) in order to avoid too complex situations, at least for h-adaptation.

We believe that this method is very different from the method we have proposed in this paper. The first difference is that the generation of hanging nodes makes the mesh non conforming. In a RD scheme, the question of how this hanging node should be treated does not seem evident. The distribution schemes (like for example (21) and (35)) would probably have to be redefined to take into account the hanging nodes, so that the scheme remains conservative (which is an essential ingredient for the convergence) and still exhibit the same behavior (for example (non) diffusive or (non) oscillatory). Even if it is still possible to use a classical Lagrange basis, it is advised in [8] to use a hierarchical basis of functions, as such a basis is more suited for p-adaptation. So, the compatibility of a hierarchical basis function with RD schemes should be analyzed. Finally, there is the question as to whether the hanging nodes should be taken into account in the system of equations (20), and for the implementation of an implicit RD scheme, what the consequences would be for the computation of the Jacobian matrix. We are not saying that the constrained approximation method is not compatible with RD schemes (indeed, we have obtained some preliminary numerical results with a prototype of a RD scheme based upon the constrained approximation method, in the case of scalar equations with Lagrange elements and an explicit scheme). But the questions we have raised should first be carefully assessed before trying to make an RD scheme compatible with the adaptive method of [8]. To the contrary, the p-adaptive method for RD schemes we have proposed here is designed for RD schemes by construction and as such avoids all the problems evoked above.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Abgrall, R., Viville, Q., Beaugendre, H. et al. Construction of a p-Adaptive Continuous Residual Distribution Scheme. J Sci Comput 72, 1232–1268 (2017). https://doi.org/10.1007/s10915-017-0399-6

Download citation

Received: 06 November 2015
Revised: 12 February 2017
Accepted: 21 February 2017
Published: 13 March 2017
Issue Date: September 2017
DOI: https://doi.org/10.1007/s10915-017-0399-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Construction of a p-Adaptive Continuous Residual Distribution Scheme

Abstract

Similar content being viewed by others

Error Estimates and Adaptivity of the Space-Time Discontinuous Galerkin Method for Solving the Richards Equation

An Introduction to Hybrid High-Order Methods

An hp finite element method for 4th order singularly perturbed problems

1 Introduction

2 Mathematical Problem and Residual Distribution Schemes

2.1 Basic Notions of Residual Distribution Schemes

2.2 Some Particular Residual Distribution Schemes

2.2.1 The Lax–Wendroff Scheme

2.2.2 The Rusanov Scheme

2.3 Construction of a High Order Monotonicity Preserving Residual Distribution Scheme

2.3.1 Limitation

2.3.2 Stabilization

2.4 Boundary Conditions

2.5 A Lax–Wendroff Like Theorem and Its Consequences

Theorem 2.1

3 Residual Distribution Schemes and p-Adaptation

3.1 Quadratic Interpolation on Triangular Elements

3.2 Quadratic Approximation on Tetrahedral Elements

3.3 General Subdivision Formulas for all Orders of Approximation

3.3.1 The Two Dimensional Case

Proposition 3.1

Proof

3.3.2 The Three Dimensional Case

Proposition 3.2

Proof

3.4 Definition of the Nodal Residuals in a Divided Element

Proposition 3.3

Proof

3.5 Practical Implementation

3.5.1 Computation of the Nodal Residuals

3.5.2 Computation of the Jacobian Matrix of the Implicit Scheme

3.5.3 Boundary Conditions

3.5.4 Choice of Quadrature Formulas

4 Numerical Results

4.1 Subsonic Flow

4.2 Transonic Flow

4.3 Supersonic Flow

4.4 Hypersonic Three Dimensional Flow

5 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1: Implicit Numerical Solver

Appendix 2: Comparison with Constrained Approximation for the Finite Element Method

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

An hp finite element method for 4^th order singularly perturbed problems