New Formulas of Numerical Quadrature Using Spline Interpolation

Magalhaes, Pedro Americo Almeida; Junior, Pedro Americo Almeida Magalhaes; Magalhaes, Cristina Almeida; Magalhaes, Ana Laura Mendonca Almeida

doi:10.1007/s11831-019-09391-3

New Formulas of Numerical Quadrature Using Spline Interpolation

Original Paper
Published: 04 January 2020

Volume 28, pages 553–576, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Archives of Computational Methods in Engineering Aims and scope Submit manuscript

New Formulas of Numerical Quadrature Using Spline Interpolation

Download PDF

Pedro Americo Almeida Magalhaes ORCID: orcid.org/0000-0003-0966-3612¹,
Pedro Americo Almeida Magalhaes Junior¹,
Cristina Almeida Magalhaes² &
…
Ana Laura Mendonca Almeida Magalhaes¹

1519 Accesses
4 Citations
Explore all metrics

Abstract

This work develops formulas for numerical integration with spline interpolation. The new formulas are shown to be alternatives to the Newton–Cotes integration formulas. These methods have important application in integration of tables or for discrete functions with constant steps. An error analysis of the technique was conducted. A new type of spline interpolation is proposed in which a polynomial passes through more than two tabulated points. The results show that the proposed formulas for numerical integration methods have high precision and absolute stability. The obtained methods can be used for the integration of stiff equations. This paper opens a new field of research on numerical integration formulas using splines.

Integro-Differential Polynomial and Trigonometrical Splines and Quadrature Formulas

Article 01 July 2018

Application of integrodifferential splines to solving an interpolation problem

Article 17 December 2014

Superconvergent Methods Based on Cubic Splines for Solving Linear Integral Equations

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Texts on numerical methods abound with formulas for numerical integration sometimes called quadrature or mechanical quadrature [1, 2]. The function f(x), which is to be integrated, may be a known function or a set of discrete data. This is not surprising, since there are so many possibilities for selecting the base-point spacing, the degree of the interpolating polynomial, and the location of base points with respect to the interval of integration. Many known functions, however, do not have an exact integral, and an approximate numerical procedure is required to compute the integral. In many cases, the function f(x) is known only as a set of discrete points, in which case an approximate numerical procedure is required to compute the integral [1, 2]. Numerical integration formulas can be developed by fitting approximating functions to discrete data and integrating the approximating function:

$$I = \int\limits_{{x_{1} }}^{{x_{n} }} {f(x)dx \cong \int\limits_{{x_{1} }}^{{x_{1} + (n - 1)h}} {P(x)dx} }$$

(1)

when the function to be integrated has known values at equally spaced points (Δx = h=constant) and n is number of points with x ranging as: x₁, x₂ = x₁ + h, x₃ = x₁ + 2h, …, x_n−1=x1 + (n − 2)h, x_n = x₁ + (n − 1)h, a polynomial P(x) can be fit to the discrete data [2, 3]. The resulting formulas are called Newton–Cotes formulas that employ functional values at equally spaced base points.

The distance between the lower and upper limits of an integral is called the range of integration. The distance between any two data points is called an increment or step (Δx = h) [1, 3,4,5].

2 State of the Art on Numerical Quadrature

There is a large literature on numerical integration, also called quadrature. Of special importance are the midpoint rule and Simpson’s rule. They are simple to use and bring enormous improvements for smooth functions in low dimensions [6,7,8]. The advantage of classical quadrature methods decays rapidly with increasing dimension. This phenomenon is a manifestation of Bellman’s ‘curse of dimensionality’, with Monte Carlo versions in two classic theorems of Bakhvalov.

The trapezoid rule is based on a piecewise linear approximation. The trapezoid rule integrates correctly any function f that is piecewise linear on each segment [x_i−1, x_i], by using two evaluation points at the ends of the segments [9,10,11]. The midpoint rule also integrates such a function correctly using just one point in the middle of each segment. The midpoint rule has benefitted from an error cancellation. This kind of cancellation plays a big role in the development of classical quadrature methods.

The midpoint rule has a big practical advantage over the trapezoid rule. It does not evaluate f at either endpoint a or b. Many of the integrals that we apply Monte Carlo methods to diverge to infinity at one or both endpoints. In such cases, the midpoint rule avoids the singularity. There are numerous mathematical techniques for removing singularities [12,13,14]. When we have no such analysis of our integrand, perhaps because it has a complicated problem-dependent formulation, or because we have hundreds of integrands to consider simultaneously, then avoiding the singularity is attractive. By contrast, the trapezoid rule does not avoid the endpoints x = a and x = b. For such methods a second, less attractive principle is to ignore the singularity, perhaps by using f(x_i) = 0 at any sample point x_i where f is singular.

The midpoint and trapezoid rules are based on correctly integrating piecewise constant and linear approximations to the integrand. That idea extends naturally to methods that locally integrate higher order polynomials [15,16,17]. The result is much more accurate integration, at least when the integrand is smooth. The idea behind Simpson’s rule generalizes easily to higher orders. We split the interval [a, b] into panels, find a rule that integrates a polynomial correctly within a panel, and then apply it within every panel to get a compound rule.

There are two main varieties of compound quadrature rule. For open rules we do not evaluate f at the end-points of the panel. The midpoint rule is open. For closed rules we do evaluate f at the end-points of the panel [18, 19]. The trapezoid rule and Simpson’s rule are both closed. Closed rules have the advantage that some function evaluations get reused when we increase n. Open rules have a perhaps greater advantage that they avoid the ends of the interval where singularities often appear.

The trapezoid rule and Simpson’s rule use n =2 and n =3 points respectively within each panel. In general, one can use m points to integrate polynomials of degree n − 1, to yield the Newton–Cotes formulas, of which the trapezoid rule and Simpson’s rule are special cases [12, 20, 21]. The Newton–Cotes rule for n =4 is another of Simpson’s rules, called Simpson’s 3/8 rule. Newton–Cotes rules of odd order have the advantage that, by symmetry, they also correctly integrate polynomials of degree m, as we saw already in the case of Simpson’s rule.

High order rules should be used with caution [22,23,24]. They exploit high order smoothness in the integrand, but can give poor outcomes when the integrand is not as smooth as they require. In particular if a genuinely smooth quantity has some mild nonsmoothness in its numerical implementation f, then high order integration rules can behave very badly, magnifying this numerical noise.

As a further caution, note that taking f fixed and letting the order n in a Newton–Cotes formula increase does not always converge to the right answer even for f with infinitely many derivatives. Lower order rules applied in panels are more robust [23,24,25]. The Newton–Cotes rules can be made into compound rules similarly to the way Simpson’s rule was compounded. When the basic method integrates polynomials of degree r exactly within panels, then the compound method has error O(m^−r), assuming that f^(r) is continuous on [a, b].

The rules considered above evaluate f at equispaced points. The basic panel for a Gauss rule is conventionally [− 1, 1] or sometimes ℝ, and not [0, h] as we used for Newton–Cotes rules. Also the target integration problem is generally weighted. The widely used weight functions are multiples of standard probability density functions, such as the uniform, gamma, Gaussian and beta distributions [12, 24, 26]. The idea is that having f be nearly a polynomial can be much more appropriate than requiring the whole integrand f(x)w(x) to be nearly a polynomial. Choosing w_i and x_i together yields 2n parameters and it is then possible to integrate polynomials of degree up to 2n − 1 without error.

Unlike Newton–Cotes rules, Gauss rules of high order have non-negative weights. We could in principle use a very large n. For the uniform weighting w(x) =1 though, we could also break the region into panels. Then for m function evaluations the error will be O(m⁻²ⁿ) assuming as usual that f⁽²ⁿ⁾ is continuous on [a, b]. Gauss rules for uniform weights on [− 1, 1] have the advantage that they can be used within panels.

Quadrature rules offer an elegant and efficient way to numerically evaluate integrals of functions from a linear space under consideration [24,25,26,27]. These rules typically require function evaluation at specific points, called nodes, and these values are multiplied by constants, called weights, to give the final value of the integral as a weighted sum.

There is an extensive number of various quadrature rules depending on n (f is univariate, bivariate, multivariate), domain shape (disc, hypercube, simplex), and the type of the linear space (polynomials, splines, rational functions, smooth non-polynomial) [12, 13, 28, 29]. For polynomial multivariate integration, the field is well studied. In the univariate case, a lot of research has been devoted to piecewise polynomials to address integration for spline spaces arising in isogeometric analysis. Introduced so called half-point rule for splines that needs the minimum number of quadrature points. However, this rule is in general exact only over the whole real line (infinite domain).

For finite domains, one may introduce additional quadrature points which make the rule non-Gaussian (slightly suboptimal in terms of the number of quadrature points), but more importantly, it yields quadrature weights that can be negative, unlike in Gaussian quadratures. When computing Galerkin approximations, assembling mass and stiffness matrices is the bottleneck of the whole computation and efficient quadrature rules for specific spline spaces are needed to efficiently evaluate the matrix entries [12, 13, 26, 29,30,31].

In the multivariate case where spline spaces possess a tensor-product structure, univariate quadrature rules are typically used in each direction, resulting in tensor-product rules. Recently, it have changed the paradigm of the mass and stiffness matrix computation from the traditional element-wise assembly to a row-wise concept [32,33,34]. When building the mass matrix, one B-spline basis function of the scalar product involved is considered as a positive measure (i.e., a weight function), and a weighted quadrature with respect to that weight is computed for each matrix row. Such an approach brings significant computational savings because the number of quadrature points in each parameter dimension is independent on the polynomial degree and requires asymptotically (for a large number of elements) only two points per element. For the multivariate case that is unstructured such as triangular meshes in 2D, however, constructing efficient quadrature rules from tensor product counterparts is unnatural, resulting in rules that are often not symmetric even though they act on a symmetric domain [35,36,37].

Classical quadrature methods are very well tuned to one-dimensional problems with smooth integrands. A natural way to extend them to multi-dimensional problems is to write them as iterated one-dimensional integrals, via Fubini’s theorem [12, 13, 38, 39]. When we estimate each of those one-dimensional integrals by a quadrature rule, we end up with a set of sample points on a multi-dimensional grid. Unfortunately, there is a curse of dimensionality that severely limits the accuracy of this approach. This curse of dimensionality is not confined to sampling on grids formed as products of one-dimensional rules. Any quadrature rule in high dimensions will suffer from the same problem. Two important theorems of Bakhvalov, make the point.

Bakhvalov’s theorem makes high-dimensional quadrature seem intractable. There is no way to beat the rate O(n^−r/d), no matter where you put your sampling points x_i or how cleverly you weight them. At first, this result looks surprising, because we have been using Monte Carlo methods which get an root mean square error O(n^−1/2) in any dimension. The explanation is that in Monte Carlo sampling we pick one single function f(·) with finite variance σ2 and then in sampling n uniform random points, get an root mean square error of σn^−1/2 for the estimate of that function’s integral. Bahkvalov’s theorem works in the opposite order [40]. We pick our points x₁,.., x_n, and their weights w_i. Then Bakhvalov finds a function f with r derivatives on which our rule makes a large error. When we take a Monte Carlo sample, there is always some smooth function for which we would have got a very bad answer. Such worst case analysis is very pessimistic because the worst case functions could behave very oddly right near our sampled x₁,.., x_n, and the worst case functions might look nothing like the ones we are trying to integrate. We can hybridize quadrature and Monte Carlo methods by using each of them on some of the variables. Hybrids of Monte Carlo and quasi-Monte Carlo methods are often used [5,6,7, 41,42,43].

The Laplace approximation is a classical device for approximate integration. The Laplace approximation is very accurate when log(f(x)) is smooth and the quadratic approximation is good where f is not negligible. Such a phenomenon often happens when x is a statistical parameter subject to the central limit theorem, f(x) is its posterior distribution, and the sample size is large enough for the Central Limit Theorem to apply [20, 21, 44,45,46].

The Laplace approximation is now overshadowed by Markov Chain Monte Carlo. One reason is that the Laplace approximation is designed for unimodal functions. When prior distribution has two or more important modes, then the space can perhaps be cut into pieces containing one mode each, and Laplace approximations applied separately and combined, but such a process can be cumbersome. Markov Chain Monte Carlo by contrast is designed to find and sample from multiple modes, although on some problems it will have difficulty doing so. The Laplace approximation also requires finding the optimum of a d-dimensional function and working with the Hessian at the mode. In some settings that optimization may be difficult, and when d is extremely large, then finding the determinant of the Hessian can be a challenge. Finally, posterior distributions that are discrete or are mixtures of continuous and discrete parts can be handled by Markov Chain Monte Carlo but are not suitable for the Laplace approximation. The Laplace approximation is not completely superceded by Markov Chain Monte Carlo. In particular, the fully exponential version is very accurate for problems with modest dimension d and large n. When the optimization problem is tractable then it may provide a much more automatic and fast answer than Markov Chain Monte Carlo does [12, 17, 23, 31, 34, 47].

There is some mild controversy about the use of adaptive methods. There are theoretical results showing that adaptive methods cannot improve significantly over non-adaptive ones. There are also theoretical and empirical results showing that adaptive methods may do much better than non-adaptive ones. These results are not contradictory, because they make different assumptions about the problem. For a high level survey of when adaptation helps [4, 47,48,49].

Sparse grids were originally developed for the quadrature of high-dimensional functions. The method is always based on a one-dimensional quadrature rule, but performs a more sophisticated combination of univariate results. However, whereas the tensor product rule guarantees that the weights of all of the cubature points will be positive if the weights of the quadrature points were positive, Smolyak’s rule does not guarantee that the weights will all be positive [39,40,41, 48].

Bayesian Quadrature is a statistical approach to the numerical problem of computing integrals and falls under the field of probabilistic numerics [42,43,44]. It can provide a full handling of the uncertainty over the solution of the integral expressed as a Gaussian Process posterior variance. It is also known to provide very fast convergence rates which can be up to exponential in the number of quadrature points n.

The problem of evaluating the integral can be reduced to an initial value problem for an ordinary differential equation by applying the fundamental theorem of calculus. For instance, the standard fourth-order Runge–Kutta method applied to the differential equation yields Simpson’s rule. Thus, in our view, numerical quadrature problems are often wrongly studied as a numerical solution of differential equations. Numerical integration is a much more general and distinct study of the differential equations [50,51,52,53].

3 Newton–Cotes Numerical Integration Formulas

The Newton–Cotes formulas are shown for comparison with the new formulas obtained using splines. The closed integration formulas use information about f(x), that is, they have base points, at both limits of integration. The trapezoid rule for a single interval is obtained by fitting a first-degree polynomial to two discrete points [1, 2, 4, 26]. Simpson’s 1/3 rule is obtained by fitting a second-degree polynomial to three equally spaced discrete points. Simpson’s 3/8 rule is obtained by fitting a third-degree polynomial to four equally spaced discrete points. Boole’s rule is obtained by fitting a fourth-degree polynomial to five equally spaced discrete points. Table 1 shows Newton–Cotes closed integration formulas.

Table 1 Newton–Cotes closed integration formulas

New Formulas of Numerical Quadrature Using Spline Interpolation

Abstract

Similar content being viewed by others

Integro-Differential Polynomial and Trigonometrical Splines and Quadrature Formulas

Application of integrodifferential splines to solving an interpolation problem

Superconvergent Methods Based on Cubic Splines for Solving Linear Integral Equations

1 Introduction

2 State of the Art on Numerical Quadrature

3 Newton–Cotes Numerical Integration Formulas

4 Spline Interpolation

5 New Type of Spline Interpolation

6 Integration of Polynomials Obtained by Spline Interpolation

7 Algorithm for Obtaining Different Integration Formulas by Spline Interpolation

8 Results Obtained and the Best Integration Formulas

9 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation