Abstract
The paper deals with the continuous and discrete higher-degree fuzzy transforms (F-transforms with polynomial components) with respect to a generalized fuzzy partition given by B-splines. We investigate properties of the direct and inverse F-transforms in these cases and prove that using B-splines allows us to improve the quality of approximation of smooth functions.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
We assume that the reader is familiar with the concept of fuzzy transform (F-transform or \(F^0\)-transform) introduced in Perfilieva and Haldeeva (2001) (see also the key paper Perfilieva 2006) and generalized to the case of higher degree by Perfilieva et al. (2011). As it was shown in Perfilieva et al. (2011), the ordinary F-transform with constant components can be extended to the F-transform with m-degree polynomial components, \( m \ge 0 \) (the \(F^{m}\)-transform). Theory and applications of \(F^{m}\)-transforms in image processing, data analysis and time series analysis have been rapidly developed during last years. The technique of the F-transform of a higher degree has been generalized in many directions: discrete \(F^{m}\)-transforms, \(F^{m}\)-transforms of functions of many variables, as well as \(F^{m}\)-transforms with respect to a generalized fuzzy partition.
The key elements of the F-transform are basic functions which form a fuzzy partition. A generalized fuzzy partition for the ordinary F-transform has been introduced by Stefanini (see Stefanini 2009, 2011). It was fuzzy k-partition with basic functions of bandwidth k (i.e., basic functions, whose support consists of 2k intervals). The result of such generalization is a denser fuzzy partition that may improve the approximation properties and the smoothness of the inverse F-transform. The concept of fuzzy k-partitions has been extended by I. Perfilieva with co-authors (see, e.g., Perfilieva 2015; Holčapek et al. 2015) and applied to the higher degree case of F-transform.
There are a number of papers dealing with fuzzy transforms with respect to a fuzzy partition with specially designed basic functions. In this paper we investigate the \(F^{m}\)-transform, \(m\ge 0\), with respect to a generalized fuzzy partition based on central, odd-degree B-splines. The idea of using splines for forming a fuzzy partition is not new (see, e.g., Bede and Rudas 2011; Kodorane and Asmuss 2013). Bede and Rudas (2011) had considered the case when a fuzzy partition is given by means of B-splines, Kodorane and Asmuss (2013) had suggested an alternative approach and introduced a new type of fuzzy partitions based on splines. In both cases the ordinary F-transform has been considered.
Our research focuses on approximation properties of the \(F^m\)-transform with respect to a generalized uniform fuzzy partition given by B-splines of degree \(2k-1\). The first result on approximation properties of the continuous \(F^m\)-transform in this case was proposed by the authors at the FUZZ-IEEE 2015 conference (Kokainis and Asmuss 2015). A discrete version of this result was announced at the FSTA 2016 conference. In this paper we prove that the continuous and discrete \(F^m\)-transforms based on B-splines of degree \(2k-1\) are precise for polynomials of degree \(r\le \min \left\{ 2m+1,2k-1 \right\} \). On the basis of this result we obtain error estimations for approximation of a transformed function and its derivatives by the continuous and discrete \(F^m\)-transform and prove that using B-splines may improve the approximation properties of the technique of higher-degree F-transform.
The paper is organized as follows. Section 2 contains preliminaries on \(F^m\)-transforms (direct and inverse, continuous and discrete) and B-splines. Section 3 introduces odd-degree B-splines as a special tool of fuzzy partitions. Sections 4 and 5 are devoted to properties of the continuous and discrete \(F^m\)-transforms based on B-splines. Section 7 identifies the degree of polynomials for which the continuous and discrete \(F^m\)-transforms based on B-splines are precise. Section 8 presents approximation properties of the B-spline based \(F^m\)-transforms. Section 9 contains proofs of the technical claims of the previous sections. Finally, Sect. 10 concludes the obtained results.
2 Preliminaries
Here we remind the concept of higher-degree direct F-transform and the corresponding inverse transform. In Sect. 2.6, key facts about B-splines are recalled.
Let \( \mathbb {N} \) be the set of positive integers and \( \mathbb {N}_0 = \mathbb {N} \cup \left\{ 0 \right\} \) be the set of nonnegative integers. The set \( \left\{ n,n+1,\ldots , m \right\} \), for integers n, m with \( n \le m \), is denoted by \( [n \, ..\, m] \). The space of univariate polynomials of degree at most l is denoted by \( \mathbb {P}_l \).
2.1 Generalized fuzzy partition
Suppose that \( [a,b] \subset \mathbb {R} \), \( a<b \), and \( N \in \mathbb {N} \) is chosen. Let \( t_0, \ldots , t_N\) be fixed nodes s.t. \( a \le t_0< \ldots < t_N \le b \). For each \( i \in [0 \, ..\, N]\) nonnegative reals \( h_i' \), \( h_i'' \) are chosen which fulfill the following requirements:
-
\( h_i' + h_i'' > 0 \);
-
\( \bar{E}_i \subset [a,b] \), where we denote
$$\begin{aligned} E_i := (t_i-h_i', t_i + h_i''), \quad \bar{E}_i := [t_i-h_i', t_i + h_i'']; \end{aligned}$$ -
\( \bigcup _{i=0}^N \bar{E}_i = [ a, b] \).
Definition 1
Fuzzy sets \( A_0, \ldots , A_N : [a,b] \rightarrow [0,1]\) are said to constitute a generalized fuzzy partition \( \mathbb {A}\) (with nodes \( t_i \) and margins \( h_i', h_i'' \), \( i \in [0 \, ..\, N] \)) of [a, b] if the following conditions are met for all \( i\in [0 \, ..\, N] \):
-
(locality): \( A_i(t) > 0 \) if \( t \in E_i \) and \( A_i(t) = 0 \) if \( t \in [ a, b] \setminus E_i\);
-
(continuity): \( A_i \) is continuous on \( \bar{E}_i\);
-
(covering): \( \sum _{j=0}^{N} A_j(t) > 0 \) for all \( t \in (a,b)\).
Remark: note a minor difference of the definition from Perfilieva and Kreinovich (2013)—originally covering condition required the strict inequality for \( t \in \left[ a,b \right] \).
We work with a uniform partition in the following sense:
Definition 2
Let \( h >0 \) and \( h' > h/2 \). Then the partition \( \mathbb {A}\) is said to be \( (h,h')\)-uniform, if the following properties are satisfied:
-
\( t_{i+1} = t_i + h \ \) for all \( i \in [0 \, ..\, N-1] \), and \( h_i' = h_i'' = h' \) for each \( i \in [0 \, ..\, N] \);
-
\( A_i(t_i - t) = A_i(t_i + t) \) for all \( t \in [0,h'] \), \( {i\in [0 \, ..\, N]} \);
-
\(A_{i}(t) = A_{i+1}(t+h) \) for all \( t \in \bar{E}_i\) and \( i\in [0 \, ..\, N-1]\).
In this case \( E_i = (t_i - h', t_i + h') \).
It is easy to see that then there is an even function \( {A : [- H,H] \rightarrow \mathbb {R} }\), \( H:=h'/h \), s.t. for all \( i\in [0 \, ..\, N]\) and \( t \in \bar{E}_i \) it holds that \( A_i(t) = A\left( \frac{t-t_i}{h} \right) \). The function A is said to be the generating function of the partition \( \mathbb {A}\).
Throughout the rest of this section, let an interval \( [a,b] \subset \mathbb {R}\), \( a<b \), and its generalized fuzzy partition \( \mathbb {A}= \left\{ A_0, \ldots , A_N \right\} \) be fixed. Fix also an integer \( m \ge 0 \).
2.2 Integral direct \( F ^m\)-transforms
For each \( i \in [0 \, ..\, N] \) denote by \( L_2^{}(A_{i}) \) the Hilbert space of all square integrable functions \( {f : \bar{E}_i \rightarrow \mathbb {R}} \) with the inner product \( \left\langle \cdot ,\cdot \right\rangle _{i} \) (and the induced norm \( \left\| f \right\| _i := \left\langle f,f \right\rangle _{i}^{0.5} \)) given by
Remark: in fact, this way a semi-norm \( \left\| \cdot \right\| _i \) in the space \( L_2^{}(A_{i}) \) is defined. Identifying functions f and g, which are almost everywhere equal in the sense \( \left\| f-g \right\| _i = 0 \), turns \( \left\| \cdot \right\| _i \) from a semi-norm into a norm and the semi-definite bilinear form \( \left\langle \cdot ,\cdot \right\rangle _{i} \) into an inner product. Further on, this technical detail will be ignored.
The set of all functions \( f : [ a, b] \rightarrow \mathbb {R} \), s.t. for all \( i \in [0 \, ..\, N] \), \( f |_{\bar{E}_i} \in L_2^{}(A_{i})\), is denoted by \( L_2(\mathbb {A}) \).
Fix \( i \in [0 \, ..\, N] \). The space \( L_2^m(A_i) \) (of dimension \( m+1 \)) is defined as the closed linear subspace of \( L_2^{}(A_{i}) \), spanned by the set of linearly independent system of polynomials \( \left\{ 1, t, t^2, \ldots , t^m \right\} \) (restricted to \( \bar{E}_i \)).
As in Perfilieva et al. (2011), an orthogonal basis \( \{\mathtt {P}_{i,0}, \ldots , \mathtt {P}_{i,m}\} \) of \( L_2^m(A_i) \) can be constructed so that \( \deg \mathtt {P}_{i,l} = l\), \( l \in [0 \, ..\, m] \), and the leading coefficient of \( \mathtt {P}_{i,l} \) is equal to 1.
Definition 3
Let \( f \in L_2( \mathbb {A}) \). By \( F_{m,i}^\rightarrow [f] \) the orthogonal projection of \( f |_{\bar{E}_i} \) on \( L_2^m(A_i) \) is denoted, \( i \in [0 \, ..\, N] \). The vector \( {F}_m^\rightarrow [f]= ( F_{m,0}^\rightarrow , \ldots , F_{m,N}^\rightarrow ) \) is said to be the direct \( F^m \)-transform of f w.r.t. the fuzzy partition \( \mathbb {A}\).
One can see that the components of the direct \( F^m \)-transform of f can be computed as
where
If no confusion can appear, this notation is simplified to \( c_{i,l} \).
Lemma 1
Suppose that \( f \in \mathbb {P}_{r} \), for some \( r \in \mathbb {N}_0 \), is expressed as
for some \( i \in [0 \, ..\, N] \) and real coefficients \( \alpha _{l}\). Denote \( \alpha _l = 0 \) for all \( l > r \).
Then \( c_{i,l} = \alpha _l \) for all \( l \in [0 \, ..\, m]\) and, consequently, the ith component of the direct \( F^m \)-transform equals
Proof
From the orthogonality of \( \mathtt {P}_{i,l} \), \( l \in [0 \, ..\, m]\), it follows that
Since \( \left\| \mathtt {P}_{i,l} \right\| _{i} \ne 0 \), this implies \( \alpha _l = c_{i,l}\). \(\square \)
2.3 Discrete direct \( F ^m\)-transform
Let \( \varDelta = \left\{ z_{1},\ldots ,z_{L} \right\} \subset [a,b]\) be a discrete set of the interval [a, b] and \( f : \varDelta \rightarrow \mathbb {R}\). Denote \( y_j = f(z_{j}) \), for each \( j \in [1\, ..\, L]\), and let \( \mathbf {Y} = \left( y_1, y_2,\ldots ,y_{L} \right) ^T \) be a column vector containing the values of the function f.
For each \( i \in [0 \, ..\, N] \) construct matrices
Definition 4
We say that the set \( \varDelta \), i.e., the domain of f, is sufficiently dense in the fuzzy partition \( \mathbb {A}\) w.r.t. m if the matrix \( \mathbf {X}_i^T \mathbf {A}_i \mathbf {X}_i \) is invertible for each \( i \in [0 \, ..\, N] \).
Notice that this definition slightly differs from Holčapek and Tichý (2014, Def. 3).
Finally, if the set \( \varDelta \) is sufficiently dense (in the fuzzy partition \( \mathbb {A}\) w.r.t. m), then the discrete direct \( F^m \)-transform can be defined:
Definition 5
(Holčapek and Tichý 2014) The vector \( {F}_m^\rightarrow [f] \) is the discrete direct \( F^m \)-transform of function f w.r.t. the fuzzy partition \( \mathbb {A}\), if the ith component \( F_{m,i}^\rightarrow [f] \), \( i\in [0 \, ..\, N] \), of this vector is the polynomial
where
2.4 Inverse and composite \( F ^m\)-transforms
Definition 6
Let \( \mathbf {p} = \left( p_0,\ldots , p_N \right) \in \mathbb {P}^{N+1}_m \) be a tuple of \( N+1 \) polynomials of degree at most m. Then the function
is called the inverse \( F^m \)-transform of \(\mathbf {p}\) w.r.t. the fuzzy partition \( \mathbb {A}\).
One can see that the inverse \(F^m\)-transform is applied to a tuple of polynomials and is defined as the linear combination of these polynomials with weight functions. In such framework the inverse fuzzy transform can be considered irrespective of the direct one. If our goal is to obtain an approximation of a function f, we need to apply the inverse \(F^m\)-transform \({F}_m^\leftarrow \) to the result \({F}_m^\rightarrow [f]\) of the direct \(F^m\)-transform, i.e., we need to take the composition of two transforms.
Definition 7
Let \( f : D_f \rightarrow \mathbb {R}\), where the set \( D_f \subset [a,b] \) is
-
either the whole interval [a, b] (in the case of the integral \( F^m \)-transform; then we also require f to be from the space \( L_2( \mathbb {A}) \)),
-
or the discrete grid \( \varDelta \), described in Sect. 2.3 (in the case of the discrete \( F^m \)-transform; then we also require that its discrete direct \( F^m \)-transform \( {F}_m^\rightarrow [f] \) exists).
Suppose that the direct \( F^m \)-transform (either the integral or the discrete version) of f w.r.t. the fuzzy partition \( \mathbb {A}\) is
Then the inverse \( F^m \)-transform of this vector, i.e., the function
is called the (composite) \( F^m \) -transform of f w.r.t. the fuzzy partition \( \mathbb {A}\).
It can be seen that the direct \( F^m \)-transform is a mapping from a space of functions to the space \( \mathbb {P}^{N+1}_m \), whereas the inverse \( F^m \)-transform maps a vector of polynomials from \( \mathbb {P}^{N+1}_m \) back to a function. Hence, the composite transform is a composition of these two mappings. We will often omit “composite” and simply say “\( F^m \)-transform of f”.
Remark: in the original definition of \( F^m \)-transform this composition is considered to be the inverse \( F^m \)-transform, i.e., originally the inverse \( F^m \)-transform is applied to a transformed function f, not to the result of its direct \( F^m \)-transform.
2.5 Ruspini condition
Suppose that \( [\hat{a}, \hat{b}] \subset [a,b] \), \(\hat{a} < \hat{b} \), is such an interval that
In this case we say that the generalized fuzzy partition \( \mathbb {A}\) fulfills the Ruspini condition on \( [\hat{a}, \hat{b}] \). It is obvious that in this interval the expression of the inverse \( F^m \)-transform of a tuple \( \mathbf {p} = \left( p_0,\ldots , p_N \right) \in \mathbb {P}^{N+1}_m \) can be simplified to
Also the expression of the composite \( F^m \)-transform can be simplified accordingly:
The following statement can be proven analogously to Lemma 8 of Perfilieva et al. (2011):
Lemma 2
For all \( p \in \mathbb {P}_m \) the \( F^m \)-transform of p coincides with p on \( [\hat{a}, \hat{b}] \) (the interval where the generalized fuzzy partition fulfills the Ruspini condition), i.e., \( p(t) = {F}_m[p](t)\) for all \( t \in [\hat{a}, \hat{b}] \).
2.6 B-splines
The way we define B-splines here follows the definition from Schoenberg (1973) (see also Schoenberg 1964).
2.6.1 General definition and properties
The truncated power function \( t^n_+ \) with integer \( n \ge 0 \) is defined as follows:
Let
Suppose that \( -\infty< t_i< t_{i+1}< \ldots< t_{i+n+1} < + \infty \). Then the function \( B^n_i \), defined as the divided difference of order \( n+1\) of \( \psi _n(t,\tau ) \) with respect to the variable \( \tau \) and based on \( n+2 \) points \( t_i, t_{i+1}, \ldots , t_{i+n+1} \), is called a B-spline (of degree n with knots \( t_i \), \( t_{i+1}, \ldots , t_{i+n+1} \)).
By \( \phi _n \) we denote the central B-spline (Schoenberg 1973), which is obtained when \( t_i = -\frac{n+1}{2} \) and \( t_{i+j} = t_i + j , j \in [0 \, ..\, n+1] \).
The functions \( B^n_i \) are piecewise polynomial functions satisfying
-
the restriction of \( B^n_i \) to \( (t_{i+j},t_{i+j+1} ) \), \( j\in [0 \, ..\, N]\), is a polynomial;
-
\( B^n_i \in C^{n-1}(\mathbb {R}) \);
-
\( B^n_i(t) > 0 \) if \( t \in \left( t_i, t_{i+n+1} \right) \);
-
\( B^n_i(t) = 0 \) if \( t \notin \left( t_i, t_{i+n+1} \right) \).
B-splines can be constructed through the following recurrence relation (Boor 2001):
for \( n \in \mathbb {N} \) and \( t \in \mathbb {R} \), noting that \( B^0_i (t ) = 1/(t_{i+1} - t_i) \) for all \( t \in \left( t_i, t_{i+ 1} \right) \). Moreover, the central B-spline \( \phi _n \) satisfies the following properties:
-
\( \phi _n \) is an even function;
-
\( \phi _n \) is non-decreasing on \( (-\infty ;0) \) and non-increasing on \( (0;+\infty ) \);
-
when \( n=2k-1 \) is an odd number, \( \phi _{2k-1} \) has integer knots \( -k \), ..., k.
2.6.2 Marsden’s identity
Suppose that \( i + n \le j \) and there is a sequence of knots
Then, for all \( z \in \mathbb {R} \) and x satisfying \( t_{i+n} < x \le t_{j+1} \), the following equality holds (Marsden 1970):
An easy consequence (Marsden 1970) is the equality
which is valid for \( x \in (t_{i+n}, t_{j+1})\).
In particular, when the knots are 1-equidistant, i.e., \( t_{i+l+1} =t_{i+l} +1 \) for all \( l \in [0 \, ..\, j-i] \), the equality \( B^n_{i+l}(x) = B^n_i(x -l ) \) holds and we obtain
If \( n \ge 1 \), then this is true also for \( x = t_{i+n} \) and \( x=t_{j+1} \).
3 Generalized fuzzy partition based on the central B-spline of odd degree
3.1 Construction of the central B-splines of odd degree
It can be easily seen from (3) that the central B-spline satisfies the following recurrence relation for all \( n \in \mathbb {N} \):
for all \( t \in \mathbb {R} \), where
and \( \phi _0(t) = 1 \), \( t \in (-0.5; \, 0.5) \). For example, it can be calculated that
Let \( k \in \mathbb {N} \). As it was mentioned previously, the central B-spline \( \phi _{2k-1} \in C^{2k-2}(\mathbb {R}) \) has integer knots \( -k \), ..., k. By (6),
and
It means that \( \phi _{2k+1}(t) \) can be expressed as
where
After simplifying these expressions we obtain that
where
Relations (8)–(9) together with (7) provide a way to construct recursively the central B-splines of odd degree.
Next we describe a construction of a generalized fuzzy partition, which will be based on the B-splines \( \phi _{2k-1} \). Note that from (7) it follows that our construction generalizes the commonly used uniform fuzzy partition given by a generating function of a triangular shape (Perfilieva et al. 2011) (and, in case \( k=1 \), gives this particular uniform partition).
3.2 Construction of a generalized fuzzy partition based on the central B-spline of odd degree
Fix \( N, k \in \mathbb {N} \) such that \( N \ge 2k-1 \). By A we denote the central B-spline of degree \( 2k-1 \).
To construct fuzzy partition based on B-splines, we use the technique introduced in Stefanini (2011) and take basic functions \( A_i \) which cover more than two consecutive subintervals (in our case, each basic function will cover 2k subintervals). This technique also requires choosing additional nodes \( t_i \) with indices \( i<0 \) and \( i>N \).
Let an interval [a, b] be fixed. Define
-
the step: \( h = (b-a)/(N+2k) \);
-
the h-equidistant nodes: \( t_i = a + h\, (i+k) \), for each \( i \in [-k \, ..\, N+k]\); notice that
$$\begin{aligned} a = t_{-k}< \cdots< t_0< \cdots< t_N< \cdots < t_{N+k} = b; \end{aligned}$$ -
the basic functions \(A_i(t) := A \left( \frac{t-t_i}{h} \right) \), \( i\in [0 \, ..\, N]\).
We consider the (generalized) fuzzy partition \( \mathbb {A}\) of [a, b] formed by the functions \( A_{0} \), ..., \( A_{N} \). Notice that this partition is (h, hk) -uniform.
From now on, we will call such partition of a given interval [a, b] as the (k, N) -FPB (fuzzy partition based on B-splines). When talking about such fuzzy partition, we also always implicitly assume that its parameters satisfy \( N \ge 2k-1 \).
3.3 Properties of the (k, N) -FPB
First we note that the support of each basic function \( A_i \) consists of 2k consecutive basic intervals:
Denote
then \( [\hat{a},\hat{b}] \subset [a,b] \) and for all \( t \in [\hat{a},\hat{b}] \) we have
where the last equality follows from (5). It follows that for the constructed fuzzy partition the Ruspini condition is fulfilled on \( [\hat{a}, \hat{b}] \). Notice that from the assumption \( N \ge 2k-1 \) it follows that \( [\hat{a}, \hat{b}] \) always consists of at least one basic interval.
From the viewpoint of fuzzy partitions, the next lemma uses only the fact that the partition is uniform. It will help to understand the relations between different components of the direct \( F^m \)-transform when it is applied to a polynomial.
Lemma 3
Let \( r\ge 0 \) be an integer and \( p^0 \), \( p^1 \), ..., \( p^r \) be polynomials satisfying \( \deg p^l = l \) for each \( l \in [0 \, ..\, r] \). Denote \( p^l_i (t) := p^l \left( \frac{t - t_i}{h} \right) \), \( i \in [0 \, ..\, N] \), \( l \in [0 \, ..\, r] \).
Fix any \( f \in \mathbb {P}_r \); let \( a_{i,l} \), for all \( i \in [0 \, ..\, N] \) and \( l\in [0 \, ..\, r] \), be real numbers such that
(such coefficients exist because \( p^0 _i\), \( p^1 _i\), ..., \( p^r _i\) span the space \( \mathbb {P}_r \)).
Then for all \( l \in [0 \, ..\, r] \) there exists a polynomial \( q_l \in \mathbb {P}_{r-l} \) such that \( a_{i,l} = q_l(i) \) for all \( i \in [0 \, ..\, N] \).
Proof in Sect. 9.1.3.
Another lemma characterizes how orthogonality of polynomials w.r.t. the weight function A translates to a similar orthogonality-like property w.r.t. sum over the values of the corresponding expression. This property is crucial to our main results, since it easily leads to Lemma 7, which in turn is the main result behind Theorem 1.
Lemma 4
Suppose that \( l \in \mathbb {N} \) and a polynomial \( P \in \mathbb {P}_l \) satisfies
-
1.
\( \int _{-k}^{k} q(t) P(t)A(t)\,\mathrm {d}t=0 \) for all polynomials q of degree at most \( \min \left\{ l-1, 2k-1-l \right\} \);
-
2.
P is either even or odd (i.e., for all \( t \in \mathbb {R} \) we have \( P(t) = \pm P(-t)\)).
Let \( p \in \mathbb {P}_{l-1}\) and suppose that \( \deg p \le 2k-1-l \). Then for all \( \tau \in [0,1] \) and \( l \in [0 \, ..\, 2k-1] \) the equality
holds.
Proof in Sect. 9.3.2.
3.4 Examples of FPBs
In Fig. 1 the (1, 8) -FPB of the interval [0, 1] is depicted. Since \( k=1 \), the basic functions are degree-1 B-splines, i.e., piecewise linear functions, commonly called triangular basic functions.
In Fig. 2 we show the (2, 8) -FPB of [0, 1] . Here \( k=2 \), thus the basic functions are given by cubic B-splines and each basic functions cover four consecutive basic intervals.
In both cases \( N=8 \), meaning that the unit interval is partitioned into 9 fuzzy sets. We also mark the endpoints of the interval where the Ruspini condition is fulfilled:
-
for the (1, 8) -FPB of [0, 1] (Fig. 1), the Ruspini condition holds on [1 / 10; 9 / 10] ;
-
for the (2, 8) -FPB of [0, 1] (Fig. 2), the Ruspini condition holds on [1 / 4; 3 / 4] .
3.5 Discrete grid associated to the FPB
For the purposes of the discrete transform we will define the discrete grid where the discrete functions will be defined. Fix a positive integer M and let
-
\( L = (N+2k)M \);
-
\( h' = h/M \);
-
\( z_{j} = a + h'\,j \), for all \( j \in [0 \, ..\, L]\).
Then
Denote
Notice that \( \varDelta \cup \left\{ a,b \right\} \) is a uniform grid which contains all fuzzy partition nodes \( t_i \), \( i\in [-k \, ..\, N+k] \).
In Fig. 3, we depict the (1, 4) -FPB of the interval [0; 1] with its basic nodes (the circles in the figure) and show the discrete set \( \varDelta \) (the asterisks in the figure). In this case \( M=2 \), i.e., each basic interval is split into two equal parts by the discrete points. It can also be noted that each fuzzy set \( A_i \) contains exactly \( 2kM-1 = 3 \) discrete points with nonzero degree.
4 Spaces related to the fuzzy transforms
4.1 Overview
As previously, we will denote by \( L_2(\mathbb {A})\) the space of all functions \( f : [a,b] \rightarrow \mathbb {R}\) such that the restriction of f onto the support of the ith basic function \( A_{ i} \) belongs to \( L_2^{}(A_{i}) \) for all \(i\in [0 \, ..\, N]\). In other words, \( L_2(\mathbb {A})\) contains all functions for which the integral \( F ^m\)-transform w.r.t. the (k, N) -FPB can be applied.
Similarly, by \( \varLambda (\mathbb {A})\) we will denote the space of all functions \( f : \varDelta \rightarrow \mathbb {R}\), i.e., all those functions for which the discrete \( F ^m\)-transform w.r.t. the (k, N) -FPB can be applied (provided that \( \varDelta \) is sufficiently dense in the FPB).
In either the integral or the discrete case function f, to which the direct transform is applied, must satisfy certain properties; in particular, the restriction must belong to a certain space (in the integral case that is the space \( L_2^{}(A_{i}) \)).
In this section we will describe both for the integral and discrete transforms a “template” space (either \( L_2^{}(A)\) or \( \varLambda (A)\), respectively), having the property that (in the integral case) all functions in the space \( L_2^{}(A_{i}) \) can be obtained from those in the template space by shifting and scaling the variables.
More precisely, \( f \in L_2^{}(A_{i}) \) iff there is a function \( F \in L_2^{}(A)\) such that we have \(F(x) = f \left( h x + t_{i} \right) \) for all \( x \in {{\mathrm{supp}}}A\). The bijection \( x \mapsto h x + t_{i} \) allows to describe objects in the space \( L_2^{}(A_{i}) \) (such as the inner product, norm, orthogonal polynomials, etc) in terms of the corresponding objects in \( L_2^{}(A)\).
In the discrete case we define spaces \( \varLambda (A_{i}) \), the discrete counterparts of \( L_2^{}(A_{i}) \), and a template space \( \varLambda (A)\), from which the space \( \varLambda (A_{i}) \) can be derived (via the bijection \( x \mapsto h x + t_{i} \)).
4.2 Spaces related to the integral transform
4.2.1 Template space \( L_2^{}(A)\)
Let \( L_2^{}(A)\) stand for the Hilbert space of square integrable functions \( f : [-k,k] \rightarrow \mathbb {R} \) with the inner product \( \left\langle \cdot ,\cdot \right\rangle \) given by
and the induced norm \( \left\| f \right\| = \left\langle f,f \right\rangle ^{0.5} \).
There exists a sequence of monic orthogonal (w.r.t the inner product \( \left\langle \cdot ,\cdot \right\rangle \)) polynomials in \( L_2^{}(A)\). This follows from a construction similar as in Perfilieva et al. (2011), or, more generally, from Gautschi (2004, Theorem 1.7). Moreover, these polynomials can be constructed via a three-term recurrence relation Gautschi (2004, Theorem 1.27) as follows:
where
and \( \beta _0 \in \mathbb {R} \) is arbitrary. However, it is easy to see that \( \alpha _l=0\); hence, the three-term recurrence relation in our case is
with initial members \( \mathtt {P}_{0} \equiv 1\), \( \mathtt {P}_{1}(t) =t\).
By inductive arguments it can be seen that \( \mathtt {P}_{l} \) is an odd function iff l is odd and \( \mathtt {P}_{l} \) is an even function iff l is even. Another obvious but important property is that \( \mathtt {P}_{l} \) is orthogonal to every polynomial of degree less than l.
4.2.2 Orthogonal polynomials for \( L_2^{}(A_{i}) \)
Let \( f,g \in L_2^{}(A)\) and denote \( f_i(x) = f\left( \frac{x-t_i}{h} \right) \) and \( g_i(x) =g\left( \frac{x-t_i}{h} \right) \), then \( f_i, g_i \in L_2^{}(A_{i}) \) (conversely, if \(\tilde{f}_i \) is an arbitrary function from \( L_2^{}(A_{i}) \), then \( \tilde{f} (x) :=\tilde{f}_i \left( xh+t_i \right) \) is in \( L_2^{}(A)\)). The inner product \( \left\langle \cdot ,\cdot \right\rangle _{i} \) can be obtained from \( \left\langle \cdot ,\cdot \right\rangle \) as follows:
It immediately follows that also \( \left\| f \right\| =h^{-0.5} \left\| f_i \right\| _{i}\).
Then the basis of \( L_2^m(A_i) \) is formed by the polynomials \( \mathtt {P}_{i,l} \), \( l\in [0 \, ..\, m]\), where
Moreover, \( \left\| \mathtt {P}_{i,l} \right\| ^2_i = h \,\left\| \mathtt {P}_{l} \right\| ^2 \).
Finally, for all \( f \in L_2^{}(A_{i}) \), \( i\in [0 \, ..\, N] \), and all \( l \ge 0 \) consider \( c_{i,l}[f] \), defined by (2). This quantity has clear interpretation in terms of projection onto the basis polynomial \( \mathtt {P}_{i,l} \): the orthogonal projection of f onto the 1-dimensional space spanned by \( \mathtt {P}_{i,l} \) is the function \( c_{i,l}[f] \mathtt {P}_{i,l}\).
4.3 Spaces related to the discrete transform
4.3.1 Template \( Z\) and the associated space \( \varLambda (A)\)
For all integers j s.t. \( \left| j \right| \le Mk -1 \), introduce numbers . The set \( Z\) has the following property:
In other words, by scaling points in \( Z\) by h and shifting by \( t_i \), we obtain all points in \( \varDelta _i \), i.e., the set \( Z\) can be thought of as a template for the discrete points in \(\varDelta _i\).
Let \( \varLambda (A)\) stand for the space of functions \( f : Z\rightarrow \mathbb {R} \). Define a bilinear form \( \left\langle \cdot ,\cdot \right\rangle : \varLambda (A)^2 \rightarrow \mathbb {R} \) with
and let \( \left\| f \right\| = \left\langle f,f \right\rangle ^{0.5}\), for all \( f \in \varLambda (A)\).
Identifying functions \( f,g \in \varLambda (A)\) which are almost everywhere equal (in the sense \( \left\| f-g \right\| = 0 \)) turns the bilinear form \( \left\langle \cdot ,\cdot \right\rangle \) into an inner product and \( \left\| \cdot \right\| \) into a norm.
Since \( Z\) consists of \( 2Mk-1 \) points (and at any of them \( A\) takes a positive value), by Gautschi (2004, Theorem 1.7) there exist polynomials \( \mathtt {P}_{l} \), \( l \in [0 \, ..\, 2Mk-2]\), such that
-
\( \deg \mathtt {P}_{l} =l \);
-
the polynomials are orthogonal with respect to \( \left\langle \cdot ,\cdot \right\rangle \), i.e., \( \left\langle \mathtt {P}_{l} ,\mathtt {P}_{l'} \right\rangle = 0 \) whenever \( l \ne l' \);
-
\( \left\| \mathtt {P}_{l} \right\| \ne 0 \) for each \( l \in [0 \, ..\, 2Mk-2]\);
-
\( \mathtt {P}_{l} \) is an even function if l is even and odd if l is odd.
Moreover, these polynomials can also be constructed via a three-term recurrence relation, see, e.g., Gautschi (2004, Theorem 1.27).
We extend this construction by defining \( \mathtt {P}_{l} \in \mathbb {P}_l\) with \( l \ge 2Mk-1 \) so that \( \mathtt {P}_{l} \equiv 0 \) on \( Z\), then the obtained infinite sequence is orthogonal w.r.t. \( \left\langle \cdot ,\cdot \right\rangle \) (even though \( \left\| \mathtt {P}_{l} \right\| =0 \) for all \( l \ge 2Mk-1 \), thus these additional polynomials are identified with 0 in the space \( \varLambda (A)\)).
For each \( l \ge 1\) the system \( \left\{ \mathtt {P}_{0}, \mathtt {P}_{1}, \ldots ,\mathtt {P}_{l} \right\} \) spans the space \( \mathbb {P}_l \); moreover, \( \mathtt {P}_{l} \) is orthogonal (w.r.t. \( \left\langle \cdot ,\cdot \right\rangle \)) to any \( p \in \mathbb {P}_{l-1} \).
4.3.2 Space \( \varLambda (A_{i}) \)
Let \( \varLambda (A_{i}) \), \( i \in [0 \, ..\, N] \), stand for the space of functions \( f : \varDelta _i \rightarrow \mathbb {R} \). Clearly, if \( f \in \varLambda (A)\), then \( f_i \), defined by \( f_i(x) = f \left( \frac{x-t_i}{h} \right) \), belongs to \( \varLambda (A_{i}) \) (and vice versa).
Each space \( \varLambda (A_{i}) \) is equipped with the discrete inner product defined by
and the associated discrete norm \( \left\| \cdot \right\| _{i} \).
We define
obtaining the sequence \( \mathtt {P}_{i,l} \), \( l \ge 0 \), of polynomials orthogonal w.r.t. \( \left\langle \cdot ,\cdot \right\rangle _{i} \).
Similarly as in the integral case (see Sect. 4.2.2), the inner product in the space \( \varLambda (A_{i}) \) is related to that in the space \( \varLambda (A)\):
Lemma 5
For all \( i \in [0 \, ..\, N]\) and \( f,g \in \varLambda (A)\) the equality \( \left\langle f_i,g_i \right\rangle _{i} = \left\langle f, g \right\rangle \) holds, where \( f_i(x)\) and \( g_i(x) \) stand for the functions \( f \left( \frac{x-t_i}{h} \right) \) and \( g\left( \frac{x-t_i}{h} \right) \), respectively.
Proof in Sect. 9.4.2.
It immediately follows that also \( \left\| f \right\| = \left\| f_i \right\| _{i}\).
Finally, for all \( f \in \varLambda (A_{i})\) , \( i\in [0 \, ..\, N] \), and all \( l \in [0 \, ..\, 2kM-2] \) we define
If no confusion can appear, this notation is simplified to \( c_{i,l} \). Again, the orthogonal projection of f onto the 1-dimensional space spanned by \( \mathtt {P}_{i,l} \) is \( c_{i,l}[f] \mathtt {P}_{i,l}\).
4.4 Relationship and common properties
From now on we will denote by
-
\( {F}_m^\rightarrow \) either the integral or the discrete direct \( F^m \)-transform (and \( F_{m,i}^\rightarrow \) stands for the ith component of the direct transform in either case);
-
\( {F}_m^\leftarrow \) either the integral or the discrete inverse \( F^m\)-transform;
-
\( \left\langle \cdot ,\cdot \right\rangle \) either the inner product in the space \( L_2^{}(A)\), defined by (11), or the inner product in the space \( \varLambda (A)\), defined by (13);
-
\( \left\langle \cdot ,\cdot \right\rangle _{i} \) either the inner product in the space \( L_2^{}(A_{i}) \), defined by (1), or the inner product in the space \( \varLambda (A_{i}) \), defined by (14);
-
\( \left\| \cdot \right\| \) the norm, associated to \( \left\langle \cdot ,\cdot \right\rangle \) (in either the discrete or the integral case); likewise, \( \left\| \cdot \right\| _{i} \) stands for the norm, associated to \( \left\langle \cdot ,\cdot \right\rangle _{i} \);
-
\( \mathtt {P}_{l} \) the orthogonal polynomials in either \( L_2^{}(A)\) or \(\varLambda (A)\);
-
\( \mathtt {P}_{i,l} \) the orthogonal polynomials in either \( L_2^{}(A_{i}) \) or \( \varLambda (A_{i})\);
-
\( c_{i,l}[f] \) (or \( c_{i,l} \)) the scalars, defined either by (2) or by (16).
Whether the integral or the discrete construction is meant will be clear from the context; in certain cases (such as in Theorem 1) notations \( {F}_m^\rightarrow \), \( \mathtt {P}_{l} \), etc., can stand for either the integral or the discrete concepts, allowing to unify the results obtained in both settings.
However, when confusion can appear (as in Lemma 6 below, where both inner products appear) the integral and discrete concepts will be distinguished by use of superscripts \( {}^{\mathrm {int}}\) and \( {}^{\mathrm {dsc}}\), respectively; for example, \( \mathtt {P}_{l}^{\mathrm {int}}\) stands for the polynomial \( \mathtt {P}_{l} \) defined in Sect. 4.2, whereas \( \mathtt {P}_{l}^{\mathrm {dsc}}\) stands for the polynomial \( \mathtt {P}_{l} \) described in Sect. 4.3.
The next lemma shows that the weighted integral (w.r.t. the weight function A) of a polynomial of degree at most \( 2k-1 \) can be replaced with a sum over the points in the template \( Z\), which establishes a relationship between the integral and discrete inner products defined previously:
Lemma 6
For all \( p \in \mathbb {P}_{2k-1} \) the following equality holds:
In terms of the inner products \( \left\langle \cdot ,\cdot \right\rangle ^{\mathrm {int}}\) and \( \left\langle \cdot ,\cdot \right\rangle ^{\mathrm {dsc}}\) this can be restated as
Proof in Sect. 9.4.1.
Finally, we have the following result, which is a consequence of Lemma 4; as mentioned previously, this is the main result leading to Theorem 1.
Lemma 7
Let \( l \in [1 \, ..\, 2k-1] \). Suppose that p is any polynomial such that \( \deg p \le \min \left\{ l-1, 2k-1-l \right\} \). Then for all \( \tau \in [0,1] \) the following equality holds:
Proof
Fix \( l \in [1 \, ..\, 2k-1] \). We claim that \( \mathtt {P}_{l} \in \mathbb {P}_l \) is orthogonal under the integral inner product \( \left\langle \cdot ,\cdot \right\rangle ^{\mathrm {int}}\) to all polynomials q of degree at most \( \min \left\{ l-1, 2k-1-l \right\} \). Then, since \( \mathtt {P}_{l} \in \mathbb {P}_l\) is either an odd or an even function, the claim follows from Lemma 4.
In the integral case we already have the required orthogonality; consider the discrete case and fix any polynomial \( q \in \mathbb {P}_{l-1} \) s.t. \( \deg q \le 2k-1-l \). Then \( \deg (q \mathtt {P}_{l}) \le 2k-1\) and, by Lemma 6,
However, since q is orthogonal to \( \mathtt {P}_{l} \) under \( \left\langle \cdot ,\cdot \right\rangle ^{\mathrm {dsc}}\), it follows that \( \int _{-k}^k q(x) \mathtt {P}_{l}(x) A(x)\,\mathrm {d}x=0 \) for all polynomials q of degree at most \( \min \left\{ l-1, 2k-1-l \right\} \), concluding the proof. \(\square \)
5 Properties of the discrete \(F^m\)-transform w.r.t. FPB
Throughout this section, fix a function \( f : \varDelta \rightarrow \mathbb {R}\); here the set \( \varDelta \) is as described in Sect. 3.5. Denote
for all , the vector \( \mathbf {Y}_i \) is the fragment of \( \mathbf {Y} \) which contains the discrete function’s values on \( \varDelta _i \).
The following lemma helps to characterize the values of m which ensures sufficient density of \( \varDelta \).
Lemma 8
Let \( m \ge 0 \). Then \( \varDelta \) is sufficiently dense in the (k, N) -FPB w.r.t. a nonnegative integer m iff \( m \le 2kM -2 \).
Proof in Sect. 9.5.2.
It also can be shown that the coefficients of the polynomial \( F_{m,i}^\rightarrow [f]\) can be computed via matrices, independent of i (except for the corresponding fragment of the vector \( \mathbf {Y} \)), which is due to the uniformness; moreover, the Vandermonde matrix \( \mathbf {X}\) can be formed by any polynomials spanning \( \mathbb {P}_m \).
Lemma 9
Let \( m \in [0 \, ..\, 2Mk-2]\) and \( p^0 \), \( p^1 \), ..., \( p^m \) be arbitrary polynomials spanning \( \mathbb {P}_m \).
Denote \( p^l_i (t) := p^l \left( \frac{t - t_i}{h} \right) \) and define matrices \( \mathbf {X}\), \( \mathbf {A} \) as follows:
Then the components of the discrete direct transform \( {F}_m^\rightarrow [f] \) satisfy
where
Proof in Sect. 9.5.3.
Alternatively, the components of the discrete \( F^m \)-transform of f can be computed similarly as in the case of the integral transform.
Corollary 1
Suppose that \( m \in [0 \, ..\, 2Mk-2]\). Then the components of the discrete direct transform \( {F}_m^\rightarrow [f] \) satisfy
where \( c_{i,l} \) are defined by (16).
Proof
We apply Lemma 9 with \( p^l = \mathtt {P}_{l} \). As in the proof of Lemma 8, notice that \( \mathbf {X}^T \mathbf {A} \mathbf {X} \) is the diagonal matrix with entries \( \left\| \mathtt {P}_{l} \right\| ^2 \), \( l \in [0 \, ..\, m] \), on its diagonal. Similarly, the lth entry of \( \mathbf {X}^T \mathbf {A} \mathbf {Y}_i \) equals
Thus the lth entry of \( (\mathbf {X}^T \mathbf {A} \mathbf {X} ) ^{-1} \mathbf {X}^T \mathbf {A} \mathbf {Y}_i \) is
concluding the proof. \(\square \)
We also obtain the following conclusion, the discrete counterpart of Lemma 1.
Corollary 2
Suppose that \( f \in \mathbb {P}_{r} \), for some \( r \in \mathbb {N}_0 \), is expressed as
for some \( i \in [0 \, ..\, N] \) and real coefficients \( \alpha _{l} \). Denote \( \alpha _l = 0 \) for all \( l > r \).
Let \( m \le 2kM-2\); then \( c_{i,l}[f] = \alpha _l \) for all \( l \in [0 \, ..\, m]\) and, consequently, the ith component of the discrete direct transform \( F^m \)-transform equals
Proof
Apply Corollary 1 and from the orthogonality of \( \mathtt {P}_{i,l} \), \( l \in [0 \, ..\, m ]\), obtain that
Since \( \left\| \mathtt {P}_{i,l} \right\| _{i} = \left\| \mathtt {P}_{l} \right\| \ne 0 \) when \( l \le m \le 2kM-2 \), it follows that \( c_{i,l}[f] = \alpha _l \). \(\square \)
6 Additional properties of \(F^m\)-transforms
In this section we show several additional results related to \( F^m \)-transforms, when the original function f is of a particular kind. Even though we formulate and prove the following results in the case of FPB, the choice of the fuzzy partition is not essential and similar results could be obtained in a more general setting.
Throughout this section, let numbers \( k, N \in \mathbb {N} \), an interval [a, b] and its (k, N) -FPB, given by the basic functions \( A_i(x) = A\left( \frac{x-t_i}{h} \right) \), be fixed. Suppose that the Ruspini condition is fulfilled on an interval \( [\hat{a}, \hat{b}] \subset [a,b] \).
Fix also a function \( f : D_f \rightarrow \mathbb {R}\), where \( D_f \subset [a,b] \) is
-
either the whole interval [a, b] (in the case of the integral \( F^m \)-transform; then we also require f to be from the space \( L_2(\mathbb {A})\)),
-
or the discrete grid \( \varDelta \), described in Section 3.5 (in the case of the discrete \( F^m \)-transform; then we also require \( m \le 2kM-2 \)).
6.1 \( F^m \)-transform of an integral transform
We consider the case when f is itself an integral transform of a function g w.r.t. some kernel K (in our consideration kernel K is piecewise continuous w.r.t. each variable). Then the \( F^m \)-transform of f can be obtained, instead applying the \( F^m \)-transform to the function g.
Lemma 10
Suppose that
where \( g \in L_1([a,b] )\) and \(K: [a,b] ^2 \rightarrow \mathbb {R}\) is piecewise continuous w.r.t. each variable. Then
Proof in Sect. 9.6.1.
6.2 \( F^m \)-transform of a hybrid function
In this subsection we consider the case when the original function is defined in a piecewise manner and show that its \( F^m \)-transform can be obtained as the \( F^m \)-transform of the corresponding sub-functions, as long as the argument is far enough from the point between pieces.
Lemma 11
Suppose that \( s \in [t_{i_0}, t_{i_0+1}] \) for some \( i_0 \in [-k \, ..\, N+k-1] \). Let f satisfy
for some functions \( g_1, g_2 : D_f \rightarrow \mathbb {R} \).
Let \( x \in [t_{j_0}, t_{j_0+1}] \subset [\hat{a}, \hat{b}] \) for some integer \( j_0 \). Then the \( F^m \)-transform of f satisfies the following properties:
-
if \( j_0 \le i_0 - 2k \), then \( {F}_m[f] (x) = {F}_m[g_1](x) \);
-
if \( j_0 \ge i_0 + 2k \), then \( {F}_m[f] (x) = {F}_m[g_2](x) \).
Proof in Sect. 9.6.2.
6.3 Affine transformations of the universe
In this subsection we consider the case when the original interval is transformed to another interval via an affine map \( \phi (t) = \alpha t + \beta \), where \( \alpha , \beta \) are some reals, \(\alpha > 0 \).
Clearly, we immediately have the (k, N) -FPB of the interval \( [ \phi (a), \phi (b)] \), with basic nodes \( \phi (t_i) \) and basic functions
for all \( i \in [-k \, ..\, N+k] \). Notice that alternatively we can write
i.e., \( \tilde{A}_i = A_i \circ \phi ^{-1} \). Clearly, this FPB fulfills the Ruspini condition on \( [\phi (\hat{a}), \phi (\hat{b})] \).
In the case of the discrete transform, the set \( \varDelta \subset [a,b] \) is transformed to the set \( \tilde{\varDelta } = \phi (\varDelta ) \). If \( \varDelta \) is a uniform grid as described in Sect. 3.5 (with parameter M), then \( \tilde{\varDelta } \) is also a uniform grid with the same parameter M.
It can be seen that the basis polynomials \( \mathtt {P}_{i,l}(x) \), corresponding to the FPB of [a, b] , are transformed to polynomials \( \tilde{\mathtt {P}}_{i,l}(u) \) for the respective FPB of \( [\phi (a), \phi (b)] \) via
For a function g (either in \( L_2(\tilde{A}_i) \) or defined over \( \tilde{\varDelta }_i \)), the scalar, defined similarly as in (2) (or (16)) w.r.t. the (k, N) -FPB of \( [\phi (a),\phi (b)] \), will be denoted by \( \tilde{c}_{i,l}[g] \).
Finally, we denote the \( F^m \)-transform of a function g w.r.t. the (k, N) -FPB of \( [\phi (a), \phi (b)] \) by \( F_{m,\phi } [g] \). Then we claim that
Lemma 12
Let \( g = f \circ \phi ^{-1} \). Then the following equality holds on \( [\phi (\hat{a}), \phi (\hat{b})] \):
Proof in Sect. 9.6.3.
As a consequence, we have the following result.
Corollary 3
Let \( \alpha >0 \) and \(\beta \) be scalars and denote \( \phi (t) = \frac{t - \beta }{\alpha } \). Suppose there is another scalar \( \gamma _\alpha \) (which may depend on \( \alpha \)) and a function g (defined on \( \phi (D_f) \)) such that f and g satisfy
Then
where \( F_{m,\phi } [g] \) stands for the \( F^m \)-transform of g w.r.t. the (k, N) -FPB of the interval \( \left[ \phi (a) , \phi (b) \right] \).
Proof
Apply the preceding lemma to obtain
where \(\tilde{f} := f \circ \phi ^{-1} \) and \( u := \phi (x) \). Notice that
i.e., \( \tilde{f} = \gamma _\alpha \cdot g \). Now from the linearity of the \( F^m \)-transforms we conclude \( F_{m,\phi } [\tilde{f}] = \gamma _\alpha F_{m,\phi } [g] \). \(\square \)
We are interested to apply this corollary to the case when [a, b] consists of only \( 4k-1 \) basic intervals and the affine transformation maps it to the symmetric interval \( [-2k + 0.5, 2k-0.5] \). For these purposes, we denote
-
by \( \mathcal {I}_k \) the interval \( [-2k + 0.5, 2k-0.5] \);
-
by \( F_{m,*}\) the \( F^m \)-transform w.r.t. the \( (k,2k-1) \)-FPB of the interval \( \mathcal {I}_k \).
Notice that the FPB of \(\mathcal {I}_k\) has unit step (i.e., independent of the initial parameter h, characterizing the FPB of [a, b] ).
7 Approximation of polynomials using \(F^m\)-transform based on B-splines
Throughout the next two sections, we fix an interval \( [a,b] \subset \mathbb {R} \) and its (k, N) -FPB (for some \( k,N \in \mathbb {N} \)), denoted by \( \mathbb {A}\). For the discrete transform we suppose that also a uniform discrete grid \( \varDelta \) is fixed as described in Sect. 3.5 (with the corresponding parameter M).
Theorem 1
Let r, m be nonnegative integers satisfying \( r \le \min \left\{ 2m+1, 2k-1 \right\} \). Suppose that \( f \in \mathbb {P}_{r} \); then
Here \( [\hat{a},\hat{b}] \) is defined by (10) and \( {F}_m[f] \) stands for either the integral or the discrete \( F^m \)-transform of f w.r.t. \( \mathbb {A}\). In the discrete case also \( m \le 2kM-2\) is required so that \( \varDelta \) is sufficiently dense.
Proof
Fix an arbitrary \( f \in \mathbb {P}_{r} \). Assume that \( r > m \), for otherwise the claim trivially follows from the \( F^m \)-transform properties.
Since polynomials \( {\mathtt {P}_{i,0}, \ldots , \mathtt {P}_{i,r}} \) for any \( i \in [0 \, ..\, N]\) form a basis of \( \mathbb {P}_r \), the polynomial f can be expressed as
for some real coefficients \( c_{i,l}\), \( l \in [0 \, ..\, r] \), \( i \in [0 \, ..\, N] \).
Now, by Lemma 1 (or Corollary 2, respectively), \( c_{i,l} \) are defined by (2) (or (16), respectively), and the ith component of \( {F}_m^\rightarrow [f]\) equals
Thus, the \( F^m \)-transform of f is given by
To prove the theorem, it is sufficient to show that for all \({ t \in [\hat{a},\hat{b}]} \) and \( {l \in [m+1 \, ..\, r] }\) the equality
holds.
From (12) and (15) the following equality is obtained:
for all \( i \in [0 \, ..\, N] \) and \( t \in \mathbb {R} \).
Fix any \( l \in [m+1 \, ..\, r] \). By applying Lemma 3 with \( p^l = \mathtt {P}_{l} \), we obtain that there is a polynomial \( q_l \in \mathbb {P}_{r-l} \) s.t. \({ c_{i,l} = q_l(i) }\) for all \( i \in [0 \, ..\, N] \).
Notice that \( \deg q_l \le r-l \le 2k-1-l\) satisfies also
Next fix any \( t \in [\hat{a}, \hat{b}] \). There exists such an index \( j \in [k-1 \, ..\, N-k] \) that \( t \in [t_{j}, t_{j+1}] \) (remark: it is possible that there are several such indices j, i.e., \( t=t_i \) for some \( i \in [k \, ..\, N-k-1] \); then any of these indices can be chosen, say, \( j=i \)).
Then \( A_i(t) = 0 \) unless \( i \in [j-k+1 \, ..\, j+k] \), thus
The right side of this equality, by the definition of \( A_i \) and either (12) or (15), can be rewritten as
Denote \( \tau = (t - t_j)/h \in [0,1] \), then \( \left( t-t_{i+j} \right) /{h}= \tau -i\) for all \( i \in [1-k \, ..\, k] \) and we have obtained that
Let \( p(x) = q_l(j+x) \), then its degree is \( \deg p =\deg q_l\), thus \( \deg p\le \min \left\{ l-1, 2k-1-l \right\} \) and, by Lemma 7,
proving (18). Now it follows that for all \( t \in [\hat{a},\hat{b}] \) the \( F^m \)-transform of f can indeed be expressed as
Here the last equality follows from the Ruspini condition.
\(\square \)
8 Approximation properties of the \(F^m\)-transform based on FPB
8.1 Estimation of the approximation error
The next theorem gives an estimation of the quality of approximation of a function f from \(L^{r+1}_q([a,b ])\) (i.e., a function s.t. \(f^{(r+1)} \in L^q([a,b ])\), \(q \ge 1\)) by its \(F^m \)-transform (\(r \le \min \left\{ 2m+1,2k-1 \right\} \)) w.r.t. the (k, N) -FPB of [a, b] .
We shall denote (\( n\in [0 \, ..\, r] \))
Then the theorem basically says that
for functions \( f \in L^{r+1}_q([a,b] ) \), i.e., the function f is approximated by its \( F^m \)-transform with an approximation order \( O(h^{r+1/q'}) \), and its derivatives are also approximated by the derivatives of the \( F^m \)-transform with a corresponding loss of the approximation order. However, the theorem also provides a tight estimate of the constant hidden in the big-O notation.
Theorem 2
Suppose that nonnegative integers r, m satisfy \(r \le \min \left\{ 2m+1,2k-1 \right\} \). Let f be a function from \(L^{r+1}_q([a,b] )\) for some \(q \ge 1,\) and \({F}_m[f]\) be its integral or discrete \(F^m\)-transform w.r.t. the (k, N) -FPB of [a, b] (in the discrete case we also require \( m \le 2kM-2\) so that \( \varDelta \) is sufficiently dense).
Then for all x from the interval \( [\hat{a}, \hat{b}] \) (defined by (10)) and \(n \in [0 \, ..\, r]\) the estimation
holds, where
and \(F_{m,*}\) is the corresponding (integral or discrete) \(F^m\)-transform w.r.t. the \( (k,2k-1) \)-FPB of the fixed interval \({\mathcal {I}_k = [-2k+0.5, 2k-0.5]}\).
Remark 1
The bound (19) is essentially tight: for all \(C > 0\) we have
where the set \(CW^{r+1}_q([a,b] )\) consists of all functions \(f \in L^{r+1}_q([a,b] )\) such that \( \left\| f^{(r+1) } \right\| _{L^q([a,b] )} \le C. \)
Remark 2
Notice that also in the case of the discrete \(F^m\)-transform the result (19) is obtained for function f defined on the whole interval [a, b], not only on the discrete grid \( \varDelta \).
Proof
We apply for function f Taylor formula with the integral form of the remainder
where \(p_r\) is the Taylor polynomial for f, \( \deg p_r \le r \), a is the point of expansion. We rewrite it
using for the remainder \( I^r_{f}\) the following form
Taking into account that from Theorem 1 it follows that \( {F}_m[p_r]\equiv p_r \) on \([\hat{a}, \hat{b}]\), we obtain that for all \(x\in [\hat{a}, \hat{b}]\)
From (20) and Lemma 10 we obtain
therefore for \( x \in [\hat{a}, \hat{b}] \) it holds
Differentiating this equality n times gives
By Hölder’s inequality,
where \(V_{n,m,r,x}(t) := D_{n,m}[K_{r}(\cdot ,t)](x) \). Moreover, for all \(C > 0\) we have
Suppose that \( x \in \left[ t_{j_0} , t_{j_0+ 1 } \right] \subset [\hat{a}, \hat{b}]\) for some index \( j_0 \in [k-1 \, ..\, N-k] \). From Lemma 11 it follows that for all \( t \notin \left[ t_{j_0-2k+1} , t_{j_0+ 2k } \right] \) we have
i.e., \( V_{n,m,r,x}(t)=0 \), since
Consequently,
Now let \( t \in \left[ t_{j_0-2k+1} , t_{j_0+ 2k } \right] \). Define an affine map \( \phi : \left[ t_{j_0-2k+1} , t_{j_0+ 2k } \right] \rightarrow \mathcal {I}_k \) by
and denote \( u = \phi (x) \), \( \tau = \phi (t) \).
Then \( K_{r}(x ,t ) = h^{r} K_{r}(u ,\tau )\). From Corollary 3,
Therefore
Now, if \( q'= \infty \), we immediately have
Consider the case \( q'< \infty \). Then
We conclude that in both cases
The theorem is proven. \(\square \)
8.2 Numerical examples
For illustrative purposes we consider approximation of functions \( f_1(x)= \sin ^2(\pi x)\) and \( f_2(x) = \sin (\exp (4x))\) over the interval [0, 1] with the discrete \( F^m \)-transform, for \( m \in \left\{ 0,1 \right\} \) and \( N\in \left\{ 12,102 \right\} \) (in the case of \( f_2 \) we also consider \( N\in \left\{ 1002,10002 \right\} \)). In these examples the fuzzy partitions will be given by cubic B-splines (i.e., we consider (2, N) -FPB of [a, b] ).
For the sake of comparability, the interval [a, b] is chosen so that [0, 1] is the interval where the Ruspini condition is fulfilled, i.e.,
-
when \( N=12\), set \( a= -0.3\), \( b=1.3 \), then
$$\begin{aligned} h=\frac{b-a}{N+2k} =\frac{1.4}{14}=0.1 \end{aligned}$$and the Ruspini condition holds on \( [\hat{a}, \hat{b}] = [0,1] \);
-
when \( N=102 \), set \( a= -0.03\), \( b=1.03 \), then
$$\begin{aligned} h=\frac{b-a}{N+2k} =\frac{1.06}{106}=0.01 \end{aligned}$$and the Ruspini condition holds on \( [\hat{a}, \hat{b}] = [0,1] \).
In either case the parameter M, characterizing the size of the discrete grid (defined in Sect. 3.5), is \( M=3 \).
For each of the described FPB we consider the error function \( \mathcal {E}_{m,N}[f] \), with \( m\in \left\{ 0,1 \right\} \), \( N\in \left\{ 12,102 \right\} \), defined by
where \( {F}_m[f]\) is the \( F^m \)-transform of f w.r.t. the corresponding FPB and \( f \in \left\{ f_1,f_2 \right\} \).
In Figs. 4 and 5 we plot \( \mathcal {E}_{0,12}[f_1] \) and \( \mathcal {E}_{0,102}[f_1] \), respectively. In this case Theorem 2 guarantees approximation of order \( O(h^{2m+2}) = O(h^2) \), which is consistent with the scaling of the error function when h decreases from 0.1 to 0.01.
In Figs. 6 and 7 we plot \( \mathcal {E}_{1,12}[f_1] \) and \( \mathcal {E}_{1,102}[f_1] \), respectively, i.e., now we use the first-degree F-transform instead of the ordinary \( F^0 \)-transform. In this case the order of approximation is \( O(h^4) \), which is again consistent with the scaling of the error function.
It can be noticed that the error in Figs. 6 and 7 oscillates; despite this phenomenon, it can be verified that the derivative of \( {F}_m[f_1] \) is a smooth function approximating the derivative of \( f_1 \). Likewise, also the second and third derivative of \( {F}_m[f_1] \) approximate the respective derivatives of \( f_1 \), as predicted by Theorem 2.
Now let us consider approximation of the function \( f_2(x) = \sin (\exp (4x))\).
In Figs. 8 and 10 we plot \( \mathcal {E}_{0,12}[f_2] \) and \( \mathcal {E}_{1,12}[f_2] \), respectively. As it can be observed, while the function is well approximated near \( x=0 \), both \( F^0 \)-transform and \( F^1 \)-transform fail to provide meaningful approximation of \( f_2 \) near \( x=1 \) (and, in fact, increasing \( m=0 \) to \( m=1 \) yields no advantage when \( h=0.1 \)).
Consider Figs. 9 and 11, where we plot \( \mathcal {E}_{0,102}[f_2] \) and \( \mathcal {E}_{1,102}[f_2] \), i.e., now \( h=0.01 \). It becomes apparent that approximation by the \( F^1 \)-transform is superior to that by the \( F^0 \)-transform. However, the approximation error is still relatively large and the approximation order \( O(h^4) \) in the case of \( F^1 \)-transform is not evident. The reason of that is that the constant hidden in the big-O notation for \( O(h^4) \) is large in this example and one has to consider larger values of N to observe the asymptotic behavior of the error.
To illustrate the asymptotic behavior of the error \( \mathcal {E}_{1,N}[f_2] \), we also plot \( \mathcal {E}_{1,1002}[f_2] \) and \( \mathcal {E}_{1,10002}[f_2] \) (Figs. 12, 13, respectively). In these cases the step size is \( h=10^{-3} \) and \( h=10^{-4}\), respectively. As before, the endpoints of the interval [a, b] are chosen so that the Ruspini condition holds on the unit interval [0, 1] .
Now it can be observed that the error rate indeed decreases proportionally to \( h^4 \), as decreasing h from \( 10^{-2} \) to \( 10^{-3}\) decreases the magnitude of the error from \( {\approx } 0.5 \times 10^0 \) to \( {\approx } 10^{-4} \). Likewise, decreasing h from \( 10^{-3} \) to \( 10^{-4} \) decreases the magnitude of the error further to \( {\approx } 10^{-8} \).
9 Proofs of the technical claims
9.1 General results
9.1.1 Central difference operator
By \( \nabla \) we will denote the central difference operator, satisfying \( \nabla f(t) = f(t+0.5) - f(t-0.5) \) for any function f defined on \( [t-0.5,t+0.5]\subset \mathbb {R} \); notation \( \nabla ^j \) stands for the jth iteration of the operator \( \nabla \), i.e., \( \nabla ^j f(t) = \nabla (\nabla ( \ldots \nabla f(t)))\).
This operator is linked to the derivatives of the central B-spline and a related construction, polynomials \( R_n \), defined in Sect. 9.2 (see Lemmas 15 and 16). This link will be used in the proof of Lemma 18, which is an important auxiliary result for the proof of Lemma 4.
It is easy to see that the central difference operator has the following property:
Lemma 13
Suppose that \( f,g : \mathbb {R} \rightarrow \mathbb {R} \) are such functions that for all \( t \in \mathbb {R} \) the integral \( \int _{\mathbb {R}}f(x)g(x+t)\,\mathrm {d}x\) exists and has a finite value.
Then for all integers \( n \ge 0\) we have
Proof
When \( n=0 \), the claim trivially holds. Suppose it holds for an integer n, then by the inductive hypothesis we have
Rewriting \( \nabla g(x)= g(x+0.5)- g(x-0.5) \) and changing the integration variable yields
and the claim follows. \(\square \)
9.1.2 A basis of the space \( \mathbb {P}_n \)
Another general claim shows a way to construct a basis of the space \( \mathbb {P}_n \). We will use this fact in the proof of Lemma 19, which is a prerequisite for Lemma 4.
Lemma 14
Suppose that \( n \in \mathbb {N} \) and a polynomial p has degree n. Let \( x_0 \), ..., \( x_n \in \mathbb {R} \) be distinct numbers. Then the polynomials \( q_j \), defined by \( q_j(x) = p(x+x_j) \), \( j \in [0 \, ..\, n]\), form a basis of the space \( \mathbb {P}_n \).
Proof
First we note that the polynomials \( p_j \), defined by \( p_j = {p^{(j)}}/{j!} \), \(j \in [0 \, ..\, n] \), form a basis of \( \mathbb {P}_n \), since \( \deg p_j =n-j \). Let us express each polynomial \( q_j \) as the unique linear combination in this basis, obtaining the basis transformation matrix. Then it will be necessary and sufficient to show that this matrix is non-singular.
Apply Taylor formula for \( q_j(x) = p(x+x_j) \) with x being the point of expansion, then we see that
i.e.,
We conclude that the coefficient matrix is
which is the Vandermonde matrix and is non-singular, since \( x_0 \), ..., \( x_n \in \mathbb {R} \) are distinct. \(\square \)
9.1.3 Proof of Lemma 3
Let \( P^l = p^l_0 \), \( l \in [0 \, ..\, r] \), then \( \deg P^l = l \) and
Therefore we have
or, equivalently,
Replacing x by \( x+i\) in this equality yields
Let us show that for each \( l \in [0 \, ..\, r] \) and \( s \in [0 \, ..\, l] \) there exists a polynomial \( \alpha _{l,s} \in \mathbb {P}_{l-s}\) s.t.
Then it will follow that (21) can be rewritten as
or, equivalently,
or
Since this holds for all \( x \in \mathbb {R} \) and the polynomials \( P^0, \ldots , P^r \) form a basis of \( \mathbb {P}_{r} \), it follows that
Let \( q_{s} \) be defined with
then \( a_{i,s} = q_{s}(i) \). Moreover, since \( \deg \alpha _{l,s} \le l-s \), we see that \( q_{s} \in \mathbb {P}_{r-s} \).
It remains to prove the existence of \( \alpha _{l,s} \) such that (22) holds. Define constants \( \gamma _{l,j,s} \) so that the jth derivative of \(P^{l} \), \( j\in [0 \, ..\, l] \), is expressed as
(this is possible since \( P^0, \ldots , P^r \) form a basis of \( \mathbb {P}_{r} \)). For any \( x,y \in \mathbb {R} \) apply Taylor formula for \( P^l(x+y) \) with x being the point of expansion, obtaining
By interchanging the order of summation we obtain
Clearly, the polynomial \( \alpha _{l,s}(y) = \sum _{j=0}^{l-s}\gamma _{l,j,s} \, y^j \) satisfies the necessary properties. \(\square \)
9.2 Claims related to the B-splines
Let us note that the derivatives of the central B-splines are closely related to the difference operator:
Lemma 15
where \( \phi _n \) stands for the central B-spline of degree n (defined in Sect. 2.6) and \( \nabla \) is the central difference operator (see Sect. 9.1).
Proof
The case \( j=0 \) is trivial; see Schoenberg (1973, Eq. 1.10) for \( j=1 \). Suppose that we have
for some \( j \in [1 \, ..\, n-1] \). Then
and the claim follows. \(\square \)
Introduce polynomials \( R_n \) (similarly defined polynomials are called dual polynomials in Lyche and Mørken (2008)), where \( R_0 \equiv 1 \) and for \( n \ge 1 \)
Notice that the polynomials \( R_n \) satisfy the following recurrence formulas:
One can show a property, similar to Lemma 15:
Lemma 16
Proof
We start with
Suppose that we have
for some \( j \in [2 \, ..\, n] \). Then
which completes the inductive step. \(\square \)
Finally, we have that polynomials \( R_n \) are orthogonal (w.r.t. the standard \( L_2 \) inner product) to the central B-spline \( \phi _n \), i.e.,
Lemma 17
Proof
From (6) we see that
where
By changing variables, we have
Now it is obvious that \( I_1 + I_2 = 0 \). \(\square \)
9.3 Claims related to the FPB
9.3.1 Auxiliary results for the proof of Lemma 4
To derive Lemma 4, we need a few auxiliary results; the next lemma shows that the convolution of the central B-spline \( \phi _{2k-1} \) and the corresponding “dual” polynomial \( R_{2k-1} \) is the polynomial \( y^{2k-1} \):
Lemma 18
Denote \( A = \phi _{2k-1} \) and \( \rho = R_{2k-1} \), i.e.,
Then for all \( y \in \mathbb {R} \) we have
Proof
It suffices to show that for all integers \( j \ge 0 \) it holds that
where \( \delta _{ji} \) stands for the Kronecker delta, or, equivalently,
The equality is obviously true for \( j \ge 2k-1 \). Fix any \( j \in [0 \, ..\, 2k-2] \), then it remains to show that
It can be shown (by partially integrating and noting that \({{\mathrm{supp}}}A =(-k,k) \)) that the left side of (25) equals \( (-1)^j \, \int \limits _{\mathbb {R}} \rho (x) A^{(j)} (x) \,\mathrm {d}x\), thus
where the last equality follows from Lemma 17. Therefore (25) holds, which completes the proof. \(\square \)
Next we show a result which relates the sum in Lemma 4 with an integral of \( f \cdot A \); as before, A stands for the central B-spline of degree \( 2k-1 \) and \( \rho \) is the corresponding ’dual’ polynomial \( R_{2k-1} \).
Lemma 19
For all \( \tau \in [0,1] \) and \( f \in \mathbb {P}_{2k-1} \) it holds that
Proof
Fix any \( \tau \in [0,1] \). Setting \( n=2k-1 \), \( x=\tau \) and
in the Marsden’s identity (4) yields
Now let \( z=y+ \tau \) for any \( y \in \mathbb {R} \), obtaining
On the other hand, it follows from Lemma 18 that for all \( y \in \mathbb {R}\) we have
The two equalities (and the fact that A is an even and \( \rho \) is an odd function) yield
for all \(y \in \mathbb {R} \). In particular, setting \( y=-i \), \( i \in [-k \, ..\, k] \), gives us
Suppose that \(\displaystyle f \in \mathbb {P}_{2k-1} \). Since \( \deg \rho = 2k-1 \), by Lemma 14 there exist real numbers \( f_i \), \( i \in [-k \, ..\, k] \), s.t.
Now we obtain
Thus
for all \( f \in \mathbb {P}_{2k-1} \) and \( \tau \in [0,1] \).\(\square \)
Now Lemma 4 easily follows from Lemma 19.
9.3.2 Proof of Lemma 4
Fix a number \( \tau \in [0,1] \) and a polynomial p such that its degree satisfies \( \deg p \le \min \left\{ l-1, 2k-1-l \right\} \). Set \( f(x) = p(x + \tau ) P(x) \), then \( f \in \mathbb {P}_{2k-1} \) and from Lemma 19 we obtain
Now the left side equals 0 (due to the first requirement on P in the statement of Lemma) and we conclude
Taking into account that P is either an odd or an even function and A is even completes the proof. \(\square \)
9.4 Claims related to the template spaces
Lemma 6 simply states that the integral \( \int _{-k} ^k p(t) A(t) \,\mathrm {d}t\) can be well approximated with the corresponding sum over the points in the template \( Z\) (defined in Sect. 4.3.1), in the sense that this approximation is precise for all polynomials \( p \in \mathbb {P}_{2k-1} \). To show that, we invoke once more Lemma 19.
9.4.1 Proof of Lemma 6
Notice that \( A(k) = 0\), hence we can rewrite the desired equality as
or, equivalently,
where
Since \( p \in \mathbb {P}_{2k-1} \) and for all \( s\in [0 \, ..\, M-1] \) we have \( s/M \in [0,1] \), from Lemma 19 it follows that
thus also
Hence (29) holds, concluding the proof. \(\square \)
9.4.2 Proof of Lemma 5
We have
Since \( z \in \varDelta _i \) if and only if \( \tilde{z} = \frac{z-t_i}{h} \) is in the set \( Z\), rewrite
for all \( z \in \varDelta _i \), \( \tilde{z} = \frac{z-t_i}{h} \in Z\). Then
and the claim follows. \(\square \)
9.5 Claims related to the FPB-based discrete transform
9.5.1 Auxiliary results for the proof of Lemmas 8 and 9
We start with an auxiliary lemma which allows to describe matrices \( \mathbf {X}_i^T \mathbf {A}_i \mathbf {X}_i \) and \( \mathbf {X}_i^T \mathbf {A}_i \mathbf {Y} \) in terms of arbitrary polynomials, spanning \( \mathbb {P}_m \).
Lemma 20
Let \( m \ge 0\) and matrices \( \mathbf {X}_i \), \( \mathbf {A}_i \), \( \mathbf {Y} \) be defined as in Definition 5. Suppose that polynomials \( p^0 \), \( p^1 \), ..., \( p^m \) span the space \( \mathbb {P}_m \).
Let \( \mathbf {P} \) be the invertible basis transformation matrix satisfying
Define also matrices
Then
and
Proof
Let \( q^l(t) = t^l\), \( l \in [0 \, ..\, m] \), and \( \hat{\mathbf {X}} \) be the matrix with entries \( \hat{X}_{r,l} = (\tilde{z}_{r})^l \) (i.e., \( \hat{\mathbf {X}} \) coincides with the matrix \( \mathbf {X} \) when \( p^l \equiv q^l\) for all l).
Notice that \( \mathbf {A}_i \) has zero rows/columns when the corresponding entry \( A_i(z_{j}) \) is zero, which happens whenever \( j \notin \varDelta _i \). Remove all rows and columns of \( \mathbf {A}_i \) which correspond to \( A_i(z_{j}) \) with \( j \notin \varDelta _i \) and all corresponding rows in \( \mathbf {X}_i \) and \( \mathbf {Y} \). Denote the obtained matrices by \(\tilde{{ \mathbf {A}}}_i \), \( \tilde{{\mathbf {X}}}_i \) and \( \mathbf {Y}_i \) (notice that we obtain the same \( \mathbf {Y}_i \) which was defined in the statement of the Lemma). Then
and
The matrix \( \tilde{{\mathbf {A}}}_i \) is the diagonal matrix with entries \( A_i(z) \), \( z \in \varDelta _i \) on its diagonal. However, these values coincide with \( A(\tilde{z}) \), where \(\tilde{z} = \frac{z-t_i}{h} \in Z\). Thus \( \tilde{{\mathbf {A}}}_i = \mathbf {A}\). Moreover, for all \( z \in \varDelta _i \) we have
Hence lth column of \( \tilde{{\mathbf {X}}}_i \) is obtained as the corresponding column of \( \hat{\mathbf {X}} \), multiplied with \( h^l \), i.e.,
Therefore
and
Finally, the basis transformation matrix \( \mathbf {P} \) satisfies
Combining this with (30) and (31) yields the desired equalities. \(\square \)
9.5.2 Proof of Lemma 8
The discrete grid \( \varDelta \) is sufficiently dense iff \( \mathbf {X}_i^T \mathbf {A}_i \mathbf {X}_i \) is non-singular for each i. We apply Lemma 20 with \( p^l = \mathtt {P}_{l} \), then \( \mathbf {X}_i^T \mathbf {A}_i \mathbf {X}_i \) is non-singular iff \( \mathbf {X}^T \mathbf {A} \mathbf {X} \) is non-singular.
Notice that the (i, j) th entry of \( \mathbf {X}^T \mathbf {A} \mathbf {X} \) equals
The RHS is nonzero iff \( i=j \), i.e., \( \mathbf {X}^T \mathbf {A} \mathbf {X} \) is the diagonal matrix with entries \( \left\| \mathtt {P}_{l} \right\| ^2 \), \( l \in [0 \, ..\, m] \), on its diagonal. Since \( \left\| \mathtt {P}_{l} \right\| =0 \) for all \( l \in [0 \, ..\, 2kM-2] \) and \( \left\| \mathtt {P}_{l} \right\| =0 \) for \( l \ge 2kM-1 \), the matrix \( \mathbf {X}^T \mathbf {A} \mathbf {X} \) is non-singular iff \( m \le 2kM-1 \).
\(\square \)
9.5.3 Proof of Lemma 9
Recall that
where
Apply Lemma 20 to get
and
Notice that
On the other hand, since \( p^l(\tilde{t}) = p_i^l(t) \), we have
It follows that
which concludes the proof. \(\square \)
9.6 Additional properties of \( F^m \)-transforms
9.6.1 Proof of Lemma 10
We prove that for all \( l \in [0 \, ..\, m] \) and \( i \in [0 \, ..\, N] \) it holds that
where \( c_{i,l}[f] \) stands for the coefficient at \( \mathtt {P}_{i,l} \) of the ith component of either the direct integral \({F}_m^{\mathrm {int}}[f]\) (in this case \( c_{i,l}[f]= c_{i,l}^{\mathrm {int}}[f] \)) or the direct discrete \({F}_m^{\mathrm {dsc}}[f]\) (in this case \(c_{i,l}[f]= c_{i,l}^{\mathrm {dsc}}[f] \)) \( F^m \)-transform of f.
The integral case
The discrete case
Then for all \(x \in [\hat{a}, \hat{b}] \)
and the claim follows. \(\square \)
9.6.2 Proof of Lemma 11
We start with an obvious observation: if \( f \equiv g \) on \( {{\mathrm{supp}}}A_i \) for some basic function \( A_i \), then for all l it holds that \( c_{i,l}[f] = c_{i,l}[g] \).
Since \( {{\mathrm{supp}}}A_i = \left( t_{i-k}, t_{i+k} \right) \) for all \( i\in [0 \, ..\, N] \), it means that in our case
-
if \( i \le i_0 - k \), then \( c_{i,l}[f] =c_{i,l}[g_1] \);
-
if \( i \ge i_0 + k +1\), then \( c_{i,l}[f] = c_{i,l}[g_2] \).
On the other hand, we have
since \( A_i(x) = 0 \) when \( i \le j_0-k \) or \( i \ge j_0 + k+1 \).
It remains to consider the following cases:
-
1.
When \( j_0 \le i_0 - 2k \), we have \( i \le i_0-k \) for all indices \( i \in [ j_0-k+1 \, ..\, j_0 + k ] \), thus in (32) all coefficients \( c_{i,l}[f] \) satisfy \( c_{i,l}[f]= c_{i,l}[g_1] \). Hence it holds that \( {F}_m[f](x) ={F}_m[g_1](x) \).
-
2.
When \( j_0 \ge i_0+2k \), we have \( i \ge i_0+k+1 \) for all indices \( i \in [ j_0-k+1 \, ..\, j_0 + k ] \), thus in (32) all coefficients \( c_{i,l}[f] \) satisfy \( c_{i,l}[f]= c_{i,l}[g_2] \). Hence \( {F}_m[f](x) ={F}_m[g_2](x) \). \(\square \)
9.6.3 Proof of Lemma 12
First let us show \( \tilde{c}_{i,l}[g] = c_{i,l}[f] \), for all \( i \in [0 \, ..\, N]\) and \( l \in [0 \, ..\, m] \).
The integral case
since \( \mathrm {d}(\phi (x)) = \alpha \,\mathrm {d}x\).
The discrete case
Now we conclude that for all \( u \in [\phi (\hat{a}), \phi (\hat{b})] \)
where \( x:= \phi ^{-1}(u) \in [\hat{a}, \hat{b}] \), which completes the proof.
\(\square \)
10 Conclusion
The error estimation proved in this paper shows that using B-splines as basic functions of the uniform fuzzy partition for the \(F^{m}\)-transform allows us to improve results on approximation by the higher-degree continuous and discrete F-transforms. According to our estimation the best result will be achieved by taking B-splines of degree \(2m+1\) (or greater) and considering the approximation error for functions from the space \(L^{2m+2}_q\). In this case we proved the error bound is characterized with \( O(h^{2m+1+ 1/q'})\), where h is the parameter of the uniform partition and \(1/q+1/q'=1\). For functions from the space \(L^{2m+2}_{\infty }\) (the case \(q=\infty \)) we obtain \( O(h^{2m+2})\). For a comparison let us note that the best result for the approximation error bound, which is known for the \(F^{m}\)-transform w.r.t. an arbitrary uniform fuzzy partition with parameter h is \( O(h^{m+1})\).
Due to a large part of applications of F-transforms in the discrete two-dimensional case, we need to extend the obtained results by considering approximation properties of the composite \( F^m \)-transform with respect to fuzzy partition given by bivariate central B-splines of odd degree for each variable. In our next paper we will describe and investigate such construction.
Another interesting direction for future research includes applications of B-spline based fuzzy partitions for solving differential equations. Taking into account that in the last years the technique of F-transform has been recognized as a tool to approximate solutions of ordinary and fuzzy differential equations (e.g., Chen and Shen 2014; Khastan et al. 2016; Radi and Stefanini 2015), we hope that the obtained benefits of B-spline based F-transforms in approximation of smooth functions and their derivatives allows us to improve the effectiveness of numerical procedures for differential equations.
References
Bede B, Rudas IJ (2011) Approximation properties of fuzzy transforms. Fuzzy Sets Syst 180(1):20–40
Chen W, Shen Y (2014) Approximate solution for a class of second-order ordinary differential equations by the fuzzy transform. J Intell Fuzzy Syst 27(1):73–82
de Boor C (2001) A practical guide to splines—revised edition, Applied mathematical sciences, vol 27. Springer, New York
Gautschi W (2004) Orthogonal polynomials. Computation and approximation. Oxford University Press, Oxford
Holčapek M, Tichý T (2014) Discrete fuzzy transform of higher degree. In: IEEE international conference on fuzzy systems, FUZZ-IEEE 2014, Beijing, China, 6–11 July 2014. IEEE, pp 604–611
Holčapek M, Perfilieva I, Novák V, Kreinovich V (2015) Necessary and sufficient conditions for generalized uniform fuzzy partitions. Fuzzy Sets Syst 277:97–121
Khastan A, Perfilieva I, Alijani Z (2016) A new fuzzy approximation method to Cauchy problems by fuzzy transform. Fuzzy Sets Syst 288:75–95
Kodorane I, Asmuss S (2013) On approximation properties of spline based F-transform with respect to fuzzy m-partition. In: J Montero, G Pasi, D Ciucci (eds) Proceedings of EUSFLAT 2013, Milan, Italy, Advances in intelligent systems research, vol 32. Atlantis Press, pp 772–779
Kokainis M, Asmuss S (2015) Approximation properties of higher degree F-transforms based on B-splines. In: IEEE international conference on fuzzy systems, FUZZ-IEEE 2015, Istanbul, Turkey, 2–5 August 2015. IEEE
Lyche T, Mørken K (2008) Spline methods: draft. www.uio.no/studier/emner/matnat/ifi/INF-MAT5340/v10/undervisningsmateriale/book.pdf. Date retrieved: 30-07-2016
Marsden MJ (1970) An identity for spline functions with applications to variation-diminishing spline approximation. J Approx Theory 3(1):7–49
Perfilieva I (2006) Fuzzy transforms: theory and applications. Fuzzy Sets Syst 157(8):993–1023
Perfilieva I (2015) F-transform. In: Kacprzyk J, Pedrycz W (eds) Springer handbook of computational intelligence. Springer, Berlin, Heidelberg, pp 113–130
Perfilieva I, Haldeeva E (2001) Fuzzy transformation and its applications. In: Proceedings of the 4th Czech–Japan seminar on data analysis and decision making under uncertainty, Czech Republic, pp 116–124
Perfilieva I, Kreinovich V (2013) F-transform in view of aggregation functions. In: Bustince H, Fernandez J, Mesiar R, Calvo T (eds) Aggregation functions in theory and in practise: Proceedings of the 7th international summer school on aggregation operators at the Public University of Navarra, Pamplona, Spain, 16–20 July 2013, Advances in intelligent systems and computing, vol 228. Springer, Berlin, Heidelberg, pp 381–389
Perfilieva I, Dan̆ková M, Bede B (2011) Towards a higher degree F-transform. Fuzzy Sets Syst 180(1):3–19
Radi D, Stefanini L (2015) Fuzzy differential equations by fuzzy-transform. In: Annual conference of the North American fuzzy information processing society NAFIPS, Redmond, USA, 17–19 August 2015. IEEE
Schoenberg IJ (1964) On interpolation by spline functions and its minimal properties. In: Butzer PL, Korevaar J (eds) On approximation theory/Über approximations theorie: proceedings of the conference held in the mathematical research institute at Oberwolfach, Black Forest, 4–10 August 1963, ISNM, vol 5. Springer, Basel, pp 109–129
Schoenberg IJ (1973) Cardinal spline interpolation, CBMS-NSF regional conference series in applied mathematics, vol 12. SIAM
Stefanini L (2009) Fuzzy transform and smooth functions. In: JP Carvalho, D Dubois, U Kaymak, JMC Sousa (eds) Proceedings of the joint 2009 international fuzzy systems association world congress and 2009 European society of fuzzy logic and technology conference, Lisbon, Portugal, 20–24 July 2009, vol 9, pp 579–584
Stefanini L (2011) F-transform with parametric generalized fuzzy partitions. Fuzzy Sets Syst 180(1):98–120
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Communicated by F. Di Martino, V. Novák.
Rights and permissions
About this article
Cite this article
Kokainis, M., Asmuss, S. Continuous and discrete higher-degree F-transforms based on B-splines. Soft Comput 21, 3615–3639 (2017). https://doi.org/10.1007/s00500-017-2655-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-017-2655-y