Keywords

1 Introduction

The least squares methods are used, in general, to obtain a continuous function that best fit pairs of data in a dataset [1]. The fuzzy least squares method arises when the dataset is composed by fuzzy numbers. Tanaka et al. proposed a fuzzy least squares method based on fuzzy regression models [2]. This method was used to find fuzzy parameters of a fuzzy linear function from a fuzzy dataset. However, this approach converts the problem to a classic linear programming problem which may lead to losing the notion of close distance between the fuzzy data and the obtained solution.

Celmins [3] proceeded with the same methodology of [2] but considered a intrinsic relation among the dataset based on conical membership functions that are (geometrically) similar to joint possibility distributions. In addiction, the concept of interactive fuzzy numbers [4,5,6] was only considered in [7], which improved the approach presented by [3].

In contrast to these previous methods, Diamond [8] proposed a fuzzy least squares method based on distance between functions. He used projection theorems for cones in Banach spaces to find the fuzzy linear function that best fit a dataset.

It is worth noting that all these approaches were developed for data given only by triangular fuzzy numbers and for fitting only fuzzy linear functions. Nevertheless, these methods can be used to model many phenomenons, for example, in economy [9], psychology [10], medicine [11], and logistics [12].

Data correlations arise naturally in longitudinal datasets. A dataset is said to be longitudinal if it contains the same type of information on the same itens at multiple points in time. Therefore, longitudinal data is characterized by the fact that repeated observations are correlated [13]. In this work, we suppose that this correlation is given by the notion of completely correlated fuzzy numbers [5, 14].

The method proposed here is based on the (sup-J) extension of classical numerical algorithm to the fuzzy context and does not take into account any distance between fuzzy numbers. Moreover, our method can be applied not only for triangular fuzzy numbers, but for any type of completely correlated fuzzy numbers, and it can approximate the dataset with higher orders functions.

In Sect. 2 we briefly recall the classical least squares method and some basic definitions and results from fuzzy set theory. In Sect. 3, we develop the extension of the classical least squares method for the case where dataset is composed by completely correlated fuzzy numbers. Finally, in Sect. 4, we apply the proposed method to fit a fuzzy function to the longitudinal dataset given in [15].

2 Mathematical Background

This section presents the least squares method [1] and some basic concepts of fuzzy set theory [16].

2.1 Least Square Method

Let \(f:[c,d]\rightarrow \mathbb {R}\) be a continuous function. Given n functions \(g_{1},\ldots ,g_{n}\), where \(g_{i}:\mathbb {R}\rightarrow \mathbb {R}\) for \(i=1,\ldots ,n\), we need to find n coefficients \(a_{1},\ldots ,a_{n}\in \mathbb {R}\) such that the function \(\varphi :\mathbb {R}\rightarrow \mathbb {R}\) given by

$$\begin{aligned} \varphi (x)=a_{1}g_{1}(x)+\ldots +a_{n}g_{n}(x) \end{aligned}$$

is the best approximation of the function f, i.e., \(\varphi \approx f\).

The function \(\varphi \) is obtained by minimizing the distance between f and \(\varphi \). More precisely, let \(||\cdot ||_{2}\) be the \(\mathcal {L}^{2}\)-norm defined on the class of the continuous functions from [cd] to \(\mathbb {R}\) (denoted by C([cd])) given by \(||h||_{2}=\left( \int _c^d |h(s)|^{2}ds\right) ^{{}^{1}\!/_{2}}, \ \forall h\in C([c,d])\). The coefficients \(a_{1},\ldots ,a_{n}\) of the function \(\varphi \) which produces the best fit to f are obtained by solving the following minimization problem:

$$\begin{aligned} \min _{a_{1},\ldots ,a_{n} \in \mathbb {R}} {}^{1}\!/_{2}||\varphi -f||_{2}^{2}. \end{aligned}$$

In the case some values of f are known, say \(D=\{f(x_{1})=y_{1},\ldots ,f(x_{m})=y_{m}\}\), the function \(\varphi \) must fit the data D, that is, \(\varphi (x_{i})\approx y_{i}\), for all \(i=1,\ldots ,m\). Therefore the following minimization problem must be solved.

$$\begin{aligned} \min _{a_{1},\ldots ,a_{n} \in \mathbb {R}} {}^{1}\!/_{2}||(\varphi (x_1)-y_1,\ldots ,\varphi (x_m)-y_m)||_{2}^{2}. \end{aligned}$$
(2.1)

The real coefficients \(a_{1},\ldots ,a_{n}\) that minimize the problem (2.1), i.e., that produces the best approximation \(\varphi \) of f, are obtained by solving the following matrix equation called normal equation:

$$\begin{aligned} Ma=b, \end{aligned}$$

where

$$\begin{aligned} M = \begin{bmatrix} \displaystyle \sum _{k=1}^{m}g_{1}(x_{k})g_{1}(x_{k})&\dots&\ \displaystyle \sum _{k=1}^{m}g_{1}(x_{k})g_{n}(x_{k}) \\ \vdots&\ddots&\ \vdots \\ \displaystyle \sum _{k=1}^{m}g_{n}(x_{k})g_{1}(x_{k}) \&\dots&\ \displaystyle \sum _{k=1}^{m}g_{n}(x_{k})g_{n}(x_{k}), \end{bmatrix}, \end{aligned}$$
$$\begin{aligned} a = \begin{bmatrix} a_{1} \\ \vdots \\ a_{n} \end{bmatrix}\text { and } b = \begin{bmatrix} \displaystyle \sum _{k=1}^{m}y_{k}g_{1}(x_{k}) \\ \vdots \\ \displaystyle \sum _{k=1}^{m}y_{k}g_{n}(x_{k}) \end{bmatrix}. \end{aligned}$$

If the matrix M is non singular, say \(P = M^{-1} = [p_{ij}]\), then the vector a is obtained by

$$\begin{aligned} a=Pb. \end{aligned}$$
(2.2)

Thus, each parameter \(a_i\) is given by

$$\begin{aligned} a_i= & {} p_{i1}b_1 + p_{i2}b_2 + \ldots + p_{in}b_n \\= & {} p_{i1}\left( \sum _{k=1}^{m}y_{k}g_{1}(x_{k})\right) + \ldots + p_{in}\left( \sum _{k=1}^{m}y_{k}g_{n}(x_{k})\right) \\= & {} \left( \sum _{j=1}^{n} p_{ij}g_{j}(x_{1})\right) y_1 + \ldots + \left( \sum _{j=1}^{n} p_{ij}g_{j}(x_{m})\right) y_m \\= & {} c_{i1}y_1 + \ldots + c_{im}y_m, \end{aligned}$$

where \(c_{ik} = \displaystyle \sum _{j=1}^{n} p_{ij}g_{j}(x_{k})\), for \(i=1,\ldots ,n\) and \(k=1,\ldots ,m.\) In general case, the matrix P stands for the pseudoinverse of M.

Since the parameters of the function \(\varphi \) can be obtained by computing the matrix product (2.2), we rewrite the function \(\varphi \) in terms of \(y_1,\ldots ,y_m\) as follows:

$$\begin{aligned} \nonumber \varphi (x)= & {} a_{1}g_{1}(x)+\ldots +a_{n}g_{n}(x)\\ \nonumber= & {} (c_{11}y_1 + \ldots + c_{1m}y_m)g_{1}(x)+\ldots + (c_{n1}y_1 + \ldots + c_{nm}y_m)g_{n}(x)\\ \nonumber= & {} \left( \sum _{j=1}^n g_j(x)c_{j1}\right) y_1 + \ldots + \left( \sum _{j=1}^n g_j(x)c_{jm}\right) y_m\\= & {} s_1(x)y_1 + \ldots + s_m(x)y_m, \end{aligned}$$
(2.3)

where

$$s_{i} = \left( \sum _{j=1}^{n} g_j(x)c_{ji} \right) $$

for each \(i=1,\ldots ,n.\)

2.2 Fuzzy Set Theory

A fuzzy subset A of an universe X is characterized by a function \(\mu _A:X\rightarrow [0,1]\), called membership function [16], where \(\mu _A(x)\), or simply A(x), represents the membership degree of x in A, for all \(x \in X\). The class of fuzzy sets of X is denoted by the symbol \(\mathcal {F}(X)\). Each classical subset A of X is a particular fuzzy set whose membership function is given by its characteristic function \(\chi _{A}:X\rightarrow \{0,1\}\), i.e., \(\chi _{A}(x)=1\) if and only if \(x\in A\).

The \(\alpha \)-cut of a fuzzy set A of X, denoted by \([A]^{\alpha }\), is defined as \([A]^{\alpha }=\{x\in X : A(x)\ge \alpha \}, \ \forall \alpha \in (0,1]\). If X is also a topological space, then we can define the 0-cut of A by \([A]^0=cl\{x\in X : A(x)>0\}\) [17], where \(cl \;\text {Y}\), \(Y \subseteq X\), denotes the closure of Y.

Zadeh’s extension principle [18] can be viewed as mathematical method to extend a function \(f:X \rightarrow Y\) to a function \(\hat{f} :\mathcal {F}(X) \rightarrow \mathcal {F}(Y)\).

Definition 1

(Zadeh’s extension principle [17, 18]). Let \(f:X\rightarrow Y\). The Zadeh’s extension of f at \(A \in \mathcal {F}(X)\) is the fuzzy set \(\hat{f}(A) \in \mathcal {F}(Y)\) whose membership function is given by

$$\begin{aligned} \hat{f}(A)(y)=\bigvee _{x\in f^{-1}(y)} A(x), \;\forall \;y \in Y, \end{aligned}$$

where \(f^{-1}(y)=\{x \in X:f(x)=y\}\) is the preimage of the function f at y and, by definition, \(\bigvee \emptyset = 0\).

A fuzzy set \(A\in \mathcal {F}(\mathbb {R})\) is called a fuzzy number if its \(\alpha \)-cuts are closed, bounded and non-empty intervals for all \(\alpha \in [0,1]\) [17]. Since each \(\alpha \)-cut of a fuzzy number A is an interval that satisfies the previous properties, we can write \([A]^{\alpha }=[a_{\alpha }^{-},a_{\alpha }^{+}]\). We denote the class of fuzzy numbers by the symbol \(\mathbb {R}_{\mathcal {F}}\). The next theorem indicates when a family of subsets can be uniquely associated with a fuzzy number.

Theorem 1

(Negoita-Ralescu’s characterization theorem [19, 20]). Given a family of subsets \(\{A_{\alpha }:\alpha \in [0,1]\}\) that satisfies the following conditions

  1. (a)

    \(A_{\alpha }\) is a non-empty, closed, and bounded interval for any \(\alpha \in [0,1]\);

  2. (b)

    \(A_{\alpha _{2}}\subseteq A_{\alpha _{1}}\), for all \(0\le \alpha _{1}\le \alpha _{2} \le 1\);

  3. (c)

    For any sequence \(\alpha _{n}\) which converges from below to \(\alpha \in (0,1]\) we have

    $$\bigcap _{n=1}^{\infty }A_{\alpha _{n}}=A_{\alpha };$$
  4. (d)

    For any sequence \(\alpha _{n}\) which converges from above to 0 we have

    $$A_{0}=cl\left( \bigcup _{n=1}^{\infty }A_{\alpha _{n}}\right) .$$

Then there exists a unique \(A \in \mathbb {R}_{\mathcal {F}}\), such that \([A]^{\alpha }=A_{\alpha }\), for each \(\alpha \in [0,1]\).

Conversely, let \(A \in \mathbb {R}_{\mathcal {F}}\), if \(A_\alpha = [A]^{\alpha }\) for all \(\alpha \in [0,1]\) then the family of subsets \(\{A_{\alpha }:\alpha \in [0,1]\}\) satisfies the conditions (a)–(d).

An example of fuzzy number is a triangular fuzzy number that is denoted by the triple (abc), with \(a\le b\le c\). In view of Theorem 1, the triangular fuzzy number can be defined in terms of its \(\alpha \)-cuts as follows:

$$[A]^{\alpha }=[a+\alpha (b-a),c-\alpha (c-b)],\ \forall \alpha \in [0,1].$$

Note that a real number a is a particular case of triangular fuzzy number since we have \(a \equiv (a;a;a)\).

A fuzzy relation R over \(X = X_{1}\times \ldots \times X_{n}\) is any fuzzy subset of \(X_{1}\times \ldots \times X_{n}\). Thus, a fuzzy relation R is associated with a membership function \(R:X_{1}\times \ldots \times X_{n}\rightarrow [0,1]\), where \(R(x_{1},\ldots ,x_{n})\in [0,1]\) represents the degree of relationship among \(x_{1},\ldots ,x_{n}\) with respect to R [17].

The projection of fuzzy relation \(R\in \mathcal {F}(X_{1}\times \ldots \times X_{n})\) onto \(X_{i}\), for \(i \in \{1,\ldots ,n\}\), is the fuzzy set \(\varPi _{R}^{i}\) of \(X_{i}\) given by

$$\varPi _{R}^{i}(y)=\bigvee _{x\in X:x_{i}=y} R(x_{1},\ldots ,x_{n}).$$

A fuzzy relation \(J\in \mathcal {F}(\mathbb {R}^{n})\) is said to be a joint possibility distribution of \(A_{1},\ldots ,A_{n}\in \mathbb {R}_{\mathcal {F}}\) if

$$\begin{aligned} A_{i}(y)=\varPi _{J}^{i}(y)=\bigvee _{x\in X : x_{i}=y} J(x_{1},\ldots ,x_{n}), \end{aligned}$$

for all \(y\in \mathbb {R}\) and for all \(i=1,\ldots ,n\).

Given a t-norm t, that is, a commutative, associative, and increasing operator \(t:[0,1]^2\rightarrow [0,1]\) satisfying \(t(x,1)=x\;t\;1=x\) for all \(x\in [0,1]\). A fuzzy relation \(J_{t}\) given by

$$\begin{aligned} J_{t}(x_{1},\ldots ,x_{n})=A_{1}(x_{1})\;t\;\ldots \;t\; A_{n}(x_{n}) \end{aligned}$$
(2.4)

is said to be a t-norm-based joint possibility distribution of \(A_{1},\ldots ,A_{n}\in \mathbb {R}_{\mathcal {F}}\) [4]. Well-known example of t-norm include the minimum t-norm “\(\wedge \)”. In particular, when \(J = J_\wedge \), that is, J is given by (2.4) with \(t = \wedge \), we say that \(A_{1},\ldots ,A_{n}\) are non-interactive. Otherwise, \(J\ne J_\wedge \), we say that \(A_{1},\ldots ,A_{n}\) are interactive [5, 18, 21].

Thus, the notion of interactivity between fuzzy numbers is given by means of joint possibility distributions. Carlsson et al. [5] introduced a possible type of interactivity relation between two fuzzy numbers that is not based on t-norms. Specifically, two fuzzy numbers A and B are said to be completely correlated if there exist \(q,r\in \mathbb {R}\) with \(q\ne 0\) such that the corresponding joint possibility distribution \(J_{\{q,r\}}\) is given by

$$\begin{aligned} J_{\{q,r\}}(x_{1},x_{2})= & {} A(x_{1})\chi _{\{qu+r=v\}}(x_{1},x_{2}) \nonumber \\= & {} B(x_{2})\chi _{\{qu+r=v\}}(x_{1},x_{2}), \end{aligned}$$
(2.5)

where \(\chi _{\{qu+r=v\}}\) stands for the characteristic function of the set \(\{(u,v)\in \mathbb {R}^2: qu+v=r\} \subset \mathbb {R}^2\). In addition, if \(q>0\) (\(q<0\)) then A and B are said to be completely positively (negatively) correlated. Since \(q\ne 0\) in Eq. (2.5), the membership function of B can be written as \(B(qx + r) = A(x)\) for all \(x\in \mathbb {R}\), and consequently \([B]^{\alpha }=q[A]^{\alpha }+\{r\}\) for all \(\alpha \in [0,1]\). Moreover, for each \(\alpha \in [0,1]\), the \(\alpha \)-cut of the joint possibility distribution \(J_{\{q,r\}}\) is given by [5]:

$$\begin{aligned}{}[J_{\{q,r\}}]^{\alpha }=\left\{ (x,qx+r)\ :\ x\in [A]^{\alpha }\right\} . \end{aligned}$$

Remark 1

Note that if the fuzzy numbers A and B are completely correlated by the line \(qu+r_1=v\), and we choose \(r_2=q(a_{\alpha }^-+a_{\alpha }^+)+r_1\), then A and B are also completely correlated if we consider \(J_{\{-q,r_2\}}\), that is, A and B are also completely correlated with respect to the line \(-qu+r_2=v\). Therefore, the distribution J is not unique.

The next definition is a generalization of Zadeh’s extension principle (cf. Definition 1).

Definition 2

(Sup-J Extension Principle [6]). Let \(J\in \mathcal {F}(\mathbb {R}^{n})\) be a joint possibility distribution of \(A_{1},\ldots ,A_{n}\in \mathbb {R}_{\mathcal {F}}\) and let \(f:\mathbb {R}^{n}\rightarrow \mathbb {R}\). The \(\sup -J\) extension of f at \((A_{1},\ldots ,A_{n})\) is defined by

$$\begin{aligned} f_{J}(A_{1},\ldots ,A_{n})(y) = \hat{f}(J)(y) = \bigvee _{(x_{1},\ldots ,x_{n})\in f^{-1}(y)}J(x_{1},\ldots ,x_{n}), \end{aligned}$$

where \(f^{-1}(y)=\{(x_{1},\ldots ,x_{n})\in \mathbb {R}^{n}:f(x_{1},\ldots ,x_{n})=y\}\).

From Definition 2, we can define arithmetic operations among n fuzzy numbers by taking the sup-J extension of the corresponding arithmetic operator. For example, let \(f(x_{1},\ldots ,x_{n})=x_{1}+\ldots +x_{n}\) for all \(x_1,\ldots ,x_n \in \mathbb {R}\). If \(J_\wedge \) is defined as in (2.4) with \(t=\wedge \), then \(f_{J_{\wedge }}(A_{1},\ldots ,A_{n})\) boils down to Zadeh’s extension of f at \((A_{1},\ldots ,A_{n})\), i.e.,

$$\begin{aligned} \widehat{f}(A_{1},\ldots ,A_{n})(y)=\bigvee _{(x_{1},\ldots ,x_{n})\in f^{-1}(y)}A_{1}(x_{1})\wedge \ldots \wedge A_{n}(x_{n}),\quad \forall \ y \in \mathbb {R}. \end{aligned}$$

The next proposition ensures that the completely correlation is a transitive relation of interactivity between fuzzy numbers. Moreover, under some conditions, the sup-\(J_{q,r}\) extensions of the addition operator, denoted by the symbol \(+_L\), satisfies the associative property.

Proposition 1

[22]. Let A, B, \(C \in \mathbb {R}_{\mathcal {F}}\). If A and B are completely correlated with respect to \(J_{\{q_1,r_1\}}\) and B and C are completely correlated with respect to \(J_{\{q_2,r_2\}}\), then there are real numbers \(q_3\) and \(r_3\) such that A and C are completely correlated with respect to \(J_{\{q_3,r_3\}}\).

Moreover, if each A, B, \(C\in \mathbb {R}_{\mathcal {F}}\) is completely correlated to \(D\in \mathbb {R}_{\mathcal {F}}\backslash \mathbb {R}\), then the associative property holds true, i.e., \(A+_L(B+_LC)=(A+_LB)+_LC\).

The notion of completely correlation can be extended to n fuzzy numbers as follows.

Definition 3

The fuzzy numbers \(A_{1},\ldots ,A_{n}\in \mathbb {R}_{\mathcal {F}}\) are said completely correlated if the joint possibility distribution J is given by

$$\begin{aligned} J(x_{1},\ldots ,x_{n})= & {} \chi _{U}(x_{1},\ldots ,x_{n})A_{1}(x_{1}) \\ \nonumber= & {} \chi _{U}(x_{1},\ldots ,x_{n})A_{2}(x_{2}) \\ \nonumber&\vdots&\\ \nonumber= & {} \chi _{U}(x_{1},\ldots ,x_{n})A_{n}(x_{n}) \nonumber , \end{aligned}$$
(2.6)

where \(U=\{(u,q_{2}u+r_{2},\ldots ,q_{n}u+r_{n}):u\in \mathbb {R}\}\), \(q_i,r_i\in \mathbb {R}\), with \(q_i\ne 0\), \(\forall i=1,\ldots ,n\).

From (2.5) and (2.6), one can see that \(A_1\) and \(A_i\), \(i>1\), are also completely correlated since we have \([A_{i}]^{\alpha }=q_{i}[A_{1}]^{\alpha }+\{r_{i}\}\), for all \(i=2,\ldots ,n\). This implies that, for each \(\alpha \in [0,1]\), the \(\alpha \)-cut of J is given as follows

$$\begin{aligned}{}[J]^{\alpha }=\left\{ (x,q_{2}x+r_{2},\ldots ,q_{n}x+r_{n}) \ :\ x\in [A_{1}]^{\alpha }\right\} \end{aligned}$$
(2.7)

Remark 2

From Eq. (2.7), we can note that the \(\alpha \)-cuts of the joint possibility distribution J can be expressed in terms of \(\alpha \)-cuts of \(A_{1}\) and the parameters \(q_{i}\) and \(r_{i}\), for all \(i=2,\ldots ,n\).

Theorem 2

[23, 24]. Let \(f:\mathbb {R}^{n}\rightarrow \mathbb {R}\) be a continuous function and \(J\in \mathcal {F}(\mathbb {R}^{n})\). We have that

$$[\widehat{f}_J(A_1, \ldots , A_n)]^{\alpha }=f([J]^{\alpha }), \ \forall \alpha \in [0,1].$$

By Theorem 2, if the \(\sup \)-J extension of f at \((A_1,\ldots ,A_n)\) is a fuzzy number, then the \(\alpha \)-cuts of \(\widehat{f}_J(A_1,\ldots ,A_n) = \widehat{f}(J)\) can be written as follows:

$$\begin{aligned}{}[\widehat{f}(J)]^{\alpha }=\displaystyle \left[ \bigwedge _{(x_{1},\ldots ,x_{n})\in [J]^{\alpha }}f(x_{1},\ldots ,x_{n}) \quad \bigvee _{(x_{1},\ldots ,x_{n})\in [J]^{\alpha }}f(x_{1},\ldots ,x_{n})\right] . \end{aligned}$$
(2.8)

In the next section, we consider the problem given in (2.1) for the case where the known values \(y_i\) are interactive fuzzy numbers.

3 Least Squares Method for Interactive Fuzzy Data

In this paper, we deal with least squares method to fit uncertain data given by interactive fuzzy numbers. In particular, we focus on the case where these fuzzy numbers are completely correlated. A typical example of correlated data are the well-known longitudinal data, which are widely studied in the statistical area [13].

Let \(D=\{(x_{1},Y_{1}),\ldots ,(x_{m},Y_{m})\} \subset \mathbb {R}\times \mathbb {R}_{\mathcal {F}}\) such that \(Y_1,\ldots ,Y_m\) are completely correlated fuzzy numbers, with respect to a joint possibility distribution J as in (2.6), and let \(F:\mathbb {R}\rightarrow \mathbb {R}_{\mathcal {F}}\) be a function that satisfies \(F(x_{i}) = Y_{i}\) for \(i=1,\ldots ,m\). We produce a function \(\varPhi :\mathbb {R}\rightarrow \mathbb {R}_{\mathcal {F}}\) that approximates F given by means of the sup-J extension principle of a function \(\varphi :\mathbb {R}\rightarrow \mathbb {R}\) of the form

$$\begin{aligned} \varphi (x)=a_1g_1(x)+\ldots +a_ng_n(x), \end{aligned}$$

where \(a_1,\ldots ,a_n\in \mathbb {R}\) and \(g_1,\ldots ,g_n\) are real-valued-functions. More precisely, we define the function \(\varPhi \) in terms of the sup-J extension principle of (2.3) at \((Y_{1},\ldots ,Y_{m})\). Since Eq. (2.3) is continuous with respect to \(y_1,\ldots ,y_m\), from Theorem 2 and Eq. (2.7), we have that \(\alpha \)-cuts of the fuzzy number \(\varPhi (x)\) is given by

$$\begin{aligned}{}[\varPhi (x)]^\alpha= & {} \{ s_1(x)y_1 + \ldots + s_m(x)y_m : (y_1,\ldots ,y_m) \in [J]^\alpha \} \\ \nonumber= & {} \{ s_1(x)y + s_2(x)(q_2y + r_2) + \ldots + s_m(x)(q_my + r_m) y : y \in [Y_1]^\alpha \}. \end{aligned}$$
(3.9)

Since the interval \([Y_1]^\alpha = [{y_1}_\alpha ^-,{y_1}_\alpha ^+]\) can be rewritten as the set of all convex combination of \({y_1}_\alpha ^-\) and \({y_1}_\alpha ^+\), that is, \([Y_1]^\alpha = \{(1-\lambda ){y_1}_\alpha ^- + \lambda {y_1}_\alpha ^+: \lambda \in [0,1] \}\), the \(\alpha \)-cut of J can also be expressed in terms of a parameter \(\lambda \in [0,1]\) as follows:

$$\begin{aligned}{}[J]^\alpha= & {} \{ (1-\lambda )Y_\alpha ^- + \lambda Y_\alpha ^+ : \lambda \in [0,1] \}, \end{aligned}$$

where \(Y_\alpha ^- = ({y_1}_\alpha ^-, q_2{y_1}_\alpha ^- + r_2,\ldots ,q_m{y_1}_\alpha ^- + r_m )\) and \(Y_\alpha ^+ = ({y_1}_\alpha ^+, q_2{y_1}_\alpha ^+ + r_2,\ldots ,\) \(q_m{y_1}_\alpha ^+ + r_m )\). Thus, Eq. (3.9) can be expressed as

$$\begin{aligned}{}[\varPhi (x)]^\alpha= & {} \{ (1-\lambda )\langle S(x), Y_\alpha ^- \rangle + \lambda \langle S(x), Y_\alpha ^+ \rangle : \lambda \in [0,1] \} \end{aligned}$$
(3.10)

where \(\langle \cdot , \cdot \rangle \) denotes the usual inner product of \(\mathbb {R}^m\) and \(S(x) = (s_{1}(x),s_{2}(x),\ldots ,\) \(s_{m}(x))\), \(x \in \mathbb {R}.\)

In order to characterize the endpoints of each \(\alpha \)-cut of \(\varPhi (x)\), we define the auxiliary function h by

$$\begin{aligned} h(x,\alpha ,\lambda ) = (1-\lambda )B_1(x,\alpha )+\lambda B_2(x,\alpha ),\;\forall x \in \mathbb {R}\;\text {and}\;\forall \alpha , \lambda \in [0,1], \end{aligned}$$

where

$$\begin{aligned} B_1(x,\alpha )= \langle S(x), Y_\alpha ^- \rangle \;\text {and}\; B_2(x,\alpha ) = \langle S(x), Y_\alpha ^+ \rangle . \end{aligned}$$

By Eqs. (3.10) and (2.8), we have that

$$\begin{aligned} \nonumber [\varPhi (x)]^{\alpha }= & {} \{h(x,\alpha ,\lambda ):\lambda \in [0,1]\} \\= & {} \left[ \bigwedge _{\lambda \in [0,1]}h(x,\alpha ,\lambda ),\bigvee _{\lambda \in [0,1]}h(x,\alpha ,\lambda )\right] . \end{aligned}$$
(3.11)

Note that if \(B_1(x,\alpha )\le B_2(x,\alpha )\), then the function \(h(x,\alpha , \cdot )\) assumes the minimum and the maximum values at \(\lambda =0\) and \(\lambda =1\), respectively. On the other hand, if \(B_1(x,\alpha )>B_2(x,\alpha )\) then the minimum and maximum values of \(h(x,\alpha , \cdot )\) are achieved at \(\lambda = 1\) and \(\lambda = 0\), respectively. In other words, the global minimizer and maximizer of \(h(x, \alpha , \lambda )\) for \(\lambda \in [0, 1]\) are given at \(\lambda = 0\) or \(\lambda = 1\). Therefore, for each \(x \in \mathbb {R}\), the \(\alpha \)-cuts of the fuzzy solution \(\varphi \) is given by

$$\begin{aligned}{}[\varPhi (x)]^{\alpha }= & {} [\min \{h(x,\alpha ,0),h(x,\alpha ,1)\},\max \{h(x,\alpha ,0),h(x,\alpha ,1)\}], \end{aligned}$$
(3.12)

where

$$\begin{aligned} h(x,\alpha ,0)=B_1(x,\alpha )= \langle S(x), Y_\alpha ^- \rangle \end{aligned}$$

and

$$\begin{aligned} \displaystyle h(x,\alpha ,1)=B_2(x,\alpha )= \langle S(x), Y_\alpha ^+ \rangle . \end{aligned}$$

In the next section we illustrate this proposed method by means of an example.

4 Application of Least Squares Method for Completely Correlated Fuzzy Data

In this section we apply the proposed method to determine a function that fits longitudinal data obtained from [15]. The authors discussed the association between children mortality and air pollution in São Paulo, Brazil, from 1994 to 1997. In their study were collected longitudinal data of sulfur dioxide (\(SO_2\)), carbon monoxide (CO), inhalable particulate (\(PM_{10}\)) and ozone (\(O_3\)). Here, we focus on the ozone dataset.

For simplicity, suppose that the longitudinal data are given by completely correlated triangular fuzzy numbers of the form \((M-\sigma ;M;M+\sigma )\), where M and \(\sigma \) are the mean and the standard deviation of the collected data in each year, respectively. Recall that the proposed method is not restricted to triangular fuzzy numbers, then other types of fuzzy number can be considered.

Let \(D=\{(x_1,Y_1),(x_2,Y_2),(x_3,Y_3),(x_4,Y_4)\} \subset \mathbb {R}\times \mathbb {R}_{\mathcal {F}}\) be the fuzzy dataset given in Table 1. The values \(x_{1}=1\), \(x_{2}=2\), \(x_{3}=3\), and \(x_{4}=4\) represent respectively the years 1994, 1995, 1996, and 1997. The fuzzy numbers \(Y_{1}=(17.6;57;96.4)\), \(Y_{2}=(25.3;60.7;96.1)\), \(Y_{3}=(34.8;76.3;117.8)\), and \(Y_{4}=(29.5;63;96.5)\) are completely correlated with respect to joint possibility distribution J, whose membership function is given by

$$\begin{aligned} J(v_{1},v_{2},v_{3},v_{4})=\chi _U(v_{1},v_{2},v_{3},v_{4})Y_{1}(v_{1}), \ \forall \ (v_{1},v_{2},v_{3},v_{4})\in \mathbb {R}^{4}, \end{aligned}$$

where

$$\begin{aligned} U=\{(u,0.8985u+9.4855,1.0533u+16.2619,0.8502u+14.5386)\ :\ u\in \mathbb {R}\}. \end{aligned}$$
(4.13)
Table 1. Fuzzy dataset D

Note that Eq. (4.13) suggests that \(Y_{1}\) and \(Y_{2}\) are positively completely correlated, as well as \(Y_{1}\) and \(Y_{3}\), \(Y_{1}\) and \(Y_{4}\), since \(q_{i}>0\), for all \(i=2,3,4\).

Consider the functions \(g_{1}(x)=x^{2}\), \(g_{2}(x)=x\) and \(g_{3}(x)=1\). From (3.12), for each \(\alpha \in [0,1]\) and \(x\in [1,4]\), the fuzzy function \(\varPhi \) is given by \([\varPhi (x)]^{\alpha }=[\min \{h(x,\alpha ,0),h(x,\alpha ,1)\},\max \{h(x,\alpha ,0),h(x,\alpha ,1)\}],\) where

$$\begin{aligned} h(x,\alpha ,0)=-3.24x^2+20.76x-0.75+\alpha (-x^2+3.84x+35.34) \end{aligned}$$

and

$$\begin{aligned} h(x,\alpha ,1)=-5.24x^2+28.44x+69.93-\alpha (-x^2+3.84x+35.34). \end{aligned}$$

Figure 1 exhibits the fuzzy function \(\varPhi \) produced by our proposal. One can observe in Subfigure 1(a) fits the data of Table 1 which varies from 1994 to 1997. The red triangles and the gray-scale surface depicted in Subfigure 1(b) correspond to the membership functions of fuzzy data \(Y_i\), \(i=1,\ldots ,4\), and fuzzy solution, respectively.

Fig. 1.
figure 1

Subfigures (a) and (b) exhibit respectively the top and depiction views of the fuzzy solution \(\varPhi \) where the greatest and smallest membership values are represented respectively by the black and white colors. In Subfigure (a), the red dots represent the endpoints of the \(\alpha -\)cuts of the fuzzy data \(Y_i\) for \(\alpha = 0,0.5,1\) and \(i=1,\ldots ,4.\) Each fuzzy data \(Y_i\) is represented by red lines in Subfigure (b).

Note that \(Y_1,\ldots ,Y_4\) are completely correlated with respect to \(2^3\) different joint possibility distributions. Thus, we can obtain \(2^3\) fuzzy functions \(\varPhi \). However, in general, the choice of a joint possibility distribution is not arbitrary and depends on the context. For example, if each object is measured m times with the same n measuring devices then we can assume that the obtained values depend only on the calibration of each equipment and not on the objects. This type of assumption induces the choice of specific parameters \(q_i\) and \(r_i\) in (2.7).

5 Conclusion

In this manuscript, we considered a fuzzy least squares problem based on dataset that has some type of correlation, for example a longitudinal dataset. We assumed that the dataset is composed by completely correlated fuzzy numbers [5]. In particular, we presented a method that provides a fuzzy function that fits a given fuzzy data. This fuzzy function depends on the choice of a joint possibility distributions as in (2.6).

The \(\alpha \)-cut of the fuzzy solution given by means of the \(\sup \)-J extension principle is a non-empty, bounded, closed interval whose endpoints are obtained by solving a minimization and maximization problems given in Eq. (3.11). Investigating this problem, we concluded that the endpoints of the \(\alpha \)-cut of the proposed solution can be evaluated by taking the minimum and maximum of two associated real functions (see Eq. (3.12)).

Finally, we applied the proposed method to determine a fuzzy function which fits a longitudinal air polution dataset [15]. The fuzzy data in this dataset was modelled using triangular fuzzy numbers, but it can be done with other types of completely correlated fuzzy numbers. The fuzzy solution was calculated considering polynomial functions \(g_1\), \(g_2\), and \(g_3\). For further works, we intend to investigate fuzzy least squares method for dataset with other intrinsic type of interactivity.