Least Squares Method with Interactive Fuzzy Coefficient: Application on Longitudinal Data

Pinto, Nilmara J. B.; Wasques, Vinícius F.; Esmi, Estevão; Barros, Laécio C.

doi:10.1007/978-3-319-95312-0_12

Nilmara J. B. Pinto⁷,
Vinícius F. Wasques⁷,
Estevão Esmi⁷ &
…
Laécio C. Barros⁷

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 831))

Included in the following conference series:

North American Fuzzy Information Processing Society Annual Conference

840 Accesses
5 Citations

Abstract

This work focus on the least square method to fit a fuzzy function to longitudinal data given by fuzzy numbers. In order to consider the intrinsic correlation of longitudinal data, we assume that there exits a linear relation among the involved fuzzy numbers that arises from the concept of a joint possibility distribution. We propose a numerical method to solve a fuzzy least square problem taking into account this linear correlation. To this end, we extend the classical least square method by means of the $\sup $-J extension principle, which consists of a generalization of Zadeh’s extension principle. Finally, we use our proposal method to fit a longitudinal dataset.

N. J. B. Pinto—Grantee CAPES 1691227.

V. F. Wasques—Grantee CNPq 142414/2017-4.

E. Esmi—Grantee FAPESP 2016/26040-7.

L. C. Barros—Grantee CNPq 306546/2017-5.

Access provided by CONRICYT-eBooks. Download conference paper PDF

A Fuzzy Convex Nonparametric Least Squares Method with Different Shape Constraints

Article 23 May 2023

Fuzzy Clusterwise Functional Extended Redundancy Analysis

Article 01 January 2015

M-based simultaneous inference for the mean function of functional data

Article 13 March 2018

Keywords

1 Introduction

The least squares methods are used, in general, to obtain a continuous function that best fit pairs of data in a dataset [1]. The fuzzy least squares method arises when the dataset is composed by fuzzy numbers. Tanaka et al. proposed a fuzzy least squares method based on fuzzy regression models [2]. This method was used to find fuzzy parameters of a fuzzy linear function from a fuzzy dataset. However, this approach converts the problem to a classic linear programming problem which may lead to losing the notion of close distance between the fuzzy data and the obtained solution.

Celmins [3] proceeded with the same methodology of [2] but considered a intrinsic relation among the dataset based on conical membership functions that are (geometrically) similar to joint possibility distributions. In addiction, the concept of interactive fuzzy numbers [4,5,6] was only considered in [7], which improved the approach presented by [3].

In contrast to these previous methods, Diamond [8] proposed a fuzzy least squares method based on distance between functions. He used projection theorems for cones in Banach spaces to find the fuzzy linear function that best fit a dataset.

It is worth noting that all these approaches were developed for data given only by triangular fuzzy numbers and for fitting only fuzzy linear functions. Nevertheless, these methods can be used to model many phenomenons, for example, in economy [9], psychology [10], medicine [11], and logistics [12].

Data correlations arise naturally in longitudinal datasets. A dataset is said to be longitudinal if it contains the same type of information on the same itens at multiple points in time. Therefore, longitudinal data is characterized by the fact that repeated observations are correlated [13]. In this work, we suppose that this correlation is given by the notion of completely correlated fuzzy numbers [5, 14].

The method proposed here is based on the (sup-J) extension of classical numerical algorithm to the fuzzy context and does not take into account any distance between fuzzy numbers. Moreover, our method can be applied not only for triangular fuzzy numbers, but for any type of completely correlated fuzzy numbers, and it can approximate the dataset with higher orders functions.

In Sect. 2 we briefly recall the classical least squares method and some basic definitions and results from fuzzy set theory. In Sect. 3, we develop the extension of the classical least squares method for the case where dataset is composed by completely correlated fuzzy numbers. Finally, in Sect. 4, we apply the proposed method to fit a fuzzy function to the longitudinal dataset given in [15].

2 Mathematical Background

This section presents the least squares method [1] and some basic concepts of fuzzy set theory [16].

2.1 Least Square Method

Let $f:[c,d]\rightarrow \mathbb {R}$ be a continuous function. Given n functions $g_{1},\ldots ,g_{n}$, where $g_{i}:\mathbb {R}\rightarrow \mathbb {R}$ for $i=1,\ldots ,n$, we need to find n coefficients $a_{1},\ldots ,a_{n}\in \mathbb {R}$ such that the function $\varphi :\mathbb {R}\rightarrow \mathbb {R}$ given by

$$\begin{aligned} \varphi (x)=a_{1}g_{1}(x)+\ldots +a_{n}g_{n}(x) \end{aligned}$$

is the best approximation of the function f, i.e., $\varphi \approx f$.

The function $\varphi $ is obtained by minimizing the distance between f and $\varphi $. More precisely, let $||\cdot ||_{2}$ be the $\mathcal {L}^{2}$-norm defined on the class of the continuous functions from [c, d] to $\mathbb {R}$ (denoted by C([c, d])) given by $||h||_{2}=\left( \int _c^d |h(s)|^{2}ds\right) ^{{}^{1}\!/_{2}}, \ \forall h\in C([c,d])$. The coefficients $a_{1},\ldots ,a_{n}$ of the function $\varphi $ which produces the best fit to f are obtained by solving the following minimization problem:

$$\begin{aligned} \min _{a_{1},\ldots ,a_{n} \in \mathbb {R}} {}^{1}\!/_{2}||\varphi -f||_{2}^{2}. \end{aligned}$$

In the case some values of f are known, say $D=\{f(x_{1})=y_{1},\ldots ,f(x_{m})=y_{m}\}$, the function $\varphi $ must fit the data D, that is, $\varphi (x_{i})\approx y_{i}$, for all $i=1,\ldots ,m$. Therefore the following minimization problem must be solved.

$$\begin{aligned} \min _{a_{1},\ldots ,a_{n} \in \mathbb {R}} {}^{1}\!/_{2}||(\varphi (x_1)-y_1,\ldots ,\varphi (x_m)-y_m)||_{2}^{2}. \end{aligned}$$

(2.1)

The real coefficients $a_{1},\ldots ,a_{n}$ that minimize the problem (2.1), i.e., that produces the best approximation $\varphi $ of f, are obtained by solving the following matrix equation called normal equation:

$$\begin{aligned} Ma=b, \end{aligned}$$

where

$$\begin{aligned} M = \begin{bmatrix} \displaystyle \sum _{k=1}^{m}g_{1}(x_{k})g_{1}(x_{k})&\dots&\ \displaystyle \sum _{k=1}^{m}g_{1}(x_{k})g_{n}(x_{k}) \\ \vdots&\ddots&\ \vdots \\ \displaystyle \sum _{k=1}^{m}g_{n}(x_{k})g_{1}(x_{k}) \&\dots&\ \displaystyle \sum _{k=1}^{m}g_{n}(x_{k})g_{n}(x_{k}), \end{bmatrix}, \end{aligned}$$

$$\begin{aligned} a = \begin{bmatrix} a_{1} \\ \vdots \\ a_{n} \end{bmatrix}\text { and } b = \begin{bmatrix} \displaystyle \sum _{k=1}^{m}y_{k}g_{1}(x_{k}) \\ \vdots \\ \displaystyle \sum _{k=1}^{m}y_{k}g_{n}(x_{k}) \end{bmatrix}. \end{aligned}$$

If the matrix M is non singular, say $P = M^{-1} = [p_{ij}]$, then the vector a is obtained by

$$\begin{aligned} a=Pb. \end{aligned}$$

(2.2)

Thus, each parameter $a_i$ is given by

$$\begin{aligned} a_i= & {} p_{i1}b_1 + p_{i2}b_2 + \ldots + p_{in}b_n \\= & {} p_{i1}\left( \sum _{k=1}^{m}y_{k}g_{1}(x_{k})\right) + \ldots + p_{in}\left( \sum _{k=1}^{m}y_{k}g_{n}(x_{k})\right) \\= & {} \left( \sum _{j=1}^{n} p_{ij}g_{j}(x_{1})\right) y_1 + \ldots + \left( \sum _{j=1}^{n} p_{ij}g_{j}(x_{m})\right) y_m \\= & {} c_{i1}y_1 + \ldots + c_{im}y_m, \end{aligned}$$

where $c_{ik} = \displaystyle \sum _{j=1}^{n} p_{ij}g_{j}(x_{k})$, for $i=1,\ldots ,n$ and $k=1,\ldots ,m.$ In general case, the matrix P stands for the pseudoinverse of M.

Since the parameters of the function $\varphi $ can be obtained by computing the matrix product (2.2), we rewrite the function $\varphi $ in terms of $y_1,\ldots ,y_m$ as follows:

$$\begin{aligned} \nonumber \varphi (x)= & {} a_{1}g_{1}(x)+\ldots +a_{n}g_{n}(x)\\ \nonumber= & {} (c_{11}y_1 + \ldots + c_{1m}y_m)g_{1}(x)+\ldots + (c_{n1}y_1 + \ldots + c_{nm}y_m)g_{n}(x)\\ \nonumber= & {} \left( \sum _{j=1}^n g_j(x)c_{j1}\right) y_1 + \ldots + \left( \sum _{j=1}^n g_j(x)c_{jm}\right) y_m\\= & {} s_1(x)y_1 + \ldots + s_m(x)y_m, \end{aligned}$$

(2.3)

where

$$s_{i} = \left( \sum _{j=1}^{n} g_j(x)c_{ji} \right) $$

for each $i=1,\ldots ,n.$

2.2 Fuzzy Set Theory

A fuzzy subset A of an universe X is characterized by a function $\mu _A:X\rightarrow [0,1]$, called membership function [16], where $\mu _A(x)$, or simply A(x), represents the membership degree of x in A, for all $x \in X$. The class of fuzzy sets of X is denoted by the symbol $\mathcal {F}(X)$. Each classical subset A of X is a particular fuzzy set whose membership function is given by its characteristic function $\chi _{A}:X\rightarrow \{0,1\}$, i.e., $\chi _{A}(x)=1$ if and only if $x\in A$.

The $\alpha $-cut of a fuzzy set A of X, denoted by $[A]^{\alpha }$, is defined as $[A]^{\alpha }=\{x\in X : A(x)\ge \alpha \}, \ \forall \alpha \in (0,1]$. If X is also a topological space, then we can define the 0-cut of A by $[A]^0=cl\{x\in X : A(x)>0\}$ [17], where $cl \;\text {Y}$, $Y \subseteq X$, denotes the closure of Y.

Zadeh’s extension principle [18] can be viewed as mathematical method to extend a function $f:X \rightarrow Y$ to a function $\hat{f} :\mathcal {F}(X) \rightarrow \mathcal {F}(Y)$.

Definition 1

(Zadeh’s extension principle [17, 18]). Let $f:X\rightarrow Y$. The Zadeh’s extension of f at $A \in \mathcal {F}(X)$ is the fuzzy set $\hat{f}(A) \in \mathcal {F}(Y)$ whose membership function is given by

$$\begin{aligned} \hat{f}(A)(y)=\bigvee _{x\in f^{-1}(y)} A(x), \;\forall \;y \in Y, \end{aligned}$$

where $f^{-1}(y)=\{x \in X:f(x)=y\}$ is the preimage of the function f at y and, by definition, $\bigvee \emptyset = 0$.

A fuzzy set $A\in \mathcal {F}(\mathbb {R})$ is called a fuzzy number if its $\alpha $-cuts are closed, bounded and non-empty intervals for all $\alpha \in [0,1]$ [17]. Since each $\alpha $-cut of a fuzzy number A is an interval that satisfies the previous properties, we can write $[A]^{\alpha }=[a_{\alpha }^{-},a_{\alpha }^{+}]$. We denote the class of fuzzy numbers by the symbol $\mathbb {R}_{\mathcal {F}}$. The next theorem indicates when a family of subsets can be uniquely associated with a fuzzy number.

Theorem 1

(Negoita-Ralescu’s characterization theorem [19, 20]). Given a family of subsets $\{A_{\alpha }:\alpha \in [0,1]\}$ that satisfies the following conditions

(a)
$A_{\alpha }$ is a non-empty, closed, and bounded interval for any $\alpha \in [0,1]$;
(b)
$A_{\alpha _{2}}\subseteq A_{\alpha _{1}}$, for all $0\le \alpha _{1}\le \alpha _{2} \le 1$;
(c)
For any sequence $\alpha _{n}$ which converges from below to $\alpha \in (0,1]$ we have
$$\bigcap _{n=1}^{\infty }A_{\alpha _{n}}=A_{\alpha };$$
(d)
For any sequence $\alpha _{n}$ which converges from above to 0 we have
$$A_{0}=cl\left( \bigcup _{n=1}^{\infty }A_{\alpha _{n}}\right) .$$

Then there exists a unique $A \in \mathbb {R}_{\mathcal {F}}$, such that $[A]^{\alpha }=A_{\alpha }$, for each $\alpha \in [0,1]$.

Conversely, let $A \in \mathbb {R}_{\mathcal {F}}$, if $A_\alpha = [A]^{\alpha }$ for all $\alpha \in [0,1]$ then the family of subsets $\{A_{\alpha }:\alpha \in [0,1]\}$ satisfies the conditions (a)–(d).

An example of fuzzy number is a triangular fuzzy number that is denoted by the triple (a; b; c), with $a\le b\le c$. In view of Theorem 1, the triangular fuzzy number can be defined in terms of its $\alpha $-cuts as follows:

$$[A]^{\alpha }=[a+\alpha (b-a),c-\alpha (c-b)],\ \forall \alpha \in [0,1].$$

Note that a real number a is a particular case of triangular fuzzy number since we have $a \equiv (a;a;a)$.

A fuzzy relation R over $X = X_{1}\times \ldots \times X_{n}$ is any fuzzy subset of $X_{1}\times \ldots \times X_{n}$. Thus, a fuzzy relation R is associated with a membership function $R:X_{1}\times \ldots \times X_{n}\rightarrow [0,1]$, where $R(x_{1},\ldots ,x_{n})\in [0,1]$ represents the degree of relationship among $x_{1},\ldots ,x_{n}$ with respect to R [17].

The projection of fuzzy relation $R\in \mathcal {F}(X_{1}\times \ldots \times X_{n})$ onto $X_{i}$, for $i \in \{1,\ldots ,n\}$, is the fuzzy set $\varPi _{R}^{i}$ of $X_{i}$ given by

$$\varPi _{R}^{i}(y)=\bigvee _{x\in X:x_{i}=y} R(x_{1},\ldots ,x_{n}).$$

A fuzzy relation $J\in \mathcal {F}(\mathbb {R}^{n})$ is said to be a joint possibility distribution of $A_{1},\ldots ,A_{n}\in \mathbb {R}_{\mathcal {F}}$ if

$$\begin{aligned} A_{i}(y)=\varPi _{J}^{i}(y)=\bigvee _{x\in X : x_{i}=y} J(x_{1},\ldots ,x_{n}), \end{aligned}$$

for all $y\in \mathbb {R}$ and for all $i=1,\ldots ,n$.

Given a t-norm t, that is, a commutative, associative, and increasing operator $t:[0,1]^2\rightarrow [0,1]$ satisfying $t(x,1)=x\;t\;1=x$ for all $x\in [0,1]$. A fuzzy relation $J_{t}$ given by

$$\begin{aligned} J_{t}(x_{1},\ldots ,x_{n})=A_{1}(x_{1})\;t\;\ldots \;t\; A_{n}(x_{n}) \end{aligned}$$

(2.4)

is said to be a t-norm-based joint possibility distribution of $A_{1},\ldots ,A_{n}\in \mathbb {R}_{\mathcal {F}}$ [4]. Well-known example of t-norm include the minimum t-norm “$\wedge $”. In particular, when $J = J_\wedge $, that is, J is given by (2.4) with $t = \wedge $, we say that $A_{1},\ldots ,A_{n}$ are non-interactive. Otherwise, $J\ne J_\wedge $, we say that $A_{1},\ldots ,A_{n}$ are interactive [5, 18, 21].

Thus, the notion of interactivity between fuzzy numbers is given by means of joint possibility distributions. Carlsson et al. [5] introduced a possible type of interactivity relation between two fuzzy numbers that is not based on t-norms. Specifically, two fuzzy numbers A and B are said to be completely correlated if there exist $q,r\in \mathbb {R}$ with $q\ne 0$ such that the corresponding joint possibility distribution $J_{\{q,r\}}$ is given by

$$\begin{aligned} J_{\{q,r\}}(x_{1},x_{2})= & {} A(x_{1})\chi _{\{qu+r=v\}}(x_{1},x_{2}) \nonumber \\= & {} B(x_{2})\chi _{\{qu+r=v\}}(x_{1},x_{2}), \end{aligned}$$

(2.5)

where $\chi _{\{qu+r=v\}}$ stands for the characteristic function of the set $\{(u,v)\in \mathbb {R}^2: qu+v=r\} \subset \mathbb {R}^2$. In addition, if $q>0$ ($q<0$) then A and B are said to be completely positively (negatively) correlated. Since $q\ne 0$ in Eq. (2.5), the membership function of B can be written as $B(qx + r) = A(x)$ for all $x\in \mathbb {R}$, and consequently $[B]^{\alpha }=q[A]^{\alpha }+\{r\}$ for all $\alpha \in [0,1]$. Moreover, for each $\alpha \in [0,1]$, the $\alpha $-cut of the joint possibility distribution $J_{\{q,r\}}$ is given by [5]:

$$\begin{aligned}{}[J_{\{q,r\}}]^{\alpha }=\left\{ (x,qx+r)\ :\ x\in [A]^{\alpha }\right\} . \end{aligned}$$

Remark 1

Note that if the fuzzy numbers A and B are completely correlated by the line $qu+r_1=v$, and we choose $r_2=q(a_{\alpha }^-+a_{\alpha }^+)+r_1$, then A and B are also completely correlated if we consider $J_{\{-q,r_2\}}$, that is, A and B are also completely correlated with respect to the line $-qu+r_2=v$. Therefore, the distribution J is not unique.

The next definition is a generalization of Zadeh’s extension principle (cf. Definition 1).

Definition 2

(Sup-J Extension Principle [6]). Let $J\in \mathcal {F}(\mathbb {R}^{n})$ be a joint possibility distribution of $A_{1},\ldots ,A_{n}\in \mathbb {R}_{\mathcal {F}}$ and let $f:\mathbb {R}^{n}\rightarrow \mathbb {R}$. The $\sup -J$ extension of f at $(A_{1},\ldots ,A_{n})$ is defined by

$$\begin{aligned} f_{J}(A_{1},\ldots ,A_{n})(y) = \hat{f}(J)(y) = \bigvee _{(x_{1},\ldots ,x_{n})\in f^{-1}(y)}J(x_{1},\ldots ,x_{n}), \end{aligned}$$

where $f^{-1}(y)=\{(x_{1},\ldots ,x_{n})\in \mathbb {R}^{n}:f(x_{1},\ldots ,x_{n})=y\}$.

From Definition 2, we can define arithmetic operations among n fuzzy numbers by taking the sup-J extension of the corresponding arithmetic operator. For example, let $f(x_{1},\ldots ,x_{n})=x_{1}+\ldots +x_{n}$ for all $x_1,\ldots ,x_n \in \mathbb {R}$. If $J_\wedge $ is defined as in (2.4) with $t=\wedge $, then $f_{J_{\wedge }}(A_{1},\ldots ,A_{n})$ boils down to Zadeh’s extension of f at $(A_{1},\ldots ,A_{n})$, i.e.,

$$\begin{aligned} \widehat{f}(A_{1},\ldots ,A_{n})(y)=\bigvee _{(x_{1},\ldots ,x_{n})\in f^{-1}(y)}A_{1}(x_{1})\wedge \ldots \wedge A_{n}(x_{n}),\quad \forall \ y \in \mathbb {R}. \end{aligned}$$

The next proposition ensures that the completely correlation is a transitive relation of interactivity between fuzzy numbers. Moreover, under some conditions, the sup-$J_{q,r}$ extensions of the addition operator, denoted by the symbol $+_L$, satisfies the associative property.

Proposition 1

[22]. Let A, B, $C \in \mathbb {R}_{\mathcal {F}}$. If A and B are completely correlated with respect to $J_{\{q_1,r_1\}}$ and B and C are completely correlated with respect to $J_{\{q_2,r_2\}}$, then there are real numbers $q_3$ and $r_3$ such that A and C are completely correlated with respect to $J_{\{q_3,r_3\}}$.

Moreover, if each A, B, $C\in \mathbb {R}_{\mathcal {F}}$ is completely correlated to $D\in \mathbb {R}_{\mathcal {F}}\backslash \mathbb {R}$, then the associative property holds true, i.e., $A+_L(B+_LC)=(A+_LB)+_LC$.

The notion of completely correlation can be extended to n fuzzy numbers as follows.

Definition 3

The fuzzy numbers $A_{1},\ldots ,A_{n}\in \mathbb {R}_{\mathcal {F}}$ are said completely correlated if the joint possibility distribution J is given by

$$\begin{aligned} J(x_{1},\ldots ,x_{n})= & {} \chi _{U}(x_{1},\ldots ,x_{n})A_{1}(x_{1}) \\ \nonumber= & {} \chi _{U}(x_{1},\ldots ,x_{n})A_{2}(x_{2}) \\ \nonumber&\vdots&\\ \nonumber= & {} \chi _{U}(x_{1},\ldots ,x_{n})A_{n}(x_{n}) \nonumber , \end{aligned}$$

(2.6)

where $U=\{(u,q_{2}u+r_{2},\ldots ,q_{n}u+r_{n}):u\in \mathbb {R}\}$, $q_i,r_i\in \mathbb {R}$, with $q_i\ne 0$, $\forall i=1,\ldots ,n$.

From (2.5) and (2.6), one can see that $A_1$ and $A_i$, $i>1$, are also completely correlated since we have $[A_{i}]^{\alpha }=q_{i}[A_{1}]^{\alpha }+\{r_{i}\}$, for all $i=2,\ldots ,n$. This implies that, for each $\alpha \in [0,1]$, the $\alpha $-cut of J is given as follows

$$\begin{aligned}{}[J]^{\alpha }=\left\{ (x,q_{2}x+r_{2},\ldots ,q_{n}x+r_{n}) \ :\ x\in [A_{1}]^{\alpha }\right\} \end{aligned}$$

(2.7)

Remark 2

From Eq. (2.7), we can note that the $\alpha $-cuts of the joint possibility distribution J can be expressed in terms of $\alpha $-cuts of $A_{1}$ and the parameters $q_{i}$ and $r_{i}$, for all $i=2,\ldots ,n$.

Theorem 2

[23, 24]. Let $f:\mathbb {R}^{n}\rightarrow \mathbb {R}$ be a continuous function and $J\in \mathcal {F}(\mathbb {R}^{n})$. We have that

$$[\widehat{f}_J(A_1, \ldots , A_n)]^{\alpha }=f([J]^{\alpha }), \ \forall \alpha \in [0,1].$$

By Theorem 2, if the $\sup $-J extension of f at $(A_1,\ldots ,A_n)$ is a fuzzy number, then the $\alpha $-cuts of $\widehat{f}_J(A_1,\ldots ,A_n) = \widehat{f}(J)$ can be written as follows:

$$\begin{aligned}{}[\widehat{f}(J)]^{\alpha }=\displaystyle \left[ \bigwedge _{(x_{1},\ldots ,x_{n})\in [J]^{\alpha }}f(x_{1},\ldots ,x_{n}) \quad \bigvee _{(x_{1},\ldots ,x_{n})\in [J]^{\alpha }}f(x_{1},\ldots ,x_{n})\right] . \end{aligned}$$

(2.8)

In the next section, we consider the problem given in (2.1) for the case where the known values $y_i$ are interactive fuzzy numbers.

3 Least Squares Method for Interactive Fuzzy Data

In this paper, we deal with least squares method to fit uncertain data given by interactive fuzzy numbers. In particular, we focus on the case where these fuzzy numbers are completely correlated. A typical example of correlated data are the well-known longitudinal data, which are widely studied in the statistical area [13].

Let $D=\{(x_{1},Y_{1}),\ldots ,(x_{m},Y_{m})\} \subset \mathbb {R}\times \mathbb {R}_{\mathcal {F}}$ such that $Y_1,\ldots ,Y_m$ are completely correlated fuzzy numbers, with respect to a joint possibility distribution J as in (2.6), and let $F:\mathbb {R}\rightarrow \mathbb {R}_{\mathcal {F}}$ be a function that satisfies $F(x_{i}) = Y_{i}$ for $i=1,\ldots ,m$. We produce a function $\varPhi :\mathbb {R}\rightarrow \mathbb {R}_{\mathcal {F}}$ that approximates F given by means of the sup-J extension principle of a function $\varphi :\mathbb {R}\rightarrow \mathbb {R}$ of the form

$$\begin{aligned} \varphi (x)=a_1g_1(x)+\ldots +a_ng_n(x), \end{aligned}$$

where $a_1,\ldots ,a_n\in \mathbb {R}$ and $g_1,\ldots ,g_n$ are real-valued-functions. More precisely, we define the function $\varPhi $ in terms of the sup-J extension principle of (2.3) at $(Y_{1},\ldots ,Y_{m})$. Since Eq. (2.3) is continuous with respect to $y_1,\ldots ,y_m$, from Theorem 2 and Eq. (2.7), we have that $\alpha $-cuts of the fuzzy number $\varPhi (x)$ is given by

$$\begin{aligned}{}[\varPhi (x)]^\alpha= & {} \{ s_1(x)y_1 + \ldots + s_m(x)y_m : (y_1,\ldots ,y_m) \in [J]^\alpha \} \\ \nonumber= & {} \{ s_1(x)y + s_2(x)(q_2y + r_2) + \ldots + s_m(x)(q_my + r_m) y : y \in [Y_1]^\alpha \}. \end{aligned}$$

(3.9)

Since the interval $[Y_1]^\alpha = [{y_1}_\alpha ^-,{y_1}_\alpha ^+]$ can be rewritten as the set of all convex combination of ${y_1}_\alpha ^-$ and ${y_1}_\alpha ^+$, that is, $[Y_1]^\alpha = \{(1-\lambda ){y_1}_\alpha ^- + \lambda {y_1}_\alpha ^+: \lambda \in [0,1] \}$, the $\alpha $-cut of J can also be expressed in terms of a parameter $\lambda \in [0,1]$ as follows:

$$\begin{aligned}{}[J]^\alpha= & {} \{ (1-\lambda )Y_\alpha ^- + \lambda Y_\alpha ^+ : \lambda \in [0,1] \}, \end{aligned}$$

where $Y_\alpha ^- = ({y_1}_\alpha ^-, q_2{y_1}_\alpha ^- + r_2,\ldots ,q_m{y_1}_\alpha ^- + r_m )$ and $Y_\alpha ^+ = ({y_1}_\alpha ^+, q_2{y_1}_\alpha ^+ + r_2,\ldots ,$ $q_m{y_1}_\alpha ^+ + r_m )$. Thus, Eq. (3.9) can be expressed as

$$\begin{aligned}{}[\varPhi (x)]^\alpha= & {} \{ (1-\lambda )\langle S(x), Y_\alpha ^- \rangle + \lambda \langle S(x), Y_\alpha ^+ \rangle : \lambda \in [0,1] \} \end{aligned}$$

(3.10)

where $\langle \cdot , \cdot \rangle $ denotes the usual inner product of $\mathbb {R}^m$ and $S(x) = (s_{1}(x),s_{2}(x),\ldots ,$ $s_{m}(x))$, $x \in \mathbb {R}.$

In order to characterize the endpoints of each $\alpha $-cut of $\varPhi (x)$, we define the auxiliary function h by

$$\begin{aligned} h(x,\alpha ,\lambda ) = (1-\lambda )B_1(x,\alpha )+\lambda B_2(x,\alpha ),\;\forall x \in \mathbb {R}\;\text {and}\;\forall \alpha , \lambda \in [0,1], \end{aligned}$$

where

$$\begin{aligned} B_1(x,\alpha )= \langle S(x), Y_\alpha ^- \rangle \;\text {and}\; B_2(x,\alpha ) = \langle S(x), Y_\alpha ^+ \rangle . \end{aligned}$$

By Eqs. (3.10) and (2.8), we have that

$$\begin{aligned} \nonumber [\varPhi (x)]^{\alpha }= & {} \{h(x,\alpha ,\lambda ):\lambda \in [0,1]\} \\= & {} \left[ \bigwedge _{\lambda \in [0,1]}h(x,\alpha ,\lambda ),\bigvee _{\lambda \in [0,1]}h(x,\alpha ,\lambda )\right] . \end{aligned}$$

(3.11)

Note that if $B_1(x,\alpha )\le B_2(x,\alpha )$, then the function $h(x,\alpha , \cdot )$ assumes the minimum and the maximum values at $\lambda =0$ and $\lambda =1$, respectively. On the other hand, if $B_1(x,\alpha )>B_2(x,\alpha )$ then the minimum and maximum values of $h(x,\alpha , \cdot )$ are achieved at $\lambda = 1$ and $\lambda = 0$, respectively. In other words, the global minimizer and maximizer of $h(x, \alpha , \lambda )$ for $\lambda \in [0, 1]$ are given at $\lambda = 0$ or $\lambda = 1$. Therefore, for each $x \in \mathbb {R}$, the $\alpha $-cuts of the fuzzy solution $\varphi $ is given by

$$\begin{aligned}{}[\varPhi (x)]^{\alpha }= & {} [\min \{h(x,\alpha ,0),h(x,\alpha ,1)\},\max \{h(x,\alpha ,0),h(x,\alpha ,1)\}], \end{aligned}$$

(3.12)

where

$$\begin{aligned} h(x,\alpha ,0)=B_1(x,\alpha )= \langle S(x), Y_\alpha ^- \rangle \end{aligned}$$

and

$$\begin{aligned} \displaystyle h(x,\alpha ,1)=B_2(x,\alpha )= \langle S(x), Y_\alpha ^+ \rangle . \end{aligned}$$

In the next section we illustrate this proposed method by means of an example.

4 Application of Least Squares Method for Completely Correlated Fuzzy Data

In this section we apply the proposed method to determine a function that fits longitudinal data obtained from [15]. The authors discussed the association between children mortality and air pollution in São Paulo, Brazil, from 1994 to 1997. In their study were collected longitudinal data of sulfur dioxide ($SO_2$), carbon monoxide (CO), inhalable particulate ($PM_{10}$) and ozone ($O_3$). Here, we focus on the ozone dataset.

For simplicity, suppose that the longitudinal data are given by completely correlated triangular fuzzy numbers of the form $(M-\sigma ;M;M+\sigma )$, where M and $\sigma $ are the mean and the standard deviation of the collected data in each year, respectively. Recall that the proposed method is not restricted to triangular fuzzy numbers, then other types of fuzzy number can be considered.

Let $D=\{(x_1,Y_1),(x_2,Y_2),(x_3,Y_3),(x_4,Y_4)\} \subset \mathbb {R}\times \mathbb {R}_{\mathcal {F}}$ be the fuzzy dataset given in Table 1. The values $x_{1}=1$, $x_{2}=2$, $x_{3}=3$, and $x_{4}=4$ represent respectively the years 1994, 1995, 1996, and 1997. The fuzzy numbers $Y_{1}=(17.6;57;96.4)$, $Y_{2}=(25.3;60.7;96.1)$, $Y_{3}=(34.8;76.3;117.8)$, and $Y_{4}=(29.5;63;96.5)$ are completely correlated with respect to joint possibility distribution J, whose membership function is given by

$$\begin{aligned} J(v_{1},v_{2},v_{3},v_{4})=\chi _U(v_{1},v_{2},v_{3},v_{4})Y_{1}(v_{1}), \ \forall \ (v_{1},v_{2},v_{3},v_{4})\in \mathbb {R}^{4}, \end{aligned}$$

where

$$\begin{aligned} U=\{(u,0.8985u+9.4855,1.0533u+16.2619,0.8502u+14.5386)\ :\ u\in \mathbb {R}\}. \end{aligned}$$

(4.13)

Table 1. Fuzzy dataset D

Full size table

Note that Eq. (4.13) suggests that $Y_{1}$ and $Y_{2}$ are positively completely correlated, as well as $Y_{1}$ and $Y_{3}$, $Y_{1}$ and $Y_{4}$, since $q_{i}>0$, for all $i=2,3,4$.

Consider the functions $g_{1}(x)=x^{2}$, $g_{2}(x)=x$ and $g_{3}(x)=1$. From (3.12), for each $\alpha \in [0,1]$ and $x\in [1,4]$, the fuzzy function $\varPhi $ is given by $[\varPhi (x)]^{\alpha }=[\min \{h(x,\alpha ,0),h(x,\alpha ,1)\},\max \{h(x,\alpha ,0),h(x,\alpha ,1)\}],$ where

$$\begin{aligned} h(x,\alpha ,0)=-3.24x^2+20.76x-0.75+\alpha (-x^2+3.84x+35.34) \end{aligned}$$

and

$$\begin{aligned} h(x,\alpha ,1)=-5.24x^2+28.44x+69.93-\alpha (-x^2+3.84x+35.34). \end{aligned}$$

Figure 1 exhibits the fuzzy function $\varPhi $ produced by our proposal. One can observe in Subfigure 1(a) fits the data of Table 1 which varies from 1994 to 1997. The red triangles and the gray-scale surface depicted in Subfigure 1(b) correspond to the membership functions of fuzzy data $Y_i$, $i=1,\ldots ,4$, and fuzzy solution, respectively.

Note that $Y_1,\ldots ,Y_4$ are completely correlated with respect to $2^3$ different joint possibility distributions. Thus, we can obtain $2^3$ fuzzy functions $\varPhi $. However, in general, the choice of a joint possibility distribution is not arbitrary and depends on the context. For example, if each object is measured m times with the same n measuring devices then we can assume that the obtained values depend only on the calibration of each equipment and not on the objects. This type of assumption induces the choice of specific parameters $q_i$ and $r_i$ in (2.7).

5 Conclusion

In this manuscript, we considered a fuzzy least squares problem based on dataset that has some type of correlation, for example a longitudinal dataset. We assumed that the dataset is composed by completely correlated fuzzy numbers [5]. In particular, we presented a method that provides a fuzzy function that fits a given fuzzy data. This fuzzy function depends on the choice of a joint possibility distributions as in (2.6).

The $\alpha $-cut of the fuzzy solution given by means of the $\sup $-J extension principle is a non-empty, bounded, closed interval whose endpoints are obtained by solving a minimization and maximization problems given in Eq. (3.11). Investigating this problem, we concluded that the endpoints of the $\alpha $-cut of the proposed solution can be evaluated by taking the minimum and maximum of two associated real functions (see Eq. (3.12)).

Finally, we applied the proposed method to determine a fuzzy function which fits a longitudinal air polution dataset [15]. The fuzzy data in this dataset was modelled using triangular fuzzy numbers, but it can be done with other types of completely correlated fuzzy numbers. The fuzzy solution was calculated considering polynomial functions $g_1$, $g_2$, and $g_3$. For further works, we intend to investigate fuzzy least squares method for dataset with other intrinsic type of interactivity.

References

Buckingham, R.A.: Numerical Methods. Sir Isaac Pitman & Sons Ltd., London (1966)
Google Scholar
Tanaka, H., Uejima, S., Asai, K.: Linear regression analysis with fuzzy model. IEEE Trans. Syst. 6, 903–907 (1982)
MATH Google Scholar
Celmins, A.: Least squares model fitting to fuzzy vector data. Fuzzy Sets Syst. 22, 245–269 (1987)
Article MathSciNet Google Scholar
Dubois, D., Prade, H.: Additions of interactive fuzzy numbers. IEEE (1981)
Google Scholar
Carlsson, C., Fullér, R., Majlender, P.: Additions of completely correlated fuzzy numbers. In: IEEE International Conference on Fuzzy Systems, vol. 1, pp. 535–539 (2004)
Google Scholar
Fullér, R., Majlender, P.: On interactive fuzzy numbers. Fuzzy Sets Syst. 143, 355–369 (2004)
Article MathSciNet Google Scholar
Tanaka, H., Ishibuchi, H.: Identification of possibilistic linear systems by quadratic membership functions of fuzzy parameters. Fuzzy Sets Syst. 41, 145–160 (1991)
Article MathSciNet Google Scholar
Diamond, P.: Fuzzy least squares. Inf. Sci. 46, 141–157 (1988)
Article MathSciNet Google Scholar
Wu, B., Tseng, N.-F.: A new approach to fuzzy regression models with application to business cycle analysis. Fuzzy Sets Syst. 130, 33–42 (2002)
Article MathSciNet Google Scholar
Takemura, K.: Fuzzy least squares regression analysis for social judgment study. J. Adv. Comput. Intell. Intell. Inform. 9(5), 461–466 (2005)
Article Google Scholar
Seng, K.-Y., Nestorov, I., Vicini, P.: Fuzzy least squares for identification of individual pharmacokinetic parameters. IEEE Trans. Biomed. Eng. 56(12), 2796–2805 (2009)
Article Google Scholar
Torfi, F., Farahani, R.Z., Mahdavi, I.: Fuzzy least-squares linear regression approach to ascertain stochastic demand in the vehicle routing problem. Appl. Math. 2, 64–73 (2011)
Article Google Scholar
Zeger, S.L., Liang, K.Y., Albert, P.S.: Models for longitudinal data: a generalized estimating equation approach. Biometrics 4, 1049–1060 (1988)
Article MathSciNet Google Scholar
Barros, L.C., Pedro, F.S.: Fuzzy differential equations with interactive derivative. Fuzzy Sets Syst. 309, 64–80 (2017)
Article MathSciNet Google Scholar
Conceição, G.M.S., Miraglia, S.G.E.K., Kishi, H.S., Saldiva, P.H.N., Singer, J.M.: Air pollution and child mortality: a time-series study in São Paulo Brazil. Environ. Health Perspect. 109, 347–350 (2001)
Google Scholar
Zadeh, L.A.: Fuzzy sets. Inf. Control 8, 338–353 (1965)
Article Google Scholar
Barros, L.C., Bassanezi, R.C., Lodwick, W.A.: A First Course in Fuzzy Logic, Fuzzy Dynamical Systems, and Biomathematics. Springer, Heidelberg (2017). https://doi.org/10.1007/978-3-662-53324-6
Book MATH Google Scholar
Zadeh, L.A.: Concept of a linguistic variable and its application to approximate reasoning, i, ii, iii. Inf. Sci. 8, 199–249, 301–357 (1975)
Google Scholar
Negoita, C., Ralescu, D.: Application of Fuzzy Sets to Systems Analysis. Wiley, New York (1975)
Book Google Scholar
Bede, B.: Mathematics of Fuzzy Sets and Fuzzy Logic. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35221-8
Book MATH Google Scholar
Dubois, D., Prade, H.: Possibility Theory: An Approach to the Computerized Processing of Information. Springer, New York (1988). https://doi.org/10.1007/978-1-4684-5287-7
Book MATH Google Scholar
Pedro, F.S.: On differential equations for linearly correlated fuzzy processes: applications in population dynamics. Ph.D. thesis, Portuguese (2017)
Google Scholar
Nguyen, H.T.: A note on the extension principle for fuzzy sets. J. Math. Anal. Appl. 64, 369–380 (1978)
Article MathSciNet Google Scholar
Barros, L.C., Bassanezi, R.C., Tonelli, P.A.: On the continuity of the Zadeh’s extension. Presented at Proceedings of the IFSA Congress (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Mathematics, Statistics and Scientific Computing, State University of Campinas, Campinas, São Paulo, 13081-970, Brazil
Nilmara J. B. Pinto, Vinícius F. Wasques, Estevão Esmi & Laécio C. Barros

Authors

Nilmara J. B. Pinto
View author publications
You can also search for this author in PubMed Google Scholar
Vinícius F. Wasques
View author publications
You can also search for this author in PubMed Google Scholar
Estevão Esmi
View author publications
You can also search for this author in PubMed Google Scholar
Laécio C. Barros
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Nilmara J. B. Pinto or Vinícius F. Wasques .

Editor information

Editors and Affiliations

Department of Teleinformatics Engineering, Federal University of Ceará, Fortaleza, Ceará, Brazil
Guilherme A. Barreto
Department of Statistics & Applied Mathematics, Federal University of Ceará, Fortaleza, Ceará, Brazil
Ricardo Coelho

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pinto, N.J.B., Wasques, V.F., Esmi, E., Barros, L.C. (2018). Least Squares Method with Interactive Fuzzy Coefficient: Application on Longitudinal Data. In: Barreto, G., Coelho, R. (eds) Fuzzy Information Processing. NAFIPS 2018. Communications in Computer and Information Science, vol 831. Springer, Cham. https://doi.org/10.1007/978-3-319-95312-0_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-95312-0_12
Published: 04 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-95311-3
Online ISBN: 978-3-319-95312-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Least Squares Method with Interactive Fuzzy Coefficient: Application on Longitudinal Data

Abstract

Similar content being viewed by others

A Fuzzy Convex Nonparametric Least Squares Method with Different Shape Constraints

Fuzzy Clusterwise Functional Extended Redundancy Analysis

M-based simultaneous inference for the mean function of functional data

Keywords

1 Introduction

2 Mathematical Background

2.1 Least Square Method

2.2 Fuzzy Set Theory

Definition 1

Theorem 1

Remark 1

Definition 2

Proposition 1

Definition 3

Remark 2

Theorem 2

3 Least Squares Method for Interactive Fuzzy Data

4 Application of Least Squares Method for Completely Correlated Fuzzy Data

5 Conclusion

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Least Squares Method with Interactive Fuzzy Coefficient: Application on Longitudinal Data

Abstract

Similar content being viewed by others

A Fuzzy Convex Nonparametric Least Squares Method with Different Shape Constraints

Fuzzy Clusterwise Functional Extended Redundancy Analysis

M-based simultaneous inference for the mean function of functional data

Keywords

1 Introduction

2 Mathematical Background

2.1 Least Square Method

2.2 Fuzzy Set Theory

Definition 1

Theorem 1

Remark 1

Definition 2

Proposition 1

Definition 3

Remark 2

Theorem 2

3 Least Squares Method for Interactive Fuzzy Data

4 Application of Least Squares Method for Completely Correlated Fuzzy Data

5 Conclusion

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation