1 Introduction

Since it was proposed by Zadeh [1], the theory of fuzzy set (FS) has achieved a great success due to its capability of handling uncertainty. Therefore, over the last decades, several higher order fuzzy sets have been introduced in the literature. Intuitionistic fuzzy set (IFS), as one of the higher order fuzzy sets, was proposed by Atanassov [2] to deal with vagueness. The main advantage of the IFS is its property to cope with the uncertainty that may exist due to information impression. Because it assigns to each element a membership degree, a non-membership degree and a hesitation degree, and thus, IFS constitutes an extension of Zadeh’s fuzzy set which only assigns to each element a membership degree [3]. So IFS is regarded as a more effective way to deal with vagueness than fuzzy set. Although Gau and Buehrer later presented vague set [4], it was pointed out by Bustince and Burillo that the notion of vague sets was the same as that of IFS [5].

The definition of similarity measure between two IFSs is one of the most interesting topics in IFSs theory. A similarity measure is defined to compare the information carried by IFSs. Measures of similarity between IFSs, as an important tool for decision making, pattern recognition, machine learning, and image processing, has received much attention in recent years [627]. Among the similarity measures proposed, a few of them come from the well-known distance measures [812], such as the Hamming distance, the Euclidian distance and the Hausdorff distance. Other similarity measures are defined based on the linear or non-linear relationship of the membership and non-membership functions of IFSs [1320]. There are also other kinds of similarity measures, e.g., similarity defined by entropy measures for IFSs, similarity induced by interval comparison and cosine similarity [7, 2127].

If taking a closer examination on the existing similarity measures between IFSs, we can find that some of those cannot fully satisfy the axiomatic definition of similarity by providing counter-intuitive cases; others are lack of definitude physical meaning with complicated expressions. Therefore, the definition of similarity measure is still an open problem achieving more interest. In this paper, we propose a new similarity measures with relative simple expression. The proposed similarity measure can be considered as the consistency of two IFSs. We define it by the direct operation on the membership function, non-membership function, hesitation function and the upper bound of membership function of two IFS, rather than defining it based on the distance measure or the relationship of membership and non-membership functions. The computation of our proposed similarity involves operations of multiplication and evolution without choosing other parameters, which is relatively simple and concise. Illustrative examples reveal that the proposed measures satisfy the properties of the axiomatic definition for similarity measures. In addition, several comparative examples are provided to show the performance of the proposed similarity measure.

The remainder of this paper is organized as follows. Section 2 presents the definitions related to the IFSs, similarity measure between IFSs, and existing similarity measures together. The new similarity measure, along with its interpretations is presented in Section 3. Comparison between the proposed similarity measure and the existing similarity measures is carried out in Section 4. The application of the proposed similarity measure to pattern recognition is presented in Section 5, followed by the conclusion of this paper in Section 6.

2 Intuitionistic fuzzy set and similarity measures

In this section, we firstly recall the basic definitions related to IFS and similarity measure to facilitate subsequent interpretation. Critical analyses on the existing similarity measures are then presented.

2.1 Basic definitions

Definition 1

Let X = {x 1, x 2, … , x n } be a universe of discourse, then a fuzzy set A in Xis defined as follows:

$$ A=\left\{ {\left\langle {x,\mu_{A} (x)} \right\rangle \left| {x\in X} \right.} \right\} $$
(1)

where μ A (x) : X → [0, 1]is the membership degree.

Definition 2

An IFS A in X defined by Atanassov can be written as:

$$ A=\left\{ {\left\langle {x,\mu_{A} (x),v_{A} (x)} \right\rangle \left| {x\in X} \right.} \right\} $$
(2)

where μ A (x):X → [0, 1]and v A (x) : X → [0, 1]are membership degree and non-membership degree, respectively, with the condition:

$$ 0\le \mu_{A} (x)+v_{A} (x)\le 1 $$
(3)

π A (x)determined by the following expression:

$$ \pi_{A} (x) = 1-\mu_{A} (x)-v_{A} (x) $$
(4)

is called the hesitancy degree of the element xX to the set A, and π A (x) ∈ [0, 1], ∀xX.

π A (x)is also called the intuitionistic index of x to A. Greater π A (x) indicates more vagueness on x. Obviously, when π A (x) = 0, ∀xX, the IFS degenerates into an ordinary fuzzy set.

In the sequel, the couple\(\left \langle {\mu _{A} (x),v_{A} (x)} \right \rangle \) is called an IFS or intuitionistic fuzzy value (IFV) for clarity. Let I F S s(X)denote the set of all IFSs inX.

It is worth noting that besides Definition 2 there are other possible representations of IF sets proposed in the literature. Hong and Choi [29] proposed to use an interval representation \(\left [ {\mu _{A} (x),1-v_{A} (x)} \right ]\) of intuitionistic fuzzy set A in X instead of pair\(\left \langle {\mu _{A} (x),v_{A} (x)} \right \rangle \). This approach is equivalent to the interval valued fuzzy sets interpretation of IF set, where μ A (x) and 1 − v A (x)represent the lower bound and upper bound of membership degree, respectively. Obviously, \(\left [ {\mu _{A} (x),1-v_{A} (x)} \right ]\) is a valid interval, since μ A (x) ≤ 1 − v A (x)always holds for μ A (x)+v A (x) ≤ 1.

Definition 3

For AI F S s(X) and BI F S s(X), some relations between them are defined as:

  1. (R1)

    \(A\subseteq B\) iff ∀xX μ A (x) ≤ μ B (x),v A (x) ≥ v B (x);

  2. (R2)

    A = B iff ∀xX μ A (x) = μ B (x),v A (x) = v B (x);

  3. (R3)

    \(A^{C}=\left \{ {\left \langle {x,v_{A} (x),\mu _{A} (x)} \right \rangle \left | {x\in X} \right .} \right \}\),where A C is the complement of A.

Definition 4

Let D denote a mapping D : I F S × I F S → [0, 1], if D(A, B) satisfies the following properties, D(A, B) is called a distance between AI F S s(X)and BI F S s(X).

  1. (DP1)

    0 ≤ D(A, B) ≤ 1;

  2. (DP2)

    D(A, B) = 0, if and only if A = B;

  3. (DP3)

    D(A, B) = D(B, A);

  4. (DP4)

    If \(A\subseteq B\subseteq C\), then D(A, B) ≤ D(A, C), and D(B, C) ≤ D(A, C).

Definition 5

A mapping S : I F S × I F S → [0, 1] is called a degree of similarity between AI F S s(X) and BI F S s(X), if S(A, B) satisfies the following properties:

  1. (SP1)

    0 ≤ S(A, B) ≤ 1;

  2. (SP2)

    S(A, B) = 1, if and only if A = B;

  3. (SP3)

    S(A, B) = S(B, A);

  4. (SP4)

    If \(A\subseteq B\subseteq C\), then S(A, B) ≥ S(A, C), and S(B, C) ≥ S(A, C).

Because distance and similarity measures are complementary concepts, similarity measures can be used to define distance measures, and vice versa.

2.2 Existing similarity measures

Because of the relationship between distance measure and similarity measure for IFSs, they are always defined together. For the sake of convenience, we would not differentiate these two concepts in this subsection.

The first study was carried out by Szmidt and Kacprzyk [8] extending the well-known distances measures, such as the Hamming Distance, the Euclidian Distance, to IFS and comparing them with the approaches used for ordinary fuzzy sets. However, Wang and Xin [9] implied that the distance measure of Szmidt and Kacprzyk [8] were not effective in some cases. Therefore, several new distance measures were proposed and applied to pattern recognition. Grzegorzewski [10] also extended the Hamming distance, the Euclidean distance, and their normalized counterparts to IFS. Later, Chen [11] pointed out that some errors existed in Grzegorzewski [10] by showing some counter examples. Hung and Yang [12] extended the Hausdorff distance to IFSs and proposed three similarity measures. On the other hand, instead of extending the well-known measures, some studies defined new similarity measures for IFSs. Li and Cheng [13] suggested a new similarity measure for IFSs based on the membership degree and the non-membership degree. Afterwards, Li [26] defined another two dissimilarity measures between intuitionistic fuzzy sets of a finite set, and it was proved that both of the measures are metrical. Mitchell [14] showed that the similarity measure of Li and Cheng [13] had some counter-intuitive cases and modified that similarity measure based on statistical point of view. Moreover, Liang and Shi [15] presented some examples to show that the similarity measure of Li and Cheng [13] was not reasonable for some conditions, and therefore proposed several new similarity measures for IFSs. Li et al. [16] analyzed, compared and summarized the existing similarity measures between IFSs/vague sets by their counter-intuitive examples in pattern recognition. Ye [7] conducted a similar comparative study of the existing similarity measures between IFSs and proposed a cosine similarity measure and a weighted cosine similarity measure. Hwang et al. [17] proposed a similarity measure for IFSs in which Sugeno integral was used for aggregation. The proposed similarity measure was applied to clustering problem. Xu [18] introduced a series of similarity measures for IFSs and applied them to multiple attribute decision making problem based on intuitionistic fuzzy information. Xu and Chen [19] introduced a series of distance and similarity measures, which are various combinations and generalizations of the weighted Hamming distance, the weighted Euclidean distance and the weighted Hausdorff distance. Xu and Yager [20] developed a similarity measure between IFSs and applied the developed similarity measure for consensus analysis in group decision making based on intuitionistic fuzzy preference relations. Xia and Xu [6] proposed a series of distance measures based on the intuitionistic fuzzy point operators.

As an addition to aforementioned studies, some attempts have been done to define similarity measures based on the relationships between distance measure, similarity measure and entropy of IFSs. Zeng and Guo [21] investigated the relationship among the normalized distance, the similarity measure, the inclusion measure, and the entropy of interval-valued fuzzy sets. It was also showed that the similarity measure, the inclusion measure, and the entropy of interval-valued fuzzy sets could be induced by the normalized distance of interval-valued fuzzy sets based on their axiomatic definitions. Wei et al. [22] introduced an entropy measure generalizing the existing entropy measures for IFS. They also introduced an approach to construct similarity measures using entropy measures for IFS and IFSs.

Besides, many other kinds of similarity measure between IFSs are emerging. Boran and Akay [23] proposed a new general type of similarity measure for IFS with two parameters, expressing L p norm and the level of uncertainty, respectively. This similarity measure can also make sense in terms of counter-intuitive cases. Zhang and Yu [24] presented a new distance measure based on interval comparison, where the IFSs were respectively transformed into the symmetric triangular fuzzy numbers. Comparison with the widely used methods indicated that the proposed method contained more information, with much less loss of information. Li et al. [25] introduced an axiomatic definition of the similarity measure of IFSs. The relationship between the entropy and the similarity measure of IFS was investigated in detail. It was proved that the similarity measure and the entropy of IFS can be transformed into each other based on their axiomatic definitions. G.A. Papakostas et al. [27] investigated the main theoretical and computational properties of the measures, as well as the relationships between them. A comparison of the distance and similarity measures was carried out by them, from a pattern recognition point of view.

As a summation, Table 1 presents several well-known similarity measures that will be analyzed in this paper. In this table, we let X = {x 1, x 2, ⋯ , x n } be a universe of discourse, AI F S s(X) and BI F S s (X) be two IFSs in X, denoted by \(A=\left \{ {\left \langle {x,\mu _{A} (x),v_{A} (x)} \right \rangle \left | {x\in X} \right .} \right \}\) and \(B=\left \{ {\left \langle {x,\mu _{B} (x),v_{B} (x)} \right \rangle \left | {x\in X} \right .} \right \}\), respectively. For clarity, we only give the expressions of similarity measures, with an absence of the interpretations of other intermediate variables, which can be found in related references. Since a comprehensive analysis on most of these similarity measures has been carried out by Li et al. in [16], it is trivial to present such analysis repeatedly. More discussion about the drawbacks of these similarity measures will be detailed in Section 4.

Table 1 Similarity measures proposed by other authors

3 A new similarity measure

Let\(A=\left \{ {\left \langle {x,\mu _{A} (x),v_{A} (x)} \right \rangle \left | {x\in X} \right .} \right \}\) and \(B=\left \{ {\left \langle {x,\mu _{B} (x),v_{B} (x)} \right \rangle \left | {x\in X} \right .} \right \}\)be two IFSs inX. We propose a new similarity measure. If we consider A and Bas intervals representation, the information carried by them is determined by not only the lower and upper bounds, but also the span of the interval. So we can define a similarity measure between A and B as:

$$\begin{array}{@{}rcl@{}} S_{Y} (A,B)&=&\frac{1}{2n}\sum\nolimits_{i=1}^{n} \left( \sqrt {\mu_{A} (x_{i} )\mu_{B} (x_{i} )} +2\sqrt {v_{A} (x_{i} )v_{B} (x_{i} )} \right.\\ &+&\left.\sqrt{\pi_{A} (x_{i} )\pi_{B} (x_{i} )} +\sqrt {\left( 1-v_{A} (x_{i} ) \right)\left( 1-v_{B} (x_{i} ) \right)} \right) \end{array} $$
(5)

Theorem 1

S Y (A, B) is a similarity measure between two IFSs A and B in X.

Proof

For the sake of simplicity, IFSs A and B are denoted by \(A=\left \{ {\left \langle {\mu _{A} (x_{i} ),v_{A} (x_{i} )} \right \rangle } \right \}\) and \(B=\left \{ {\left \langle {\mu _{B} (x_{i} ),v_{B} (x_{i} )} \right \rangle } \right \}\), respectively.

  1. (SP1)

    For each \(x,y\in [0,+\infty ]\), we have \(0\le \sqrt {xy} \le \frac {x+y}{2}\).

For 0≤μ(x i )≤1, 0≤v(x i )≤1,0≤π(x i )≤1 and 0≤1−v(x i )≤1, we can get:

  1. (i)

    \(\le \sqrt {\mu _{A} (x_{i} )\mu _{B} (x_{i} )} +2\sqrt {v_{A} (x_{i} )v_{B} (x_{i} )} +\sqrt {\pi _{A} (x_{i} )\pi _{B} (x_{i} )} +\sqrt {\left ({1-v_{A} (x_{i} )} \right )\left ({1-v_{B} (x_{i} )} \right )} \) \(\le \frac {\mu _{A} (x_{i} )+\mu _{B} (x_{i} )}{2}+2\cdot \frac {v_{A} (x_{i} )+v_{B} (x_{i} )}{2}+\frac {\pi _{A} (x_{i} )+\pi _{B} (x_{i} )}{2}+\frac {1-v_{A} (x_{i} )+1-v_{B} (x_{i} )}{2} \), \(=1+\frac {\mu _{A} (x_{i} )+v_{A} (x_{i} )+\pi _{A} (x_{i} )}{2}+\frac {\mu _{B} (x_{i} )+v_{B} (x_{i} )+\pi _{B} (x_{i} )}{2} \) =2

and

$$\begin{array}{@{}rcl@{}} &&0\le \sum\nolimits_{i=1}^{n} \left( \sqrt {\mu_{A} (x_{i} )\mu_{B} (x_{i} )} +2\sqrt {v_{A} (x_{i} )v_{B} (x_{i} )}\right. \\ &&\left.+\sqrt {\pi_{A} (x_{i} )\pi_{B} (x_{i} )} +\sqrt {\left( {1-v_{A} (x_{i} )} \right)\left( {1-v_{B} (x_{i} )} \right)} \right) \le 2n. \end{array} $$

So we have 0≤S Y (A,B)≤1.

  1. (SP2)

    We know that \(\sqrt {xy}\) achieves its maximum value \(\frac {x+y}{2}\) when x=y. Therefore, we have:

  1. (ii)

    \(S_{Y} (A,B)=1\Leftrightarrow \sqrt {\mu _{A} (x_{i} )\mu _{B} (x_{i} )} +2\sqrt {v_{A} (x_{i} )v_{B} (x_{i} )} +\sqrt {\pi _{A} (x_{i} )\pi _{B} (x_{i} )} +\sqrt {\left ({1-v_{A} (x_{i} )} \right )\left ({1-v_{B} (x_{i} )} \right )} =2 \) \(\Leftrightarrow \mu _{A} (x_{i} )=\mu _{B} (x_{i} ),v_{A} (x_{i} )=v_{B} (x_{i} ),\pi _{A} (x_{i} )=\pi _{B} (x_{i} ),1-v_{A} (x_{i} )=1-v_{B} (x_{i} ) \) \(\Leftrightarrow A=B \)

Thus, S Y (A,B)=1, if and only if A=B.

  1. (SP3)

    It is easy to note that the expression of S Y (A,B) is commutative. So we have S Y (A,B)=S Y (B,A).

  2. (SP4)

    Let \(C=\left \{ {\left \langle {\mu _{C} (x_{i} ),v_{C} (x_{i} )} \right \rangle } \right \}\) be another IFS in X, satisfying \(A\subseteq B\subseteq C\). We have 0≤μ A (x i )≤μ B (x i )≤μ C (x i )≤1 and 0≤v C (x i )≤v B (x i )≤v A (x i )≤1, for ∀xX. Based on (5), the similarity measures between (B,C) and (A,C), can be written as:

$$\begin{array}{@{}rcl@{}}S_{Y} (B,C)=\frac{1}{2n}\sum\nolimits_{i=1}^{n} \left( \sqrt {\mu_{B} (x_{i} )\mu_{C} (x_{i} )} +\sqrt {v_{B} (x_{i} )v_{C} (x_{i} )} \right.\\ &&\left.+\sqrt {\pi_{B} (x_{i} )\pi_{C} (x_{i} )} +\sqrt {\left( {1-v_{B} (x_{i} )} \right)\left( {1-v_{C} (x_{i} )} \right)} \right) \end{array} $$
$$\begin{array}{@{}rcl@{}}S_{Y} (A,C)=\frac{1}{2n}\sum\nolimits_{i=1}^{n} \left( \sqrt {\mu_{A} (x_{i} )\mu_{C} (x_{i} )} +\sqrt {v_{A} (x_{i} )v_{C} (x_{i} )} \right.\\ &&\left.+\sqrt {\pi_{A} (x_{i} )\pi_{C} (x_{i} )} +\sqrt {\left( {1-v_{A} (x_{i} )} \right)\left( {1-v_{C} (x_{i} )} \right)} \right) \end{array} $$

For a,b∈[0,1],a+b≤1, we define a function f as:

$$\begin{array}{@{}rcl@{}}f(x,y)=\sqrt {ax} +2\sqrt {by} \\ &&+\sqrt {(1-a-b)(1-x-y)} +\sqrt {(1-b)(1-y)} \end{array} $$

where x,y∈[0,1],x+y∈[0,1].

Then we have:

$$\begin{array}{@{}rcl@{}} &&\frac{\partial f}{\partial x}=\frac{\sqrt a} {2\sqrt x} -\frac{\sqrt {1-a-b}} {2\sqrt {1-x-y}} \\ &&=\frac{(a-x)(1-b)}{2\sqrt {x(1-x-y)} \left( {\sqrt {a(1-x-y)} +\sqrt {(1-a-b)x}} \right)},\\ &&\frac{\partial f}{\partial y}=\frac{\sqrt b} {\sqrt y} -\frac{\sqrt {1-a-b}} {2\sqrt {1-x-y}} -\frac{\sqrt {1-b}} {2\sqrt {1-y}} \\ &&=\frac{(b-y)(1-a)}{2\sqrt {y(1-x-y)} \left( {\sqrt {b(1-x-y)} +\sqrt{(1-a-b)y}} \right)}\\ &&+\frac{b-y}{2\sqrt {y(1-y)} \left( {\sqrt {b(1-y)} +\sqrt {(1-b)y}} \right)}. \end{array} $$

Given ax≤1,b≤1, we have \(\frac {\partial f}{\partial x}\le 0\), which means that f is a decreasing function of x, when xa.

For 0≤xa,b≤1, we can get \(\frac {\partial f}{\partial x}\ge 0\), which means that f is an increasing function of x, when, xa.

Similarly, we can also get \(\frac {\partial f}{\partial y}\ge 0\) for 0≤yb,a≤1 and \(\frac {\partial f}{\partial y}\le 0\) for by≤1,a≤1. These indicate that f is an increasing function of y for yb, but a decreasing function when yb.

Given a=μ A (x i ),b=v A (x i ) and two couples \(\left ({\mu _{B} (x_{i} ),v_{B} (x_{i} )} \right ),\left ({\mu _{C} (x_{i} ),v_{C} (x_{i} )} \right )\), satisfying a=μ A (x i )≤μ B (x i )≤μ C (x i ) and v C (x i )≤v B (x i )≤v A (x i )=b, we can get:

$$f(\mu_{C} (x_{i} ),v_{C} (x_{i} ))\le f(\mu_{B} (x_{i} ),v_{C} (x_{i} ))\le f(\mu_{B} (x_{i} ),v_{B} (x_{i} )). $$

And then

  1. (iii)

    \(\sqrt {\mu _{A} (x_{i} )\mu _{C} (x_{i} )} +\sqrt {v_{A} (x_{i} )v_{C} (x_{i} )} \\+\sqrt {\pi _{A} (x_{i} )\pi _{C} (x_{i} )} +\sqrt {\left ({1-v_{A} (x_{i} )} \right )\left ({1-v_{C} (x_{i} )} \right )} \\ \le \sqrt {\mu _{A} (x_{i} )\mu _{B} (x_{i} )} +2\sqrt {v_{A} (x_{i} )v_{B} (x_{i} )} \\+\sqrt {\pi _{A} (x_{i} )\pi _{B} (x_{i} )} +\sqrt {\left ({1-v_{A} (x_{i} )} \right )\left ({1-v_{B} (x_{i} )} \right )} \)

Therefore, S Y (A,B)≥S Y (A,C).

In such a way, if we suppose a=μ C (x i )b=v C (x i ), considering another two couples \(\left ({\mu _{B} (x_{i} ),v_{B} (x_{i} )} \right )\) and \(\left ({\mu _{A} (x_{i} ),v_{A} (x_{i} )} \right )\), we have: μ A (x i )≤μ B (x i )≤μ C (x i )=a,b=v C (x i )≤v B (x i )≤v A (x i ).

Hence, it follows that

f(μ A (x i ),v A (x i ))≤f(μ B (x i ),v A (x i ))≤f(μ B (x i ),v B (x i )), which can be written as:

  1. (iv)

    \(\sqrt {\mu _{A} (x_{i} )\mu _{C} (x_{i} )} +\sqrt {v_{A} (x_{i} )v_{C} (x_{i} )} +\sqrt {\pi _{A} (x_{i} )\pi _{C} (x_{i} )} \\+\sqrt {\left ({1-v_{A} (x_{i} )} \right )\left ({1-v_{C} (x_{i} )} \right )} \\ \le \sqrt {\mu _{B} (x_{i} )\mu _{C} (x_{i} )} +2\sqrt {v_{B} (x_{i} )v_{C} (x_{i} )} +\sqrt {\pi _{B} (x_{i} )\pi _{C} (x_{i} )} \\+\sqrt {\left ({1-v_{B} (x_{i} )} \right )\left ({1-v_{C} (x_{i} )} \right )} \).

Then we have S Y (B,C)≥S Y (A,C).

So the similarity measure S Y (A,B) satisfies all properties in Definition 5. It is a similarity measure between IFSs. □

Considering the weights of x i , we can define the weighted similarity between two IFSs as:

$$\begin{array}{@{}rcl@{}} &&S_{WY} (A,B) = \frac{1}{2}\sum\nolimits_{i=1}^{n} w_{i} \left( \sqrt {\mu_{A} (x_{i} )\mu_{B} (x_{i} )} +2\sqrt {v_{A} (x_{i} )v_{B} (x_{i} )} \right.\\ &&\left.+\sqrt{\pi_{A} (x_{i} )\pi_{B} (x_{i} )} +\sqrt {\left( {1-v_{A} (x_{i} )} \right)\left( {1-v_{B} (x_{i} )} \right)} \right) \end{array} $$
(6)

where w i is the weights factor of the features x i , \(w_{i} \in [0,1]\textit {and}\sum \nolimits _{i=1}^{n} {w_{i}} =1\)

Theorem 2

S WY (A,B)is the similarity measure between two IFSs AandBinX.

4 Numerical comparisons

In order to illustrate the superiority of the proposed similarity measure, a comparison between the proposed similarity measure and all the existing similarity measures is conducted based on the numerical cases in [23], which are widely used as counter-intuitive examples. Table 2 presents the result with p = 1for \(S_{HB} ,{S_{e}^{p}} ,{S_{s}^{p}} ,{S_{h}^{p}} \textit {and}p=1\ t=2\textit {for}{S_{t}^{p}} \).

Table 2 The comparison of similarity measures (counter-intuitive cases are in bold type)
Table 3 The similarity measures between the known patterns and the unknown pattern in Example 2 (Patterns not discriminated are in bold type). (\(p=1\textit {for}S_{HB} ,{S_{e}^{p}} ,{S_{s}^{p}} ,{S_{h}^{p}} \textit {and}p=1\),\(t=2\textit {for}{S_{t}^{p}} \))

We can see that S C (A, B) = S D C (A, B) = C I F S (A, B) = 1for two different IF sets\(A=\left \langle {0.3,0.3} \right \rangle \)and \(B=\left \langle {0.4,0.4} \right \rangle \).This indicates that the second axiom of similarity measure (S2) is not satisfied by S C (A, B),S D C (A, B)and C I F S (A, B). This also can be illustrated by S C (A, B) = S D C (A, B) = 1when \(A=\left \langle {0.5,0.5} \right \rangle ,B=\left \langle {0,0} \right \rangle \textit {and}A=\left \langle {0.4,0.2} \right \rangle ,B=\left \langle {0.5,0.3} \right \rangle \). As for\(S_{H} ,S_{O} ,S_{HB} ,{S_{e}^{p}} ,{S_{s}^{p}} \textit {and}{S_{h}^{p}} \), different pairs of A, Bmay provide the identical results, which cannot satisfy the application of pattern recognition. It can be read from Table 2 that S H B =0.9for both\(A=\left \langle {0.3,0.3} \right \rangle ,B=\left \langle {0.4,0.4} \right \rangle \textit {and}A=\left \langle {0.3,0.4} \right \rangle ,B=\left \langle {0.4,0.3} \right \rangle \). Such situation seems to be worse for\(S_{HY}^{1} ,S_{HY}^{2} \)and \(S_{HY}^{3} \), where all the cases take the same similarity degree except case.3 and case.4. \({S_{t}^{p}} \)seems to be reasonable without any counter-intuitive results, but it bring new problem with the choice of parameters p and t, which is still an open problem. Moreover, we can notice an interesting situation when comparing case.3 and case.4. For three IF sets\(A=\left \langle {1,0} \right \rangle \), \(B=\left \langle {0.5,0.5} \right \rangle \textit {and}C=\left \langle {0,0} \right \rangle \), intuitively, it is more reasonable to take the similarity degree between them as: S F (A, C) = 0.15,S F (B, C) = 0.25than taking \({S_{t}^{p}} (A,C) = 0.5\)and \({S_{t}^{p}} (B,C) = 0.833\). In such a sense, the proposed similarity measure is the most reasonable one with a relative simple expression, and has none of the counter-intuitive cases. Three IF sets \(A=\left \langle {0.4,0.2} \right \rangle \), \(B=\left \langle {0.5,0.3} \right \rangle \textit {and}C=\left \langle {0.5,0.2} \right \rangle \)can be written in forms of interval values as: \(A=\left [ {0.4,0.8} \right ]\), \(B=\left [ {0.5,0.7} \right ]\)and \(C=\left [ {0.5,0.8} \right ]\), respectively. In such a sense, we can say that the similarity degree between A and C should not be less than the similarity degree between A and B, which is also illustrated by other similarity measures except \(S_{C} ,S_{DC} \textit {and}{S_{t}^{p}} \)(underlined cases). Therefore, our proposed similarity measure is in agreement with this analysis. The proposed similarity measure is the most reasonable similarity measure without any counter-intuitive cases. We must note that, among the measures listed in Table 2, \({S_{t}^{p}} \) seems to be another metric measure without any counter-intuitive cases. However, it brings a new problem with the choice of the parameter p and t, which is also an important open problem facing by similarity measures \(S_{HB} ,{S_{e}^{p}} ,{S_{s}^{p}} \textit {and}{S_{h}^{p}} \). Therefore, we can say that our proposal is a satisfactory similarity measure satisfying all axiomatic properties, without any counter-intuitive cases and the problem of choosing other parameters.

In order to study the effectiveness of the proposed similarity measure for IFS in the application of pattern recognition, we consider the pattern recognition problem discussed in [7, 13].

Suppose there are mpatterns, which can be represented by IFSs \(A_{j} =\left \{ {\left \langle {x_{i} ,\mu _{A_{j}} (x_{i} ),v_{A_{j}} (x_{i} )} \right \rangle \left | {x_{i} \in X} \right .} \right \},A_{j} \in IFSs(X),j=1,2,\mathellipsis ,m\). Let the sample to be recognized be denoted as \(B=\left \{ {\left \langle {x_{i} ,\mu _{B} (x_{i} ),v_{B} (x_{i} )} \right \rangle \left | {x_{i} \in X} \right .} \right \}\). According to the recognition principle of maximum degree of similarity between IFSs, the process of assigning B to A k is described by:

$$ k=\arg \max\limits_{j=1,2,\mathellipsis ,m} \{S(A_{j} ,B)\} $$
(7)

Example 1

Assume that there exists three known patterns A 1, A 2 and A 3, with class labels C 1, C 2 and C 3, respectively. Each pattern can be expressed by IFS in X = {x 1, x 2, x 3}as:

$$A_{1} =\left\{ {\left\langle {x_{1} ,1,0} \right\rangle ,\left\langle {x_{2} ,0.8,0} \right\rangle ,\left\langle {x_{3} ,0.7,0.1} \right\rangle} \right\}, $$
$$A_{2} =\left\{ {\left\langle {x_{1} ,0.8,0.1} \right\rangle ,\left\langle {x_{2} ,1,0} \right\rangle ,\left\langle {x_{3} ,0.9,0} \right\rangle} \right\}, $$
$$A_{3} =\left\{ {\left\langle {x_{1} ,0.6,0.2} \right\rangle ,\left\langle {x_{2} ,0.8,0} \right\rangle ,\left\langle {x_{3} ,1,0} \right\rangle} \right\}. $$

The sampleBneed to be recognized is:

$$B=\left\{ {\left\langle {x_{1} ,0.5,0.3} \right\rangle ,\left\langle {x_{2} ,0.6,0.2} \right\rangle ,\left\langle {x_{3} ,0.8,0.1} \right\rangle} \right\}. $$

The similarity degree between A i (i = 1,2,3) and Bcalculated by (5) is:

$$S_{Y} (A_{1} ,B)\,=\,0.887, S_{Y} (A_{2} ,B)\,=\,0.913, S_{Y} (A_{3} ,B)\,=\,0.936. $$

It can be observed that the pattern B should be classified to A 3 with a class label C 3.according to the recognition principle of maximum degree of similarity between IFSs. This result is in agreement with the one obtained in [7, 13].

Let’s assume that the weights of x 1, x 2and x 3 are 0.5, 0.3, and 0.2, respectively, as they were assumed in [7]. Considering (6), we can get:

$$\begin{array}{@{}rcl@{}} S_{WY} (A_{1} ,B)&=&0.853, S_{WY} (A_{2} ,B)\\ &=&0.919, S_{WY} (A_{3} ,B) = 0.949. \end{array} $$

According to (7), B can be recognized as A 3, which is identical to the result obtained in [7, 13].

To make our similarity measure more transparent and comparable with the measures proposed earlier by other authors, the example analyzed in [17] will be discussed next.

Example 2

Assume that there are three IFS patterns in X = {x 1, x 2, x 3}.The three patterns are denoted as follows:

$$A_{1} =\left\{ {\left\langle {x_{1} ,0.3,0.3} \right\rangle ,\left\langle {x_{2} ,0.2,0.2} \right\rangle ,\left\langle {x_{3} ,0.1,0.1} \right\rangle} \right\}, $$
$$A_{2} =\left\{ {\left\langle {x_{1} ,0.2,0.2} \right\rangle ,\left\langle {x_{2} ,0.2,0.2} \right\rangle ,\left\langle {x_{3} ,0.2,0.2} \right\rangle} \right\} $$
$$A_{3} =\left\{ {\left\langle {x_{1} ,0.4,0.4} \right\rangle ,\left\langle {x_{2} ,0.4,0.4} \right\rangle ,\left\langle {x_{3} ,0.4,0.4} \right\rangle} \right\} $$

Assume that a sample \(B=\left \{ {\left \langle {x_{1} ,0.3,0.3} \right \rangle ,\left \langle {x_{2},0.2,0.2} \right \rangle }\right . ,\) \(\left . {\left \langle {x_{3} ,0.1,0.1} \right \rangle } \right \}\)is to be classified.

The similarity degrees of S(A 1, B), S(A 2, B)and S(A 3, B) calculated for all similarity measures listed in Table 1 are shown in Table 3.

The proposed similarity measure S Y can be calculated by (5) as:

$$S_{Y} (A_{1} ,B) = 1, S_{Y} (A_{2} ,B) = 0.990, S_{Y} (A_{3} ,B) = 0.932 $$

It is obvious that Bis equal to A 1. This indicates that sample B should be classified to A 1. However, the similarity degrees of S(A 1, B), S(A 2, B)and S(A 3, B)are equal to each other when S C , S H , S D C and C I F S are employed. These four similarity measures are not capable of discriminating the difference between the three patterns. Fortunately, the results of S Y (A i , B)(i = 1,2,3) can be used to make correct classification conclusion. This means that the proposed similarity measure shows an identical performance with majority of the existing measures.

5 Applications in pattern recognition

Along with the previous investigation of classification capabilities of the proposed measure, an additional experiment discussed in [7, 22, 23, 3236] will be presented as an application in pattern recognition. In this paper, we propose an alternative approach to medical diagnosis using the newly defined similarity measure.

Suppose that there are four patients Al, Bob, Joe, Ted, represented as P = { Al, Bob, Joe, Ted} . Their symptoms are S = { Temperature, Headache, Stomach pain, Cough, Chest pain} . The set of diagnoses is defined as D = { Viral fever, Malaria, Typhoid, Stomach problem, Chest problem} . The intuitionistic fuzzy relation PS is presented in Table 4. Table 5 gives the intuitionistic fuzzy relation SD. Each element of the tables is given in the form of IFV, which is a pair of numbers corresponding to the membership and non-membership values, respectively.

Table 4 Symptoms characteristic for the patients
Table 5 Symptoms characteristic for the diagnoses

In order to make a proper diagnosis for each patient, we calculate the similarity degree between each patient and each diagnose. According to the principle of maximum similarity degree, the higher similarity degree indicates a proper diagnosis. In Table 6, the similarity degree S Y between patients and diagnoses are presented. According to the similarity degrees in Table 6, a conclusion can be made that Al suffers from Viral Fever, Bob suffers from Stomach problem, Joe suffers from Typhoid, and Ted suffers from Viral Fever. The diagnosis results for this case obtained in previous study have been presented in [23]. It is clear that our proposed method provides the same results obtained by Vlachos in [33], Own in [36] and Boran in [23]. Moreover, our proposed similarity measure is calculated based on the IFNs, without any other parameters such as p, t in [23]. So it can reduce the computation complexity.

Table 6 The proposed similarity measure S Y between each patient’s symptoms and the considered set of possible diagnoses

6 Conclusion

Even though several similarity measures between IFSs have been proposed to cope with uncertainty in information systems, most of them have provided counter-intuitive results. In this study, a new similarity measure and weighted similarity measure between IFSs are proposed. The new similarity measure is calculated based on the operations on the membership degree μ A (x), non-membership degree v A (x), hesitancy degree π A (x), as well as the upper bound of membership 1 − v A (x). In some special cases where some of the existing similarity measure cannot provide reasonable results, the proposed similarity measure shows great capacity for discriminating IFSs. Moreover, investigation of the new measure’s classification capability is carried out based on two numerical examples and medical diagnosis. It has been illustrated that the proposed similarity measure performs as well as or better than previous measures. However, our proposed similarity is not an absolute perfect one. It is stuck with the lack of definitude physical meaning. Efforts are continuing to look for a more excellent similarity measure for much better exploration and exploitation on IFS.