1 Introduction

Formal concept analysis (FCA) [1] is a mathematical tool for analyzing data and representing knowledge by means of concept lattice [2] and implication [3,4,5,6]. A concept in FCA is a pair of extent and intent, where the extent refers to the objects covered by the concept and the intent refers to the common features of the objects in the extent. In FCA, the intents and extents of concepts are uniquely determined by each other, and the complete lattice formed by all concepts is called concept lattice. Concept lattice has been successfully applied in data mining [7, 8], machine learning [9,10,11], feature selection [12, 13] and concept cognitive learning [14,15,16,17].

In classical FCA, the relations between objects and attributes are precise. However, in practical applications, most of the relations are fuzzy and uncertain. Thus, Burusco [18] developed the theory of L fuzzy concept lattice and presented a construction method of fuzzy concept lattice. The concept operators in [18] are based on t-conorm, and L fuzzy concepts can be obtained by calculating the fixed point sets of the concept operators. Such concept operators, however, do not form Galois connection, and some basic properties of classical formal concepts in [1] do not hold in the fuzzy case. Therefore, Bělohlávek [19] generalized Galois connection from the perspective of fuzzy logic and extended FCA to the fuzzy case [20]. Since the number of formal concepts extracted from fuzzy data can be very large, Bělohlávek [21] introduced true-stressing hedge and constructed true-stressing hedge based fuzzy concept lattice. It was shown [21] that the stronger the hedge, the smaller the fuzzy concept lattice. In addition, in order to efficiently build fuzzy concept lattice, various methods and algorithms have been proposed. For example, Mao [22] utilized weighted graph to generate all crisp-fuzzy concepts. Singh [23] proposed a method to generate fuzzy concepts based on maximal acceptance of fuzzy attributes. Shemis [24] employed Shannon entropy to update fuzzy concept lattice in massive data.

On the other hand, Zhang [25] introduced threshold based fuzzy concepts and constructed variable threshold concept lattice. Thus, one may adjust threshold to control the scale of variable threshold concept lattice. In addition, the number of formal concepts in a variable threshold concept lattice is far less than that in a fuzzy concept lattice. Since threshold was used as a different strategy from hedge, Bělohlávek analyzed the relationship between true-stressing hedge and threshold in [26], and showed that the thresholdbased approach can be considered as a special case of the true-stressing hedge approaches.

From the perspective of theory, it seems that the fuzzy concept lattice model proposed in [26] is the most general one for visualizing fuzzy data. From the perspective of application, however, the more general the models, the more time-consuming the algorithms of constructing the models, because the general model will contain more concepts than other models. It may be an issue of application to choose a proper model. Therefore, from the perspective of application, variable threshold concept lattice may be of more advantages in visualizing fuzzy data. Thus, in order to control the number of fuzzy concepts within a proper scale, the paper chooses variable threshold concept lattice as a baseline.

In variable threshold model proposed by [25], whether an attribute belongs to an object is determined by the degree of the object having the attribute, which should be greater than or equal to a given threshold. However, in the real setting, since data may fluctuate within a range, relying only on a threshold may be susceptible to noise and reduce the robustness of the model. For example, setting the threshold to 0.60, then only when the degree of an object having an attribute is greater than or equal to 0.60, one may regard the object having the attribute; if the degree decreases to 0.59, the object will never have the attribute. Since the values 0.59 and 0.60 are very close, it may be inappropriate to diagnose that a patient has some disease if the possibility is greater than or equal to 0.60, instead of 0.59. In other words, variable threshold concept lattice is not robust to noise.

In order to solve the problems of variable threshold concept lattice, the paper introduces another threshold, called the tolerance threshold, to variable threshold concept lattice and proposes the ROBust variable threshold fuzzy Concept Lattice (RobCL). The properties of RobCL show that, distinctive from other concept lattice models (including variable threshold concept lattice), RobCL exhibits the incremental characteristics when identifying attributes for objects or recognizing objects for attributes. The incremental characteristics reflect the deepening of cognition and can be expressed as the so-called incremental sequences. We show that such sequences will end in finite steps and are able to mark the final concepts with their incremental sequences. Such final concepts form the robust variable threshold concepts. A further comparison with variable threshold concept lattice shows that when the two thresholds coincide with each other, RobCL degenerates to variable threshold concept lattice and the incremental characteristics vanish. In other words, variable threshold model is actually a special case of RobCL. Therefore, our model is an improvement and development of Zhang’s thought.

The main contributions of the paper can be summarized as follows: (1) we introduced the tolerance threshold to variable threshold concept lattice and proposed RobCL to improve the robustness of variable threshold concept lattice; (2) we studied the properties of RobCL and showed that RobCL exhibits the incremental characteristics in constructing robust concepts; (3) we applied RobCL to the heart disease data set in UCI and showed its robustness in medical diagnosis.

This paper is organized as follows. Some related notions and properties in FCA and variable threshold concept lattice will be reviewed in Sect. 2, and the limitations of variable threshold concept lattice will be discussed in Sect. 3. In Sect. 4, we will introduce double threshold operators and analyze their properties. Then fuzzy concepts in RobCL can be captured and RobCL related properties will be proved. This paper is concluded in Sect. 5.

2 Preliminaries

2.1 Basic notions of FCA

In FCA [27], data are represented by formal context.

Definition 1

[27] A formal context is a triple \(K=(G,M,I)\), where G and M are the set of objects and the set of attributes respectively, and \(I\subseteq G\times M\) is a binary relation between G and M. For \(g\in G\) and \(m\in M\), \((g,m)\in I\) expresses that the object g has the attribute m.

Definition 2

[27] Let \(K = \mathrm{{(}}G,M,I)\) be a formal context. For \(A \subseteq G\), \(B \subseteq M\), one can define two operators:

$$\begin{aligned}{} & {} {A^I} =\{ m \in M \mid \forall g \in A,gIm \} \\ {}{} & {} {B^I} = \{g \in G \mid \forall m \in B,gIm\}. \end{aligned}$$

If \(A^{I}=B\) and \(B^{I}=A\) then the pair (AB) is called a formal concept, where A is the extent of the concept and B is the intent of the concept. For two concepts \((A_{1},B_{1})\) and \((A_{2},B_{2})\), defining the order

$$\begin{aligned} (A_{1},B_{1})\le (A_{2},B_{2})\Leftrightarrow A_{1}\subseteq A_{2}(\Leftrightarrow B_{1}\subseteq B_{2}) \end{aligned}$$

will lead to a complete lattice, called the concept lattice of K.

Concept lattice can be regarded as a visual representation of formal context. Another representation of formal context, which can be regarded as knowledge representation, is attribute implication; one may refer to [27] for the details of attribute implication and [3,4,5,6, 28] for the details of decision implication.

2.2 Variable threshold concept lattice

Variable threshold concept lattice is constructed on fuzzy formal context.

Definition 3

[25] A fuzzy formal context is a triple \(K=(G,M,I)\), where G is a non-empty finite set of objects, M is a non-empty finite set of attributes, and \(I:G\times M\rightarrow [0,1]\) is a fuzzy relation between G and M. For \(g\in G\), \(m\in M\), I(gm) denotes the degree at which the object g has the attribute m.

Definition 4

[25] Let \(K=(G,M,I)\) be a fuzzy formal context and \(\alpha \in (0,1]\). For \(X\subseteq G\), \(B\subseteq M\), we define the variable threshold operators as:

$$\begin{aligned}{} & {} X^{\alpha }=\{m\in M \mid I(x,m)\ge \alpha ,\forall x\in X\} \\{} & {} B^{\alpha }=\{x\in G \mid I(x,m)\ge \alpha ,\forall m\in B\}. \end{aligned}$$

If \(X^{\alpha }=B\) and \(B^{\alpha }=X\), the pair (XB) is called a variable threshold (formal) concept, where X is the extent and B is the intent. The set of all the variable threshold concepts of K constitutes a complete lattice, denoted by \(L^{\alpha }(K)\) and called the variable threshold concept lattice of K, under the partial order “\(\le _{\alpha }\)”: \((X_{1},B_{1})\le _{\alpha }(X_{2},B_{2})\Leftrightarrow X_{1}\subseteq X_{2}(\Leftrightarrow B_{1}\subseteq B_{2})\). The infimums and supremums are given by:

$$\begin{aligned}{} & {} (X_{1},B_{1})\wedge (X_{2},B_{2})=(X_{1}\cap X_{2},(B_{1}\cup B_{2})^{\alpha \alpha }) \\{} & {} (X_{1},B_{1})\vee (X_{2},B_{2})=((X_{1}\cup X_{2})^{\alpha \alpha },B_{1}\cap B_{2}). \end{aligned}$$

Proposition 1

[25] Let (GMI) be a fuzzy formal context. For X, \(X_{1}\), \(X_{2}\subseteq G\), B, \(B_{1}\), \(B_{2}\subseteq M\), and \(\alpha \in (0,1]\), we have:

  1. 1.

    \(X_{1}\subseteq X_{2}\Rightarrow X_{2}^{\alpha }\subseteq X_{1}^{\alpha }\), \(B_{1}\subseteq B_{2}\Rightarrow B_{2}^{\alpha }\subseteq B_{1}^{\alpha }\)

  2. 2.

    \(X\subseteq X^{\alpha \alpha }\), \(B\subseteq B^{\alpha \alpha }\)

  3. 3.

    \(X^{\alpha }=X^{\alpha \alpha \alpha }\), \(B^{\alpha }=B^{\alpha \alpha \alpha }\)

  4. 4.

    \(X\subseteq B^{\alpha }\Leftrightarrow B\subseteq X^{\alpha }\)

  5. 5.

    \({({X_1} \cup {X_2})^\alpha } = {X_1}^\alpha \cap {X_2}^\alpha\), \({({B_1} \cup {B_2})^\alpha } = {B_1}^\alpha \cap {B_2}^\alpha\)

  6. 6.

    \({({X_1} \cap {X_2})^\alpha } \supseteq {X_1}^\alpha \cup {X_2}^\alpha\), \({({B_1} \cap {B_2})^\alpha } \supseteq {B_1}^\alpha \cup {B_2}^\alpha\).

3 Limitations of variable threshold concept lattice

As stated in the Introduction section, variable threshold concept lattice is not robust to noise because it employs a single threshold, instead of an interval to derive formal concepts. In order to reveal the limitations of variable threshold concept lattice, we take the heart disease data set in UCI as an example to illustrate the generation process of concepts on a single threshold.

After normalization, the corresponding fuzzy formal context is obtained and shown in Table 1, where \(G=\{x_{1},x_{2},x_{3},x_{4},x_{5},x_{6},x_{7},x_{8},x_{9},x_{10}\}\) represents 10 patients, and \(M=\{a,b,c,d,e\}\) represents the age level, the degree of chest pain, the maximum heart rate, the ST inhibition and the serum cholesterol level. In Table 1, patients \(x_{1}-x_{5}\) do not have heart disease and \(x_{6}-x_{10}\) have heart disease.

Table 1 A fuzzy formal context

When setting \(\alpha =0.60\), we can obtain all the variable threshold concepts of Table 1 as follows: \((x_{1}x_{2}x_{3}x_{4}x_{5}x_{6}x_{7}x_{8}x_{9}x_{10},\emptyset )\), \((x_{2}x_{4}x_{6}x_{7}x_{8}x_{9}x_{10},c)\), \((x_{1}x_{6}x_{9}x_{10},e)\), \((x_{6}x_{7}x_{8}x_{9}x_{10},bcd)\), \((x_{6}x_{8}x_{9},bcde)\), \((x_{7}x_{8}x_{9}x_{10},abcd)\), \((x_{2}x_{6}x_{7}x_{8}x_{9}x_{10},bc)\), \((x_{9}x_{10},abcde)\). All the concepts are meaningful because they capture specific groups of patients with their common symptoms. For example, \((x_{6}x_{7}x_{8}x_{9}x_{10},bcd)\) implies that at the threshold level 0.60, all the possible common symptoms that the patients \(\{x_{6}x_{7}x_{8}x_{9}x_{10}\}\) are suffering from are \(\{bcd\}\) and conversely, all the patients in G who are suffering from \(\{bcd\}\) are \(\{x_{6}x_{7}x_{8}x_{9}x_{10}\}\). In other words, at the threshold level 0.60, the pair \((x_{6}x_{7}x_{8}x_{9}x_{10},bcd)\) represents a disease with \(\{bcd\}\) as its salient symptoms and \(\{x_{6}x_{7}x_{8}x_{9}x_{10}\}\) as its patients.

Variable threshold concept lattice is not robust to noise because any threshold, regardless of whether it is determined by experts or obtained adaptively from data, is a concise number and is not robust to slight disturbance. For example, if setting \(\alpha =0.60\), 0.60 will be regarded as a threshold at which an attribute belongs to a set of objects. If an attribute, however, belongs to some object in the set at the threshold less than 0.60, say 0.58, the object is regarded as not having the attribute, which may cause the object to be excluded from the set of objects having the attribute.

For example, at the threshold level 0.60, the variable threshold concept \((x_{2}x_{6}x_{7}x_{8}x_{9}x_{10},bc)\) represents a disease with \(\{bc\}\) as its salient symptoms and \(\{x_{2}x_{6}x_{7}x_{8}x_{9}x_{10}\}\) as its patients. Thus, all the patients in G who are suffering from \(\{bc\}\) at the threshold level 0.60 are \(\{x_{2}x_{6}x_{7}x_{8}x_{9}x_{10}\}\). For the patient \(x_{4}\), however, since \(I(x_{4},b)=0.58\) and \(I(x_{4},c)=0.60\), \(x_{4}\) is excluded from \(\{x_{2}x_{6}x_{7}x_{8}x_{9}x_{10}\}\) because \(x_{4}\) has b (the degree of chest pain) at the degree 0.58. It is well known that human body is a changing system and any unexpected factors such as personal emotions may cause the measures of symptoms to fluctuate in a range. Therefore, a tolerance threshold \(\beta\) may help to reduce the impact of noise and make variable threshold concept lattice more robust.

At the threshold level 0.60, i.e., \(\alpha =0.60\), set the tolerance threshold to be 0.58, i.e., \(\beta =0.58\). For \(X=\{x_{2}x_{6}x_{7}x_{8}x_{9}x_{10}\}\), we can compute \(X^{\alpha }=\{bc\}\), i.e., the salient symptoms of \(\{x_{2}x_{6}x_{7}x_{8}x_{9}x_{10}\}\) are \(\{bc\}\); in this case, we compute \(X^{\alpha \beta }\) instead of \(X^{\alpha \alpha }\), where \(X^{\alpha \beta }=\{g\in G\mid I(g,m)\ge \beta ,\forall m\in X^{\alpha }\}\), i.e., we collect all the objects that have the salient symptoms in \(X^{\alpha }\) at the degree greater than or equal to \(\beta\). Since \(\beta \le \alpha\), all the objects having the salient symptoms in \(X^{\alpha }\) at the degree greater than or equal to \(\alpha\) will be included in \(X^{\alpha \beta }\); at the same time, all the objects having the salient symptoms in \(X^{\alpha }\) at the degree in \([\beta ,\alpha )\) will be also included in \(X^{\alpha \beta }\). For \(X=\{x_{2}x_{6}x_{7}x_{8}x_{9}x_{10}\}\), we have \(X^{\alpha \beta }=\{x_{2}x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\} \supseteq \{x_{2}x_{6}x_{7}x_{8}x_{9}x_{10}\}=X^{\alpha \alpha }\). Thus, the patient \(x_{4}\) is also included in \(X^{\alpha \beta }\), because \(I(x_{4},b)=0.58\) falls in [0.58, 0.60).

4 Robust variable threshold fuzzy concept lattice

In this section, we will construct the robust version of variable threshold concept lattice, called ROBust variable threshold fuzzy Concept Lattice (RobCL).

4.1 Double threshold operators

First, we extends the operators in Definition 4 to the double threshold operators.

Definition 5

Let (GMI) be a fuzzy formal context, \(\alpha \in (0,1]\) and \(\beta \in (0,1]\). For \(X\subseteq G\), \(B\subseteq M\), we can define double threshold operators as follows:

$$\begin{aligned}{} & {} X^{\alpha }=\{m\in M\mid I(g,m)\ge \alpha ,\forall g\in X\}\\{} & {} B^{\beta }=\{g\in G \mid I(g,m)\ge \beta ,\forall m\in B\}. \end{aligned}$$

According to FCA, \(X^{\alpha }\) in Definition 5 represents the common attributes that X has at the threshold \(\alpha\), and \(B^{\beta }\) represents the objects having B at the threshold \(\beta\). In order to further clarify the implication of Definition 5, let us analyze the case of \(\alpha \ge \beta\). In the case of \(\alpha \ge \beta\), \(\alpha\) can be regarded as the threshold of objects having attributes and \(\beta\) can be regarded as the tolerance threshold of objects having attributes, or equivalently, \(\alpha\) and \(\beta\) can be regarded as the thresholds of objects having salient or general features respectively. Similarly, when \(\beta \ge \alpha\), \(\beta\) can be regarded as the threshold of objects having attributes and \(\alpha\) can be regarded as the tolerance threshold.

Specifically, for \(g\in G\) and \(m\in M\), if \(I(g,m)<\beta\), it can be considered that the object g does not have the attribute m; if \(I(g,m)\ge \beta\), the object g has the attribute m, where if \(I(g,m)\ge \alpha\), the object g has the attribute m as its salient feature, and if \(\beta \le I(g,m)<\alpha\), the object g has the attributes m as its general feature. Thus, given a set X of objects, the salient features of X can be obtained by \(X^{\alpha }\); for the salient features in \(X^{\alpha }\), the objects that have \(X^{\alpha }\) as their general features can be obtained by \(X^{\alpha \beta }\). Since the values in \([\beta ,\alpha )\) are tolerance values of objects having attributes, the objects having \(X^{\alpha }\) at the degree greater than or equal to \(\beta\) should be included in \(X^{\alpha \beta }\). More implications of double threshold operators will be explored in the following.

For brevity, we write \(\{x\}^{\alpha }\) as \(x^{\alpha }\) for \(x\in G\) and \(\{m\}^{\beta }\) as \(m^{\beta }\) for \(m\in M\). Operating \(\alpha \beta\) on X or \(\beta \alpha\) on B with n times will lead to \(X^{(\alpha \beta )^{n}}\) and \(B^{(\beta \alpha )^{n}}\) respectively.

The operators in Definition 5 have the following properties.

Proposition 2

Let (GMI) be a fuzzy formal context. For X, \(X_{1}\), \(X_{2}\subseteq G\), B, \(B_{1}\), \(B_{2}\subseteq M\), \(\alpha \in (0,1]\) and \(\beta \in (0,1]\), the following conclusions hold.

  1. 1.

    \(X_{1}\subseteq X_{2}\Rightarrow \mathop {X}\nolimits _{2}^{\alpha }\subseteq \mathop {X}\nolimits _{1}^{\alpha }\), \(B_{1}\subseteq B_{2}\Rightarrow \mathop {B}\nolimits _{2}^{\beta }\subseteq \mathop {B}\nolimits _{1}^{\beta }\)

  2. 2.

    If \(\alpha \ge \beta\) then \(X\subseteq X^{\alpha \beta }\)

  3. 3.

    If \(\alpha \le \beta\) then \(B\subseteq B^{\beta \alpha }\).

Proof

(1) Let \(m\in X_{2}^{\alpha }\). Then we have \(I(g,m)\ge \alpha\) for any \(g\in X_{2}\), implying that \(I(x,m)\ge \alpha\) for any \(x\in X_{1}\) since \(X_{1}\subseteq X_{2}\). Thus we have \(m\in X_{1}^{\alpha }\) and \(\mathop {X}\nolimits _{2}^{\alpha }\subseteq \mathop {X}\nolimits _{1}^{\alpha }\). Similarly, we have \(\mathop {B}\nolimits _{2}^{\beta }\subseteq \mathop {B}\nolimits _{1}^{\beta }\).

(2) Suppose \(\alpha \ge \beta\). For \(g\in X\), we have \(I(g,m)\ge \alpha\) for any \(m\in X^{\alpha }\), implying that \(I(g,m)\ge \alpha \ge \beta\) for any \(m\in X^{\alpha }\), since \(\alpha \ge \beta\). Thus, we have \(g\in X^{\alpha }\) and \(X\subseteq X^{\alpha }\).

(3) Similar to the proof of (2). \(\square\)

Proposition 2 (1) implies that the more objects a set contains, the fewer attributes the objects have; similarly, the more attributes a set contains, the fewer objects the attributes pertain to. Proposition 2 (2) implies that when \(\alpha \ge \beta\), all the objects in X have the attributes in \(X^{\alpha }\) at the threshold \(\beta\). As shown in Fig. 1, this is obvious, because \(X^{\alpha \beta }\) contains the objects that have the attributes in \(X^{\alpha }\) at the threshold \(\beta\) and the objects in X have the attributes in \(X^{\alpha }\) at the threshold \(\alpha \ge \beta\). From the perspective of robustness, Proposition 2 (2) implies that introducing tolerance threshold allows more objects having attributes \(X^{\alpha }\) to be considered. The similar explanation applies to Proposition 2 (3), as shown in Fig. 2.

Fig. 1
figure 1

The relationship between X and \(X^{\alpha \beta }\) when \(\alpha \ge \beta\)

Fig. 2
figure 2

The relationship between B and \(B^{\beta \alpha }\) when \(\beta \ge \alpha\)

It should be noted that if \(\alpha \ge \beta\), Proposition 2 (2) cannot apply to \(B\subseteq M\), and that if \(\alpha \le \beta\), Proposition 2 (3) cannot apply to \(X\subseteq G\), as shown in Example 1. However, if \(\alpha =\beta\), both Proposition 2 (2) and (3) hold for any X and B, i.e., \(X\subseteq X^{\alpha \beta}\) and \(B\subseteq B^{\beta \alpha }\). In this case, double threshold operators degenerate to variable threshold operators.

Example 1

Set \(\alpha =0.50\) and \(\beta =0.40\). In Table 1, for \(X=\{x_{6}x_{7}x_{8}x_{10}\}\), we can compute \(X^{\alpha }=\{bcd\}\) and \(X^{\alpha \beta }=\{x_{2}x_{3}x_{6}x_{7}x_{8}x_{9}x_{10}\}\), i.e., \(X\subseteq X^{\alpha \beta }\).

If \(B=\{bcde\}\), we have \(B^{\beta }=\{x_{2}x_{3}x_{6}x_{7}x_{8}x_{9}x_{10}\}\) and \(B^{\beta \alpha }=\{bc\}\), i.e., \(B\not \subseteq B^{\beta \alpha }\). This is because for \(e\in B\), there exists the object \(x_{8}\in G\) satisfying \(0.40\le I(x_{8},e)<0.50\) and thus \(e\notin B^{\beta \alpha }\).

Similarly, if setting \(\alpha =0.30\) and \(\beta =0.40\) and letting \(X=\{x_{2}x_{4}x_{5}x_{7}\}\), we have \(\{x_{2}x_{4}x_{5}x_{7}\}^{\alpha }=\{ace\}\) and \(\{x_{2}x_{4}x_{5}x_{7}\}^{\alpha \beta }=\{x_{2}x_{3}x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\), i.e., \(X\nsubseteq X^{\alpha \beta }\). This is because for \(x_{5}\in X\), there exists the attribute \(a\in M\) satisfying \(0.30\le I(x_{5},a)<0.40\) and thus \(x_{5}\notin X^{\alpha \beta }\).

If setting \(\alpha =\beta =0.50\), the double threshold operators will degenerate to variable threshold operators and both Proposition 2 (2) and (3) hold for any X and B. For example, for \(X=\{x_{6}x_{7}x_{8}x_{10}\}\), we have \(X^{\alpha \beta }=\{x_{2}x_{6}x_{7}x_{8}x_{9}x_{10}\}\) and for \(B=\{bcde\}\), we have \(B^{\beta \alpha }=\{bcde\}\), satisfying Proposition 2 (2) and (3).

In addition, set \(\alpha =0.50\) and \(\beta =0.40\), and let \(X=\{x_{6}x_{7}x_{8}x_{10}\}\). Then, we have \(X^{\alpha \beta \alpha }=\{bc\}\) and \(X^{\alpha \beta \alpha \beta }=\{x_{2}x_{3}x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\), i.e., \(X^{\alpha \beta }\subset X^{\alpha \beta \alpha \beta }\). In the case of variable threshold concept lattice, however, setting \(\alpha =\beta =0.50\), we have \(X^{\alpha \beta }=X^{\alpha \beta \alpha \beta }=\{x_{2}x_{6}x_{7}x_{8}x_{9}x_{10}\}\).

By Example 1, if \(\alpha \ge \beta\), then \(X^{(\alpha \beta )^{n}}\subset X^{(\alpha \beta )^{n+1}}\) may hold, a distinctive characteristic of double threshold operators from variable threshold operators, as verified by Proposition 3.

Proposition 3

Let (GMI) be a fuzzy formal context and \(X\subseteq G\). If \(\alpha \ge \beta\), then

  1. 1.

    \(X^{(\alpha \beta )^{n}}\subset X^{(\alpha \beta )^{n+1}}\) if and only if there exists \(g\in X^{(\alpha \beta )^{n+1}}\) such that \(I(g,m)\ge \beta\) for any \(m\in X^{(\alpha \beta )^{n}\alpha }\) and \(\beta >I(g,m_{1})\) for some attribute \(m_{1}\in X^{(\alpha \beta )^{n-1}\alpha }\).

  2. 2.

    If \(X^{(\alpha \beta )^{n}}\subset X^{(\alpha \beta )^{n+1}}\), all attributes \(m_{1}\) in \(X^{(\alpha \beta )^{n-1}\alpha }\backslash X^{(\alpha \beta )^{n}\alpha }\) satisfy \(\alpha >I(g_{1},m_{1})\ge \beta\) for some object \(g_{1}\in X^{(\alpha \beta )^{n}}\backslash X^{(\alpha \beta )^{n-1}}\).

Proof

(1) By Proposition 2 (2), we know that \(X^{(\alpha \beta )^{n}}\subseteq X^{(\alpha \beta )^{n+1}}\), and thus \(X^{(\alpha \beta )^{n}}\subset X^{(\alpha \beta )^{n+1}}\) if and only if there exists \(g\in X^{(\alpha \beta )^{n+1}}\backslash X^{(\alpha \beta )^{n}}\). By definition of \(X^{(\alpha \beta )^{n+1}}\), \(g\in X^{(\alpha \beta )^{n+1}}\) if and only if \(I(g,m)\ge \beta\) for any \(m\in X^{(\alpha \beta )^{n}\alpha }\). By definition of \(X^{(\alpha \beta )^{n}}\), \(g\notin X^{(\alpha \beta )^{n}}\) if and only if \(g\notin X^{((\alpha \beta )^{n-1}\alpha )\beta }\), if and only if there exists \(m_{1}\in X^{(\alpha \beta )^{n-1}\alpha }\) such that \(\beta >I(g,m_{1})\). Thus, the conclusion holds.

(2) For each \(m_{1}\in X^{(\alpha \beta )^{n-1}\alpha }\backslash X^{(\alpha \beta )^{n}\alpha }\), since \(m_{1}\notin X^{(\alpha \beta )^{n}\alpha }\), there exists some object \(g_{1}\in X^{(\alpha \beta )^{n}}\) such that \(\alpha >I(g_{1},m_{1})\). Because \(m_{1}\in X^{(\alpha \beta )^{n-1}\alpha }\) and \(g_{1}\in X^{(\alpha \beta )^{n}}=X^{((\alpha \beta )^{n-1}\alpha )\beta }\), we have \(I(g_{1},m_{1})\ge \beta\) and thus \(\alpha >I(g_{1},m_{1})\ge \beta\). \(\square\)

Compared with variable threshold concept lattice, Proposition 3 (1) says that the concept generation of RobCL is not a one-step process. This is because there may exist an object \(g\in X^{(\alpha \beta )^{n+1}}\backslash X^{(\alpha \beta )^{n}}\) such that \(I(g,m)\ge \beta\) for any \(m\in X^{(\alpha \beta )^{n}\alpha }\) and \(\beta >I(g,m_{1})\) for some \(m_{1}\in X^{(\alpha \beta )^{n-1}\alpha }\) (see Fig. 3).

Fig. 3
figure 3

The change from \(X^{(\alpha \beta )^{n}}\) to \(X^{(\alpha \beta )^{n+1}}\)

If \(X^{(\alpha \beta )^{n}}\subset X^{(\alpha \beta )^{n+1}}\), we have \(X^{(\alpha \beta )^{n-1}\alpha }\ne X^{(\alpha \beta )^{n}\alpha }\) and \(X^{(\alpha \beta )^{n}}\ne X^{(\alpha \beta )^{n-1}}\)Footnote 1 and thus, by Proposition 3 (2), there must exist some attribute \(m_{1}\in X^{(\alpha \beta )^{n-1}\alpha }\backslash X^{(\alpha \beta )^{n}\alpha }\) such that \(\alpha >I(g_{1},m_{1})\ge \beta\) for some \(g_{1}\in X^{(\alpha \beta )^{n}}\backslash X^{(\alpha \beta )^{n-1}}\) (see Fig. 3). In other words, if \(X^{(\alpha \beta )^{n}}\subset X^{(\alpha \beta )^{n+1}}\), there must exist \(g_{1}\in X^{(\alpha \beta )^{n}}\backslash X^{(\alpha \beta )^{n-1}}\) with \(\alpha >I(g_{1},m_{1})\ge \beta\) for \(m_{1}\in X^{(\alpha \beta )^{n-1}\alpha }\backslash X^{(\alpha \beta )^{n}\alpha }\).

From the perspective of cognition, starting from \(X^{(\alpha \beta )^{n-1}}\), one can obtain the salient features of \(X^{(\alpha \beta )^{n-1}}\), i.e., \(X^{(\alpha \beta )^{n-1}\alpha }\); in order to collect the objects that also have \(X^{(\alpha \beta )^{n-1}\alpha }\), one should employ the tolerance threshold \(\beta\) and thus include \(g_{1}\) in \(X^{(\alpha \beta )^{n}}\). Since \(g_{1}\in X^{(\alpha \beta )^{n}}\) and \(m_{1}\) is not a salient feature of \(g_{1}\), \(m_{1}\) is excluded from \(X^{(\alpha \beta )^{n}\alpha }\) and is not a salient feature of \(X^{(\alpha \beta )^{n}}\). Thus, Proposition 3 (2) says that if \(X^{(\alpha \beta )^{n}}\subset X^{(\alpha \beta )^{n+1}}\), there exists attribute \(m_{1}\in X^{(\alpha \beta )^{n-1}\alpha }\backslash X^{(\alpha \beta )^{n}\alpha }\) that is a general but not salient feature of \(X^{(\alpha \beta )^{n}}\). The same conclusion applies to \(X^{(\alpha \beta )^{n}}\). Since \(X^{(\alpha \beta )^{n}}\subset X^{(\alpha \beta )^{n+1}}\), there may exist some attribute in \(X^{(\alpha \beta )^{n}\alpha }\) that is not a salient feature of \(X^{(\alpha \beta )^{(n+1)}\alpha }\).

Such incremental generating processes can be visually represented by the symbol “\(\rightarrow\)”. For example, “\(X\rightarrow B\)" denotes the process from X to B (i.e. \(X^{\alpha }\)), and “\(B\rightarrow X\)" denotes the process from B to X (i.e. \(B^{\beta }\)).

Example 2

For the fuzzy formal context in Table 1, set \(\alpha =0.70\) and \(\beta =0.65\). Let \(X=\{x_{7}\}\). Since \(X^{\alpha }=\{abcd\}\), \(X^{\alpha \beta }=\{x_{7}x_{9}\}\), \(X^{\alpha \beta \alpha }=\{bcd\}\), \(X^{\alpha \beta \alpha \beta }=\{x_{6}x_{7}x_{8}x_{9}x_{10}\}\) and \(X^{\alpha \beta \alpha \beta \alpha }=\{bc\}\), the incremental sequence of X is:

$$\begin{aligned}{} & {} \{x_{7}\}\rightarrow \{abcd\}\rightarrow \{x_{7}x_{9}\}\rightarrow \{bcd\}\rightarrow \{x_{6}x_{7}x_{8} x_{9}x_{10}\}\\{} & {} \quad \rightarrow \{bc\}\rightarrow \{x_{6}x_{7}x_{8}x_{9}x_{10}\}. \end{aligned}$$

In this process, since \(\{x_{7}\}^{\alpha \beta }=\{x_{7}x_{9}\}\subset \{x_{6}x_{7}x_{8}x_{9}x_{10}\} =\{x_{7}\}^{(\alpha \beta )^{2}}\), there exists \(x_{6}\in \{x_{7}\}^{(\alpha \beta )^{2}}\backslash \{x_{7}\}^{\alpha \beta }\) such that \(I(x_{6},m)\ge \beta\) for any \(m\in X^{\alpha \beta \alpha }\) and \(\beta >I(x_{6},a)\) for \(a\in \{x_{7}\}^{\alpha }\). Moreover, for each attribute \(m_{1}\) in \(X^{\alpha }\backslash X^{(\alpha \beta )\alpha }\), i.e., the attribute a, there exists an object \(x_{9}\) such that \(\alpha >I(x_{9},a)\ge \beta\).

From the perspective of cognition, for X, \(X^{\alpha }=\{abcd\}\) captures all the salient symptoms of X at the threshold 0.70, and \(X^{\alpha \beta }=\{x_{7}x_{9}\}\) collected all the patients who have the symptoms \(\{abcd\}\) at the threshold 0.65. It should be noted that in the process, the patient \(x_{9}\) having the symptom a with \(\alpha =0.70>I(x_{9},a)=0.66\ge 0.65=\beta\) has been included in \(X^{\alpha \beta }\), because although \(x_{9}\) has the symptom a as its general feature it should not be excluded from \(X^{\alpha \beta }\), since the value 0.66 is very close to 0.70. From the perspective of medical science, the operation can avoid missed diagnoses. Since \(x_{9}\) is included in \(X^{\alpha \beta }\), the salient features of \(X^{\alpha \beta }\) should be updated to \(\{bcd\}\), where the attribute a has been removed from \(X^{\alpha }\) because a is not a salient feature of \(x_{9}\). In order to collect all the objects that have \(\{bcd\}\) as their symptoms at the tolerance threshold 0.65, one should compute \(X^{\alpha \beta \alpha \beta }\) and include \(x_{6}\), \(x_{8}\) and \(x_{10}\) as its members. Finally, one can obtain the salient features \(\{bc\}\) of the patients \(\{x_{6}x_{7}x_{8}x_{9}x_{10}\}\) and conversely, no patient can be added to \(\{x_{6}x_{7}x_{8}x_{9}x_{10}\}\) because no patient has the symptoms \(\{bc\}\) at the degree greater than or equal to \(\beta =0.65\). In fact, since \(\{x_{6}x_{7}x_{8}x_{9}x_{10}\}\) are the patients with heart disease, the pair \((x_{6}x_{7}x_{8}x_{9}x_{10},bc)\) is the concept of heart disease that contains \(\{bc\}\) as its salient symptoms and \(\{x_{6}x_{7}x_{8}x_{9}x_{10}\}\) as its patients.

Similarly, if starting from the sets \(\{x_{6}x_{7}\}\) and \(\{x_{7}x_{8}\}\), one can obtain the following incremental sequences:

$$\begin{aligned}{} & {} \{x_{6}x_{7}\}\rightarrow \{bcd\}\rightarrow \{x_{6}x_{7}x_{8}x_{9}x_{10}\}\\{} & {} \quad \rightarrow \{bc\} \rightarrow \{x_{6}x_{7}x_{8}x_{9}x_{10}\} \\{} & {} \{x_{7}x_{8}\}\rightarrow \{bcd\}\rightarrow \{x_{6}x_{7}x_{8}x_{9}x_{10}\}\rightarrow \{bc\}\\{} & {} \quad \rightarrow \{x_{6}x_{7}x_{8}x_{9}x_{10}\}. \end{aligned}$$

Both of the processes lead to the concept of heart disease.

4.2 Incremental Sequence

By Example 2, one may find that the incremental sequences may end at some point. Specifically, if \(\alpha \ge \beta\), for the set X of objects, we may have the following incremental sequence:

$$\begin{aligned}{} & {} X\rightarrow X^{(\alpha )}\rightarrow X^{(\alpha \beta )}\rightarrow X^{(\alpha \beta )\alpha }\rightarrow X^{(\alpha \beta )^{2}}\\{} & {} \quad \rightarrow X^{(\alpha \beta )^{2}\alpha }\rightarrow \cdots \rightarrow X^{(\alpha \beta )^{n}} \end{aligned}$$

satisfying \(X\subseteq X^{(\alpha \beta )}\subseteq X^{(\alpha \beta )^{2}}\cdots \subseteq X^{(\alpha \beta )^{n}}\) and \(X^{\alpha }\supseteq X^{(\alpha \beta )\alpha }\supseteq X^{(\alpha \beta )^{2}\alpha }\cdots \supseteq X^{(\alpha \beta )^{n}\alpha }\). Thus, in the incremental sequence, X will collect more and more objects that have the same attributes with X and capture less and less common attributes of them. In the paper, when \(\alpha \ge \beta\), we refer to the incremental sequence from X to \(X^{(\alpha \beta )^{n}}\) as the object incremental sequence of X.

Similarly, if \(\beta \ge \alpha\), the attribute incremental sequence of B is as follows:

$$\begin{aligned}{} & {} B\rightarrow B^{(\beta )}\rightarrow B^{(\beta \alpha )}\rightarrow B^{(\beta \alpha )\beta }\rightarrow B^{(\beta \alpha )^{2}}\\{} & {} \quad \rightarrow B^{(\beta \alpha )^{2}\beta }\rightarrow \cdots \rightarrow B^{(\beta \alpha )^{n}}. \end{aligned}$$

Obviously, if both G and M are finite sets, both object and attribute incremental sequences will end at some points. Even G or M is infinite, we have the following results.

Proposition 4

Let (GMI) be a fuzzy formal context, \(X\subseteq G\), and \(B\subseteq M\). Then, the following conclusions hold.

  1. 1.

    If there exists a non-negative integer n such that \(X^{(\alpha \beta )^{n}}=X^{(\alpha \beta )^{n+1}}\), then we have \(X^{(\alpha \beta )^{n}}=X^{(\alpha \beta )^{n+k}}\) for any non-negative integer \(k\ge 0\).

  2. 2.

    If there exists a non-negative integer m such that \(B^{(\beta \alpha )^{m}}=B^{(\beta \alpha )^{m+1}}\), then we have \(B^{(\beta \alpha )^{m}}=B^{(\beta \alpha )^{m+k}}\) for any non-negative integer \(k\ge 0\).

In order to further analyze incremental sequence, we will discuss the relationships between incremental sequences. To this end, we will take object incremental sequences as examples and classify object incremental sequences into two types of relationships, convergence or parallel.

Definition 6

For \(X_{1}\), \(X_{2}\subseteq G\), we say that (the incremental sequences of) \(X_{1}\) and \(X_{2}\) converge in the object-object way at (ij), if (ij) satisfies \(X_{1}^{(\alpha \beta )^{i}}=X_{2}^{(\alpha \beta )^{j}}\); if (ij) further satisfies \(X_{1}^{(\alpha \beta )^{i-1}\alpha }\ne X_{2}^{(\alpha \beta )^{j-1}\alpha }\), \(X_{1}^{(\alpha \beta )^{i}}\ne X_{2}^{(\alpha \beta )^{j-1}}\) and \(X_{1}^{(\alpha \beta )^{i-1}}\ne X_{2}^{(\alpha \beta )^{j}}\), (ij) is the smallest object-object convergent unit of \(X_{1}\) and \(X_{2}\). Similarly, for \(X_{1}\), \(X_{2}\subseteq G\), if there exists (ij) satisfying \(X_{1}^{(\alpha \beta )^{i}\alpha }=X_{2}^{(\alpha \beta )^{j}\alpha }\), \(X_{1}\) and \(X_{2}\) converge in the object-attribute way at (ij); if (ij) further satisfies \(X_{1}^{(\alpha \beta )^{i}}\ne X_{2}^{(\alpha \beta )^{j}}\), \(X_{1}^{(\alpha \beta )^{i}\alpha }\ne X_{2}^{(\alpha \beta )^{j-1}\alpha }\) and \(X_{1}^{(\alpha \beta )^{i-1}\alpha }\ne X_{2}^{(\alpha \beta )^{j}\alpha }\), (ij) the smallest object-attribute convergent unit of \(X_{1}\) and \(X_{2}\).

By Definition 6, if \(X_{1}\) and \(X_{2}\) converge in the object-attribute way at (ij), then we have \(X_{1}^{(\alpha \beta )^{i}\alpha }=X_{2}^{(\alpha \beta )^{j}\alpha }\) and thus \(X_{1}^{(\alpha \beta )^{i}\alpha \beta }=X_{2}^{(\alpha \beta )^{j}\alpha \beta }\), i.e., \(X_{1}^{(\alpha \beta )^{i+1}}=X_{2}^{(\alpha \beta )^{j+1}}\). Therefore, \(X_{1}\) and \(X_{2}\) converge in the object-object way at \((i+1,j+1)\). Similarly, if \(X_{1}\) and \(X_{2}\) converge in the object-object way at (ij), \(X_{1}\) and \(X_{2}\) will also converge in the object-attribute way at \((i+1,j+1)\). Thus, for brevity, we will not distinguish the two types of convergence of object incremental sequences in the following.

Example 3

In Table 1, let \(X_{1}=\{x_{6}x_{7}x_{9}x_{10}\}\) and \(X_{2}=\{x_{7}x_{8}x_{9}\}\), and set \(\alpha =0.50\) and \(\beta =0.40\). The object incremental sequences of \(X_{1}\) and \(X_{2}\) are as follows:

$$\begin{aligned}{} & {} \{x_{6}x_{7}x_{9}x_{10}\}\rightarrow \{bcde\}\rightarrow \{x_{2}x_{3}x_{6}x_{7}x_{8}x_{9}x_{10}\}\\{} & {} \quad \rightarrow \{bc\}\rightarrow \{x_{2}x_{3}x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\\{} & {} \{x_{7}x_{8}x_{9}\}\rightarrow \{abcd\}\rightarrow \{x_{2}x_{3}x_{6}x_{7}x_{8}x_{9}x_{10}\}\rightarrow \{bc\}\\{} & {} \quad \rightarrow \{x_{2}x_{3}x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}. \end{aligned}$$

Because there exists (1, 1) such that \(X_{1}^{\alpha }\ne X_{2}^{\alpha }\), \(X_{1}^{\alpha \beta }=X_{2}^{\alpha \beta }\), \(X_{1}\ne X_{2}^{\alpha \beta }\) and \(X_{1}^{\alpha \beta }\ne X_{2}\), (1, 1) is the smallest object-object convergent unit of \(X_{1}\) and \(X_{2}\).

The convergence of attribute incremental sequences can be defined similarly.

Definition 7

For \(B_{1}\), \(B_{2}\subseteq G\), we say that \(B_{1}\) and \(B_{2}\) converge in the attribute-attribute way at (ij), if (ij) satisfies \(B_{1}^{(\beta \alpha )^{i}}=B_{2}^{(\beta \alpha )^{j}}\); if (ij) further satisfies \(B_{1}^{(\beta \alpha )^{i-1}\beta }\ne B_{2}^{(\beta \alpha )^{j-1}\beta }\), \(B_{1}^{(\beta \alpha )^{i-1}}\ne B_{2}^{(\beta \alpha )^{j}}\) and \(B_{1}^{(\beta \alpha )^{i}}\ne B_{2}^{(\beta \alpha )^{j-1}}\), (ij) is said to be the smallest attribute-attribute convergent unit of \(B_{1}\) and \(B_{2}\). Similarly, if (ij) satisfies \(B_{1}^{(\beta \alpha )^{i}\beta }=B_{2}^{(\beta \alpha )^{j}\beta }\), \(B_{1}\) and \(B_{2}\) converge in the attribute-object way at (ij); if (ij) further satisfies \(B_{1}^{(\beta \alpha )^{i-1}}\ne B_{2}^{(\beta \alpha )^{j-1}}\), \(B_{1}^{(\beta \alpha )^{i-1}\beta }\ne B_{2}^{(\beta \alpha )^{j}\beta }\) and \(B_{1}^{(\beta \alpha )^{i}\beta }\ne B_{2}^{(\beta \alpha )^{j-1}\beta }\), (ij) is the smallest attribute-object convergent unit of \(B_{1}\) and \(B_{2}\).

Next, we will explore the properties of incremental sequence. First, we need the following results.

Lemma 1

Let \(X_{1}\), \(X_{2}\subseteq G\) be two sets of objects such that \(X_{1}\subset X_{1}^{\alpha \beta }\) and \(X_{2}\subset X_{2}^{\alpha \beta }\). If \(\alpha \ge \beta\) and (ij) is the smallest object-object convergent unit of \(X_{1}\) and \(X_{2}\), then we have \(X_{1}\subset X_{1}^{(\alpha \beta )}\subset X_{1}^{(\alpha \beta )^{2}}\cdots \subset X_{1}^{(\alpha \beta )^{i}}\) and \(X_{2}\subset X_{2}^{(\alpha \beta )}\subset X_{2}^{(\alpha \beta )^{2}}\cdots \subset X_{2}^{(\alpha \beta )^{j}}\).

Proof

If \(\alpha \ge \beta\), by Proposition 2 (2) we have \(X_{1}\subseteq X_{1}^{(\alpha \beta )}\subseteq X_{1}^{(\alpha \beta )^{2}}\cdots \subseteq X_{1}^{(\alpha \beta )^{i-1}}\subseteq X_{1}^{(\alpha \beta )^{i}}\). Since \(X_{1}\subset X_{1}^{\alpha \beta }\), suppose that there exists \(1\le k<i\) such that \(X_{1}^{(\alpha \beta )^{k}}=X_{1}^{(\alpha \beta )^{k+1}}\). Since \(k<i\), by Proposition 4 we have \(X_{1}^{(\alpha \beta )^{k}}=X_{1}^{(\alpha \beta )^{k+(i-k-1)}}=X_{1}^ {(\alpha \beta )^{i-1}}=X_{1}^{(\alpha \beta )^{k+(i-k)}}=X_{1}^{(\alpha \beta )^{i}}\). However, because (ij) is smallest object-object convergent unit of \(X_{1}\) and \(X_{2}\), we have \(X_{1}^{(\alpha \beta )^{i}}=X_{2}^{(\alpha \beta )^{j}}\) and \(X_{1}^{(\alpha \beta )^{i-1}}\ne X_{2}^{(\alpha \beta )^{j}}\), a contradiction with \(X_{1}^{(\alpha \beta )^{i-1}}=X_{2}^{(\alpha \beta )^{i}}=X_{2}^{(\alpha \beta )^{j}}\). Similarly, we have \(X_{2}\subset X_{2}^{(\alpha \beta )}\subset X_{2}^{(\alpha \beta )^{2}}\cdots \subset X_{2}^{(\alpha \beta )^{j}}\). \(\square\)

Similarly, the smallest attribute-attribute convergent unit has the following property.

Lemma 2

Let \(B_{1}\), \(B_{2}\subseteq M\) be two subsets of attributes such that \(B_{1}\subset B_{1}^{\beta \alpha }\) and \(B_{2}\subset B_{2}^{\beta \alpha }\). If \(\alpha \le \beta\) and (ij) is the smallest object-object convergent unit of \(B_{1}\) and \(B_{2}\), then we have \(B_{1}\subset B_{1}^{(\beta \alpha )}\subset B_{1}^{(\beta \alpha )^{2}}\cdots \subset B_{1}^{(\beta \alpha )^{i}}\) and \(B_{2}\subset B_{2}^{(\beta \alpha )}\subset B_{2}^{(\beta \alpha )^{2}}\cdots \subset B_{2}^{(\beta \alpha )^{j}}\).

Now, we can present the conditions of determining the convergence of object incremental sequences.

Theorem 1

If \(\alpha \ge \beta\) and two sets \(X_{1}\), \(X_{2}\subseteq G\) converge, then there exist integers \(m,n,k\ge 0\) such that \(X_{1}^{(\alpha \beta )^{m}}\subseteq X_{2}^{(\alpha \beta )^{k}}\subseteq X_{1}^{(\alpha \beta )^{n}}\). Furthermore, if both G and M are finite, the converse also hold, i.e., \(X_{1}\) and \(X_{2}\) converge if and only if there exist integer \(m,n,k\ge 0\) such that \(X_{1}^{(\alpha \beta )^{m}}\subseteq X_{2}^{(\alpha \beta )^{k}}\subseteq X_{1}^{(\alpha \beta )^{n}}\).

Proof

If \(X_{1}\) and \(X_{2}\) converge, then there exist \(i,j\ge 0\) satisfying \(X_{1}^{(\alpha \beta )^{i}}=X_{2}^{(\alpha \beta )^{j}}\). Setting \(m=n=i\), \(k=j\), then we have \(X_{1}^{(\alpha \beta )^{m}}\subseteq X_{2}^{(\alpha \beta )^{k}}\subseteq X_{1}^{(\alpha \beta )^{n}}\).

Conversely, suppose that both G and M are finite. Since G is finite, there exists a non-negative integer i such that \(X_{1}^{(\alpha \beta )^{i}}=X_{1}^{(\alpha \beta )^{i+1}}\). In this case, by Proposition 4 (1) we have \(X_{1}^{(\alpha \beta )^{i}}=X_{1}^{(\alpha \beta )^{i+m}}=X_{1}^{(\alpha \beta )^{i+n}}\), where \(m,n\ge 0\). From the condition \(X_{1}^{(\alpha \beta )^{m}}\subseteq X_{2}^{(\alpha \beta )^{k}}\subseteq X_{1}^{(\alpha \beta )^{n}}\) and Proposition 2 (2), we can derive \(X_{1}^{(\alpha \beta )^{m+i}}\subseteq X_{2}^{(\alpha \beta )^{k+i}}\subseteq X_{1}^{(\alpha \beta )^{n+i}}\). Thus, we have \(X_{1}^{(\alpha \beta )^{i}}=X_{1}^{(\alpha \beta )^{i+m}}\subseteq X_{2}^{(\alpha \beta )^{i+k}}\subseteq X_{1}^{(\alpha \beta )^{i+n}}=X_{1}^{(\alpha \beta )^{i}}\), which yields \(X_{2}^{(\alpha \beta )^{i+k}}=X_{1}^{(\alpha \beta )^{i}}\), i.e., \(X_{1}\) and \(X_{2}\) converge. \(\square\)

Theorem 1 presents a sufficient and necessary condition for determining the convergence of \(X_{1}\) and \(X_{2}\). In fact, even if \(X_{1}\) and \(X_{2}\) will converge, there may be no any relation between \(X_{1}\) and \(X_{2}\). However, Theorem 1 says that if, after a certain number of double threshold operations, the incremental results of \(X_{1}\) and \(X_{2}\) have the inclusion relation \(X_{1}^{(\alpha \beta )^{m}}\subseteq X_{2}^{(\alpha \beta )^{k}}\subseteq X_{1}^{(\alpha \beta )^{n}}\), \(X_{1}\) and \(X_{2}\) will converge. Intuitively, if after a certain number of double threshold operations, \(X_{1}\) and \(X_{2}\) satisfy \(X_{1}^{(\alpha \beta )^{m}}\subseteq X_{2}^{(\alpha \beta )^{k}}\), the inclusion relation will be maintained in the subsequent incremental results; in this case, however, \(X_{1}\) and \(X_{2}\) may not converge even if both G and M are finite. This is because the final incremental results of \(X_{1}\) and \(X_{2}\) may have a proper inclusion relation (see Example 4). Thus, the condition \(X_{2}^{(\alpha \beta )^{k}}\subseteq X_{1}^{(\alpha \beta )^{n}}\) further ensures that \(X_{1}\), after a certain number of double threshold operations, will collect more objects than \(X_{2}^{(\alpha \beta )^{k}}\), excluding the proper inclusion relation between \(X_{1}\) and \(X_{2}\).

Example 4

For Table 1, set \(\alpha =0.50\) and \(\beta =0.40\). Let \(X_{1}=\{x_{7}\}\) and \(X_{2}=\{x_{5}\}\). The object incremental sequences of \(X_{1}\) and \(X_{2}\) are as follows:

$$\begin{aligned}{} & {} \begin{aligned} \{x_{7}\}\rightarrow \{abcde\}\rightarrow \{x_{2}x_{3}x_{6}x_{7}x_{8}x_{9}x_{10}\} \rightarrow \{bc\}\rightarrow \\\{x_{2}x_{3}x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\rightarrow \{bc\} \end{aligned} \\{} & {} \begin{aligned} \{x_{5}\}\rightarrow \{ce\}\rightarrow \{x_{2}x_{3}x_{4}x_{5}x_{6}x_{7}x_{8}x_{9}x_{10}\}\rightarrow \{c\} \rightarrow \\\{x_{2}x_{3}x_{4}x_{5}x_{6}x_{7}x_{8}x_{9}x_{10}\}. \end{aligned} \end{aligned}$$

If setting \(m=k=1\), we have \(X_{1}^{\alpha \beta }\subseteq X_{2}^{\alpha \beta }\). When cognition ends, however, we also have \(X_{1}^{(\alpha \beta )^{3}}=X_{1}^{(\alpha \beta )^{2}}\subset X_{2}^{\alpha \beta }=X_{2}^{(\alpha \beta )^{2}}\), i.e., the condition \(X_{1}^{(\alpha \beta )^{m}}\subseteq X_{2}^{(\alpha \beta )^{k}}\) in Theorem 1 cannot ensure the convergence of \(X_{1}\) and \(X_{2}\).

Similarly, one can determine the convergence of attribute incremental sequences by the following conclusion.

Theorem 2

If \(\beta \ge \alpha\) and two sets \(B_{1}\), \(B_{2}\subseteq M\) converge, then there exist integers \(m,n,k\ge 0\) such that \(B_{1}^{(\beta \alpha )^{m}}\subseteq B_{2}^{(\beta \alpha )^{k}}\subseteq B_{1}^{(\beta \alpha )^{n}}\). Furthermore, if both G and M are finite, the converse also hold, i.e., \(B_{1}\) and \(B_{2}\) converge if and only if there exist integers \(m,n,k\ge 0\) such that \(B_{1}^{(\beta \alpha )^{m}}\subseteq B_{2}^{(\beta \alpha )^{k}}\subseteq B_{1}^{(\beta \alpha )^{n}}\).

We have discussed the convergence of object and attribute incremental sequences and presented some results to judge the convergence of object and attribute incremental sequences. The results are useful especially when both G and M are finite, a general case in data mining. If some incremental sequences do not meet the definitions of convergence, the incremental sequences are considered to be parallel to each other.

4.3 Robust variable threshold fuzzy concept lattice

If two sets converge, the two sets can be considered to be with the same information. In other words, after a certain number of double threshold operations, the incremental results of the two sets tend to be stable and coincide. Next, we will analyze the relationships of object and attribute incremental sequences and then construct the robust variable threshold fuzzy concept lattice based on double threshold operators.

In the following, we also suppose \(\alpha \ge \beta\) and take object incremental sequence as example to establish the equivalence relation R. Similar discussions can be applied to attribute incremental sequences.

Definition 8

For the power set P(G) of G, we can define the relation R on P(G) as follows: for \(X_{1}\), \(X_{2}\in P(G)\), \(\left\langle X_{1},X_{2}\right\rangle \in R\) if and only if \(X_{1}\) and \(X_{2}\) converge.

Example 5

For Table 1, set \(\alpha =0.50\) and \(\beta =0.40\). Let \(X_1=\{x_2x_8x_{10}\}\) and \(X_2=\{x_2x_7x_8x_9\}\). The object incremental sequences of \(X_{1}\) and \(X_{2}\) can be computed as follows:

$$\begin{aligned}{} & {} \begin{aligned} \{x_2x_8x_{10}\} \rightarrow \{bcd\} \rightarrow \{x_2x_3x_6x_7x_8x_9x_{10}\} \rightarrow \{bc\} \rightarrow \\ \{x_2x_3x_4x_6x_7x_8x_9x_{10}\} \rightarrow \{bc\} \end{aligned}\\{} & {} \begin{aligned} \{x_2x_7x_8x_9\} \rightarrow \{bcd\} \rightarrow \{x_2x_3x_6x_7x_8x_9x_{10}\} \rightarrow \{bc\} \rightarrow \\ \{x_2x_3x_4x_6x_7x_8x_9x_{10}\}\rightarrow \{bc\}. \end{aligned} \end{aligned}$$

Thus, we have \(\langle X_1,X_2\rangle \in R\).

The relation R in Definition 8 has the following properties.

Proposition 5

Let \(K=(G,M,I)\) be a fuzzy formal context. The relation R in Definition 8 is an equivalence relation, i.e., for any \(X_{1}\), \(X_{2}\in P(G)\)

  1. 1.

    \(\left\langle X_{1},X_{1}\right\rangle \in R\)

  2. 2.

    If \(\left\langle X_{1},X_{2}\right\rangle \in R\), then \(\left\langle X_{2},X_{1}\right\rangle \in R\)

  3. 3.

    If \(\left\langle X_{1},X_{2}\right\rangle \in R\) and \(\left\langle X_{2},X_{3}\right\rangle \in R\), then \(\left\langle X_{1},X_{3}\right\rangle \in R\).

Proof

The proofs of self-reflexivity and symmetry are obvious.

If \(\left\langle X_{1},X_{2}\right\rangle \in R\) and \(\left\langle X_{2},X_{3}\right\rangle \in R\), there exists (ij) satisfying \(X_{1}^{(\alpha \beta )^{i}}=X_{2}^{(\alpha \beta )^{j}}\) and (mn) satisfying \(X_{2}^{(\alpha \beta )^{m}}=X_{3}^{(\alpha \beta )^{n}}\). Setting \(k=\max (i,j,m,n)\), by Proposition 4 we have \(X_{1}^{(\alpha \beta )^{k}}=X_{2}^{(\alpha \beta )^{k}}\) and \(X_{2}^{(\alpha \beta )^{k}}=X_{3}^{(\alpha \beta )^{k}}\), and hence \(X_{1}^{(\alpha \beta )^{k}}=X_{3}^{(\alpha \beta )^{k}}\), i.e., \(X_{1}\) and \(X_{3}\) converge. Thus, we have \(\left\langle X_{1},X_{3}\right\rangle \in R\). \(\square\)

According to Definition 8, the equivalence class of \(X\in P(G)\) can be denoted as \([X]_{R}=\{X_{1} \mid \left\langle X,X_{1}\right\rangle \in R\}\), i.e., the sets that converge with X.

In the following, we suppose that G is finite. Thus, for \(X\in P(G)\), there exists a non-negative integer n satisfying \(X^{(\alpha \beta )^{n}}=X^{(\alpha \beta )^{n+1}}\), denoted as \([X]_{\max }=X^{(\alpha \beta )^{n}}\). Clearly, for any non-negative integer k, we have \([X]_{\max }=([X]_{\max })^{(\alpha \beta )^{k}}\). Furthermore, for any \(X_{1}\in [X]_{R}\), we have \([X_{1}]_{\max }=[X]_{\max }\), meaning that all the sets in \([X]_{R}\) end at \([X]_{\max }\).

Definition 9

Let \(K=(G,M,I)\) be a fuzzy formal context. If \(\alpha \ge \beta\), for \(X\subseteq G\), denote \({\overline{X}}=[X]_{\max }\) and \({\overline{X}}^{\alpha }=[X]_{\max }^{\alpha }\) and call \(({\overline{X}},{\overline{X}}^{\alpha })\) a robust variable threshold concept of (GMI), where \({\overline{X}}\) is the extent and \({\overline{X}}^{\alpha }\) is the intent. Denote by \(L^{\alpha \beta }(K)\) all the robust variable threshold concepts of K of thresholds \({\alpha }\) and \(\beta\).

By Definition 9, the extent \({\overline{X}}\) of \(({\overline{X}},{\overline{X}}^{\alpha })\) collects all the objects that have the salient features in \({\overline{X}}{}^{\alpha }\), and the intent \({\overline{X}}^{\alpha }\) captures the salient features common to all the objects in \({\overline{X}}\). In this case, the pair \(({\overline{X}},{\overline{X}}^{\alpha })\) is stable because they can be identified by each other, i.e., we have \({\overline{X}}^{\alpha }=({\overline{X}})^{\alpha }\) and \({\overline{X}}=({\overline{X}}^{\alpha })^{\beta }\).

Robust variable threshold concepts in Definition 9 have the following properties.

Proposition 6

Let \(K=(G,M,I)\) be a fuzzy formal context with G and M being finite, and \(X,X_{1}\), \(X_{2}\subseteq G\). If \(\alpha \ge \beta\), then the following conclusions hold.

  1. 1.

    \(X\subseteq {\overline{X}}\)

  2. 2.

    \(X_{1}\subseteq X_{2}\Rightarrow \overline{X_{1}}\subseteq \overline{X_{2}}\)

  3. 3.

    \({\overline{X}}={\overline{X}}^{\alpha \beta }=\overline{{\overline{X}}}\)

  4. 4.

    \(\overline{X_{1}}\cup \overline{X_{2}}\subseteq \overline{\overline{X_{1}} \cup \overline{X_{2}}}\)

  5. 5.

    \(\overline{X_{1}}\cap \overline{X_{2}}=\overline{\overline{X_{1}}\cap \overline{X_{2}}}\)

  6. 6.

    \({\overline{X}}^{\alpha }={\overline{X}}^{\alpha \beta \alpha }\)

  7. 7.

    \(X_{1}\subseteq X_{2}\Rightarrow \overline{X_{2}}^{\alpha }\subseteq \overline{X_{1}}^{\alpha }\)

  8. 8.

    \(\overline{X_{1}}\subseteq \overline{X_{2}}\Leftrightarrow \overline{X_{2}} ^{\alpha }\subseteq \overline{X_{1}}^{\alpha }\)

  9. 9.

    \((\overline{X_{1}}\cup \overline{X_{2}})^{\alpha }=\overline{X_{1}}^{\alpha } \cap \overline{X_{2}}^{\alpha }\)

  10. 10.

    \((\overline{X_{1}}^{\alpha }\cup \overline{X_{2}}^\alpha )^\beta = \overline{X_{1}}^{\alpha \beta }\cap \overline{X_{2}}^{\alpha \beta }\).

Proof

The conclusions of (1)–(3) and (6) follow from the definitions of \({\overline{X}}\) and \([X]_{\max }\) and Proposition 2. The conclusion of (4) follows from (1). The conclusion of (7) follows from (2) and Proposition 2.

(5) Since \(\overline{X_{1}}\cap \overline{X_{2}}\subseteq \overline{X_{1}}\) and \(\overline{X_{1}}\cap \overline{X_{2}}\subseteq \overline{X_{2}}\), by (2) and (3) we have \(\overline{\overline{X_{1}}\cap \overline{X_{2}}}\subseteq \overline{\overline{X_{1}}}=\overline{X_{1}}\) and \(\overline{\overline{X_{1}}\cap \overline{X_{2}}}\subseteq \overline{\overline{X_{2}}}=\overline{X_{2}}\), and thus \(\overline{\overline{X_{1}}\cap \overline{X_{2}}}\subseteq \overline{X_{1}}\cap \overline{X_{2}}\). Conversely, by (1), we have \(\overline{X_{1}}\cap \overline{X_{2}}\subseteq \overline{\overline{X_{1}} \cap \overline{X_{2}}}\) and thus \(\overline{X_{1}}\cap \overline{X_{2}}=\overline{(\overline{X_{1}}\cap \overline{X_{2}})}\).

(8) If \(\overline{X_{1}}\subseteq \overline{X_{2}}=\overline{X_{2}}^{\alpha \beta }\), by Proposition 2 we have \(\overline{X_{2}}^{\alpha \beta \alpha }\subseteq \overline{X_{1}}^{\alpha }\). Since by (6) we have \(\overline{X_{2}}^{\alpha \beta \alpha }=\overline{X_{2}}^{\alpha }\), we obtain \(\overline{X_{2}}^{\alpha }\subseteq \overline{X_{1}}^{\alpha }\). Conversely, by \(\overline{X_{2}}^{\alpha }\subseteq \overline{X_{1}}^{\alpha }\) and Proposition 2 we have \(\overline{X_{1}}^{\alpha \beta }\subseteq \overline{X_{2}}^{\alpha \beta }\). Since \(\overline{X_{1}}=[X_{1}]_{\max }\) and \(\overline{X_{2}}=[X_{2}]_{\max }\), we have \(\overline{X_{1}}=\overline{X_{1}}^{\alpha \beta }\) and \(\overline{X_{2}}=\overline{X_{2}}^{\alpha \beta }\), and thus \(\overline{X_{1}}\subseteq \overline{X_{2}}\).

(9) Since \(\overline{X_{1}}\subseteq \overline{X_{1}}\cup \overline{X_{2}}\) and \(\overline{X_{2}}\subseteq \overline{X_{1}}\cup \overline{X_{2}}\), by Proposition 2 we have \((\overline{X_{1}}\cup \overline{X_{2}})^{\alpha }\subseteq \overline{X_{1}}^{\alpha }\), \((\overline{X_{1}}\cup \overline{X_{2}})^{\alpha }\subseteq \overline{X_{2}}^{\alpha }\) and \((\overline{X_{1}}\cup \overline{X_{2}})^{\alpha }\subseteq \overline{X_{1}}^{\alpha }\cap \overline{X_{2}}^{\alpha }\). Suppose that there exists \(m\in (\overline{X_{1}}^{\alpha }\cap \overline{X_{2}}^{\alpha })\) \\((\overline{X_{1}}\cup \overline{X_{2}})^{\alpha }\). Since \(m\notin (\overline{X_{1}}\cup \overline{X_{2}})^{\alpha }\), there exists \(x\in (\overline{X_{1}}\cup \overline{X_{2}})\) satisfying \(I(x,m)<\alpha\). By \(m\in (\overline{X_{1}}^{\alpha }\cap \overline{X_{2}}^{\alpha })\), all elements \(x_{1}\in \overline{X_{1}}\) and \(x_{2}\in \overline{X_{2}}\) satisfy \(I(x_{1},m)\ge \alpha\) and \(I(x_{2},m)\ge \alpha\), implying that for any \(x'\in (\overline{X_{1}}\cup \overline{X_{2}})\), we have \(I(x',m)<\alpha\), a contradiction with \(I(x,m)<\alpha\) for \(x\in (\overline{X_{1}}\cup \overline{X_{2}})\).

(10) Since \(\overline{X_{1}}^{\alpha }\subseteq (\overline{X_{1}}^{ {\alpha }}\cup \overline{X_{2}}^{\alpha })\) and \(\overline{X_{2}}^{\alpha }\subseteq (\overline{X_{1}}^{{\alpha }} \cup \overline{X_{2}}^{\alpha })\), by Proposition 2 we have \(\overline{X_{1}}^{\alpha \beta }\subseteq (\overline{X_{1}}^{\alpha } \cup \overline{X_{2}}^{\alpha })^{\beta }\), \((\overline{X_{1}}^{\alpha }\cup \overline{X_{2}}^{\alpha })^{\beta } \subseteq \overline{X_{2}}^{\alpha \beta }\) and \((\overline{X_{1}}^{\alpha }\cup \overline{X_{2}}^{\alpha })^{\beta } \subseteq \overline{X_{1}}^{\alpha \beta }\cap \overline{X_{2}}^{\alpha \beta }\). Suppose that there exists \(g\in (\overline{X_{1}}^{\alpha \beta }\cap \overline{X_{2}}^{\alpha \beta }) \backslash \overline{(X_{1}}^{\alpha }\cup \overline{X_{2}}^{\alpha })^{\beta }\). Since \(g\in \overline{X_{1}}^{\alpha \beta }\cap \overline{X_{2}}^{\alpha \beta }\), by definition of \({\overline{X}}^{\alpha \beta }\), all elements \(m_{1}\in \overline{X_{1}}^{\alpha }\) and \(m_{2}\in \overline{X_{2}}^{\alpha }\) satisfy \(I(g,m_{1})\ge \beta\) and \(I(g,m_{2})\ge \beta\). Therefore, for \(g\in \overline{X_{1}}^{\alpha \beta }\cap \overline{X_{2}}^{\alpha \beta }\), any \(m\in (\overline{X_{1}}^{\alpha }\cup \overline{X_{2}}^{\alpha })\) satisfies \(I(g,m)\ge \beta\), i.e., \(g\in\) \((\overline{X_{1}}^{\alpha }\cup \overline{X_{2}}^{\alpha })^{\beta }\), a contradiction with \(g\notin (\overline{X_{1}}^{\alpha }\cup \overline{X_{2}}^{\alpha })^{\beta }\). \(\square\)

By Proposition 6 (1), (2) and (3), the operator \(\overline{(\cdot )}\) is a closure operator on P(G), which is similar to the closure operators \((\cdot )^{II}\) in FCA and \((\cdot )^{\alpha \alpha }\) in variable threshold concept lattice. Basically, the properties enable the system of robust variable concepts to be a closure system and thus a complete lattice, as shown in Theorem 3. The property (5) in Proposition 6 says that the meet of two extents is also an extent, a property also similar to FCA and variable threshold concept lattice. The property simplifies the definition of infimum in \(L^{\alpha \beta }(K)\) of Theorem 3 from \(\overline{\overline{X_{1}}\cap \overline{X_{2}}}\) to \(\overline{X_{1}}\cap \overline{X_{2}}\). The property (8) in Proposition 6 shows the equivalence between the two inequalities, enabling the two equivalent definitions of the partial order defined in Eq. (1). The properties (5) and (9) in Proposition 6 indicate the applicability of the distributive property, similar to the property (5) in Proposition 1.

One distinctive characteristic of robust variable threshold fuzzy concept lattice from FCA and variable threshold concept lattice is its lack of symmetry between object and attribute. This is because in Proposition 2 the operator \((\cdot )^{\alpha \beta }\) does not satisfy \(X\subseteq X^{\alpha \beta }\) if \(\alpha \le \beta\), and the operator \((\cdot )^{\beta \alpha }\) does not satisfy \(B\subseteq B^{\beta \alpha }\) if \(\alpha \ge \beta\). Thus, in robust variable threshold fuzzy concept lattice, we construct our model from sets of objects for \(\alpha \le \beta\) and from sets of attributes for \(\alpha \ge \beta\), and all the properties in Proposition 6 concerning intents are also starting from sets of objects.

Thus, by Proposition 6, for two robust variable threshold concepts \((\overline{X_{1}},\overline{X_{1}}^{\alpha })\), \((\overline{X_{2}},\overline{X_{2}}^{\alpha })\), we can define

$$\begin{aligned}{} & {} (\overline{X_{1}},\overline{X_{1}}^{\alpha })\le (\overline{X_{2}}, \overline{X_{2}}^{\alpha })\Leftrightarrow \overline{X_{1}}\subseteq \overline{X_{2}}\nonumber \\{} & {} \quad (\Leftrightarrow \overline{X_{2}}^{\alpha }\subseteq \overline{X_{1}}^{\alpha }) \end{aligned}$$
(1)

where \((\overline{X_{1}},\overline{X_{1}}^{\alpha })\) is called a sub-concept of \((\overline{X_{2}},\overline{X_{2}}^{\alpha })\) and \((\overline{X_{2}},\overline{X_{2}}^{\alpha })\) is a super-concept of \((\overline{X_{1}},\overline{X_{1}}^{\alpha })\).

Theorem 3 shows that the above defined partial order leads to a complete lattice.

Theorem 3

Let \(K=(G,M,I)\) be a fuzzy formal context with G and M being finite. For two robust variable threshold concepts \((\overline{X_{1}},\overline{X_{1}}^{\alpha })\), \((\overline{X_{2}},\overline{X_{2}}^{\alpha })\in L^{\alpha \beta }(K)\), define the infimum and the supremum by

$$\begin{aligned}&(\overline{X_{1}},\overline{X_{1}}^{\alpha })\wedge (\overline{X_{2}}, \overline{X_{2}}^{\alpha }) \\&\quad =(\overline{X_{1}}\cap \overline{X_{2}}, (\overline{X_{1}}\cap \overline{X_{2}})^{\alpha })\\&(\overline{X_{1}},\overline{X_{1}}^{\alpha })\vee (\overline{X_{2}}, \overline{X_{2}}^{\alpha })\\&\quad =(\overline{\overline{X_{1}}\cup \overline{X_{2}}},(\overline{\overline{X_{1}} \cup \overline{X_{2}}})^{\alpha }). \end{aligned}$$

Then, \((L^{\alpha \beta }(K),\wedge ,\vee )\) forms a complete lattice, called the ROBust variable threshold fuzzy Concept Lattice (RobCL) of K.

Proof

By Proposition 6 (5), we have \(({\overline{X}}_{1}\cap {\overline{X}}_{2},\left( {\overline{X}}_{1} \cap {\overline{X}}_{2}\right) ^{\alpha })\in L^{\alpha \beta }(K)\) and \((\overline{{\overline{X}}_{1}\cup \overline{X_{2}}},\left( \overline{{\overline{X}}_{1}\cup \overline{X_{2}}}\right) ^{\alpha })\in L^{\alpha \beta }(K)\). Furthermore, it is obvious that \(({\overline{X}}_{1}\cap {\overline{X}}_{2},\left( {\overline{X}}_{1} \cap {\overline{X}}_{2}\right) ^{\alpha })\) is a sub-concept of \((\overline{X_{1}},\overline{X_{1}}^{\alpha })\) and \((\overline{X_{2}},\overline{X_{2}}^{\alpha })\), and that \((\overline{{\overline{X}}_{1}\cup \overline{X_{2}}},\left( \overline{ {\overline{X}}_{1}\cup \overline{X_{2}}}\right) ^{\alpha })\) is a super-concept of \((\overline{X_{1}},\overline{X_{1}}^{\alpha })\) and \((\overline{X_{2}},\overline{X_{2}}^{\alpha })\).

Next, we prove that any sub-concept of \((\overline{X_{1}},\overline{X_{1}}^{\alpha })\) and \((\overline{X_{2}},\overline{X_{2}}^{\alpha })\) is a sub-concept of \(({\overline{X}}_{1}\cap {\overline{X}}_{2},\left( {\overline{X}}_{1} \cap {\overline{X}}_{2}\right) ^{\alpha })\), i.e., \(({\overline{X}}_{1}\cap {\overline{X}}_{2},\left( {\overline{X}}_{1} \cap {\overline{X}}_{2}\right) ^{\alpha })\) is the infimum of \((\overline{X_{1}},\overline{X_{1}}^{\alpha })\) and \((\overline{X_{2}},\overline{X_{2}}^{\alpha })\). For any sub-concept \((\overline{X_{3}},\overline{X_{3}}^{\alpha })\) of \((\overline{X_{1}},\overline{X_{1}}^{\alpha })\) and \((\overline{X_{2}},\overline{X_{2}}^{\alpha })\), since \(\overline{X_{3}}\subseteq \overline{X_{1}}\) and \(\overline{X_{3}}\subseteq \overline{X_{2}}\), we have \(\overline{X_{3}}\subseteq \overline{X_{1}}\cap \overline{X_{2}}\) and thus \((\overline{X_{3}},\overline{X_{3}}^{\alpha })\) is a sub-concept of \(({\overline{X}}_{1}\cap {\overline{X}}_{2},\left( {\overline{X}}_{1} \cap {\overline{X}}_{2}\right) ^{\alpha })\).

Similarly, for any super-concept \((\overline{X_{4}},\left( \overline{X_{4}}\right) ^{\alpha })\) of \((\overline{X_{1}},\overline{X_{1}}^{\alpha })\) and \((\overline{X_{2}},\overline{X_{2}}^{\alpha })\), since \(\overline{X_{1}}\cup \overline{X_{2}}\subseteq \overline{X_{4}}\), by Proposition 6 (2) and (3), we have \(\overline{({\overline{X}}_{1}\cup {\overline{X}}_{2}}) \subseteq {\overline{\overline{X_{4}}}=\overline{X_{4}}}\), i.e., \(({\overline{X}}_{1}\cup {\overline{X}}_{2},(\overline{{\overline{X}}_{1} \cup \overline{X_{2}}})^{\alpha })\) is the supermum of \((\overline{X_{1}},\overline{X_{1}}^{\alpha })\) and \((\overline{X_{2}},\overline{X_{2}}^{\alpha })\). \(\square\)

Definition 10

Let \(K=(G,M,I)\) be a fuzzy formal context. If \(\beta \ge \alpha\), for \(B\subseteq M\), denote \({\overline{B}}=[B]_{\max }\) and \({\overline{B}}{}^{\beta }=[B]_{\max }^{\beta }\) and call \(({\overline{B}}{}^{\beta },{\overline{B}})\) a robust variable threshold concept of (GMI), where \({\overline{B}}{}^{\beta }\) is the extent and \({\overline{B}}\) is the intent.

By Definition 10, the extent \({\overline{B}}^{\beta }\) of \(({\overline{B}}^{\beta },{\overline{B}})\) collects all the objects that possess the salient features in \({\overline{B}}\), and the intent \({\overline{B}}\) captures the salient features common to all the objects in \({\overline{B}}^{\beta }\). In this case, the pair \(({\overline{B}}{}^{\beta },{\overline{B}})\) is stable because they can be identified by each other. Obviously, if we change the set X of objects in Proposition 6 with the set B of attributes and the operator \(\alpha \beta\) with \(\beta \alpha\), the equations in Proposition 6 still hold.

Example 6

Setting \(\alpha =0.54\) and \(\beta =0.51\), we can obtain the robust variable threshold fuzzy concept lattice of Table 1, as shown in Fig. 4. For comparison, the variable threshold concept lattices with \(\alpha =\beta =0.54\) and \(\alpha =\beta =0.51\) of Table 1 are shown in Fig. 5 and Fig. 6.

Fig. 4
figure 4

RobCL with \(\alpha =0.54\) and \(\beta =0.51\)

Fig. 5
figure 5

Variable threshold concept lattice with \(\alpha =\beta =0.54\)

Fig. 6
figure 6

Variable threshold concept lattice with \(\alpha =\beta =0.51\)

In Fig. 4, all the meets of extents are also extents. For example, the meet of the extents of \((x_{7}x_{8}x_{9}x_{10},abcd)\) and \((x_{6}x_{9}x_{10},bcde)\) is the extent of \((x_{9}x_{10},abcde)\). Comparing Fig. 4 with Fig. 5, we can find that the variable threshold concept \((x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10},bc)\) has been removed from Fig. 4. This is because the object incremental sequence of \(\{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\) is

$$\begin{aligned} \begin{aligned} \{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\rightarrow \{bc\}\rightarrow \{x_{2} x_{3}x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\} \rightarrow \\ \emptyset \rightarrow \{x_{1}x_{2}x_{3}x_{4}x_{5}x_{6}x_{7}x_{8}x_{9}x_{10}\} \end{aligned} \end{aligned}$$

i.e., the corresponding robust variable threshold concept of \(\{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\) is \((x_{1}x_{2}x_{3}x_{4}x_{5}x_{6}x_{7}x_{8}x_{9}x_{10},\emptyset )\). In the incremental sequence of \(\{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\), \(\{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}^{\alpha }=\{bc\}\) captures all the salient symptoms of \(\{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\), and \(\{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}^{\alpha \beta }=\{x_{2}x_{3} x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\) collects all the patients who have the symptoms \(\{bc\}\) at the degree greater than or equal 0.51. Compared with variable threshold concept lattice, \(x_{3}\) has been included in \(\{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}^{\alpha \beta }\), whereas \(\{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}^{\alpha \alpha }=\{x_2x_{4}x_{6} x_{7}x_{8}x_{9}x_{10}\}\) does not take \(x_{3}\) as its member. This is because for the symptoms \(\{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}^{\alpha }=\{bc\}\), as stated above, if one employs \(\alpha\) for the following computation, i.e. \(\{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\), one may collect only the patients who have the symptoms \(\{bc\}\) at the degree greater than or equal to 0.54, missing the patients who have the symptoms \(\{bc\}\) less than but close to 0.54, i.e. \(x_{3}\), where \(\beta \le I(x_{3},b)=0.52\le \alpha\) and \(\beta \le I(x_{3},c)=0.53\le \alpha\). Continuing the process, we can obtain the robust variable threshold concept \((x_{1}x_{2}x_{3}x_{4}x_{5}x_{6}x_{7}x_{8}x_{9}x_{10},\emptyset )\) for \(\{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\), i.e., there is no common salient symptoms for \(\{x_2x_{4}x_{6}x_{7}x_{8}x_{9}x_{10}\}\) at the threshold \(\alpha =0.54\) and the tolerance threshold \(\beta =0.51\).

Similarly, comparing Fig. 4 with Fig. 6, one can find that the variable threshold concept \((x_2x_3x_4x_6x_7x_8x_9x_{10},bc)\) has been removed from Fig. 6. In Fig. 6, for \(\{x_2x_3x_4x_6x_7x_8x_9x_{10}\}\), since \(\alpha =0.51\), we have \(\{x_2x_3x_4x_6x_7x_8x_9x_{10}\}^{\alpha } = \{bc\}\). In Fig. 4, however, because \(0.51< I(x_3,b)=0.52 < 0.54\) and \(0.51< I(x_3,c)=0.53 < 0.54\), we have \(\{x_2x_3x_4x_6x_7x_8x_9x_{10}\}^\alpha =\emptyset\), where \(\alpha =0.54\). Thus, \((x_2x_3x_4x_6x_7x_8x_9x_{10},bc)\) is a variable threshold concept of Fig. 6 with 0.51 being the threshold of objects having attributes, whereas \((x_1x_2x_3x_4x_5x_6x_7x_8x_9x_{10},\emptyset )\) is a robust variable threshold concept of Fig. 4 with 0.54 being the threshold of objects having attributes and 0.51 being the tolerance threshold.

5 Conclusion and Further Work

In this paper, we introduced the tolerance threshold to variable threshold concept lattice, solving the problem that variable threshold concept lattice is easily disturbed by noise. Then, we constructed RobCL and proved that RobCL is a complete lattice. In addition, we further made a comparison with variable threshold concept lattice and the results show that when the two thresholds coincide with each other, RobCL degenerates to variable threshold concept lattice and thus variable threshold concept lattice should be regarded as a special case of RobCL.

In the process of constructing RobCL, by introducing the tolerance threshold, an object may have a salient feature, a general feature or no feature; in other words, there are three possible states between an object and an attribute, considering with the idea of three-way decision [29,30,31,32]. It is easy to find the difference between RobCL and three-way concept lattice [33,34,35,36], because the latter is constructed on a binary formal context by identifying the common attributes not shared by extents, whereas the former is constructed on a fuzzy formal context by taking into account the general attributes shared by extents. Certainly, the relationship between RobCL and three-way concept lattice may be further clarified after determining whether the three states can be replaced by three values, say 0, 0.5 and 1, without altering the structure of RobCL. The latter, however, seems not straightforward because the incremental characteristics of incremental sequence make it difficult to capture the underlying principle. For example, Proposition 3 still holds for this case, with substituting \(I(g,m)\ge \beta\) by \(I(g,m)\ge 0\) and \(\alpha >I(g_{1},m_{1})\ge \beta\) by \(I(g_{1},m_{1})=0.5\).

Another further work concerns the relationship between RobCL and variable threshold concept lattice. Although it is clear that RobCL is a general case of variable threshold concept lattice, the preliminary results show that the robust version of variable threshold concept lattice, i.e. RobCL may reduce the fuzzy concepts in variable threshold concept lattice, a surprising result. The basic explanation for this is that the robustness reduce the fuzzy concepts that are not robust; however, further clarification is also needed.

It should be noted that although RobCL was applied only in medical diagnosis in the paper, RobCL can be applied in all the fields whose data can be represented as fuzzy two-dimensional tables. For example, in uncertain group decision-making, the set of objects usually consists of different experts, the set of attributes consists of different solutions, and the values between objects and attributes are the evaluating results of solutions by experts. In this case, if only a simple threshold is chosen, the experts’ attitudes towards the solutions will be either ‘agree’ or ‘disagree’. Thus, a tolerance threshold may allow the evaluating results to fluctuate within a certain small range and improve the robustness of group decision-making in the uncertain case.