Keywords

1 Introduction

Concepts are important and basic constituents in human’s cognition process. Consequently, they are crucial in many psychological processes, such as categorization, inference, memory, learning, and decision-making. In philosophy, there are different views or structures of concepts. In classical view, a concept contains two parts, extension and intension. The extension is a group of objects belonging to the concept and the intension is a family of attributes characterizing the properties of the concept. The classical view holds that all instances of a concept share common properties, which are necessary and sufficient conditions for defining the concept. In order to apply the philosophical concept into data processing, Wille [22] proposed a new field, formal concept analysis (FCA), giving a mathematical format of the classical view of concepts.

FCA [22] shows a mathematical format of classical view of concepts, named formal concepts. A formal concept consists of a pair of an object set (extent) and an attribute set (intent). The objects in extent possess all the attributes in intent and the attributes in intent are possessed by all the objects in extent. Based on the partial order theory, Wille and Ganter [8] presented a lattice structure of formal concepts named a concept lattice which reveals hierarchical structure of concepts with respect to the generalization and the specialization of concepts. However, the formal concept is an all-or-none phenomenon. That is, if an object possesses all the attributes in the intent of a formal concept, it is definitely in the extent of this formal concept, but if an object does not possess all the attributes in the intent, even though it possesses most attributes in the intent, this object is definitely not in this formal concept. In other words, if two objects are in the extent of same concept, they must have same degree of typicality in this formal concept. That is, the objects in the extent of a concept are equally important in people’s understanding of the concept. This view of concepts is mostly used in machine-oriented concept learning [1, 10, 12, 25, 27], but not always consistent with human’s understanding of concepts. Classical formal concepts have been extended to other types, such as preconcepts [23], semiconcepts [24], protoconcepts [21], property oriented concepts [4], object oriented concepts [26], dual concepts [2, 13], monotone concepts [3], RS-definable concepts [28] and three-way concepts [15,16,17].

There is increasing evidence that memberships of objects in semantic categories which are expressed by words of natural languages can be graded rather than all-or-none. Lakoff [11], Rosch [19] and Zadeh [29] argued that some natural categories are analog and must be represented logically in a manner which reflects their analog structure. Rosch [19] has further characterized some natural analog categories as internally structured into a prototype (clearest cased, best examples of the category) and non-prototype members, with non-prototype members tending toward an order from better to poorer examples. For example, chair is a more reasonable exemplar than radio of the concept furniture, or we can say that the chair has a larger membership than radio of the concept furniture. When we talk about color, vermilion, fuchsia, pink, cerise, peach, garnet, cardinal, rose, wine all belong to concept red. However, rose is more typical than pink. This kind of view of concepts are called prototype view of concepts.

In this paper, we try to give a mathematical representation of the prototype view of concepts [6, 7]. Considering the cognitive process of recognizing concepts, we firstly choose an object as the prototype of a concept, which is the most typical object and can be a representative of this concept. Then the similarities between other objects and prototype are given according to a similarity measurement. Since the prototype is described by a group of attributes [9], the similarity measurement is defined based on the description of objects. The objects with high similarity to the prototype can be put into the concept. In order to quantitatively define high similarity, we preset a threshold k and the corresponding prototype view concepts are called k-cutting concepts. Since prototype o is the most typical object of this concept, the description of prototype o is regarded as the intent of this k-cutting concept and the objects whose similarity to prototype o is bigger than k are put into the extent of this k-cutting concept. Furthermore we study the properties of k-cutting concepts.

The rest of the paper is organized as follows. Section 2 gives the basic notions in formal concept analysis. Then Sect. 3 presents the similarity measurement between two different objects and defines the k-cutting concept. Furthermore we show the properties of k-cutting concepts. Finally, this paper is concluded in Sect. 4.

2 Formal Concept Analysis

This section reviews basic notions in FCA. FCA, proposed by Wille in 1982 [22], gives a mathematical way to represent a concept with a pair of objects set (called the extent) and attributes set (called the intent). The data source of FCA is called formal context defined as follows [8, 22].

Definition 1

A formal context \((OB, AT, \mathbf{I})\) consists of two sets \(O\!B\) and AT, and a relation \(\mathbf{I}\) between \(O\!B\) and AT. The elements of \(O\!B\) are called the objects and the elements of AT are called the attributes of the context. In order to express that an object o is in a relation \(\mathbf{I}\) with an attribute a, we write \(o\mathbf{I}a\) or \((o,a) \in \mathbf{I}\) and read it as “the object o has the attribute a”.

Based on the formal context, the set of attributes possessed by an object o and the set of objects possessing an attribute a are given as

$$\begin{aligned} o\mathbf{I}.= & {} \{a \in AT \mid o \mathbf{I} a \} \subseteq AT, \nonumber \\ {.}{} \mathbf{I}a= & {} \{o \in O\!B \mid o \mathbf{I} a\}\subseteq O\!B. \end{aligned}$$
(1)

Actually, \(o\mathbf{I}.\) can be regarded as the description of object o and \({.}{} \mathbf{I}a\) can be understood as a set of objects which can be described by attribute a or a set of representatives of description \(\{a\}\). Given a formal context \((O\!B, AT, \mathbf{I})\), if for any \(o\in O\!B\), we have \(o\mathbf{I}.\ne \emptyset \), \(o\mathbf{I}.\ne AT\), and for any \(a\in AT\), we have \({.}{} \mathbf{I}a\ne \emptyset \), \({.}{} \mathbf{I}a\ne O\!B\), then the formal context \((O\!B, AT, \mathbf{I})\) is called canonical. If for any objects \(o_{1}, o_{2}\in O\!B\), from \(o_{1}{} \mathbf{I}.=o_{2}{} \mathbf{I}.\), it always follows that \(o_{1}=o_{2}\) and, consequently, \({.}{} \mathbf{I}a_{1}=.\mathbf{I}a_{2}\) implies \(a_{1}=a_{2}\) for all \(a_{1}, a_{2}\in AT\). We call this context a clarified formal context. In this paper, we suppose all formal contexts are canonical, clarified and finite. Based on the description of an object and the representatives of an attribute, a pair of operators called derivation operators are defined on an objects set \(O \subseteq O\!B\) and an attributes set \(A \subseteq AT\), respectively, in \((O\!B,AT,\mathbf{I})\) [8]:

$$\begin{aligned} O^*= & {} \{a \in AT \mid \forall o\in O(o \mathbf{I} a)\} = \{a \in AT \mid O \subseteq \,{.}{} \mathbf{I}a\} = {\textstyle \bigcap } \{ o \mathbf{I}. \mid o \in O\}, \nonumber \\ A^*= & {} \{o \in O\!B \mid \forall a \in A (o \mathbf{I} a)\} = \{o \in O\!B \mid A \subseteq o\mathbf{I}.\} = {\textstyle \bigcap } \{ {.}{} \mathbf{I}a \mid a \in A\}. \end{aligned}$$
(2)

It is obvious to see that, for any object \(o\in O\!B\) and any attribute \(a\in AT\), it always follows \(o\mathbf{I}.=\{o\}^* \) and \({.}{} \mathbf{I}a=\{a\}^* \). Then based on above derivation operators, a formal concept is obtained [8].

Definition 2

A formal concept of the context \((O\!B,AT,\mathbf{I})\) is a pair (OA) with \( O^* = A\) and \(O=A^*\) \((O\subseteq O\!B, A\subseteq AT)\). We call O the extent and A the intent of the formal concept (OA).

The formal concepts of a formal context \((O\!B,AT,\mathbf{I})\) are ordered by

$$\begin{aligned} (O_1,A_1 ) \le (O_2,A_2 ) \Leftrightarrow O_1 \subseteq O_2\ ( \Leftrightarrow A_1 \supseteq A_2). \end{aligned}$$
(3)

All formal concepts of \((O\!B,AT,\mathbf{I})\) can form a complete lattice called the formal concept lattice of \((O\!B,AT,\mathbf{I})\), denoted by \(L(O\!B,AT,\mathbf{I})\). The infimum and supremum are given by

$$\begin{aligned}&(O_{1},A_{1})\wedge (O_{2},A_{2})=(O_{1}\cap O_{2},(A_{1}\cup A_{2})^{**}), \nonumber \\&(O_{1},A_{1})\vee (O_{2},A_{2})=((O_{1}\cup O_{2})^{**},A_{1}\cap A_{2}) \end{aligned}$$
(4)

In a formal context, there is a kind of important concept, named object concept [8].

Definition 3

Let \((O\!B, AT, \mathbf{I})\) be a formal context, \((o^{**}, o^{*})\) is a formal concept for all \(o\in O\!B\), which is called an object concept. Here, for convenience, we write \(o^{*}\) instead of \(\{o\}^{*}\) for any \(o\in O\!B\).

The object concept \((o^{**}, o^{*})\) can be understood as a concept induced by object o, which means the object o is a typical object (prototype) of concept \((o^{**}, o^{*})\). Specifically, the description (intent) of concept \((o^{**}, o^{*})\) is the description of object o and the extent of this concept is a set of objects which can be described by the description of object o. In order to show the importance of the object concept, the notion of join-dense is recalled in next definition [8].

Definition 4

Let P be an ordered set and let \(Q\subseteq P\). Then Q is called join-dense in P if for every element \(a\in P\) there is a subset A of Q such that \(a=\vee _{P} A\).

Following theorem shows that any formal concept can be constructed based on a set of object concepts, so the object concepts can be regarded as the fundamental elements in concept construction [8].

Theorem 1

Let \((O\!B, AT, \mathbf{I})\) be a formal context and \(L(O\!B,AT,\mathbf{I})\) the associated complete lattice of concepts. Then the set of all the object concepts is join-dense in \(L(O\!B, AT, \mathbf{I})\). Specifically, for a formal concept (OA),

$$\begin{aligned} \bigvee \{(o^{**}, o^{*})\mid o\in O\}=(O, A) \end{aligned}$$
(5)

holds.

Finally, we give an example to illustrate the definitions and theorems presented in this section.

Example 1

Table 1 is a formal context \((O\!B, AT, \mathbf{I})\) with four objects \(O\!B=\{o_{1},o_{2},o_{3},o_{4}\}\) and four attributes \(AT=\{a,b,c,d\}\). The description of every object and the representatives of every attribute are as follows:

$$\begin{aligned}&o_{1}{} \mathbf{I}.=\{a,d\},~o_{2}{} \mathbf{I}.=\{b,d\},~ o_{3}{} \mathbf{I}.=\{a,b,c\},~o_{4}{} \mathbf{I}.=\{b,c\}.\\&{.}{} \mathbf{I}a=\{o_{1},o_{3}\},~{.}{} \mathbf{I}b=\{o_{2},o_{3},o_{4}\},~ {.}{} \mathbf{I}c=\{o_{3},o_{4}\},~{.}{} \mathbf{I}d=\{o_{1},o_{2}\}. \end{aligned}$$
Table 1. A formal context \((O\!B, AT, \mathbf{I})\)

We can see that for any object \(o_{i}\in O\!B\), its description is neither whole attribute set AT nor the empty set. Also, for any attribute in AT, its representatives set is neither whole object set \(O\!B\) nor the empty set. Thus the formal context \((O\!B, AT, \mathbf{I})\) is canonical. Moreover, for any two different objects, their descriptions are different, and for any two different attributes, their representatives sets are different. Thus the formal context \((O\!B, AT, \mathbf{I})\) is clarified.

The formal concept lattice of context \((O\!B, AT, \mathbf{I})\) is shown in Fig. 1.

The object concepts are: \((o_{1}^{**},o_{1}^{*})=(o_{1},ad),~(o_{2}^{**},o_{2}^{*})=(o_{2},bd), ~(o_{3}^{**},o_{3}^{*})=(o_{3},abc),~(o_{4}^{**},o_{4}^{*})=(o_{3}o_{4},bc)\). After calculation, we have

$$\begin{aligned}&(o_{2}o_{3}o_{4},b)=(o_{2},bd)\vee (o_{3},abc)\vee (o_{3}o_{4},bc),\\&(o_{1}o_{3},a)=(o_{1},ad)\vee (o_{3},abc),\\&(o_{1}o_{2},d)=(o_{1},ad)\vee (o_{2},bd),\\&(O\!B,\emptyset )=(o_{1},ad)\vee (o_{2},bd)\vee (o_{3},abc)\vee (o_{3}o_{4},bc). \end{aligned}$$

That is, any formal concept can be constructed by joining a set of object concepts. Thus, the set of all object concepts is join-dense in formal concept lattice.

Fig. 1.
figure 1

The formal concept lattice \(L(O\!B, AT, \mathbf{I})\)

3 The Prototype View of Concept

Section 2 shows the importance of object concepts in concept construction. However, the definition of object concepts is too strict. According to Definition 3, the extent of an object concept \((o^{**}, o^{*})\) is a set of objects which can be fully described by the description of object o. Actually, the semantic concept in our daily life is based on a typical object (prototype), but the extent of semantic concept is not required to be fully described by the description of the typical object. Based on the similarity [5, 18, 20] to the typical object, the typicality of objects in extent can be defined. In this section, we give a mathematical way to represent the semantic concept and discuss its properties. Firstly, the similarity measurement of one object to another is shown in Sect. 3.1.

3.1 Similarity Measurement Between Two Objects

In a formal context \((O\!B, AT, \mathbf{I})\), an object o can be described by a set of attributes \(o\mathbf{I}.\) (called description of object o). And if two objects have same description, they can be regarded as same one [8]. Thus, in order to measure the similarity of object \(o_{i}\) to object o, we only need to measure the similarity between the descriptions of these two objects. The more similar the descriptions of two objects are, the more similar the two objects are.

Definition 5

Let \((O\!B, AT, \mathbf{I})\) be a formal context and o be a reference object. For any object \(o_{i}\in O\!B\), the similarity measurement of \(o_{i}\) to o is defined as

$$\begin{aligned} Sim(o_{i}, o)=\frac{|o_{i}{} \mathbf{I}.\,\cap \,o\mathbf{I}.{|}}{|o\mathbf{I}.\!{|}}. \end{aligned}$$
(6)

The range of the value of this similarity measurement is \(0\le Sim(o_{i},o)\le 1\). The closer the similarity is to 1, the more similar object \(o_{i}\) is to object o; the closer the similarity is to 0, the less similar object \(o_{i}\) is to object o. In other words, the more attributes in description of object o can be used to describe object \(o_{i}\), the more similar object \(o_{i}\) is to object o. Now let us consider the similarity of objects in the extent of an object concept \((o^{**}, o^{*})\) to the object o.

Proposition 1

Let \((O\!B, AT, \mathbf{I})\) be a formal context and \((o^{**}, o^{*})\) be an object concept induced by object o. Then the value of similarity measurement of any object in \(o^{**}\) to object o is 1.

Proof

 Suppose \((o^{**}, o^{*})\) is an object concept of formal context \((O\!B, AT, \mathbf{I})\) and \(o_{i}\in o^{**}\) is an object from the extent of this concept. Since \(o_{i}\in o^{**}\), according to the properties of operator \(*\), we have \(o^{***}\subseteq o_{i}^{*}\) and \(o^{*}=o^{***}\). Thus, we obtain \(o^{*}\subseteq o_{i}^{*}\). That is, \(o\mathbf{I}.\subseteq o_{i}{} \mathbf{I}.\) Hence we have \(o_{i}{} \mathbf{I}.\cap o\mathbf{I}.=o\mathbf{I}\). Thus, \(Sim(o_{i}, o)=\frac{|o_{i}{} \mathbf{I}.\,\cap \,o\mathbf{I}.|}{|o\mathbf{I}.|}= \frac{|o\mathbf{I}.|}{|o\mathbf{I}.|}=1\).

By Proposition 1, the extent \(o^{**}\) of object concept \((o^{**}, o^{*})\) consists of objects whose value of similarity measurement to object o is 1. If we regard object o as a typical object (prototype) of an semantic concept, in order to get all the objects in this prototype view of concepts, we should not only consider objects with similarity value of 1, but also objects with similarity value less than 1.

We use a simple example to illustrate the basic notions and ideas introduced so far.

Example 2

(Continued with Example 1) We set object \(o_{3}\) as a reference object. In the following, we will compute the similarity of each object in \(O\!B\) to reference object \(o_{3}\).

$$\begin{aligned}&Sim(o_{1},o_{3})&=\frac{|o_{1}{} \mathbf{I}.\,\cap \,o_{3}{} \mathbf{I}.\!{|}}{|o_{3}{} \mathbf{I}.\!{|}}=\frac{|\{a,d\}\cap \{a,b,c\}|}{|\{a,b,c\}|}=\frac{1}{3},\\&Sim(o_{2},o_{3})&=\frac{|o_{2}{} \mathbf{I}.\,\cap \,o_{3}{} \mathbf{I}.\!{|}}{|o_{3}{} \mathbf{I}.\!{|}}=\frac{|\{b,d\}\cap \{a,b,c\}|}{|\{a,b,c\}|}=\frac{1}{3},\\&Sim(o_{3},o_{3})&=\frac{|o_{3}{} \mathbf{I}.\,\cap \,o_{3}{} \mathbf{I}.\!{|}}{|o_{3}{} \mathbf{I}.\!{|}}=\frac{|\{a,b,c\}\cap \{a,b,c\}|}{|\{a,b,c\}|}=1,\\&Sim(o_{4},o_{3})&=\frac{|o_{4}{} \mathbf{I}.\,\cap \,o_{3}{} \mathbf{I}.\!{|}}{|o_{3}{} \mathbf{I}.\!{|}}=\frac{|\{b,c\}\cap \{a,b,c\}|}{|\{a,b,c\}|}=\frac{2}{3}. \end{aligned}$$

Then, we check the correctness of Proposition 1. In Example 1, we get four object concepts \((o_{1},ad)\), \((o_{2},bd)\), \((o_{3},abc)\) and \((o_{3}o_{4},bc)\) inducing by objects \(o_{1}\), \(o_{2}\), \(o_{3}\) and \(o_{4}\), respectively. The similarities of objects in extent of object concepts to the objects inducing these concepts are computed as follows:

$$\begin{aligned}&Sim(o_{1},o_{1})=\frac{|o_{1}{} \mathbf{I}.\,\cap \,o_{1}{} \mathbf{I}.\!{|}}{|o_{1}{} \mathbf{I}.\!{|}}=\frac{|\{a,d\}\cap \{a,d\}|}{|\{a,d\}|}=1,\\&Sim(o_{2},o_{2})=\frac{|o_{2}{} \mathbf{I}.\,\cap \,o_{2}{} \mathbf{I}.\!{|}}{|o_{2}{} \mathbf{I}.\!{|}}=\frac{|\{b,d\}\cap \{b,d\}|}{|\{b,d\}|}=1,\\&Sim(o_{3},o_{3})=\frac{|o_{3}{} \mathbf{I}.\,\cap \,o_{3}{} \mathbf{I}.\!{|}}{|o_{3}{} \mathbf{I}.\!{|}}=\frac{|\{a,b,c\}\cap \{a,b,c\}|}{|\{a,b,c\}|}=1,\\&\left\{ \begin{array}{lr} Sim(o_{3},o_{4})=\frac{|o_{3}{} \mathbf{I}.\,\cap \,o_{4}{} \mathbf{I}.\!{|}}{|o_{4}{} \mathbf{I}.\!{|}}=\frac{|\{a,b,c\}\,\cap \,\{b,c\}|}{|\{b,c\}|}=\frac{|\{b,c\}|}{|\{b,c\}|}=1,&{}\\ Sim(o_{4},o_{4})=\frac{|o_{4}{} \mathbf{I}.\,\cap \,o_{4}{} \mathbf{I}.\!{|}}{|o_{4}{} \mathbf{I}.\!{|}}=\frac{|\{b,c\}\,\cap \,\{b,c\}|}{|\{b,c\}|}=1.&{} \end{array} \right. \end{aligned}$$

The computation results are consistent to Proposition 1. That is, the value of similarity of objects in extent of object concept to the object inducing this concept is 1.

3.2 The k-cutting Concept Induced by One Typical Object

The classical view of concepts holds that all instances of a concept share common properties that are necessary and sufficient conditions for defining the concept [14]. However, in our daily life, semantic category in our nature language is not an all-or-none phenomenon. For example, we know that a chair is a more reasonable exemplar of the category furniture than a radio. In other words, the chair is more typical than radio in the category furniture. This is contrary to the assumption that categories are necessarily logical, bounded entities, Rosch [19] has characterized some natural analog categories as internally structured into a prototype (clearest cases, best example of the category) and nonprototype members, with nonprototype members tending toward an order from better to poorer example. Based on the results of Rosch’s study, we summarize the process of human to recognize a semantic concept as follows:

$$\begin{aligned}&step~1{:}~\text {Pick up the typical object (prototype) of the concept }{;}\\&step~2{:}~\text {Calculate the characterized attributes (description) of the typical object }{;}\\&step~3{:}~\text {Calculate the similarity of each object to the typical object }{;}\\&step~4{:}~\text {Put the objects with high similarity into the extent of concept }{.} \end{aligned}$$

The above steps are just a qualitative description of process to obtain semantic concepts. If we want to express this process in a mathematical way, some quantitative index is needed. For example, in step 4, high is a qualitative description. In order to determine which object has high similarity to the typical object, we give a preset threshold k. If the similarity measurement of an object to typical object is bigger than k, this object can be regarded as being highly similar to typical object. Thus, this object can be put into the extent and the corresponding concept is called the k-cutting concept. Since the objects in one extent belong to the same concept, they should possess some common attributes with each other. Thus we strict that k should satisfy \(k>\frac{1}{2}\). The following use of k satisfies these settings.

The above process is easy for us to understand, but it is hard to give a mathematical definition of k-cutting concept directly. In the following, we show the mathematical definition of the k-cutting concept. Firstly, a pair of k-cutting derivation operators are given as follows.

Definition 6

Let \((O\!B, AT, \mathbf{I})\) be a formal context. A pair of k-cutting derivation operators (\(k>\frac{1}{2}\)) for objects set \(O\subseteq O\!B\) and attributes set \(A\subseteq AT\) are defined as:

$$\begin{aligned} O^{*k}= & {} \{a\in AT \mid |a^{*}\cap O|\ge k\cdot |O|\}, \nonumber \\ A^{*k}= & {} \{o\in O\!B \mid |o^{*}\cap A|\ge k\cdot |A|\} \end{aligned}$$
(7)

In Definition 6, the attribute shared by more than \(k\cdot |O|\) objects in O belongs to attributes set \(O^{*k}\); the object possessing more than \(k\cdot |A|\) attributes in A belongs to objects set \(A^{*k}\). In the following, we present the properties of k-cutting derivation operators.

Property 1

Let \((O\!B, AT, \mathbf{I})\) be a formal context. The following properties hold for any objects sets \(O, O_{1}, O_{2}\subseteq O\!B\) and attributes sets \(A, A_{1}, A_{2}\subseteq AT\):

Proof

 The results in G1 and G2 are obvious.

G3. If object set O is a singleton set, then there exists an object \(o_{i}\in O\!B\) satisfying \(O=\{o_{i}\}\). Since \(\{o_{i}\}\) is a singleton set, the result of \(|a^{*}\cap \{o_{i}\}|\), for any \(a\in AT\), is either 0 or 1. According to Definition 6, for any \(k>\frac{1}{2}\), we have \(O^{*k}=\{a\in AT \mid |a^{*}\cap O|\ge k\cdot |O|\}=\{a\in AT \mid |a^{*}\cap \{o_{i}\}|\ge k|\{o_{i}\}|\}=\{a\in AT \mid |a^{*}\cap \{o_{i}\}|\ge k\}\). That is, if attribute \(a\in O^{*k}\), then \(|a^{*}\cap \{o_{i}\}|\ge k\). That means \(|a^{*}\cap \{o_{i}\}|=1\). Thus \(O^{*k}=\{a\in AT \mid |a^{*}\cap \{o_{i}\}|=1\}=\{a\in AT \mid \{o_{i}\}\subseteq a^{*}\}=\{o_{i}\}^{*}=O^{*}\). The rest part can be proved similarly.

G4. For any \(a\in O^{*h}\), according to Definition 6, we have \(|a^{*}\cap O|\ge h\cdot |O|\). Since \(k\le h\), we have \(h\cdot |O|\ge k\cdot |O|\). Thus, \(|a^{*}\cap O|\ge h\cdot |O|\ge k\cdot |O|\). That is, \(a\in O^{*k}\). Because of the arbitrariness of attribute a, we obtain \(O^{*h}\subseteq O^{*k}\). The formula \(k\le h\Rightarrow A^{*h}\subseteq A^{*k}\) can be proved similarly.

G5. From property G4, for any \(k_{i}\ge k\), we have \(O^{*k_{i}}\subseteq O^{*k}\). Hence, we obtain \(\cup _{k\le k_{i}}O^{*k_{i}}\subseteq O^{*k}\). Also, since \(k\le k\), we can get \(O^{*k}\subseteq \cup _{k\le k_{i}}O^{*k_{i}}\). Thus, we obtain \(O^{*k}=\cup _{k\le k_{i}}O^{*k_{i}}\). Analogously, from property G4, for any \(k_{j}\le k\), we have \(O^{*k}\subseteq O^{*k_{j}}\). Hence, we obtain \(O^{*k}\subseteq \cap _{k_{j}\le k}O^{*k_{j}}\). Also, since \(k\le k\), we can get \(\cap _{k_{j}\le k}O^{*k_{j}}\subseteq O^{*k}\). Thus, we obtain \(O^{*k}=\cap _{k_{j}\le k}O^{*k_{j}}\). The rest part \(A^{*k}=\cup _{k\le k_{i}}A^{*k_{i}}=\cap _{k_{j}\le k}A^{*k_{j}}\) can be proved similarly.

G6. For any \(o_{i}\in O\), from property of operator \(*\), we have \(O^{*}\subseteq o_{i}^{*}\). Consequently, we obtain \(o_{i}^{*}\cap O^{*}=O^{*}\). Thus, \(|o_{i}^{*}\cap O^{*}|=|O^{*}|\ge k|O^{*}|\) holds no matter what value k has. According to Definition 6, we can get \(o_{i}\in O^{**k}\). The rest part \(A\subseteq A^{**k}\) can be proved similarly.

Then based on the k-cutting derivation operators, the definition of k-cutting concept induced by one prototype (we will simply call it k-cutting concept if there is no confusion) is given as follows.

Definition 7

Let \((O\!B, AT, \mathbf{I})\) be a formal context. The k-cutting concept induced by one typical object o is defined as \((\hat{o}^{**k}, o^{*})\). \(\hat{o}^{**k}\) and \(o^{*}\) are called extent and intent of k-cutting concept \((\hat{o}^{**k}, o^{*})\). Here, \(\hat{o}^{**k}= (o_{i}, m(o_{i})),~o_{i}\in o^{**k}\) is a set of objects in \(o^{**k}\) accompanied with a membership value.

Specifically, the element of \(\hat{o}^{**k}\) is an object-membership pair \((o_{i}, m(o_{i}))\), in which \(o_{i}\in o^{**k}\) and \(m(o_{i})\) is the membership of object \(o_{i}\) belonging to the k-cutting concept \((\hat{o}^{**k}, o^{*})\). The membership can be measured in different ways, and the most common way is using the similarity measurement value of object \(o_{i}\) to typical object o. In following analysis, for convenience, we can regard \(o^{**k}\) instead of \(\hat{o}^{**k}\) as an extent of k-cutting concept \((\hat{o}^{**k}, o^{*})\). That is, we can rewrite k-cutting concept \((\hat{o}^{**k}, o^{*})\) as \((o^{**k}, o^{*})\).

From Definition 7, the intent of the k-cutting concept \((o^{**k}, o^{*})\) is a set of attributes which is the description of typical object o and the extent of the concept is a set of object-membership pairs in which the description of object contains more than \(k\cdot |o^{*}|\) attributes in description of o. The set of all k-cutting concepts induced by one typical object in formal context \((O\!B, AT, \mathbf{I})\) is denoted by \(OCC_{k}(O\!B, AT, \mathbf{I})\). Now we check the similarity of any object in k-cutting concept given in Definition 7 to verify its rationality.

Theorem 2

Let \((O\!B, AT, \mathbf{I})\) be a formal context and \((o^{**k}, o^{*})\) is a k-cutting concept. An object \(o_{i}\in O\!B\) belongs to \(o^{**k}\) if and only if the similarity of \(o_{i}\) to o is bigger than k. That is, \(Sim(o_{i}, o)\ge k\).

Proof

 According to Definition 6, we have \(o_{i}\in o^{**k}\) is equivalent to \(|o_{i}^{*}\cap o^{*}|\ge k\cdot |o^{*}|\). Since we assumed that the formal context in this paper is canonical, we have \(o^{*}\ne \emptyset \), that is, \(|o^{*}|\ne 0\). Thus, both sides of the inequality \(|o_{i}^{*}\cap o^{*}|\ge k\cdot |o^{*}|\) can be divided by \(|o^{*}|\). The result is \(\frac{|o_{i}^{*}\cap o^{*}|}{ |o^{*}|}\ge k\). That is, \(Sim(o_{i}, o)\ge k\). Thus, \(o_{i}\in o^{**k}\) holds if and only if \(Sim(o_{i}, o)\ge k\) holds.

Remark 1

The higher the value of similarity the object has, the closer it is to the typical object. However, the similarity measure can not be used to decide whether or not the object is a prototype or typical object. That is, for some object, the value of similarity measurement is 1, but it is not a prototype of this concept, since it has more attributes than the attributes in intent.

At the beginning of Sect. 3.2, we discussed that since all the objects in one extent belong to a same concept, they should possess some common attributes with each other. Hence, we restrict the value of \(k>\frac{1}{2}\). The following proposition shows that the restriction of k guarantees the existence of common attributes of a concept.

Proposition 2

Let \((o^{**k}, o^{*})\) be a k-cutting concept. If \(k>\frac{1}{2}\), then, for any \(o_{1}, o_{2}\in o^{**k}\), we have \(o_{1}^{*}\cap o_{2}^{*}\cap o^{*}\ne \emptyset \).

Proof

 Because of \(o_{1}, o_{2}\in o^{**k}\), based on Definition 6, we have \(|o_{1}^{*}\cap o^{*}|\ge k\cdot |o^{*}|\) and \(|o_{2}^{*}\cap o^{*}|\ge k\cdot |o^{*}|\). Since we suppose \(k>\frac{1}{2}\), the formulas \(|o_{1}^{*}\cap o^{*}|>\frac{|o^{*}|}{2}\) and \(|o_{2}^{*}\cap o^{*}|>\frac{|o^{*}|}{2}\) can be obtained. Thus, \(|o_{1}^{*}\cap (o_{2}^{*}\cap o^{*})|=|(o_{1}^{*}\cap o^{*})\cap (o_{2}^{*}\cap o^{*})|=|(o_{1}^{*}\cap o^{*})|+ |(o_{2}^{*}\cap o^{*})|-|(o_{1}^{*}\cap o^{*})\cup (o_{2}^{*}\cap o^{*})|>\frac{|o^{*}|}{2}+\frac{|o^{*}|}{2}-|(o_{1}^{*}\cap o^{*})\cup (o_{2}^{*}\cap o^{*})|>|o^{*}|-|o^{*}|=0\). That is, \(o_{1}^{*}\cap o_{2}^{*}\cap o^{*}\ne \emptyset \).

The result \(o_{1}^{*}\cap o_{2}^{*}\cap o^{*}\ne \emptyset \) in Proposition 2 can be rewritten as \((o_{1}^{*}\cap o^{*})\cap (o_{2}^{*}\cap o^{*})\ne \emptyset \). This proposition shows that in order to let objects in the same concept have common attributes, the value of k should satisfy \(k>\frac{1}{2}\). These common attributes are the most important characters of the concept, since they can reflect the commonness of objects in extent.

We continue with Example 2 to demonstrate the ideas of k-cutting concept induced by one typical object and to verify the correctness of Theorem 2 and Proposition 2.

Example 3

According to Definition 7, the \(\frac{2}{3}\)-cutting concepts induced by typical object \(o_{1}\), \(o_{2}\), \(o_{3}\), and \(o_{4}\) are \((\{(o_{1},1)\},ad)\), \((\{(o_{2},1)\},bd)\), \((\{(o_{3},1),(o_{4},\frac{2}{3})\},abc)\), and \((\{(o_{3},1),(o_{4},1)\},bc)\). Compared with the classical object concept induced by object \(o_{3}\) whose extent only contains object \(o_{3}\), the \(\frac{2}{3}\)-cutting concepts induced by object \(o_{3}\) contains objects \(o_{3}\) and \(o_{4}\). And these two objects have different memberships. Since object \(o_{3}\) is the prototype of this concept, its membership is 1. The membership of object \(o_{4}\) is \(\frac{2}{3}\). The \(\frac{2}{3}\)-cutting concept can be regarded as a more general concept than the classical concept. The object in k-cutting concept induced by object o does not need to possess all the attributes in description of object o. We only restrict that the description of any object in extent of k-cutting concept contains more than \(k\cdot |o|\) attributes in the description of prototype o.

We use \((\{(o_{3},1),(o_{4},\frac{2}{3})\},abc)\), the \(\frac{2}{3}\)-cutting concept induced by typical object \(o_{3}\), as an example to show the correctness of Theorem 2. According to the similarity measurement calculated in Example 2, we have \(Sim(o_{1},o_{3})=\frac{1}{3}<\frac{2}{3}\), \(Sim(o_{2},o_{3})=\frac{1}{3}<\frac{2}{3}\), \(Sim(o_{3},o_{3})=1\ge \frac{2}{3}\) and \(Sim(o_{4},o_{3})=\frac{2}{3}\ge \frac{2}{3}\). Based on Theorem 2, only objects \(o_{3}\) and \(o_{4}\) belong to the extent of \(\frac{2}{3}\)-cutting concept \((\{(o_{3},1),(o_{4},\frac{2}{3})\},abc)\), which is consistent with our calculation by Definition 7. Also, from \((\{(o_{3},1),(o_{4},1)\},bc)\), the \(\frac{2}{3}\)-cutting concept induced by typical object \(o_{4}\), we can see that the similarity of object \(o_{3}\) to typical object \(o_{4}\) is 1, but object \(o_{3}\) is not the typical object of this concept, since its description is \(\{a,b,c\}\) which is bigger than \(\{b,c\}\), description of typical object \(o_{4}\).

We will check the correctness of Proposition 2 in the following. Since \(k=\frac{2}{3}>\frac{1}{2}\), every two objects in \(o_{3}^{**\frac{2}{3}}\) should have common attributes. Based on Table 1, we have \(o_{3}^{**\frac{2}{3}}=\{o_{3},o_{4}\}\), and we can calculate \((o_{3}^{*}\cap o_{3}^{*})\cap (o_{4}^{*}\cap o_{3}^{*})=\{a,b,c\}\cap \{b,c\}=\{b,c\}\ne \emptyset \). The results are consistent to Proposition 2.

4 Conclusion

In formal concept analysis, the formal concept is a mathematical formation of the classical view of concept and reflects a semantic meaning “commonly possessing”. However, in our daily life, the prototype view of concepts is more common and just reflects the meaning of “mostly possessing”. In this paper, we discussed the similarity between two objects and defined the mathematical formation of the prototype of concepts, named k-cutting concepts. Moreover, the properties of this newly proposed concept are studied and its rationality is discussed.

The results of this paper suggest several future research topics. It is interesting to investigate the structure of k-cutting concepts and the k-cutting concepts can be generalized as the k-cutting concepts induced by a group of typical objects.