Keywords

1 Introduction

In the real application, depending on the context, a given object may be characterized by different values of attributes or different set of attributes. In this case, different set of attributes are considered as different contexts in which they may also provide different values of attributes for a given object based on different perceptions. Here, context may be viewed as background or situation by which somehow it is necessary to group some attributes as a subset of attributes and consider the subset as a certain context. For instance, considering human beings as a universal set of objects, every person (object) might be characterized by several sets of attributes corresponding to some contexts such as his or her status as employee, student, club member, family member, etc. In the context of student, student’s set of attributes might be regarded as {ID-Number, Name, Address, Supervisor, Major, etc.}. We may consider different sets of attributes in the relation to the contexts of both employee and family member. Using the same example of human beings as universal set of objects, in the relation to perception-based data especially for fuzzy data, attributes such as weight, age and height might have different fuzzy values in describing a given certain object depending on contexts (perceptions) of Japanese, American and so on. For instance, Japanese may consider height of 175 cm as {high}, but American may consider it as {medium}. Therefore, it is necessary to consider multi-contextual information systems as an extension of information system (see Sect. 3.3). Here, every context as represented by a certain set of attributes may provide a certain partition of objects. Consequently, n contexts (n subsets of attributes) will provide n partitions. A given set of object, X, may then be approximated into n pairs of lower and upper approximations, called multi rough sets of X as already proposed in [4, 5]. In the relation to the concept of multi rough sets, more properties and more set operations are proposed and examined. Primary concern is also given to the generalization of contexts in the presence of multi-contextual information systems. Furthermore, three general contexts, namely AND-general context, OR-general context and OR+ general context, are recalled. It can be proved that AND-general context and OR+-general context provide (disjoint) partitions. On the other hand, OR+-general context provides covering of the universe. Then, a summarized rough set of a given crisp set of objects is able to be derived from partitions as well as covering of the general contexts. Finally, relations among three general contexts are examined and summarized.

2 Concept of Rough Sets

Rough set theory, introduced by Pawlak in 1982 [6], plays essential roles in many applications of Data Mining and Knowledge Discovery. The theory offers mathematical tools to discover hidden patterns in data, recognize partial or total dependencies in data bases, remove redundant data, and others [7]. Rough set generalizes classical (crisp) set by providing an alternative to formulate sets with imprecise boundaries. A rough set is basically an approximate expression of a given crisp set in terms of two crisp subsets derived from a crisp partition defined on the universal set involved [3]. Two subsets are denoted as the lower and upper approximation. Every elemets in the crisp partition is related based on equivalence relation. Thus, an element belongs to the only one equivalence class and two distinct equivalence classes are disjoint. Formally, the concept of rough sets is defined precisely as follows. Let U denotes a non-empty universal set, and let R be an equivalence relation on U. The partition of the universe is referred to as the quotient set and is denoted by U/R, where [x] R represents the equivalence class in U/R that contains xU. A rough set of subset A ⊆ U is represented by a pair of lower and upper approximation. The lower approximation [6],

$$ \begin{aligned} Lo\left( A \right) & = \{ x \in U|\left[ x \right]_{R} \subseteq A\} , \\ & = \{ \left[ x \right]_{R} \in U/R|\left[ x \right]_{R} \subseteq A\} , \\ \end{aligned} $$
(3.1)

is the union of all equivalence classes in U/R that are contained in A. The upper approximation [6],

$$ \begin{aligned} Up\left( A \right) & = \left\{ {x \in U|\left[ x \right]_{R} { \cap }A = \emptyset } \right\}, \\ & = \left\{ {\left[ x \right]_{R} \in U/R|\left[ x \right]_{R} { \cap }A = \emptyset } \right\}, \\ \end{aligned} $$
(3.2)

is the union of all equivalence classes in U/R that overlap with A.

3 Multi-contextual Information Systems

A Multi Rough Sets is considered as a generalized concept of rough set. A multi rough sets is constructed when a given set of objects is approximated into several partitions of objects in which every partition is constructed by a certain context of attributes. Every context of attributes is represented by a set of attributes. Clearly, multi-contexts of attributes may provide partitions of multi-contexts that are generated from multi-contextual information systems. Here, multi-contextual information systems [4, 5] is formally defined by a pair I = (U, A), where U is a universal set of objects and A is a non-empty set of contexts such as A = {A 1 , …, A n }. A i A is a certain set of attributes and denoted as a context. Every attribute, aA i , is associated with a set of V a as its attribute values called domain of a. It is not necessary for i ≠ j ⇒ A i ∩ A j  = . Attributes such as weight and height might be represented by different contexts (i.e. context of Japanese and context of American) in which they may provide different perceptions or values of a certain attribute concerning a given object. Therefore, for all xU, a(x)iV a is denoted as the value of objects x in terms of attribute a based on the context aA i . An equivalence relation (indiscernibility relation) is then defined in terms of context A i as follows.

For x, yU [4],

$$ R_{{A_{i} }} (x,y) \Leftrightarrow a(x)^{i} = a(y)^{i} ,\quad a(x)^{i} ,a(y)^{i} \in V_{a} ,\quad {\forall }a{ \in }A_{i} . $$
(3.3)

Equivalence class of xU in the context A i is given by [4]

$$ [x]_{{A_{i} }} = \left\{ {y \in U|R_{{A_{i} }} \left( {x,y} \right)} \right\}. $$
(3.4)

It should be proved that for i ≠ j, ∃xU, [x] Ai  ≠ [x] Aj , otherwise A i and A j are redundants in terms of providing similar crisp partitions. By eliminating all redundant contexts, the number of contexts in the relation to the number of objects satisfies the following equation [4].

$$ {\text{For}}|U| = m,\quad |{\mathbf{A}}| \le B(m),\quad B(m) = \sum\limits_{i = 0}^{m - 1} {C(m - 1,i) \times B(i),} $$
(3.5)

where B(0) = 1 and C(n, k) is combination of size k from n elements given by:

$$ C\left( {n,k} \right) = \frac{n!}{{k!\left( {n - k} \right)!}}. $$

|U| and |A| are cardinalities of U and A representing the number of objects and the number of contexts, respectively. It can be clearly seen that a set of contexts A derives set of partitions of universal objects as given by {U/A 1 , …, U/A n }, where U/A i expresses a partition of the universe based on context A i . Here, U/A i contains all equivalence classes of [x] Ai , xU.

4 Concept of Multi Rough Set

A multi rough set is introduced as an approximation of a given crisp set in the presence of a set of partitions derived from multi-contextual information systems providing set of rough sets corresponding to the set of partitions. Here, the multi rough sets may be provided regardless of redundant contexts in multi-contextual information systems. Clearly, every element of the multi rough set is a pair of lower and upper approximation in the relation to a given context of attributes. Formally, multi rough set is given by the following definition.

Definition 3.1

Let U be a non-empty universal set of objects. \( R_{{A_{i} }} \) and \( U/R_{{A_{i} }} \) represent equivalence relation and partition with respect to set of attributes in the context of A i , respectively. For X ⊆ U, in the relation to a set of contexts, A = {A 1 , A 2 , …, A n }, X is multi rough set of X as given by the following equation [4, 5].

$$ X = \left\{ {\left( {Lo\left( {X_{1} } \right),Up\left( {X_{1} } \right)} \right),\left( {Lo\left( {X_{2} } \right),Up\left( {X_{2} } \right)} \right), \ldots ,\left( {Lo\left( {X_{n} } \right),Up\left( {X_{n} } \right)} \right)} \right\}. $$
(3.6)

Thus, a pair of lower and upper approximations, \( \left( {Lo\left( {X_{i} } \right),Up\left( {X_{i} } \right)} \right) \) is an element of multi rough set in terms of context A i . Similar to the definition of classical rough set, \( Lo\left( {X_{i} } \right) \) and \( Up\left( {X_{i} } \right) \) are then defined by following equations [4]

$$ Lo(X_{i} ) = \{ u{ \in }U|[u]_{{A_{i} }} \subseteq X\} = \bigcup {\{ [u]_{{A_{i} }} { \in }U/A_{i} } |[u]_{{A_{i} }} \subseteq X\} , $$
(3.7)

and

$$ Up(X_{i} ) = \{ u \in U|[u]_{{A_{i} }} { \cap }X \ne {\emptyset }\} = \bigcup {\{ [u]_{{A_{i} }} \in U/A_{i} } |[u]_{{A_{i} }} { \cap }X \ne {\emptyset }\} , $$
(3.8)

respectively. Similar to crisp multi-set (bags) as discussed in [8], a multi rough set, X is characterized by a counting function Σ X as given by:

$$ \sum\nolimits_{\text{X}} {\mathbf{P}} (U)^{2} \to {\text{N,}} $$
(3.9)

where P (U) is power set of U and N is a set of non-negative integers. Here, Σ X ((M,N)) counts number of occurrences the pair (M,N) in the multi rough set X for any pair of lower and upper approximations (M,N) ∈ P (U)2. It should be verified that

$$ (M,N) \notin {\mathbf{X}} \Rightarrow \sum\nolimits_{{\mathbf{X}}} {((M,N)) = 0} $$

Also, X* denotes a support set of X. In its definition, X* satisfies the following equation:

$$ (M,N){ \in }{\mathbf{X}}* \Leftrightarrow \sum\nolimits_{{\mathbf{X}}} {((M,N)) > 0,} $$
(3.10)

where \( {\forall }(M,N){ \in }{\mathbf{X}}^{*} ,\sum\nolimits_{{{\mathbf{X}}*}} {((M,N)) = 1.} \) It can be also verified that if X = X* then there is no redundancy in the set of contexts A, not vice versa. It is necessary to define some basic relations and operations concerning sets of pair lower and upper approximations as elements of multi rough set. For X and Y are two multi rough sets on U drawn from multi-contextual information systems A, where |A| = n [4, 5]:

  1. i.

    Containment: \( {\mathbf{X}} \subseteq {\mathbf{Y}} \Leftrightarrow (Lo(X_{i} ) \subseteq Lo(Y_{i} ),Up(X_{i} ) \subseteq Up(Y_{i} )),{\forall }i{ \in }{\text{N}}_{n} ; \)

  2. ii.

    Equality: \( {\mathbf{X}} = {\mathbf{Y}} \Leftrightarrow (Lo(X_{i} ) = Lo(Y_{i} ),Up(X_{i} ) = Up(Y_{i} )),\forall i \in {\text{N}}_{n} ; \)

  3. iii.

    Complement: \( {\mathbf{Y}} = \neg {\mathbf{X}} \Leftrightarrow (Lo(Y_{i} ) = U - Up(X_{i} ),Up(Y_{i} ) = U - Lo(X_{i} )),{\forall }i{ \in }{\text{N}}_{n} ; \)

  4. iv.

    Union: \( {\mathbf{X}}{ \cup }{\mathbf{Y}} \Leftrightarrow \{ (Lo(X_{i} ){ \cup }Lo(Y_{i} ),Up(X_{i} ){ \cup }Up(Y_{i} ))|{\forall }i{ \in }{\text{N}}_{n} \} ; \)

  5. v.

    Intersection: \( {\mathbf{X}}{ \cap }{\mathbf{Y}} \Leftrightarrow \{ (Lo(X_{i} ){ \cap }Lo(Y_{i} ),Up(X_{i} ){ \cap }Up(Y_{i} ))|{\forall }i{ \in }{\text{N}}_{n} \} ; \)

where N n is a non-negative set of integers which is less or equal to n. Obviously, the operations given in (i)–(v) are strongly related to the order of elements corresponding to set of contexts. In the relation to the occurrences of elements with regardless of the order of elements in the multi rough set, some more basic operations are defined as follows.

  1. (a)

    Containment: \( {\mathbf{X}} \prec {\mathbf{Y}} \Leftrightarrow \sum\nolimits_{\text{X}} {((M,N))} \le \sum\nolimits_{\text{Y}} {((M,N))} ,{\forall }(M,N); \)

  2. (b)

    Equality: \( {\mathbf{X}} \equiv {\mathbf{Y}} \Leftrightarrow \sum\nolimits_{\text{X}} {((M,N))} = \sum\nolimits_{\text{Y}} {((M,N))} ,{\forall }(M,N); \)

  3. (c)

    Union: \( \sum\nolimits_{{{\mathbf{X}} \oplus {\mathbf{Y}}}} {((M,N))} = {\mathbf{max}}\left[ {\varSigma_{{\mathbf{X}}} ((M,N)),\varSigma_{{\mathbf{Y}}} ((M,N))} \right],{\forall }(M,N); \)

  4. (d)

    Intersection: \( \sum\nolimits_{{\text{X} \otimes \text{Y}}} {((M,N))} = {\mathbf{min}}\left[ {\varSigma_{\text{X}} ((M,N)),\varSigma_{\text{Y}} ((M,N))} \right],\forall (M,N); \)

  5. (e)

    Insertion: \( \sum\nolimits_{{{\mathbf{X}} + {\mathbf{Y}}}} {((M,N))} = \varSigma_{{\mathbf{X}}} ((M,N)) + \varSigma_{{\mathbf{Y}}} ((M,N)),{\forall }(M,N){ \in }{\mathbf{X}} \oplus {\mathbf{Y}}; \)

  6. (f)

    Minus: \( \sum\nolimits_{{{\mathbf{X}} - {\mathbf{Y}}}} {((M,N))} = {\mathbf{max}}\left[ {\varSigma_{{\mathbf{X}}} ((M,N)) - \varSigma_{{\mathbf{Y}}} ((M,N)),0} \right],{\forall }(M,N){ \in }{\mathbf{X}}; \)

It can be proved that the above basic operations satisfy the following properties:

  1. 1.

    Idempotent laws:

    $$ {\mathbf{X}}{ \cup }{\mathbf{X}} = {\mathbf{X}},\quad {\mathbf{X}}{ \cap }{\mathbf{X}} = {\mathbf{X}},\quad {\mathbf{X}} \oplus {\mathbf{X}} = {\mathbf{X}},\quad {\mathbf{X}} \otimes {\mathbf{X}} = {\mathbf{X}}; $$
  2. 2.

    Commutative laws:

    $$ \begin{aligned} & {\mathbf{X}}{ \cup }{\mathbf{Y}} = {\mathbf{Y}}{ \cup }{\mathbf{X}},\quad {\mathbf{X}}{ \cap }{\mathbf{Y}} = {\mathbf{X}}{ \cap }{\mathbf{Y}},\quad {\mathbf{X}} \oplus {\mathbf{Y}} = {\mathbf{Y}} \oplus {\mathbf{X}}, \\ & {\mathbf{X}} \otimes {\mathbf{Y}} = {\mathbf{Y}} \otimes {\mathbf{X}},\quad {\mathbf{X}} + {\mathbf{Y}} = {\mathbf{X}} + {\mathbf{Y}}; \\ \end{aligned} $$
  3. 3.

    Associative laws:

    $$ \begin{aligned} & {\mathbf{W}}{ \cup }({\mathbf{X}}{ \cup }{\mathbf{Y}}) = ({\mathbf{W}}{ \cup }{\mathbf{Y}}){ \cup }{\mathbf{X}},\quad {\mathbf{W}}{ \cap }({\mathbf{X}}{ \cap }{\mathbf{Y}}) = ({\mathbf{W}}{ \cap }{\mathbf{X}}){ \cap }{\mathbf{Y}}, \\ & {\mathbf{W}} \oplus ({\mathbf{X}} \oplus {\mathbf{Y}}) = ({\mathbf{W}} \oplus {\mathbf{Y}}) \oplus {\mathbf{X}},\quad {\mathbf{W}} \otimes ({\mathbf{X}} \otimes {\mathbf{Y}}) = ({\mathbf{W}} \otimes {\mathbf{Y}}) \otimes {\mathbf{X}}, \\ & {\mathbf{W}} + ({\mathbf{X}} + {\mathbf{Y}}) = ({\mathbf{W}} + {\mathbf{Y}}) + {\mathbf{X}}; \\ \end{aligned} $$
  4. 4.

    Absorption laws:

    $$ \begin{aligned} & {\mathbf{X}}{ \cap }({\mathbf{X}}{ \cup }{\mathbf{Y}}) = {\mathbf{X}},\quad {\mathbf{X}}{ \cup }({\mathbf{X}}{ \cap }{\mathbf{Y}}) = {\mathbf{X}}, \\ & {\mathbf{X}}{ \otimes }({\mathbf{X}} \oplus {\mathbf{Y}}) = {\mathbf{X}},\quad {\mathbf{X}} \oplus ({\mathbf{X}} \otimes {\mathbf{Y}}) = {\mathbf{X}}, \\ & {\mathbf{X}} \otimes ({\mathbf{X}} + {\mathbf{Y}}) = {\mathbf{X}},\quad {\mathbf{X}} \oplus ({\mathbf{X}} + {\mathbf{Y}}) = {\mathbf{X}} + {\mathbf{Y}}; \\ \end{aligned} $$
  5. 5.

    Distributive laws:

    $$ \begin{aligned} & {\mathbf{W}}{ \cup }({\mathbf{X}} \cap Y) = ({\mathbf{W}} \cup {\mathbf{X}}) \cap ({\mathbf{W}} \cup {\mathbf{Y}}), \, \\ &{\mathbf{W}}{ \cap }({\mathbf{X}} \cup Y) = ({\mathbf{W}} \cap {\mathbf{X}}) \cup ({\mathbf{W}} \cap {\mathbf{Y}}), \, \\ & {\mathbf{W}} \oplus ({\mathbf{X}} \otimes Y) = ({\mathbf{W}} \oplus {\mathbf{X}}) \otimes ({\mathbf{W}} \oplus {\mathbf{Y}}), \, \\& {\mathbf{W}} \otimes ({\mathbf{X}} \oplus Y) = ({\mathbf{W}} \otimes {\mathbf{X}}) \oplus ({\mathbf{W}} \otimes {\mathbf{Y}}), \, \\ &{\mathbf{W}} + ({\mathbf{X}} \otimes Y) = ({\mathbf{W}} + {\mathbf{X}}) \otimes ({\mathbf{W}} + {\mathbf{Y}}), \, \\& {\mathbf{W}} + ({\mathbf{X}} \oplus Y) = ({\mathbf{W}} + {\mathbf{X}}) \oplus ({\mathbf{W}} + {\mathbf{Y}}); \, \end{aligned} $$
  6. 6.

    Additive laws:

    $$ {\mathbf{X}} \oplus {\mathbf{Y}} = {\mathbf{X}} + {\mathbf{Y}} - ({\mathbf{X}} \otimes {\mathbf{Y}}); $$
  7. 7.

    Double negation law:

    $$ \neg \neg {\mathbf{X}} = {\mathbf{X}}; $$
  8. 8.

    De Morgan laws:

    $$ \neg ({\mathbf{X}}{ \cap }{\mathbf{Y}}) = \neg {\mathbf{X}}{ \cup }\neg {\mathbf{Y}},\quad \neg ({\mathbf{X}}{ \cup }{\mathbf{Y}}) = \neg {\mathbf{X \cap }}\neg {\mathbf{Y}}; $$
  9. 9.

    Maximum multi rough sets (U = {(U,U),…}, |U| = |X|);

    $$ {\mathbf{X}}{ \cap }{\mathbf{U}} = {\mathbf{X}},\quad {\mathbf{X}}{ \cup }{\mathbf{U}} = {\mathbf{U}}; $$
  10. 10.

    Minimum multi rough sets (E = {(∅,∅),…}, |E| = |X|);

    $$ {\mathbf{X}}{ \cap }{\mathbf{E}} = {\mathbf{E}},\quad {\mathbf{X}}{ \cup }{\mathbf{E}} = {\mathbf{X}}; $$
  11. 11.

    Kleene’s laws:

    $$ ({\mathbf{X}}{ \cap }\neg X){ \cap }({\mathbf{Y}}{ \cup }\neg {\mathbf{Y}}) = ({\mathbf{X}}{ \cap }\neg {\mathbf{X}}),\quad ({\mathbf{X \cap }}\neg {\mathbf{X}}){ \cup }({\mathbf{Y}}{ \cup }\neg {\mathbf{Y}}) = ({\mathbf{Y}}{ \cup }\neg {\mathbf{Y}}); $$

Since basic operations defined in (iii)–(v) do not satisfy complementary laws (\( \left( {{\mathbf{X}}{ \cap }\neg {\mathbf{X}}} \right) \ne {\mathbf{E}} \) and \( \left( {{\mathbf{X}}{ \cup }\neg {\mathbf{X}}} \right) \ne {\mathbf{U}} \)), they do not satisfy Boolean algebra but just Kleene algebra instead. We may apply operations of union and intersection to all pair elements of multi rough sets X in order to achieve a summary of the multi rough set as given by the following definition [4, 5]:

$$ \Gamma \left( X \right) = \bigcup\limits_{i} {Up\left( {X_{i} } \right),} $$
(3.11)
$$ \Theta \left( X \right) = \bigcap\limits_{i} {Up\left( {X_{i} } \right)} , $$
(3.12)
$$ \Phi \left( X \right) = \bigcup\limits_{i} {Lo\left( {X_{i} } \right),} $$
(3.13)
$$ \Psi (X) = \bigcap\limits_{i} {Lo\left( {X_{i} } \right)} , $$
(3.14)

where \( Lo\left( {\mathbf{X}} \right) = \left\{ {\left( {\Phi \left( X \right),\Psi \left( X \right)} \right)} \right\} \) and \( Up\left( {\mathbf{X}} \right) = \left\{ {\left( {\Gamma \left( X \right),\Theta \left( X \right)} \right)} \right\} \) are summarized multi rough set by which they have only one pair elements. It can be easily proved that their relationship satisfies the following equation:

$$ \Psi (X) \subseteq\Phi (X) \subseteq X \subseteq\Theta (X) \subseteq\Gamma (X). $$

Here, pair of \( (\Phi (X),\;\Theta (X)) \) may be considered as a finer approximation of \( X \subseteq U \). On the other hand, pair of \( (\Psi (X),\;\Gamma (X)) \) is a worse approximation of \( X \subseteq U \). Moreover, it can be followed that the definition of summary multi rough set satisfies some properties such as [4]:

  1. (1)

    \( X \subseteq Y \Leftrightarrow [\Psi \left( X \right) \subseteq\Psi \left( Y \right),\,\Phi \left( X \right) \subseteq\Phi \left( Y \right),\,\Theta \left( X \right) \subseteq\Theta \left( Y \right),\,\Gamma \left( X \right) \subseteq\Gamma \left( Y \right)], \)

  2. (2)

    \( \Psi \left( X \right) = \neg\Gamma \left( {\neg X} \right),\,\Phi \left( X \right) = \neg\Theta \left( {\neg X} \right),\,\Theta \left( X \right) = \neg\Phi \left( {\neg X} \right),\,\Gamma \left( X \right) = \neg\Psi \left( {\neg X} \right), \)

  3. (3)

    \( \Psi \left( U \right) =\Phi \left( U \right) =\Theta \left( U \right) =\Gamma \left( U \right) = U,\,\Psi (\emptyset ) =\Phi (\emptyset ) =\Theta (\emptyset ) =\Gamma (\emptyset ) = \emptyset , \)

  4. (4)

    \( \begin{aligned}\Psi \left( {X{ \cap }Y} \right) = &\Psi \left( X \right){ \cap \varPsi }\left( Y \right), \\\Phi \left( {X{ \cap }Y} \right) = &\Phi \left( X \right){ \cap \varPhi }\left( Y \right), \\\Theta \left( {X{ \cap }Y} \right) \le &\Theta \left( X \right){ \cap \varTheta }\left( Y \right), \\\Gamma \left( {X{ \cap }Y} \right) \le &\Gamma \left( X \right) \le\Gamma \left( Y \right), \\ \end{aligned} \)

  5. (5)

    \( \begin{aligned}\Psi (X{ \cup }Y) \ge &\Psi \left( X \right) +\Psi \left( Y \right) -\Psi \left( {X{ \cap }Y} \right), \\\Phi (X{ \cup }Y) \ge &\Phi \left( X \right) +\Phi \left( Y \right) -\Phi \left( {X \cap Y} \right), \\\Phi (X \cup Y) \le &\Phi \left( X \right) +\Phi \left( Y \right) -\Phi \left( {X{ \cap }Y} \right), \\\Gamma (X{ \cup }Y) \le &\Gamma \left( X \right) +\Gamma \left( Y \right) -\Gamma \left( {X{ \cap }Y} \right). \\ \end{aligned} \)

Furthermore, we may consider two special characteristics of context, namely total ignorance and identity, as follows.

  1. 1.

    A i is called total ignorance (τ) if xU, [x] τ  = U.

    Therefore ∀X ⊆ U, X ≠ ∅ ⇒ Lo(X τ ) = ∅, Up(X τ ) = U.

  2. 2.

    A i is called identity (ι) if ∀x ∈ U, [x] ι  = {x}.

    Therefore, ∀X ⊆ U ⇒ Lo(X ι ) = Up(X ι ) = X.

Clearly, in the relation to intersection and union operations, it is also satisfied some properties as follows. ∀A i A, X ⊆ U [4],

  • Union: X ≠ ∅ ⇒ Up(X i ) ∪ Up(X τ ) = U, Lo(X i ) ∪ Lo(X τ ) = Lo(X i ),

    $$ Up\left( {X_{i} } \right){ \cup }Up\left( {X_{\iota } } \right) = Up\left( {X_{i} } \right),Lo\left( {X_{i} } \right){ \cup }Lo\left( {X_{\iota } } \right) = X. $$
  • Intersection: Up(X i ) ∩ Up(X τ ) = Up(X i ), Lo(X i ) ∩ Lo(X τ ) = ∅,

$$ Up\left( {X_{i} } \right){ \cap }Up\left( {X_{\iota } } \right) = X,\quad Lo\left( {X_{i} } \right){ \cap }Lo\left( {X_{\iota } } \right) = Lo\left( {X_{i} } \right). $$

From the above relations dealing with union and intersection operations, ι is considered as identity context for intersection operation of lower approximation as well as for union operation of upper approximation. On the other hand, τ is considered as an identity context for intersection operation of upper approximation as well as for union operation of lower approximation.

Moreover, it is necessary to define two count functions in order to characterize multi rough sets based on the number of objects (elements of U) as follows:

Definition 3.2

η X : U  N n and σ X : U  N n are defined as two functions to characterize multi rough set by counting total number of copies of a given element of U in upper and lower sides of multi rough set X, respectively, as given by:

$$ \eta_{X} \left( x \right) = \sum\limits_{i}^{n} {\theta_{{Up\,\left( {X_{i} } \right)}} \left( x \right),} $$
(3.15)
$$ \sigma_{X} \left( x \right) = \sum\limits_{i}^{n} {\theta_{{Lo(X_{i} )}} \left( x \right),} $$
(3.16)

where |A| = n and \( \theta_{M} \left( x \right) = 1 \Leftrightarrow x \in M \), otherwise \( \theta_{M} \left( x \right) = 0 \).

These count functions are similar to one proposed in (Yager 1990) talking about bags (multi-set). Similar results will be found by firstly taking insertion operation to all lower side yielding a multi-set of lower side as well as all upper side yielding a multi-set of upper side. Then, the counting function is used to calculate number of copies each element in both multi-sets. In the relation to summary rough sets as discussed before, these two count functions, η and σ, satisfy some properties such as for X, YU, |A| = n [4]:

  1. 1.

    \( \eta_{{\mathbf{X}}} \left( y \right) \ge \sigma_{{\mathbf{X}}} \left( y \right),{\forall }y{ \in }U, \)

  2. 2.

    \( \sigma_{{\mathbf{X}}} \left( y \right) > 0 \Rightarrow y{ \in }X, \)

  3. 3.

    \( y{ \in }X \Rightarrow \eta_{{\mathbf{X}}} \left( y \right) = n, \)

  4. 4.

    \( y{ \in \varTheta }\left( X \right) \Leftrightarrow \eta_{{\mathbf{X}}} \left( y \right) = n, \)

  5. 5.

    \( y{ \in \varPsi }\left( X \right) \Leftrightarrow \sigma_{{\mathbf{X}}} \left( y \right) = n, \)

  6. 6.

    \( \eta_{{\mathbf{X}}} \left( y \right) > 0 \Leftrightarrow\Gamma \left( X \right), \)

  7. 7.

    \( \sigma_{{\mathbf{X}}} \left( y \right) > 0 \Leftrightarrow\Phi \left( X \right), \)

  8. 8.

    \( {\mathbf{X}} \subseteq {\mathbf{Y}} \Rightarrow \eta_{{\mathbf{X}}} \left( y \right) \le \eta_{{\mathbf{Y}}} \left( y \right),\;\sigma_{{\mathbf{X}}} \left( y \right) \le \sigma_{{\mathbf{Y}}} \left( y \right),{\forall }y{ \in }U, \)

  9. 9.

    \( {\mathbf{X}} = {\mathbf{Y}} \Rightarrow \eta_{{\mathbf{X}}} \left( y \right) = \eta_{{\mathbf{Y}}} \left( y \right),\;\sigma_{{\mathbf{X}}} \left( y \right) = \sigma_{{\mathbf{Y}}} \left( y \right),{\forall }y\, \in \,U, \)

  10. 10.

    \( \eta_{{{\mathbf{X}}{ \cup }{\mathbf{Y}}}} \left( y \right) = \eta_{{\mathbf{X}}} \left( y \right) + \eta_{{\mathbf{X}}} \left( y \right) - \eta_{{{\mathbf{X}}{ \cap }{\mathbf{Y}}}} \left( y \right), \)

  11. 11.

    \( \sigma_{{{\mathbf{X}}{ \cup }{\mathbf{Y}}}} \left( y \right) = \sigma_{{\mathbf{X}}} \left( y \right) + \sigma_{{\mathbf{X}}} \left( y \right) - \sigma_{{{\mathbf{X}}{ \cap }{\mathbf{Y}}}} \left( y \right), \)

  12. 12.

    \( \eta_{{{\mathbf{X}} \oplus {\mathbf{Y}}}} \left( y \right) = \eta_{{\mathbf{X}}} \left( y \right) + \eta_{{\mathbf{X}}} \left( y \right) - \eta_{{{\mathbf{X}} \otimes {\mathbf{Y}}}} \left( y \right), \)

  13. 13.

    \( \sigma_{{{\mathbf{X}} \oplus {\mathbf{Y}}}} \left( y \right) = \sigma_{{\mathbf{X}}} \left( y \right) + \sigma_{{\mathbf{X}}} \left( y \right) - \sigma_{{{\mathbf{X}} \otimes {\mathbf{Y}}}} \left( y \right), \)

Simply, two membership functions denoted by µ X (y): U → [0,1] and ν X (y): U → [0,1] might be defined by dividing two previous count functions with total number of contexts (|A| = n) as follows [4].

$$ \mu_{X} (y) = \frac{{\eta_{X} (y)}}{n}, $$
(3.17)
$$ \nu_{X} (y) = \frac{{\sigma_{X} (y)}}{n}, $$
(3.18)

where \( \mu_{X} (y) \) and \( \nu_{X} (y) \) represent membership value of y in upper and lower multi set X, respectively. Actually, μ and ν are nothing but another representation of the count functions. However, \( \left( {\nu_{X} (y),\,\mu_{X} (y)} \right) \) might be considered as pair of an interval membership function of yU in the presence of multi-contexts of attributes. Similarly, by changing n to 1 in Property (3)–(5), μ and ν have exactly the same properties as given by η and σ, respectively.

5 Generalization of Contexts

The objective behind generalization of contexts is to combine all contexts of attributes into one general context. In this case, three kinds of general context are introduced, namely AND-general context, OR-general context and OR+-general context.

First, AND-general context is a general context which is provided by AND logic operator to all attributes of all contexts. It is simply built by unifying all elements of attributes of all contexts into one general context as given by the following definition.

Definition 3.3

Let A = {A 1 , A 2 , …, A n } be set of contexts. \( A_{ \wedge } \) is defined as AND-general context by \( A_{ \wedge } = A_{1} \wedge A_{2} \wedge \cdots \wedge A_{n} \), where \( A_{ \wedge } \) is a result of summation of all conditions as given by all attributes of A i , \( \forall i \in {\text{N}}_{n} \) or simply,

$$ A_{ \wedge } = A_{1} + A_{2} + \cdots + A_{n} $$
(3.19)

Similar to what has been discussed in Sect. 3.3, In Definition 3.3, it is also not necessary \( i \ne j \Rightarrow A_{i} { \cap }A_{j} = {\emptyset }. \) In this case, every attribute is assumed to provide unique and independent value of the attribute in the relation to a given object in terms of a certain context of attribute. It can be verified that \( A_{ \wedge } \) satisfies \( |A_{ \wedge } | = \sum\nolimits_{i = 1}^{n} {|A_{i} |} \). Also, \( {\forall }[u]_{{A_{ \wedge } }} ,{\forall }i{ \in }{\text{N}}_{n} ,\exists [u]_{{A_{i} }} \) such that \( [u]_{{A_{ \wedge } }} \,\, \subseteq \,\,[u]_{{A_{i} }} . \)

For a given X ⊆ U, \( Lo\left( {X_{ \wedge } } \right) \) and \( Up\left( {X_{ \wedge } } \right) \) are denoted as lower and upper approximation of X based on set of attributes, \( A_{ \wedge } \). Approximation space constructed by AND-general context is considered as the finest disjoined partition by combining all partition of contexts and considering every possible area of intersection among equivalence classes as a equivalence class of AND-general context (see Fig. 3.1c). It can be clearly seen that AND-general context provides the finest approximation of rough set.

Fig. 3.1
figure 1

Generalization contexts [4]. a U/A 1, b U/A 2, c U/A ^, d U/A v, e \( U/A_{v}^{+} \)

Second, the independency of every context persists in the process of generalization, if relationships among contexts are related by OR logic operator. Obviously, instead of a disjoint partition, it may provide a covering of the universal objects. Since OR-general context provides a covering [9, 1], it is also called Cover-general context (C-general context, for short) as given by the following definition.

Definition 3.4

Let A = {A 1 , A 2 , …, A n } be set of contexts. \( A_{ \vee } \) is defined as C-general context as given by: \( A_{ \vee } = A_{1} \vee A_{2} \vee \cdots \vee A_{n} \), such that [4]

$$ U/A_{ \vee } = \bigcup\limits_{i = 1}^{n} {U/A_{i} } $$
(3.20)

where \( U/A_{ \vee } \) is a covering of the universe as union of all equivalence classes in \( U/A_{i} ,\,i \in {\text{N}}_{n} \).

Consequently, \( |U/A_{ \vee } | \le \sum\nolimits_{i = 1}^{n} {|U/A_{i} |} \) and \( {\forall }C\, { \in }U/A_{ \vee } ,{\forall }i{ \in }{\text{N}}_{n} ,\exists [u]_{{A_{i} }} \) such that \( C = [u]_{{A_{i} }} \), where C is a similarity class in covering and \( [u]_{{A_{i} }} \) is an equivalence class in the partition of U/A i . Here C is called similarity class in order to distinguish it from equivalence class provided by equivalence relation as usually used in partition. Here, every similarity class may take overlap one to each other in providing a covering. Therefore, a given object uU is possibly a member or an element of more than one similarity classes. It can be proved that for \( X \subseteq U \), \( Lo\left( {X_{ \vee } } \right) \) and \( Up\left( {X_{ \vee } } \right) \), as a pair of lower and upper approximations of X in terms of \( A_{ \vee } \), are calculated by the following equations [4],

$$ Lo\left( {X_{ \vee } } \right) = \bigcup\limits_{i = 1}^{n} {Lo\left( {A_{i} } \right),} $$
(3.21)
$$ Up\left( {X_{ \vee } } \right) = \bigcup\limits_{i = 1}^{n} {Up\left( {A_{i} } \right),} $$
(3.22)

where \( Lo\left( {X_{i} } \right) \) and \( Up\left( {X_{i} } \right) \) are lower and upper approximation of X dealing with the context A i . It can be easily verified that iterative operation is applied in operation of the upper approximation as given by \( Up\left( {X_{ \vee } } \right) \subseteq Up\left( {Up\left( {X_{ \vee } } \right)} \right) \). In this case, \( M\left( {Up\left( {X_{ \vee } } \right)} \right) \) is then considered as a maximum upper approximation as given by \( Up\left( {X_{ \vee } } \right) \subseteq Up\left( {Up\left( {X_{ \vee } } \right)} \right) \subseteq \cdots \subseteq M\left( {Up\left( {X_{ \vee } } \right)} \right) \) in which the iterative operation is no longer applied in the maximum upper approximation as shown in the following relation: \( Up\left( {M\left( {Up\left( {X_{ \vee } } \right)} \right)} \right) = M\left( {Up\left( {X_{ \vee } } \right)} \right) \). In the relation to the covering of the univeral objects, some properties have been already given and discussed in [2]. Furthermore, related to summary of multi rough set as discussed in the previous section, it can be verified that \( Up\left( {X_{ \vee } } \right) =\Gamma \left( X \right) \) and \( Lo\left( {X_{ \vee } } \right) =\Phi \left( X \right) \).

The third general context is called OR+-general context. In OR+-general context, transitive closure operation is applied to the covering as provided by OR-general context or C-general context. In other words, similarity classes of OR+-general context are constructed by union of all equivalence classes of all partitions (of all contexts) that overlap one to each other. Similarity classes of OR+-general context is then defined by the following definition.

Definition 3.5

Let A = {A 1 , A 2 , …, A n } be set of contexts. \( A_{ \vee }^{ + } \) is defined as OR+-general context by: \( A_{ \vee }^{ + } = A_{1} \circ A_{2} \circ \cdots \circ A_{n} \), such that \( y \in [x]_{{A_{ \vee }^{ + } }} \) iff [4]

$$ \begin{aligned} & \left( {\exists C_{i} { \in }U/A_{ \vee } ,\,x,y{ \in }C_{i} } \right)\quad {\text{OR}}\quad (\exists C_{i1} ,C_{i2} , \ldots ,C_{im} { \in }U/A_{ \vee } , \\ & x{ \in }C_{i1} ,C_{ik} { \cap }C_{i(k + 1)} \ne {\emptyset },k = 1, \ldots ,m - 1,y{ \in }C_{im)} . \\ \end{aligned} $$
(3.23)

where m ≤ n and \( \left[ x \right]_{{A_{ \vee }^{ + } }} \) is an equivalence class containing x in terms of \( A_{ \vee }^{ + } \).

For \( U/A_{ \vee } \) be a set of equivalence classes provided by all contexts, equivalence classes generated by \( A_{ \vee }^{ + } \) are able to be constructed by the following algorithm [4, 5]:

Finally, algorithm in Definition 3.5 shows that there will be p equivalence classes. Possibly, p might be equal to 1 in case all elements in \( U/A_{ \vee } \) transitively join each other. It can be verified that all equivalence classes in \( U/A_{ \vee }^{ + } \) are disjoint. Also, \( {\forall }S{ \in }U/A_{ \vee }^{ + } \) such that \( \forall i \in {\text{N}}_{n} ,\exists M \in U/A_{i} ,M \subseteq S. \) For a given X ⊆ U, \( Lo\left( {A_{ \vee }^{ + } } \right) \) and \( Up\left( {A_{ \vee }^{ + } } \right) \) are defined as lower and upper approximation of X, respectively provided by set of attributes, \( A_{ \vee }^{ + } \). It can be easily seen that OR+-general context will construct the worst disjoined partition. Hence, it will provide the worst approximation of rough set. In the relation to the maximum upper approximation based on OR-general context, it can be easily proved that apr \( Up\left( {A_{ \vee }^{ + } } \right) = M\left( {Up\left( {A_{ \vee } } \right)} \right) \). Compare to summary multi rough set in previous discussion and approximation based on AND-general context, we have the following relation [4].

$$ Lo(A_{ \vee }^{ + } ) \subseteq\Psi (X) \subseteq\Phi (X) \subseteq Lo(A_{ \wedge } ) \subseteq X \subseteq Up(A_{ \wedge } ) \subseteq\Theta (X) \subseteq\Gamma (X) \subseteq Up(A_{ \vee }^{ + } ) $$

To be more clearly understandable how generalization of contexts applied in the approximation of X, they are illustrated by Fig. 3.1. To simplify the problem, two contexts are given, A 1 and A 2, such as approximation of X is represented in Fig. 3.1a, b. Approximations of X based on AND, OR and OR+-general context are given in Fig. 3.1c–e, respectively.

6 Conclusion

The concept of multi rough sets based on multi contextual information systems was proposed by Intan et al. (2003) [4, 5]. Here, context can be viewed as situation or background by which somehow it is necessary to group some attributes as a subset of attributes and consider the subset as a certain context. Multi rough sets were considered as a generalization of rough sets. This paper was an extended work by exploring more properties and set operations of the concept of multi rough sets based on multi-contextual information systems. Basic operations and some properties were examined. Two count functions as well as their properties were defined and examined to characterize multi rough sets. Finally, three types of general contexts, namely AND-general context, C-general context and OR+-general context were proposed and discussed in order to aggregate contexts into one general context. This paper also discussed briefly relation among all approximations provided by the general contexts. In the future work, it is necessary to apply and implement the concept of multi rough sets in the real world application.