Concept lattice reduction using different subset of attributes as information granules

Singh, Prem Kumar; Kumar, Cherukuri Aswani

doi:10.1007/s41066-016-0036-z

Concept lattice reduction using different subset of attributes as information granules

Original Paper
Published: 03 December 2016

Volume 2, pages 159–173, (2017)
Cite this article

Download PDF

Granular Computing Aims and scope Submit manuscript

Concept lattice reduction using different subset of attributes as information granules

Download PDF

Prem Kumar Singh¹ &
Cherukuri Aswani Kumar²

1926 Accesses
33 Citations
Explore all metrics

Abstract

In recent years, the output of formal concept analysis has been widely spread in various research fields for knowledge processing tasks. In this process, a major issues arises when large number of formal concepts are generated from the given context. Available approaches lacks in user required dynamic reduction of concept lattice based on shape and size of the given problem. To overcome this problem, the current paper proposes a method to control the size of concept lattice based on user defined subset of attributes (or objects). Further the proposed method provides a way to select some of the important concepts generated from chosen subset of attributes. For this purpose properties of Shannon entropy is utilized by the proposed method to select some of the important concepts at different granulation of their computed weight. The analysis derived from the proposed method is also compared with recently published granulation tree method with an empirical analysis.

Reduction of Concepts from Generalized One-Sided Concept Lattice Based on Subsets Quality Measure

Multi-level granularity in formal concept analysis

Article Open access 09 August 2018

0–1 linear integer programming method for granule knowledge reduction and attribute reduction in concept lattices

Article 29 June 2018

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In the last decade, formal concept analysis (FCA) has been applied in various research fields for knowledge processing tasks (Poelmans et al. 2013a, b). The concept of FCA was introduced by Wille (1982) using the mathematics of applied lattice theory. FCA processes input data set as context format to discover formal concept and concept lattice (Ganter and Wille 1999). A formal context represents binary relationship among set of objects and their corresponding attribute set as row–column matrix format. The matrix contains × if the objects having the corresponding attributes otherwise null. From the given context, FCA discovers patterns in form of object and their common attribute set specially called as formal concept. It is maximal pair of set of objects (extent) and their corresponding attributes (intent) closed with Galois connection. All the discovered formal concepts can be visualized in a hierarchically ordered structure called as concept lattice (Aswani Kumar and Prem Kumar 2014). There are numerous interesting extensions of concept lattice in fuzzy setting (Burusco and Fuentes–Gonzales 1994), fuzzy graph (Ghosh et al. 2010), interval-valued fuzzy setting (Prem Kumar et al. 2016a; Yao 2016), bipolar fuzzy setting (Prem Kumar and Aswani Kumar 2014a, b), three-polar (Prem Kumar 2016a, b, and other mathematical models Macko 2013; Poelmans et al. 2013b; Ignatov et al. 2015). In each orientation, the concept lattice generated from a large number of attributes may provide some un-important formal concepts as demonstrated by Prem Kumar et al. 2016a, b. In this case, selecting some of the important concepts from the large number of generated concepts is a major concern for the researchers. Recently, attention has been paid towards reducing the size of concept lattice using K-means clustering (Aswani Kumar and Srinivas 2010), non-negative matrix factorization (Aswani Kumar et al. 2015), stability index (Babin and Kuznetsov 2012), computing weight (Bělohlávek and Macko 2012; Bělohlávek and Trnecka 2012), Junction Based Object Similarity (JBOS) (Dias and Viera 2013), entropy (Li et al. 2013; Prem Kumar and Abdullah Gani 2015; Zhang et al. 2012), K-medoids (Li et al. 2016), and homomorphism (Prem Kumar and Aswani Kumar 2014a, b). None of the available approaches provides a way to process the large context based on the user defined information granules (Bart et al. 2012; Dias and Viera 2015; Li et al. 2016). The reason is user or expert need some of the important concepts based on his/her requirement due to that it may differ from expert to expert. To deal with this issue a study on concept lattice reduction based on a chosen information granules is deeply required.

Recently, some of the researchers have made attention towards concept lattice representation via a defined information granules to process the large context into several small context for precise analysis of knowledge processing tasks (Yao 2004; Pedrycz 2013; Li et al. 2015). The information granules also used to find some of the frequent item set based on their tree (Vo et al. 2013; Yao 2016a), orthopairs (Cucci 2016) and triarchy (Yao 2016a) structure for adequate analysis of situation awareness (Loia et al. 2013), intelligent system (Pedrycz and Chen 2011), Big data (Pedrycz and Chen 2015a), decision making process (Pedrycz and Chen 2015b), neural network (Song and Wang 2016), and other fields for human–data interactions (Wilke and Portmann 2016; Zadeh 2008). Recently, the properties of information granules is extended to handle the data with binary (Bělohlávek et al. 2014), fuzzy (Kang et al. 2012; Li et al. 2015), interval-valued (Yao 2016b) and bipolar fuzzy attribute (Prem Kumar and Aswani Kumar 2014a, b) to refine some of the important concepts based on user defined granulation (Prem Kumar and Aswani Kumar 2012; Prem Kumar and Abdullah Gani 2015). These recent studies given an interactive (Skowron et al. 2016) and a new format of granular computing (Dubois and Prade 2016) which can be useful to bridge its gap from knowledge reduction tasks (Wu et al. 2009). These recent analysis motivated us to focus on an another form of granular computing (i.e., subset of attributes as information granules) to reduce the size of concept lattice in this paper.

The necessity of using subset of attributes as information granules is to find some of the important pattern (i.e., concepts) by their closeness (Dias and Viera 2013), functional relationships (Prem Kumar and Aswani Kumar 2014a, b), crisp ordering (Prem Kumar and Aswani Kumar 2015), similarity (Prem Kumar and Abdullah Gani 2015) or a defined complex granules (Skowron et al. 2016). The selection of granules is based on the shape and size of the given problem which should commensurate with the user requirements to resolve the given problem. It means the chosen level of granulation provides a way to process the large context with an efficient manner via modularizing the complex problem into a series of well-defined sub problems (modules) within minimal computation cost as discussed by Loia et al. (2016). In this paper, shape is used to represent the formal context and size represents its dimension, whereas subset of attributes are considered as small information granules. The level of granulation for choosing the particular subset can be defined by user based on his/her requirements. For example, suppose a context have three attributes \(\{ 1,2,3\}\) to process the knowledge. In this case, following subset can be generated: \(\phi\), {1}, \(\left\{ 2\right\}\), \(\left\{ 3\right\}\), \(\left\{ 1,2\right\}\), \(\left\{ 1,3\right\}\), \(\left\{ 2,3\right\}\), \(\left\{ 1,2,3\right\}\). Among these subset user can choose any of the subset as level of granulation. The level of granulation shows the reduced number of attributes in the chosen subset as given below:

Granulation level 0 means none of the attributes are reduced so user selects {1}, {2}, {3}
Granulation level 1 means one of the attribute is reduced. In this case, user may selects following subsets:
1. 1.
  ({1}, {2, 3})
2. 2.
  ({2}, {1, 3})
3. 3.
  ({1, 2}, {3})
Granulation level 2 means two attributes are reduced. In this case, user can selects the subset {1, 2, 3}.

Above information shows that choosing a subset of attributes as information granules provides a mechanism to reduce the large context by changing the size of subset. In this process, it may possible to hide or reveal a certain amount of details for the chosen subset of attributes to solve the particular problem based on its complexity and requirements. The reason is each of the chosen subset of attributes as information granules provides a specific way to describe the particular part of the problem. Now the chosen subset of attributes can be visualized as vertices of the graph (Berry and Sigayret 2004) as it is applied in mathematical searching (Nguyen et al. 2012), preference analysis (Obiedkov 2012), item set mining (Troiano and Scibelli 2014), AFS algebra (Wang and Liu 2008), and interval-set approximation (Yao 2016b). The complexity of concept lattice visualization and its processing time increases when number of attributes are more in the given context. In this case, a problem arises when a user want to visualize the data using some of the potential subset of attributes. Such that user can find some of the important concepts which may or may not be detectable while using all the attributes. To achieve the goal this paper paper aimed at following proposals:

1.
To propose a method considering the chosen subset of attributes as granulation to process the large context.
2.
To reduce the size of concept lattice based on chosen granulation level for the subsets of attributes.
3.
To find some of the important concepts from the obtained context at different granulation of their computed weight using entropy.
4.
To provide an empirical analysis of the proposed method with granular tree method given by Bělohlávek et al. (2014). The reason is granular tree method also provides a way to control the size of concept lattice using spatial neighborhood of attributes.

Rest of this paper is organized as follows: Sect. 2 provides a brief background about FCA. Section 3 contains the proposed method. Section 4 provides illustration of the proposed method. The empirical analysis of the proposed method with granularity tree is demonstrated in Sect. 5 followed by conclusions, acknowledgement and references.

2 Formal concept analysis

Definition 1

(Formal context) A formal context (F) = (X, Y, R) represents set of objects (X), set of attributes (Y), and binary relation(R) among them in form of row–column matrix. If the objects having corresponding attributes then represents × otherwise null in the corresponding row–column of the matrix.

Definition 2

(Concept forming operators) The operators \(\uparrow\): \(2^{X} \rightarrow 2^{Y}\) and \(\downarrow\): \(2^{Y} \rightarrow 2^{X}\) defined for every A \(\subseteq\) X and B \(\subseteq\) Y by

\(A^{\uparrow }\) = \(\left\{ y\in Y | \forall x \in A: (x, y) \in R \right\}\),

\(B^{\downarrow }\) = \(\left\{ x\in X | \forall y \in B: (x, y) \in R \right\}\),

\(A^{\uparrow }\) is the set of all attributes shared by all objects from A. Similarly, \(B^{\downarrow }\) is the set of all objects sharing all attributes from B.

Definition 3

(Formal concept) It is a pair (A, B) of maximal subset of objects and attributes set, respectively, where A \(\subseteq\) X and B \(\subseteq\) Y. The given subset of attributes are closed as follows: \(A^{\uparrow }\) = B and \(B^{\downarrow }\) = A. This connection generates the pair of data having same properties called as extent-intent. The collection of all such pairs of concepts forms a concept lattice under the closure operation.

Definition 4

(Concept lattice) The concept lattice structure determines the hierarchy of formal concepts. It defines the partial ordering principle, i.e., \((\textit{A}_{1},\textit{B}_{1})\le (\textit{A}_{2},\textit{B}_{2})\Longleftrightarrow \textit{A}_{1}\subseteq \textit{A}_{2}(\Longleftrightarrow \textit{B}_{2}\subseteq \textit{B}_{1})\) among each of the formal concepts. In this case, the concept \((A_{1}, B_{1})\) can be considered as more specific when compare to \((A_{2}, B_{2})\) (i.e., \((A_{2}, B_{2})\) is more general when compare to \((A_{1}, B_{1})\)). From this ordering it can be concluded that each of the concept lattice structure contains two special nodes at their top and bottom boundaries representing the most general and the most specific concepts, respectively. The generalized concepts contain more objects while specialized concepts contain more attributes. The attributes of each formal concept are inherited from the most general maximum node, while the objects are inherited from the most specific minimum node as given by Ganter and Wille (1999):

\(\wedge _{j\in J}(A_{j}, B_{j})\) = \((\bigcap _{j\in J} A_{j}, (\bigcup _{j\in J}B_{j})^{\downarrow \uparrow })\),
\(\vee _{j\in J} (A_{j}, B_{j})\) = \(((\bigcup _{j \in J} A_{j})^{\uparrow \downarrow },\bigcap _{j\in J} B_{j})\).

Definition 5

(Granular computing) It is an important tool to process the large or chunks of information based on their small information granules. The information granules includes collections of some attributes based on their similarity, functional adjacency, and indistinguishability. In this paper, subset of attributes are used as a information granules to detect some important patterns in the given data set. Hence, the level of granulation provides a way to process the large context with an efficient manner via modularizing the complex problem into a series of well-defined subproblems (modules) within minimal computation cost. The importance of submodules or information granules can be defined using their computed weight (w) where \(0\le w \le 1\). Such that user can select some of the concepts based on his/her requirements at different granulation–\(\theta\) (0\(\le \theta \le\)1). However, the selection of particular granules is based on user or experts choice or requirements of the problem. It is an important tool to analyze the data set having large attributes set. The information granule includes one or another way to quantify the lack of numeric precision in the given large attribute data set. Hence, it provides collection of small information to detect some important patterns in the given large data set.

Concept lattice provides hierarchical order visualization of formal concepts to accelerate the knowledge processing tasks using FCA. However, FCA discovers large number of formal concepts even for the middle size of formal context. In this case, selecting some of the important formal concepts is major concern for the practical applications of FCA. To encounter this problem, a method is proposed in the next section based on chosen subset of attributes as information granules and their computed weight at defined granulation.

3 Proposed method

3.1 Granulation based subset of attributes

The proposed method in this paper is focused on controlling the size of concept lattice based on chosen subset of attributes. For this purpose, it uses subset of attributes as information granules. The step by step procedure of the proposed method is given as below:

Step 1 Let us suppose a formal context–F = (X, Y, R) having n–number of objects and m–number of attributes.

Step 2 Find all the subset of attributes (Y), i.e., 2^m in the given formal context.

Step 3 Now consider the subset of attributes (\(S_{j}\)) as granulation level and defined the level of granularity as given follows:

Granulation level 0 means none of the attributes are reduced by chosen subset of attributes.
Granulation level 1 means one of the attribute is reduced by chosen subset of attributes.
Granulation level 2 means two attributes are reduced by chosen subset of attributes.
Similarly granulation level \(m-1\) means \(m-1\) attributes are reduced by chosen subset of attributes.

It can be observed that the level of granulation indicates only the number of reduced attributes by the chosen subset of attributes. In this case, choosing the right subset is another issue. To resolve this issue, next step provides an equation to verify the chosen level of granulation.

Step 4 Previous step shows that if the chosen subset of attributes have equal number of attributes then they may contain similar granulation level. In this case, a user can choose the subset of attributes which follows the following equality: \(S_{1}\) \(\cup\) \(S_{2}\) \(\cup \cdots\) \(\cup\) \(S_{j}\) = Y where \(|S_{j}|\le 2^{m}\).

Step 5 The chosen subset of attributes and their corresponding relationship with given objects set provides an another formal context F _S = \((X, S_{j}, R_{1})\) where \(|S_{j}| <|Y| = m\) and \(|R_{1}| <|R|\). The size of new formal context can be controlled using the chosen level of granulation for the subset of attributes as shown in Step 3.

Step 6 Now the concepts can be generated from newly obtained context F _S = \((X, S_{j}, R_{1})\) for knowledge processing tasks. Of course for the reducing the size of formal context the proposed method does not removed any attributes or objects set. Hence, comparatively less information loss using the proposed method when compare to other available approaches in FCA with binary setting.

Table 1 Proposed algorithm for concept lattice reduction using chosen subset of attributes

Concept lattice reduction using different subset of attributes as information granules

Abstract

Similar content being viewed by others

Reduction of Concepts from Generalized One-Sided Concept Lattice Based on Subsets Quality Measure

Multi-level granularity in formal concept analysis

0–1 linear integer programming method for granule knowledge reduction and attribute reduction in concept lattices

Explore related subjects

1 Introduction

2 Formal concept analysis

Definition 1

Definition 2

Definition 3

Definition 4

Definition 5

3 Proposed method

3.1 Granulation based subset of attributes

3.2 Proposed algorithm to choose some important concepts generated from subset of attributes

4 Concept lattice reduction using granular based subset of attributes

Example 1

5 Empirical analysis

Example 2

Example 2.1

Example 2.2

6 Conclusions and future work

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation