Multi-objective optimisation based fuzzy association rule mining method

Zheng, Hui; He, Jing; Liu, Qing; Li, Jianhua; Huang, Guangli; Li, Peng

doi:10.1007/s11280-022-01073-8

Multi-objective optimisation based fuzzy association rule mining method

Published: 27 June 2022

Volume 26, pages 1055–1072, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

World Wide Web Aims and scope Submit manuscript

Multi-objective optimisation based fuzzy association rule mining method

Download PDF

Hui Zheng ORCID: orcid.org/0000-0001-5959-2483^1,2,3,
Jing He¹,
Qing Liu⁴,
Jianhua Li⁵,
Guangli Huang⁶ &
…
Peng Li^1,2

384 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

Fuzzy association rule mining (FARM) is a mainstream method to discover hidden patterns and association rules in quantitative data. It is essential to improve performance metrics, including quantity performance (e.g., the number of rules, the number of frequent itemsets) and quality performance (e.g., fuzzy support and confidence). The current approaches inadequately support optimisation of both quantity and quality performance. We propose a multi-objective optimisation algorithm for FARM (MOOFARM), where quantity and quality performance metrics are improved and validated simultaneously. The experimental evaluation conducted on a real dataset showcases the outstanding performance of MOOFARM against state-of-the-art works. In particular, at minimum support = 0.1, minimum confidence = 0.7, our MOOFARM increases the quantity performance up to 11 times. The proposed method improves the quality performance up to 71.05%.

A Survey of Fuzzy Data Mining Techniques

SQ-FMFO: A Novel Scalarized Multi-objective Q-Learning Approach for Fuzzy Membership Function Optimization

Article 01 November 2022

A Fuzzy Association Rules Mining Algorithm with Fuzzy Partitioning Optimization for Intelligent Decision Systems

Article 09 May 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

A fuzzy association rule, such as {high blood pressure}$\xrightarrow {0.6}$ {having diabetes}, means that if a person suffers from high blood pressure, then there will be about a 60% probability of having diabetes. Fuzzy association rules contain fuzzy sets {having diabetes, 0.6}, which can express various types of vagueness and uncertainty in relations.

Fuzzy association rule mining, has been applied in Web Intelligence [1], since it can provide interpretable relations for quantitative data with fuzzy sets [2]. It has been widely applied in web mining [3], recommendation systems [4], data analytics [5, 6], decision making [7, 8], natural language processing [9] and so on.

Enhancing metrics of fuzzy association rules’ number is one of popular ways to improve the quality of fuzzy association rules. However, there usually are more than one metric for measuring fuzzy association rules, including quantity performance (e.g., the number of rules, the number of frequent itemsets) and quality performance (e.g., fuzzy support and confidence) [10], and each of them can be equally important. Thus, it is essential to optimise multiple objective functions for association rules with high quantity and quality discovery.

There are some existing reference works on the problem of multiple objective functions for association rule mining. Some papers apply stochastic optimisation methods, such as genetic optimisation (GA) algorithm, Ant Colony Optimisation (ACO) algorithm, cross-entropy (CE) algorithm, and reactive search optimisation (RSO) algorithm [11]. Some papers apply the deterministic optimisation method, such as Linear programming problems (LP), Mixed-integer linear programming problems (MILP), Non-linear programming problems (NLP), and Mixed-integer non-linear programming problems (MINLP) [12]. Some papers, such as [13], propose the simulated water-stream algorithm (SWA) by combining stochastic and deterministic optimisation. The SWA algorithm solves objective functions problems that are nondifferentiable.

However, there are still two challenges for existing work: 1) find high performance rules by selecting appropriate partitioning points for fuzzy sets with the given dataset; 2) optimise multiple metrics simultaneously to enhance the quality and quantity of fuzzy association rules.

Therefore, this paper proposes a multi-objective optimisation based fuzzy association rule mining (MOOFARM) method to optimise all metrics in every optimisation step. Then, the proposed method can generate fuzzy association rules with both high quality and quantity. Our contribution is threefold.

The proposed multi-objective optimisation method, which optimises all objections simultaneously in each optimisation step (Section 3.2), can be applied in any other application.
Our MOOFARM method improves both quantity and quality performance of fuzzy association rules by locating partitioning points for changeable fuzzy sets and division sets, as shown in Section 3.3.
Experimental results also witness that our MOOFARM method outperforms its counterparts in terms of quality and quantity of fuzzy association rules, as shown in Section 4.

The rest of this paper is as follows. Section 2 shows existing fuzzy association rule mining methods with an optimisation process. We propose our multi-objective optimisation based fuzzy association rule mining (MOOFARM) method in Section 3. Following this, Section 4 illustrates our experiments and compares our proposed MOOFARM method with its counterparts. Finally, Section 5 concludes this paper.

2 Related works

The current fuzzy association rule mining methods for multiple optimisation include 1) a multi-objective approach; 2) a fuzzy logic-based approach; 3) a fuzzy-based multi-objective optimisation approach.

We first enumerate multi-objective optimisation works. Reference [14] applies a multi-objective genetic approach that can find association rules and improve the accuracy of classification. The fuzzy system generates a classier with high performance in a real-life power system. Reference [15] inaugurates the concept of fuzzy association rule mining and proposes a multi-object genetic algorithm (GA). The authors demonstrate the improvement of strongness, interestingness and comprehensibility. Reference [16] uses a multi-objective genetic approach to extract and optimise fuzzy rules. A computer activity dataset tests the effectiveness.

In addition, more efforts have been made to the multi-objective optimisation. The authors apply the bat algorithm and introduce a cooperative master-slave strategy for different subpopulations in [17]. Various experiments illustrate the bat algorithm outperforming other bio-inspired ones thanks to the association rule mining method. Furthermore, the same authors improve the bat algorithm based association rule mining method in [18]. Their study on cooperative strategies between swarms results in a multi-swarm optimisation method. Reference [17] also applies a hybrid topology by merging the ring and the master-slave methods in [17]. The results of multi-swarm optimisation show a clear superiority against its counterparts in terms of time and rule quality. Reference [19] discusses and compares two genetic algorithms: 1) Genetic Cooperative-Competitive Learning algorithm (GFS.GCCL algorithm); 2) Structural learning algorithm (SLAVE algorithm). The authors employ two public datasets: the Iris dataset and the Wine dataset in the experiments. The experimental results show that the SLAVE algorithm performs better than the GFS.GCCL algorithm in these two datasets. Reference [20] demonstrates the dimension-reduction technique in transactions and itemsets. The authors apply fuzzy and whale optimisation for frequent itemsets discovery and association rule mining. The proposed algorithm is comparable to particle swarm optimisation genetic algorithm and frequent fuzzy itemset-miner, as illustrated in the experimental results.

Next, we discuss fuzzy logic-based optimisation. Reference [21] applies to learn automata (LA) representation with reinforcement-based optimisation method (LA-OMF) to discover fuzzy sets of the time spent by users on each web page (TMFs) when mining fuzzy association rules. The experimental evaluation on a real dataset shows that the LA-OMF method is more efficient in discovering optimised membership function-based fuzzy association rules. Ai et al. [22] presents an optimisation method to optimise partitioning points for fuzzy sets in the fuzzy association rule mining process adaptively. The proposed strategy can predict behaviours with unknown system dynamics. The authors implement an FNN-based soft-sensor to adjust the number of fuzzy sets and optimal image features for better control performance. The strategy is validated in simulations and real-life experiments.

Then, there is also some work on fuzzy-based multi-objective optimisation approach. Reference [23] proposes a multi-objective optimisation method and applies it to fuzzy association rule mining to improve rules’ quality and quantity compared with the FARM method proposed in [24]. The proposed optimised method can reach a better result for at least one of the objectives in each optimisation step. The authors innovate the previous strategy and achieve higher quality and quantity for fuzzy association rules in [25], inspiring us to refine the procedures for multiple objectives optimisation. However, it remains open to obtaining better results for multiple objectives in each optimisation step of association rule mining.

3 Multiple objective optimisation for association rule performance metrics

The multi-objective optimisation method based on gradient descent and Schmidt orthogonal projection mainly includes the following two key steps: 1) Clarify the rule quality standard that needs to be optimised, that is, the objective function in multi-objective optimisation; 2) Use Schmidt orthogonalised projection and its matrix decomposition obtain the direction vector that can optimise multiple objective functions at the same time, and prove through theorem that the multi-objective optimisation algorithm can optimise all objective functions for each the iterative process.

Note that the proposed multi-objective optimisation method can solve the general problem of optimising multiple objectives. The objective functions relating to association rules are just one of its applications.

3.1 Association rules of interestingness and objective functions

Before discussing our objective functions, we borrow the concept of interestingness proposed by Silberschatz and Tuzhilin [26]. The definition of interestingness is shown in Definition 1.

Definition 1

Interestingness in association rule mining describes that an object is valuable on a given occasion. It usually explains that a condition of an association rule or an item is unexpected or actionable by the user in a fixed data set or on a given occasion.

To measure the ‘interestingness’ of an association rule r, we apply the following function from our previous work [23] in multiple objective optimisation processes.

$$\varphi(r) = \min\left\{ \begin{array}{l} \text{Supp}(r) - \text{minSupp}\\ \text{Conf}(r) - \text{minConf}\\ \text{CF}(r)-\text{minCF} \end{array} \right\}.$$

(1)

This formula (1) is applied to control the performance of mined association rules in this paper, where minSupp, minConf and minCF present corresponding user-defined threshold, respectively. Then, the value of function φ(r) larger means the mined association rules’ quality is better. To further optimise the quality of mined rules and compare them with our previous work, we also apply the following φ(r) based objective functions in [23].

${\varPhi }_{1} = \max \limits _{r} \varphi (r)$: the measurement of the best quality rules among all mined fuzzy association rules;
${\varPhi }_{3} = \max \limits _{r_{1}, r_{2}, r_{3}} {\sum }_{i=1}^{3} \varphi (r_{i})$: the average measurement of the three best quality rules among all mined fuzzy association rules;
Φ₅: the average measurement of the five best quality rules among all mined fuzzy association rules;
Φ₁₀: the average measurement of the ten best quality rules among all mined fuzzy association rules;
Φ_n/2: the average measurement of the half best quality rules among all mined fuzzy association rules;

We will use the worst quality rules to fill the deficient ones in calculating these objective functions in our experiments. The performance of these objectives will also be compared as measurements for the quality of fuzzy association rules.

3.2 Construction of our multiple objective optimisations

How to optimise multiple objective functions at the same time is a primary problem in the process of designing multiple objective optimisation algorithms. The basic idea is to construct a matrix according to the gradient of all objective functions and perform Schmidt orthogonal decomposition of the constructed matrix. Then, our optimisation process finds the optimal iterative direction vector of the multiple objective functions according to Schmidt orthogonal decomposition.

Suppose there are m objective functions, then they are denoted as

$$\varphi_{1}, \varphi_{2}, {\ldots} , \varphi_{m}$$

respectively. The gradient of these m objective functions are denoted as

$$\nabla \varphi_{1}, \nabla \varphi_{2}, {\ldots} , \nabla \varphi_{m}.$$

All gradients of the objective functions form a n × m matrix G. Then, we have

$$G = [ \nabla \varphi_{1}, \nabla \varphi_{2}, {\ldots} , \nabla \varphi_{m} ].$$

As matrix G has n rows, which means the matrix G^T has n columns.

To find a n dimension vector x, so that

$${G^{T}} \mathbf{x} = \mathbf{y} = { [ y_{1}, y_{2}, \ldots, y_{m} ]^{T}},$$

satisfies condition y₁ > 0,y₂ > 0,…,y_m > 0, then, we can obtain n dimension vector x. This gradient vector x is the solution that can optimise m objective functions simultaneously. To construct this gradient vector x, we define the following Lemmas [27, 28].

Definition 2 (Vector origin perpendicular)

In n dimension linear space, for ∀ non-zero n dimension vector v, the vectors which perpendicular to the vector v and cross the origin point’s hyperplane are denoted as the origin perpendicular [29]. They are shown as follows: ⊥_v.

Definition 3 (Vector group of vertical subspace)

In n dimension linear space, ∀k,k < n non-zero n dimension vector v₁,v₂,…,v_k, the vector group of vertical subspace [28] is shown as follows:

$$\perp_{\mathbf{v}_{1}, \mathbf{v}_{2}, \ldots, \mathbf{v}_{k}} := \perp_{\mathbf{v}_{1}} \cap \perp_{\mathbf{v}_{2}} \cap \ldots \cap \perp_{\mathbf{v}_{k}}.$$

Definition 4 (Vertical projection of vector on vector group)

In the n dimensional linear space, for any given k < n non-zero n dimensional vector v₁,v₂,…,v_k and another n dimensional vector x, vector x about the vector group v₁,v₂,…,v_k is defined as x in projection $\perp _{\mathbf {v}_{1}, \mathbf {v}_{2}, \ldots , \mathbf {v}_{k}}$ [27],

$$\mathcal{P}_{\perp_{\mathbf{v}_{1}, \mathbf{v}_{2}, \ldots, \mathbf{v}_{k}}} \mathbf{x}.$$

Lemma 1 (Calculation method of vector vertical projection on vector group)

In a n dimensional linear space, let {k : k < n} non-zero n dimensional vectors v₁,v₂,…,v_k and another n dimensional vector x form a linearly independent vector group of k + 1 vectors, then Schmidt orthogonalisation can get matrix decomposition

$$\left[ \mathbf{v}_{1}, \mathbf{v}_{2}, \ldots, \mathbf{v}_{k}, \mathbf{x} \right] = Q U,$$

where

$$Q = \left[\mathbf{q}_{1}, \mathbf{q}_{2}, \ldots, \mathbf{q}_{k+1}\right]$$

is a n × (k + 1) matrix and

$${\mathbf{q}_{i}^{T}}{\mathbf{q}_{j} } = \left\{ \begin{array}{c} 1, \quad i = j \\ 0, \quad i \neq j \end{array} \right.$$

(2)

$$U= \begin{bmatrix} u_{1,1} & u_{1,2} & {\ldots} & u_{1,k} & u_{1,k+1} \\ 0 & u_{2,2} & {\ldots} & u_{2,k} & u_{2,k+1} \\ \vdots& {\vdots} &{\ddots} & {\vdots} & {\vdots} \\ 0 & 0 & {\ldots} & u_{k,k} & u_{k,k+1} \\ 0 & 0 & {\ldots} & 0 & u_{k+1,k+1} \end{bmatrix}$$

is an upper triangular matrix with all positive diagonal elements and has the following equation

$$\mathcal{P}_{\perp_{\mathbf{v}_{1}, \mathbf{v}_{2}, \ldots, \mathbf{v}_{k}}} \mathbf{x} = u_{k+1,k+1} \mathbf{q}_{k+1}.$$

Proof

First, let’s see the Schmidt Orthogonalisation process, as shown below:

$$u_{1,1} = \sqrt{\left \langle \mathbf{v}_{1}, \mathbf{v}_{1} \right \rangle},$$

$$\mathbf{q}_{1} = \frac{1}{u_{1,1}} \mathbf{v}_{1},$$

$$u_{1,2} = \left \langle \mathbf{q}_{1}, \mathbf{v}_{2} \right \rangle,$$

$$\tilde{\mathbf{v}}_{2} = \mathbf{v}_{2} - u_{1,2} \mathbf{q}_{1},$$

$$u_{2,2} = \sqrt{\left \langle \tilde{\mathbf{v}}_{2}, \tilde{\mathbf{v}}_{2} \right \rangle},$$

$$\mathbf{q}_{2} = \frac{1}{u_{2,2}} \tilde{\mathbf{v}}_{2},$$

$$\vdots$$

$$u_{i,j} = \left \langle \mathbf{q}_{i}, \mathbf{v}_{j} \right \rangle\quad \forall i = 1,\ldots,j-1$$

$$\tilde{\mathbf{v}}_{j} = \mathbf{v}_{j} - {\sum}_{i=1}^{j-1} u_{i,j} \mathbf{q}_{i},$$

$$u_{j,j} = \sqrt{\left \langle \tilde{\mathbf{v}}_{j}, \tilde{\mathbf{v}}_{j} \right \rangle},$$

$$\mathbf{q}_{j} = \frac{1}{u_{j,j}} \tilde{\mathbf{v}}_{j},$$

$$\vdots$$

$$u_{i,k+1} = \left \langle \mathbf{q}_{i}, \mathbf{x} \right \rangle\quad \forall i = 1,\ldots,k$$

$$\tilde{\mathbf{x}} = \mathbf{x} - {\sum}_{i=1}^{k} u_{i,k+1} \mathbf{q}_{i},$$

$$u_{k+1,k+1} = \sqrt{\left \langle \tilde{\mathbf{x}}, \tilde{\mathbf{x}} \right \rangle},$$

$$\mathbf{q}_{k+1} = \frac{1}{u_{k+1,k+1}} \tilde{\mathbf{x}},$$

The linear independence condition between the vector group v₁,v₂,…,v_k,x is for all i = 1,2,…,k + 1 both ensure that u_i,i > 0, so that when calculating q_i, u_i,i can be used as the denominator.

From the above Schmidt orthogonalisation process, we can also know that there are the following relations:

$$\mathbf{v}_{j} \in \text{span}\left\{ \mathbf{q}_{1},\ldots, \mathbf{q}_{j} \right\} \quad \forall j = 1,2,\ldots,k$$

Apart from the above formula, it has orthogonalisation relations as follows.

$${\mathbf{q}_{i}^{T}}{\mathbf{q}_{j} } = \left\{ \begin{array}{c} 1, \quad i = j \\ 0, \quad i \neq j \end{array} \right.$$

Therefore,

$$\mathbf{q}_{k+1} \perp \mathbf{v}_{j} \quad \forall j = 1,2,\ldots,k,$$

In other words,

$$\mathbf{q}_{k+1} \in \perp_{\mathbf{v}_{1}, \mathbf{v}_{2}, \ldots, \mathbf{v}_{k}},$$

From q₁,q₂,…,q_k can be expressed linearly by v₁,v₂,…,v_k, we have

$$\mathbf{q}_{j} \perp \perp_{\mathbf{v}_{1}, \mathbf{v}_{2}, \ldots, \mathbf{v}_{k}},$$

where j < k + 1, then,

$${\sum}_{i=1}^{k} u_{i,k} \mathbf{q}_{i} \perp \perp_{\mathbf{v}_{1}, \mathbf{v}_{2}, \ldots, \mathbf{v}_{k}}$$

Finally, with

$$\mathbf{x} = {\sum}_{i=1}^{k+1} u_{i,k+1} \mathbf{q}_{i},$$

$$\mathbf{q}_{k+1} \in \perp_{\mathbf{v}_{1}, \mathbf{v}_{2}, \ldots, \mathbf{v}_{k}},$$

and

$${\sum}_{i=1}^{k} u_{i,k} \mathbf{q}_{i} \perp \perp_{\mathbf{v}_{1}, \mathbf{v}_{2}, \ldots, \mathbf{v}_{k}},$$

we have

$$\mathcal{P}_{\perp_{\mathbf{v}_{1}, \mathbf{v}_{2}, \ldots, \mathbf{v}_{k}}} \mathbf{x} = u_{k+1,k+1} \mathbf{q}_{k+1}.$$

□

Lemma 2

Let m ≤ n and m × n real matrix A full row rank, That is, the rank of the matrix Ar(A) = m, the non-zero vector x can be found so that

$$A \mathbf{x} = \mathbf{y} = (y_{1}, y_{2}, \ldots, y_{m})^{T},$$

(3)

satisfy y₁ ≥ 0,y₂ ≥ 0,…,y_m ≥ 0, and at least one of them makes y_i > 0 true.

Proof

Denote A’s i-th row is a_i,∗, ∀ integer in 1,2,…,m, and having

$$\mathbf{x} = \mathcal{P}_{\perp_{\mathbf{a}_{1,*}, \ldots, \mathbf{a}_{i-1,*}, \mathbf{a}_{i+1,*}, \ldots, \mathbf{a}_{m,*}}} \mathbf{a}_{i,*},$$

Then, with Lemma 1, we can ensure that x is non-zero, and the portion vector y₁,y₂,…,y_m of

$$\mathbf{y} = A \mathbf{x}$$

has

$$y_{j} = \left\{ \begin{array}{c} \left \langle x, x \right \rangle, \quad i = j \\ 0, \quad i \neq j \end{array} \right.$$

□

Theorem 1

Suppose m ≤ n and m × n real matrix A is full row rank, then

$$\mathbf{x} := {\sum}_{i=1}^{m} \mathcal{P}_{\perp_{\mathbf{a}_{1,*}, \ldots, \mathbf{a}_{i-1,*}, \mathbf{a}_{i+1,*}, \ldots, \mathbf{a}_{m,*}}} \mathbf{a}_{i,*}$$

(4)

For portion vector y₁,y₂,…,y_m of

$$\mathbf{y} = A \mathbf{x},$$

which satisfies

$$y_{1} > 0, y_{2} > 0, \ldots, y_{m} > 0$$

Proof

According to the proof processing of Lemma 2, for

$$\mathbf{x}_{i} := \mathcal{P}_{\perp_{\mathbf{a}_{1,*}, \ldots, \mathbf{a}_{i-1,*}, \mathbf{a}_{i+1,*}, \ldots, \mathbf{a}_{m,*}}} \mathbf{a}_{i,*}$$

vector Ax_i, only the i-th portion equals $\left \langle \mathbf {x}_{i}, \mathbf {x}_{i} \right \rangle > 0$, while other portions are set to 0. Then, we have the sum of them,

$$\mathbf{x} = {\sum}_{i=1}^{m} \mathbf{x}_{i}$$

satisfies the conclusion of this theorem. □

Note that the Theorem 1 not only proves the existence of x, but also gives the construction method of x. The solution method is easy to realise with a computer program. Based on this theorem, the process of determining the search direction for multi-objective optimisation is the construction of the gradient construction matrix corresponding to the objective function.

$$A = \begin{bmatrix} \nabla {\varphi_{1}^{T}} \\ \nabla {\varphi_{2}^{T}} \\ {\vdots} \\ \nabla {\varphi_{m}^{T}} \end{bmatrix},$$

(5)

Then, we can solve the corresponding x in the formula (4) via Theorem 1, that is, Ax = y = (y₁,y₂,…,y_m)^T satisfy the relationship y₁ > 0,y₂ > 0,…,y_m > 0. The vector y in Theorem 1 corresponds to the degree of optimisation of the multi-objective function. The direction vector x that satisfies the theorem can be solved simultaneously. The direction vector for optimising the multi-objective function is, therefore, solved.

Although the requirement of the theorem on the rank of Ar(A) = m increases the limitation of the theorem. In practical applications, the probability of r(A) = m in the iterative optimisation process is high. Theoretically, the optimal solution of the optimisation problem will degenerate the matrix A to r(A) = 1. However, the optimal solution is not attainable in a finite step, and the detection of the optimal solution is comparative. It is easier in solving x when the corresponding row vector of A is not full rank.

From Algorithm 1, we can see that the process of finding the direction for multi-objective optimisation, as shown in lines 1 − 3. Line 1 shows the initialisation step. Line 2 calculate the direction according to Theorem 1. Then, line 3 returns the direction vector η.

Theorem 2

In each optimisation iteration process of multi-objective optimisation, if the vector group formed by the gradient vectors of these objective functions is linearly independent, there is a direction vector η that can optimise all objective functions at the same time.

Proof

It is a direct inference of the Theorem 1. Note that the linearity of the vector group here is only to show that the row of the vector group is full of rank. □

Combined with the Algorithm 1, the Theorem 2 is further explained as follows:

The selected interval division points can be gradually optimised through the iterative optimisation target process in Algorithm 1.
Fuzzy set and its membership value have no corresponding changes in Algorithm 1, and the update process needs further research.

Determining the objective function of the multi-objective optimisation algorithm is the process of continuously optimising the quality evaluation metrics. This process is the core of the multi-objective optimisation algorithm and the process of discovering better association rules. Therefore, while ensuring the current multi-objective optimisation process, it is necessary to consider the construction process of fuzzy sets and related membership functions.

3.3 Find the direction for global multi-objective optimisation

This subsection further improves our multi-objective optimisation method by applying a random partitioning point for division sets. Also, we compare the optimisation parameters of our MOOFARM approach with that of our previous work, DOFARM.

According to Algorithm 2, we can see our global multi-objective optimisation processes from lines 1 − 15. Lines 1 − 4 are our initialisation steps. Then, we apply our randomly obtained parameters for multi-objectives and apply our proposed optimisation method by lines 5 − 14. Next, we compare the best objectives for each group of randomly obtained parameters by line 7 and keep the best Parameters that can achieve the best objectives according to Algorithm 1 in line 8. In this way, we can find the best direction for our global multi-objective optimisation with lines 11 − 13. We return the best direction in line 15.

According to our previous work [25], division sets construct the fuzzy sets for fuzzy association rules, and five division sets can generate three fuzzy sets. As shown in Figure 1, there are five division sets, and the two partitioning points for the three fuzzy sets are related to the second division set and the fourth division set separately. Therefore, two partitioning points of fuzzy sets are adjustable by changing four partitioning points of division sets.

The most related work DOFARM method adjusts and optimises the partitioning points of fuzzy sets to achieve high performance of fuzzy association rules. In contrast, our proposed MOOFARM optimises the partitioning points for both fuzzy sets and division sets. In this way, our proposed method MOOFARM will be more robust and can achieve better optimisation results. Then, we can mine higher quality and quantity of fuzzy association rules.

4 Experimental results

This section describes our experimental study of the proposed method. We also compare the proposed method with the most related work to show the excellent performance of our proposed multi-objective optimisation based fuzzy association rule mining (MOOFARM) method. We also compare our proposed MOOFARM method with a general fuzzy association rule mining (FARM) method [24] and our previous work DOFARM method [25]. Dataset ‘pima-indians-diabetes’ [30] from UCI Machine Learning Data is applied in our experiments.

4.1 Comparison with the number of frequent itemsets and fuzzy association rules

This subsection compares our MOOFARM method with existing methods: the FARM method and DOFARM method with the number of frequent itemsets and fuzzy association rules.

Figure 2 illustrates the number of frequent itemsets (Nfreq) among our proposed MOOFARM method, DOFARM method, and FARM method. The performance of three different minimum confidences: 0.5,0.6,0.7 are shown in sub-figures (a), (b), (c) in Figure 2 respectively. Our proposed MOOFARM method has a larger number of frequent itemsets than the numbers of both the FARM and DOFARM methods, such as at minimum support = 0.1, minimum confidence = 0.7, our MOOFARM (≈ 1300) is higher than both of DOFARM (≈ 400) and FARM (≈ 400) by (1300 − 400)/400 = 2.25 = 225% (2.25) times.

Figure 3 compares the number of fuzzy association rules (Nrules) among our proposed MOOFARM method, DOFARM method, and FARM method. For each different minimum confidence in: 0.5,0.6,0.7, our proposed MOOFARM method has a larger number of fuzzy association rules than the numbers of both the FARM method and DOFARM method. Similarly, taking minimum support = 0.1, minimum confidence = 0.7 as an example, our MOOFARM (≈ 600) is higher than both of DOFARM (≈ 100) and FARM (≈ 50) by 5 times and 11 times separately.

4.2 Comparison with the quality of fuzzy association rules

This subsection compares our MOOFARM method with existing methods: the FARM method and DOFARM method the quality of fuzzy association rules. The quality is presented in this paper according to φ(r) as shown in formula (1).

Figure 4 compares the quality of fuzzy association rules (Best): φ(r) as shown in formula (1) among our proposed MOOFARM method, DOFARM method, and FARM method. Selecting different minimum confidence: 0.5,0.6,0.7 in Figure 4, our proposed MOOFARM method always has better quality φ(r) of fuzzy association rules than that of both the FARM method and DOFARM method. For example, at minimum support = 0.1, minimum confidence = 0.7, our MOOFARM (≈ 0.68) is higher than DOFARM (≈ 0.52) and FARM (≈ 0.42) by 30.77 % and 61.90 %.

As shown in Figure 5, the quality φ(r) of fuzzy association rules (BestThree) of our proposed MOOFARM method is better than φ(r) of DOFARM and FARM method at different minimum confidences: 0.5,0.6,0.7. At minimum support = 0.1, minimum confidence = 0.7, for example, our MOOFARM (≈ 0.67) is higher than DOFARM (≈ 0.51) and FARM (≈ 0.42) by 31.37 % and 59.52 %.

Figure 6 shows that the quality φ(r) of fuzzy association rules (BestFive) of our proposed MOOFARM method is higher than that of DOFARM and FARM method at different minimum confidences: 0.5,0.6,0.7. For instance, when minimum support = 0.1, minimum confidence = 0.7, our MOOFARM (≈ 0.66) is higher than DOFARM (≈ 0.5) and FARM (≈ 0.4) by 32 % and 65 %.

Figure 7 illustrates the quality of fuzzy association rules (BestTen) among our proposed MOOFARM method, DOFARM method, and FARM method at different minimum confidences: 0.5,0.6,0.7. Our proposed MOOFARM method’s quality φ(r) of fuzzy association rules is always higher than that of the FARM and DOFARM. Taking minimum support = 0.1, minimum confidence = 0.7 as an example, our MOOFARM (≈ 0.65) is higher than DOFARM (≈ 0.45) and FARM (≈ 0.38) by 44.44 % and 71.05 %.

Figure 8 applies different minimum confidence: 0.5,0.6,0.7 to better present the quality performance of all three methods. Our proposed MOOFARM method’s quality φ(r) of fuzzy association rules are much higher than that of both the FARM method and DOFARM method. For example, at minimum support = 0.1, minimum confidence = 0.7, the proposed MOOFARM method (≈ 0.46) increases the quality of fuzzy association rules (BestHalf) by 64.28 % compared to both DOFARM method (≈ 0.28), and FARM method (≈ 0.28).

According to Figures 2 and 3, our proposed MOOFARM method increases the number of frequent item sets and the number of fuzzy association rules than the FARM method and DOFARM method. Also, in Figures 4, 5, 6, 7 and 8, it is evident that the quality of fuzzy association rules mined by our proposed MOOFARM method is higher than that of its two counterparts. Specifically, at minimum support = 0.1, minimum confidence = 0.7, the proposed MOOFARM method’s quantity of fuzzy association rules increases approximately 2.25 − 5 times (DOFARM) and 2.25 − 11 times; while the quality of MOOFARM method has been improved around 30.77 − 62.28% and 59.52 − 71.05% for DOFARM and FARM respectively.

5 Conclusion

This paper first proposes a general multi-objective optimisation method to optimise all objections for each optimisation step simultaneously. Then, it is applied in fuzzy association rule mining to build our multi-objective optimisation-based fuzzy association rule mining (MOOFARM) model. In addition, we employ metrics in optimisation to improve both association rule’s quality and quantity performance. Moreover, our MOOFARM method achieves better performance as it optimises partitioning points in two rounds: different fuzzy sets and dynamic division sets. Experiments also show that our MOOFARM method outperforms its two counterparts. Specifically, at minimum support = 0.1, minimum confidence = 0.7, our MOOFARM increases the quantity metrics up to 11 times; for the quality of fuzzy association rules, the growth of the proposed MOOFARM method is up to 71.05%.

References

Kalia, H., Dehuri, S., Ghosh, A.: A survey on fuzzy association rule mining. International Journal of Data Warehousing and Mining (IJDWM) 9(1), 1–27 (2013)
Article Google Scholar
Alcala-Fdez, J., Alcala, R., Herrera, F.: A fuzzy association rule-based classification model for high-dimensional problems with genetic rule selection and lateral tuning. IEEE Trans. Fuzzy Syst. 19(5), 857–872 (2011)
Article Google Scholar
Arotaritei, D., Mitra, S.: Web mining: a survey in the fuzzy framework. Fuzzy Set. Syst. 148(1), 5–19 (2004)
Article MathSciNet Google Scholar
Wanaskar, U., Vij, S., Mukhopadhyay, D.: A hybrid web recommendation system based on the improved association rule mining algorithm. arXiv:1311.7204 (2013)
Jindal, A., Dua, A., Kumar, N., Das, A.K., Vasilakos, A.V, Rodrigues, J.J.P.C.: Providing healthcare-as-a-service using fuzzy rule based big data analytics in cloud computing. IEEE Journal of Biomedical and Health Informatics 22 (5), 1605–1618 (2018)
Article Google Scholar
Pérez-Alonso, A., Blanco, I.J., Serrano, J.M., González-González, L.M.: Incremental maintenance of discovered fuzzy association rules. Fuzzy Optim. Decis. Making, 1–21 (2021)
Noor, N.M.M., Wan, M.F., Nawawi, W., Ghazali, A.F.: Supporting decision making in situational crime prevention using fuzzy association rule. In: 2013 International Conference on Computer, Control, Informatics and Its Applications (IC3INA), pp. 225–229. IEEE (2013)
Shang, H., Duan, L u, Zhou, Q.: Early warning of enterprise finance risk of big data mining in internet of things based on fuzzy association rules. Neural Comput. & Applic. 33(9), 3901–3909 (2021)
Article Google Scholar
Cambria, E., White, B.: Jumping nlp curves: A review of natural language processing research. IEEE Computational Intelligence Magazine 9(2), 48–57 (2014)
Article Google Scholar
Muyeba, M., Sulaiman Khan, M., Coenen, F.: Fuzzy weighted association rule mining with weighted support and confidence framework. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp 49–61. Springer (2008)
Najim, K., Ikonen, E., Daoud, A.-K.: Stochastic processes: estimation, optimisation and analysis. Elsevier (2004)
Lin, M.-H., Tsai, J.-F., Yu, C.-S.: A review of deterministic optimization methods in engineering and management. Math. Probl. Eng. 2012 (2012)
Cheung, Y.-M., Gu, F, Liu, H.-L., Tan, K.C., Huang, H.: Objective-domain dual decomposition: An effective approach to optimizing partially differentiable objective functions. IEEE Transactions on Cybernetics 50(3), 923–934 (2018)
Article Google Scholar
Li, F., Jiaju, Q.: Short-term load forecasting for anomalous days based on fuzzy multi-objective genetic optimization algorithm. Proceedings of the CSEE 25(10), 29–34 (2005)
Google Scholar
Kaya, M.: Multi-objective genetic algorithm based approaches for mining optimized fuzzy association rules. Soft Computing 10(7), 578–586 (2006)
Article MATH Google Scholar
Santhi Thilagam, P., Ananthanarayana, V.S.: Extraction and optimization of fuzzy association rules using multi-objective genetic algorithm. Pattern. Anal. Applic. 11(2), 159–168 (2008)
Article MathSciNet Google Scholar
Heraguemi, K.E., Kamel, N., Drias, H.: Multi-population cooperative bat algorithm for association rule mining. In: Computational Collective Intelligence, pp 265–274. Springer (2015)
Heraguemi, K.E., Kamel, N., Drias, H.: Multi-swarm bat algorithm for association rule mining using multiple cooperative strategies. Appl. Intell. 45(4), 1021–1033 (2016)
Article Google Scholar
Kar, S., Kabir, M.M.J.: Comparative analysis of mining fuzzy association rule using genetic algorithm. In: 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE), pp 1–5. IEEE (2019)
Sharmila, S., Vijayarani, S.: Association rule mining using fuzzy logic and whale optimization algorithm. Soft. Comput. 25(2), 1431–1446 (2021)
Article Google Scholar
Anari, Z., Hatamlou, A., Anari, B., Masdari, M.: Optimizing membership functions using learning automata for fuzzy association rule mining. Journal of AI and Data Mining 8(4), 491–514 (2020)
Google Scholar
Ai, M., Xie, Y., Xie, S., Zhang, J., Gui, W.: Fuzzy association rule-based set-point adaptive optimization and control for the flotation process. Neural Comput. & Applic. 32(17), 14019–14029 (2020)
Article Google Scholar
Zheng, H., He, J., Huang, G., Zhang, Y.: Optimized fuzzy association rule mining for quantitative data. In: 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), pp 396–403. IEEE (2014)
Delgado, M., Marin, N., Sanchez, D., Vila, M. -A.: Fuzzy association rules: general model and applications. IEEE Trans. Fuzzy Syst. 11(2), 214–225 (2003)
Article Google Scholar
Zheng, H., He, J., Huang, G., Zhang, Y., Wang, H.: Dynamic optimisation based fuzzy association rule mining method. Int. J. Mach. Learn. Cybern. 10(8), 2187–2198 (2019)
Article Google Scholar
Silberschatz, A., Tuzhilin, A.: What makes patterns interesting in knowledge discovery systems. IEEE Transactions on Knowledge and Data Engineering 8(6), 970–974 (1996)
Article Google Scholar
Wilkinson, J.H., Bauer, F.L., Reinsch, C.: Linear Algebra, vol. 2. Springer, Berlin (2013)
Google Scholar
Demmel, J.W.: Applied numerical linear algebra. SIAM (1997)
Strang, G., Strang, G., Strang, G., Strang, G.: Introduction to Linear Algebra, vol. 3. Press Wellesley, Wellesley-Cambridge (1993)
MATH Google Scholar
Rossi, R.A., Ahmed, N.K.: The network data repository with interactive graph analytics and visualization. In: AAAI (2015)

Download references

Acknowledgements

Supported by the National Key R&D Program of China (No. 2018YFB1003201), the National Natural Science Foundation of P. R. China (No. 61672296, No. 61872196, No. 61872194, and No. 61902196), Scientific and Technological Support Project of Jiangsu Province (No. BE2017166, and No. BE2019740), Major Natural Science Research Projects in Colleges and Universities of Jiangsu Province (No. 18KJA520008), Six Talent Peaks Project of Jiangsu Province (RJFW-111), and NUPTSF (No. NY220014, and No. NY220188).

Author information

Authors and Affiliations

School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing, China
Hui Zheng, Jing He & Peng Li
Jiangsu High Technology Research Key Laboratory for Wireless Sensor Networks, Jiangsu Province, Nanjing, China
Hui Zheng & Peng Li
Swinburne University and Technology, Melbourne, Australia
Hui Zheng
Software and Computational Systems Program, Data 61, CSIRO, Canberra, Australia
Qing Liu
School of Info Technology, Deakin University Burwood Campus, Melbourne, Australia
Jianhua Li
School of Information Technology, Deakin University, Geelong, Australia
Guangli Huang

Authors

Hui Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Jing He
View author publications
You can also search for this author in PubMed Google Scholar
Qing Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jianhua Li
View author publications
You can also search for this author in PubMed Google Scholar
Guangli Huang
View author publications
You can also search for this author in PubMed Google Scholar
Peng Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Jing He or Peng Li.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article belongs to the Topical Collection: Special Issue on Web Intelligence = Artificial Intelligence in the Connected World

Guest Editors: Yuefeng Li, Amit Sheth, Athena Vakali, and Xiaohui Tao

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zheng, H., He, J., Liu, Q. et al. Multi-objective optimisation based fuzzy association rule mining method. World Wide Web 26, 1055–1072 (2023). https://doi.org/10.1007/s11280-022-01073-8

Download citation

Received: 01 June 2021
Revised: 28 February 2022
Accepted: 30 May 2022
Published: 27 June 2022
Issue Date: May 2023
DOI: https://doi.org/10.1007/s11280-022-01073-8

Multi-objective optimisation based fuzzy association rule mining method

Abstract

Similar content being viewed by others

A Survey of Fuzzy Data Mining Techniques

SQ-FMFO: A Novel Scalarized Multi-objective Q-Learning Approach for Fuzzy Membership Function Optimization

A Fuzzy Association Rules Mining Algorithm with Fuzzy Partitioning Optimization for Intelligent Decision Systems

1 Introduction

2 Related works

3 Multiple objective optimisation for association rule performance metrics

3.1 Association rules of interestingness and objective functions

Definition 1

3.2 Construction of our multiple objective optimisations

Definition 2 (Vector origin perpendicular)

Definition 3 (Vector group of vertical subspace)

Definition 4 (Vertical projection of vector on vector group)

Lemma 1 (Calculation method of vector vertical projection on vector group)

Proof

Lemma 2

Proof

Theorem 1

Proof

Theorem 2

Proof

3.3 Find the direction for global multi-objective optimisation

4 Experimental results

4.1 Comparison with the number of frequent itemsets and fuzzy association rules

4.2 Comparison with the quality of fuzzy association rules

5 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Multi-objective optimisation based fuzzy association rule mining method

Abstract

Similar content being viewed by others

A Survey of Fuzzy Data Mining Techniques

SQ-FMFO: A Novel Scalarized Multi-objective Q-Learning Approach for Fuzzy Membership Function Optimization

A Fuzzy Association Rules Mining Algorithm with Fuzzy Partitioning Optimization for Intelligent Decision Systems

Explore related subjects

1 Introduction

2 Related works

3 Multiple objective optimisation for association rule performance metrics

3.1 Association rules of interestingness and objective functions

Definition 1

3.2 Construction of our multiple objective optimisations

Definition 2 (Vector origin perpendicular)

Definition 3 (Vector group of vertical subspace)

Definition 4 (Vertical projection of vector on vector group)

Lemma 1 (Calculation method of vector vertical projection on vector group)

Proof

Lemma 2

Proof

Theorem 1

Proof

Theorem 2

Proof

3.3 Find the direction for global multi-objective optimisation

4 Experimental results

4.1 Comparison with the number of frequent itemsets and fuzzy association rules

4.2 Comparison with the quality of fuzzy association rules

5 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation