Abstract
Decision-theoretic rough set (DTRS) model, proposed by Yao in the early 1990’s, introduces Bayesian decision procedure and loss function in rough set theory. Considering utility function in decision processing, utility-based decision-theoretic rough set model (UDTRS) is given in this paper. The utility of the positive region, the boundary region and the negative region are obtained respectively. We provide a reduction definition which can obtain the maximal utility in decisions. A heuristic reduction algorithm with respect to the definition is proposed. Finally, experimental results show the proposed algorithm is effective.
Access provided by CONRICYT-eBooks. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Decision-theoretic rough set (DTRS) model was firstly introduced by Yao et al. [1] in the early 1990’s. As a probabilistic rough set model, it has been successfully used in many research areas, such as knowledge presentation [2,3,4], data mining [5], machine learning [6], artificial intelligence [7, 8] and pattern recognition.
Attribute reduction [9,10,11,12,13,14] aims to remove the unnecessary attributes from the information system while keeping the particular property, and becomes one of the hottest issues in rough set theory. Yao and Zhao [9] studied attribute reduction in decision-theoretic rough set models with respect to the different classification properties, confidence, coverage, decision-monotocity, generality and cost, they also gave a general definition of probabilistic attribute reduction. Jia et al. [10] provided a minimum cost attribute reduction in decision-theoretic rough set model, and decision cost induced from the reduct is minimum. Dou et al. proposed a parameterized decision-theoretic rough set model in the paper [11]. In the proposed model, the smallest possible cost and the largest possible cost are computed respectively. Li et al. [12] introduced a non-monotonic attribute reduction for decision-theoretic rough set model. The expanded positive region can be kept by the non-monotonic attribute reduction in an information system. To extend classical indiscernibility relation in Yao’s decision-theoretic rough sets, Ju et al. [13] gave the δ-cut decision-theoretic rough set. In the proposed decision-theoretic rough set model, attribute reduction of the decision-monotonicity criterion and the cost minimum criterion are proposed respectively in the paper. By constructing variants of conditional entropy in decision-theoretic rough set model, Ma et al. [14] proposed solutions to the attribute reduction based on decision region preservation.
The remaining parts of this paper are arranged as follows. Some basic notions with respect to utility-based decision-theoretic rough set (UDTRS) model are briefly recalled in Sect. 2. Definition of attribute reduction in UDTRS and relative heuristic reduction algorithm are investigated respectively in Sect. 3. We give the experimental analysis in Sect. 4. The paper is summarized in Sect. 5.
2 Utility-Based Decision-Theoretic Rough Sets
By considering the subjective factors in risk decision, Zhang et al. [15] proposed utility-based decision-theoretic rough set (UDTRS) model based on Yao’s decision-theoretic rough set model [1]. In this section, we briefly recall some basic notions about utility-based decision-theoretic rough set model. Detailed information about UTRS can be found in the paper [15].
A decision system is defined as the 3-tuple \( S = (U,AT = C{ \cup }D,V_{a} ) \).Universe \( U \) is the finite set of the objects; \( AT \) is a nonempty set of the attributes, such that for all \( a\,{ \in }\,AT \); \( C \) is the set of conditional attribute and \( D \) is the set of decision attribute; \( V_{a} \) is the domain of attribute \( a \).
For each nonempty subset \( A \subseteq AT \), the indiscernibility relation \( IND(A) \) is defined as: \( IND(A) = \{ (x,y) \in U^{2} ,a(x) = a(y),{\forall }a \in A\} \). Two objects in \( U \) satisfy \( IND(A) \) if and only if they have the same value in \( \forall a \in A \). \( U \) is divided into a family of disjoint subsets \( U/IND(A) \) defined a quotient set of \( U \) as \( U/IND(A) = \{ [x]_{A} :x \in U\} \), where \( [x]_{A} = \{ y \in U:(x,y)\,{ \in }\,IND(A)\} \) denotes the equivalence class determined by \( x \) with respect to \( A \). The set of states is given by \( \Omega = U /D = \{ X,X^{c} \} \) indicating that an object is in state \( X \) or \( X^{c} \).
Utility is an important economic concept, and it reflects degree of one’s satisfaction related to the cost or profit in decision procedure. For \( \forall x \in U \) and \( [x] \in U /\pi \), \( u(\lambda ) \) is utility function, \( \lambda \) denotes the cost of taking action. The expected utilities for different actions can be expressed as:
According to maximal utility in Bayesian procedures, we have the following as:
If \( P(X|[x]) = P \) then \( P(X^{c} |[x]) = 1 - P \), then we derived the following decision rules:
where
Since \( u(\lambda_{PP} ) \ge u(\lambda_{BP} ) > u(\lambda_{NP} ),\,u(\lambda_{NN} ) \ge u(\lambda_{BN} ) > u(\lambda_{PN} ),\,\alpha_{u} \in (0,1],\,\beta_{u} \in [0,1) \) Then, we can obtain
For \( \forall X \subseteq U,\,\,(\alpha_{u} ,\beta_{u} ) \) -upper and lower approximations in utility-based decision-theoretic rough set model are presented as:
Based on the definition of rough approximations in UDTRS, the positive, boundary and negative regions are defined as
3 Attribute Reduction in UDTRS
In this section, we will give the definition of attribute reduction based on maximal utility in UDTRS. By attribute reduction, the maximal utility will be obtain in decisions. According to the proposed definition of reduction, a heuristic algorithm with respect to the maximal utility will be investigated in this section.
Similar to the Bayesian expected cost [10] in decision-theoretic rough set model, the Bayesian expected utility [15] of each rule is expressed as:
From above, we can easily get the Bayesian expected utility of decision rules:
Utility of positive rules:
Utility of boundary rules:
Utility of negative rule:
For any subset \( A \subseteq AT \), the whole utility is composed of three parts: utility of positive region, utility of boundary region and utility of negative region. Then, we have the whole utility of all decision rules in decision systems as follows [15]:
In real applications, it is better to obtain more utility in decision procedures. Thus, according to “non-decreasing” principle, we define attribute reduction in utility-based decision-theoretic rough set model as follows:
Definition 1.
A decision system \( S = (U,C \cup D,V_{a} ),\,R \subseteq C \) is a reduct of \( C \) with respect to \( D \) if it satisfies the following two conditions:
-
(1)
\( Utility_{R} \ge Utility_{C} ; \)
-
(2)
\( \forall Rh^{{\prime }} \subset R,\,Utility_{{R^{{\prime }} }} < Utility_{R} . \)
From Definition 1, the decision utility will be increased or unchanged by the reduction. Condition (1) is the jointly sufficient condition and condition (2) is the individual necessary condition. Condition (1) guarantees that the utility induced from the reduct is maximal, and condition (2) guarantees the reduct is minimal.
The fitness function, which shows the significance of an attribute, is usually used to construct a heuristic algorithm in rough set theory. In UTRS model, the fitness function is defined as:
Definition 2.
A decision system \( S = \{ U,C \cup D,V_{a} \} ,\,A \subseteq C \) The utility fitness function of attribute \( a_{i} \in A \) is defined as:
The three strategies in heuristic algorithm is summarized in paper [9]. In this paper, we take deletion strategy to give an algorithm in UDTRS. The heuristic algorithm (The algorithm of maximal-utility attribute reduction, AMUAR) based on the utility fitness function is described as follows:
The fitness function shows the significance of an attribute. In the processing of deleting attributes, if \( Utility_{B} \ge Utility_{C} \), the algorithm will stop the deleting procedure and output reduct of decision systems.
4 Experimental Analysis
In this section, we will verify effectiveness of the algorithm AMUAR and the monotonicity of utility with attributions by experiments. All the experiments have been carried out on a personal computer with Windows 7, Intel (R) Pentium (R) CPU G640 (2.8 GHz) and 6.00 GB memory. The programming language is Matlab 2010b.
We take \( u(\lambda ) = a( - \lambda + c)^{b} \) as the utility function. If \( 0 < b < 1 \), then UDTRS model is risk aversion; If \( b = 1 \), UDTRS model is risk neutrality; If \( b > 1 \), UDTRS model is risk loving; Six data sets from the UCI Machine Learning Repository are used. For each data set, the utility functions are randomly generated in interval value [100, 1000]. Their values meet the following constraint conditions: \( u(\lambda_{BP} ) > u(\lambda_{NP} ),\,u(\lambda_{BN} ) > u(\lambda_{PN} ),\,u(\lambda_{PP} ) = u(\lambda_{NN} ) = 1 \). 10 different groups of utility functions are randomly generated. Table 1 shows the average length of the derived reduct with different data sets.
To validate the monotonicity of utility with attributes, utility is calculated with the increasing number of attributes from 1 to the total attribute number in each data set. In Fig. 1, the x-coordinate represents the number of attributes, and the y-coordinate represents the utility of three models. Figure 1 shows the utility of three models do not strictly increase with the increasing of attribute numbers. For example, the utility decrease with adding an attribute in data set credit_a, forestfires and vote. The utility with the number of attributes increasing do not present monotonicity strictly.
5 Conclusions
Utility-based decision-theoretic rough set model is introduced in this paper. The utility of the positive region, the boundary region and the negative region are given respectively. We provide a definition of reduction which aims to obtain the maximal utility in decisions. A heuristic reduction algorithm with respect to the definition is proposed. Finally, experimental results show the proposed algorithm is effective.
References
Yao, Y.Y., Wong, S.K.M., Lingras, P.: A decision-theoretic rough set model. In: Ras, Z.W., Zemankova., M., and Emrichm M.L. (eds.) Proceedings of the 5th International Symposium on Methodologies for Intelligent Systems, 25–27 October 1990. North-Holland, New York (1990)
Liu, D., Liang, D.C., Wang, C.C.: A novel three-way decision model based on incomplete information system. Knowl.-Based Syst. 91, 32–45 (2016)
Deng, X.F., Yao, Y.Y.: Decision-theoretic three-way approximations of fuzzy sets. Inf. Sci. 279, 702–715 (2014)
Herbert, J.P., Yao, J.T.: Game-theoretic rough sets. Fundamenta Informaticae 108, 267–286 (2011)
Yu, H., Liu, Z.G., Wang, G.Y.: An automatic method to determine the number of clusters using decision-theoretic rough set. Int. J. Approx. Reason. 55, 101–115 (2014)
Zhang, H.R., Min, F.: Three-way recommender systems based on random forests. Knowl.-Based Syst. 91, 275–286 (2016)
Zhou, B., Yao, Y.Y., Luo, J.G.: Cost-sensitive three-way email spam filtering. J. Intell. Inf. Syst. 42, 19–45 (2014)
Liang, D.C., Pedrycz, W., Liu, D., Hu, P.: Three-way decisions based on decision-theoretic rough sets under linguistic assessment with the aid of group decision making. Appl. Soft Comput. 29, 256–269 (2015)
Yao, Y.Y., Zhao, Y.: Attribute reduction in decision-theoretic rough set model. Inf. Sci. 178, 3356–3373 (2008)
Jia, X.Y., Liao, W.H., Tang, Z.H., Shang, L.: Minimum cost attribute reduction in decision-theoretic rough set models. Inf. Sci. 219, 151–167 (2013)
Dou, H.L., Yang, X.B., Song, X.N., et al.: Decision-theoretic rough set: a multicost strategy. Knowl.-Based Syst. 91, 71–83 (2016)
Li, H.X., Zhou, X.Z., Zhao, J.B.: Non-monotonic attribute reduction in decision-theoretic rough sets. Fundamenta Informaticae 126, 415–432 (2013)
Ju, H.R., Dou, H.L., Qi, Y., Yu, H.L., Yu, D.J., Yang, J.Y.: δ-cut decision-theoretic rough set approach: model and attribute reductions. Sci. World J. 2014, 1–12 (2014)
Ma, X.A., Wang, G.Y., Yu, H., Li, T.R.: Decision region distribution preservation reduction in decision-theoretic rough set model. Inf. Sci. 278, 614–640 (2014)
Zhang, N., Jiang, L.L., Yue, X.D., Zhou, J.: Utility-based three-way decisions model. CAAI Trans. Intell. Syst. 11, 459–468 (2016)
Acknowledgements
This work was partially supported by the National Natural Science Foundation of China (Nos. 61403329, 61572418, 61663002, 61502410, 61572419), the Natural Science Foundation of Shandong Province (No. ZR2013 FQ020).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Zhang, N., Jiang, L., Liu, C. (2017). Attribute Reduction in Utility-Based Decision-Theoretic Rough Set Models. In: Polkowski, L., et al. Rough Sets. IJCRS 2017. Lecture Notes in Computer Science(), vol 10314. Springer, Cham. https://doi.org/10.1007/978-3-319-60840-2_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-60840-2_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-60839-6
Online ISBN: 978-3-319-60840-2
eBook Packages: Computer ScienceComputer Science (R0)