Abstract
Association rule mining is an important data analysis method that can discover associations within data. There are numerous previous studies that focus on finding fuzzy association rules from precise and certain data. Unfortunately, real-world data tends to be uncertain due to human errors, instrument errors, recording errors, and so on. Therefore, a question arising immediately is how we can mine fuzzy association rules from uncertain data. To this end, this paper proposes a representation scheme to represent uncertain data. This representation is based on possibility distributions because the possibility theory establishes a close connection between the concepts of similarity and uncertainty, providing an excellent framework for handling uncertain data. Then, we develop an algorithm to mine fuzzy association rules from uncertain data represented by possibility distributions. Experimental results from the survey data show that the proposed approach can discover interesting and valuable patterns with high certainty.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD, Washington, DC, pp 207–216
Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceedings of the 20th international conference on very large data bases. Santiago, Chile, pp 487–499
Alcala-Fdez J, Alcala R, Gacto MJ, Herrera F (2009) Learning the membership function contexts for mining fuzzy association rules by using genetic algorithms. Fuzzy Set Syst 160(7): 905–921
Arslan A, Kaya M (2001) Determination of fuzzy logic membership functions using genetic algorithms. Fuzzy Set Syst 118(2): 297–306
Berry M, Linoff G (1997) Data mining techniques: for marketing, sales, and customer support. Wiley, New York
Berti-Equille L (2007) Data quality awareness: a case study for cost-optimal association rule mining. Knowl Inf Syst 11(2): 191–215
Chen YL, Ho CY (2005) A sampling-based method for mining frequent patterns from databases. Lect Notes Artif Int 3614: 536–545
Chen YL, Huang TCK (2005) Discovering fuzzy time-interval sequential patterns in sequence databases. IEEE Trans Syst Man Cybern B 35(5): 959–972
Chen YL, Huang TCK (2006) A new approach for discovering fuzzy quantitative sequential patterns in sequence databases. Fuzzy Set Syst 157(12): 1641–1661
Chen YL, Shen CC (2005) Mining generalized knowledge from ordered data through attribute-oriented induction techniques. Eur J Oper Res 166(1): 221–245
Chen YL, Tang K, Shen RJ, Hu YH (2005) Market basket analysis in a multiple store environment. Decis Support Syst 40(2): 339–354
Chen YL, Weng CH (2008) Mining association rules from imprecise ordinal data. Fuzzy Set Syst 159(4): 60–474
Cheng J, Ke Y, Ng W (2008) A survey on algorithms for mining frequent itemsets over data streams. Knowl Inf Syst 16(1): 1–27
Cheung DW, Ng VT, Fu AW, Fu YJ (1996) Efficient mining of association rules in distributed databases. IEEE Trans Knowl Data Eng 8(6): 911–922
Chui C, Kao B, Hung E (2007) Mining frequent itemsets from uncertain data. Lect Notes Comput Sci 4426: 47–58
Chung SM, Luo C (2008) Efficient mining of maximal frequent itemsets from databases on a cluster of workstations. Knowl Inf Syst 16(3): 359–391
Conci A, Castro EMMM (2002) Image mining by content. Expert Syst Appl 23(4): 377–383
De SK, Krishna PR (2004) Clustering web transactions using rough approximation. Fuzzy Set Syst 148(1): 131–138
Delgado M, Marin N, Sanchez D, Vila MA (2003) Fuzzy association rules: general model and applications. IEEE Trans Fuzzy Syst 11(2): 214–225
Dempster AP (1967) Upper and lower probabilities induced by a multivalued mapping. Anuals Math Stat 38: 325–339
Denguir-Rekik A, Mauris G, Montmain J (2006) Propagation of uncertainty by the possibility theory in Choquet integral-based decision making: application to an E-commerce website choice support. IEEE Trans Instrum Meas 55(3): 721–728
Djouadi Y, Redaoui S, Amroun K (2007) Mining fuzzy association rules from uncertain data. In: IEEE international fuzzy systems conference, London, pp 1-6
Dubois D (2006) Possibility theory and statistical reasoning. Comput Stat Data Anal 51(1): 47–69
Dubois D, Prade H (1988) Representation and combination of uncertainty with belief functions and possibility measures. Comput Intell 4(3): 244–264
Eick CF, Rouhana A, Bagherjeiran A, Vilalta R (2006) Using clustering to learn distance functions for supervised similarity assessment. Eng Appl Artif Intel 19(4): 395–401
Han J, Fu Y (1995) Discovery of multiple-level association rules from large databases. In: Proceedings of 1995 international conference on very large databases. Zurmh, Switzerland, pp 420–431
Han J, Kamber WM (2001) Data mining: concepts and techniques. Morgan Kaufmann, San Francisco
Holt JD, Chung SM (2002) Mining association rules using inverted hashing and pruning. Inform Process Lett 83(4): 211–220
Hong TP, Chen JB (1999) Finding relevant attributes and membership functions. Fuzzy Set Syst 103(3): 389–404
Hong TP, Kuo CS, Wang SL (2004) A fuzzy AprioriTid mining algorithm with reduced computational time. Appl Soft Comput 5(1): 1–10
Hong TP, Lee CY (1996) Induction of fuzzy rules and membership functions from training examples. Fuzzy Set Syst 84(1): 33–47
Hong TP, Lin KY, Wang SL (2003) Fuzzy data mining for interesting generalized association rules. Fuzzy Set Syst 138(2): 255–269
Hsu HM, Wang WP (2001) Possibilistic programming in production planning of assemble-to-order environments. Fuzzy Set Syst 119(1): 59–70
Hsu PY, Chen YL, Ling CC (2004) Algorithms for mining association rules in bag databases. Inf Sci 166(1): 31–47
Hu YC, Chen RS, Tzeng GH (2003) Discovering fuzzy association rules using fuzzy partition methods. Knowl Based Syst 16(3): 137–147
Hu YC, Tzeng GH (2003) Elicitation of classification rules by fuzzy data mining. Eng Appl Artif Intel 16(7-8): 709–716
Hüllermeier E (2003) Possibilistic instance-based learning. Artif Intell 148((1-2): 335–383
Ke Y, Cheng J, Ng W (2008) An information-theoretic approach to quantitative association rule mining. Knowl Inf Syst 16(2): 245–258
Lee G, Lee KL, Chen ALP (2001) Efficient graph-Based algorithms for discovering and maintaining association rules in large databases. Knowl Inf Syst 3(3): 338–355
Leung CW, Chan SC, Chung F (2006) A collaborative filtering framework based on fuzzy association rules and multiple-level similarity. Knowl Inf Syst 10(3): 357–381
Loiseau Y, Prade H, Boughanem M (2004) Qualitative pattern matching with linguistic terms. AI Commun 17(1): 25–34
Ordonez C, Ezquerra N, Santana CA (2006) Constraining and summarizing association rules in medical data. Knowl Inf Syst 9(3): 259–283
Oussalah M, Maaref H, Barret C (2001) New fusion methodology approach and application to mobile robotics: investigation in the framework of possibility theory. Inf Fusion 2(1): 31–48
Pedrycz W (1998) Fuzzy set technology in knowledge discovery. Fuzzy Set Syst 98(3): 279–290
Prade H, Testemale C (1984) Generalizing database relational algebra for the treatment of incomplete or uncertain information and vague queries. Inf Sci 34(22): 115–143
Rastogi R, Shim K (2001) Mining optimized support rules for numeric attributes. Inf Syst 26(6): 425–444
Shafer G. (1976) A Mathematical Theory of Evidence. Princeton University Press, Princeton
Shu JY, Tsang ECC, Yeung DS (2001) Query fuzzy association rules in relational database. In: IFSA World Congress and 20th NAFIPS international conference. Vancouver, BC, Canada, pp 2989-2993
Shyu ML, Haruechaiyasak C, Chen SC, Premaratne K, Mining association rules with uncertain item relationships, in [http://www.cs.fiu.edu/~chens/PDF/SCI02.pdf]
Wu X, Zhang C, Zhang S (2005) Database classification for multi-database mining. Inform Syst 30(1): 71–88
Yun H, Ha D, Hwang B, Ryu K Ho (2003) Mining association rules on significant rare data using relative support. J Syst Software 67(3): 181–191
Zadeh LA (1965) Fuzzy sets. Inf Control 8: 338–353
Zadeh LA (1978) Fuzzy sets as a basis for a theory of possibility. Fuzzy Set Syst 1: 3–28
Zhang S, Wu X, Zhang C, Lu J (2008) Computing the minimum-support for mining frequent patterns. Knowl Inf Syst 15(2): 233–257
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Weng, CH., Chen, YL. Mining fuzzy association rules from uncertain data. Knowl Inf Syst 23, 129–152 (2010). https://doi.org/10.1007/s10115-009-0223-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-009-0223-1