Abstract
Today’s era is an era of data. Hence, the most attentive field of research and study is to collect and select the data rapidly as per the need nowadays. Data mining is the field which helps industries to overcome the problem of data extraction. Association rule mining (ARM) is one of the major techniques of data mining which identifies the itemsets appear frequently in the dataset known as frequent itemset and generates the association rules. This helps in decision making. The extension of traditional association rule mining has come up with the concept of utility which should be considered while mining; hence, utility mining aims to identify the itemsets which not only have the frequent occurrences but also have considered the utility of the itemset. High utility itemsets mining can be used as an efficient method to discover interesting patterns. Rare items are items that appear fewer frequently in a database. High utility itemsets can be frequent or rare. Even rare itemsets can also be of high or low utility. In this paper, a literature survey of various research works has been discussed on High Utility Rare Itemset Mining. We have thoroughly surveyed High Utility Rare Itemset Mining methods and applications here. Also, we have focused on some open research issues to represent future challenges in this domain.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Abaya SA (2012) Association rule mining based on Apriori algorithm in minimizing candidate generation. Int J Sci Eng Res 3(7):1–4
Adda M, Wu L, Feng Y (2007) Rare itemset mining. In: IEEE sixth international conference on machine learning and applications, ICMLA 2007, pp 73–80
Adda M, Wu L, White S, Feng Y (2007) Pattern detection with rare item-set mining. arXiv preprint. arXiv:1209.3089
Agrawal R, Mannila H, Srikant R, Toivonen H, Verkamo (2007) AI Fast discovery of association rules. Adv Knowl Discov Data Min 12(1):307–328
Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceedings of 20th international conference very large data bases VLDB, vol 1215, pp 487–499
Ashrafi MZ, Taniar D, Smith K (1994) A new approach of eliminating redundant association rules. In: International conference on database and expert systems applications. Springer, Berlin, Heidelberg, pp 465–474S
Ashrafi MZ, Taniar D, Smith K (2007) Redundant association rules reduction techniques. Int J Bus Intell Data Min 2(1):29–63
Cagliero L, Garza P (2014) Infrequent weighted itemset mining using frequent pattern growth. IEEE Trans Knowl Data Eng 26(4):903–915
Chui CK, Kao B, Hung E (2007) Mining frequent itemsets from uncertain data. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, Berlin, Heidelberg, pp 47–58
Dimitrijevic M, Bosnjak Z, Subotica S (2010) Discovering interesting association rules in the web log usage data. Interdiscip J Inf Knowl Manag 5:191–207
Duong QH, Liao B, Fournier-Viger P, Dam TL (2016) An efficient algorithm for mining the top-k high utility itemsets using novel threshold raising and pruning strategies. Knowl Based Syst 104:106–122
Erwin A, Gopalan RP, Achuthan NR (2007) A bottom-up projection based algorithm for mining high utility itemsets. In: Proceedings of the 2nd international workshop on integrating artificial intelligence and data mining Australian computer society Inc., vol 84, pp 3–11
Fayyad U, Piatetsky-Shapiro G, Smyth P (1996) From data mining to knowledge discovery in databases. AI Mag 173:37
Fournier-Viger P, Lin JC, Vo B, Chi TT, Zhang J, Le HB (2017) A survey of itemset mining. Wiley Interdiscip Rev: Data Min Knowl Discov 7(4):e1207
Fournier-Viger P, Wu CW, Zida S, Tseng VS (2014) FHM: faster high-utility itemset mining using estimated utility co-occurrence pruning. In: International symposium on methodologies for intelligent systems. Springer, Cham, pp 83–92
Fournier-Viger P and Zida S (2015) FOSHU: faster on-shelf high utility itemset mining with or without negative unit profit. In: Proceedings of the 30th annual ACM symposium on applied computing, pp 857–864
Goyal V, Dawar S, Sureka A (2015) High utility rare itemset mining over transaction databases. In: International workshop on databases in networked information systems. Springer, Cham, pp 27–40
Grahne G, Zhu J (2005) Fast algorithms for frequent itemset mining using fp-trees. IEEE Trans Knowl Data Eng 17(10):1347–1362
Gyenesei A (2000) Mining weighted association rules for fuzzy quantitative items. In: European conference on principles of data mining and knowledge discovery. Springer, Berlin, Heidelberg, pp 416–423
Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier
Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. ACM Sigmod Rec ACM 29(2):1–12
Hu YH, Chen YL (2006) Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism. Decis Support Syst 42(1):1–24
Kantardzic M (2011) Data mining: concepts, models, methods, and algorithms. Wiley
Kiran RU, Reddy PK (2011) Novel techniques to reduce search space in multiple minimum supports-based frequent pattern mining algorithms. In: Proceedings of the 14th international conference on extending database technology ACM, pp 11–20
Koh YS, Rountree N (2005) Finding sporadic rules using Apriori-inverse. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, Berlin, Heidelberg, pp 97–106
Leung CK, Khan QI, Li Z, Hoque T (2007) CanTree: a canonical-order tree for incremental frequent-pattern mining. Knowl Inf Syst 11(3):287–311
Li Y (2011) Data mining: concepts, background and methods of integrating uncertainty. In data mining
Lin CW, Hong TP, Lu WH (2011) An effective tree structure for mining high utility itemsets. Expert Syst Appl 38(6):7419–7424
Lin CW, Hong TP, Lu WH (2009) The Pre-FUFP algorithm for incremental mining. Expert Syst Appl 36(5):9498–9505
Lin JC, Gan W, Fournier-Viger P, Hong TP, Tseng VS (2016) Fast algorithms for mining high-utility itemsets with various discount strategies. Adv Eng Inform 30(2):109–126
Lin JC, Gan W, Fournier-Viger P, Hong TP, Tseng VS (2015) Mining potential high-utility itemsets over uncertain databases. In: Proceedings of the ASE bigdata and social informatics 2015 ACM
Lin WY, Tseng MC (2006) Automated support specification for efficient mining of interesting association rules. J Inf Sci 32(3):238–250
Lin YC, Wu CW, Tseng VS (2015) Mining high utility itemsets in big data. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, Cham, pp 649–661
Liu B, Hsu W, Ma Y (1999) Mining association rules with multiple minimum supports. In: Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining ACM, pp 337–341
Liu M, Qu J (2012) Mining high utility itemsets without candidate generation. In: Proceedings of the 21st ACM international conference on information and knowledge management, ACM, pp 55–64, 29 Oct 2012
Liu Y, Liao WK, Choudhary A (2005) A fast high utility itemsets mining algorithm. In: Proceedings of the 1st international workshop on Utility-based data mining ACM, pp 90–99
Liu Y, Liao WK, Choudhary A (2005) A two-phase algorithm for fast discovery of high utility itemsets. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, Berlin, Heidelberg, pp 689–695
Mahgoub H, Rosner D (2006) Mining association rules from unstructured documents. In: Proceedings of 3rd international conference on knowledge mining, pp 167–172
Ninoria SZ, Thakur SS (2017) Study of high utility itemset mining. Int J Comput Appl 175(4):43–50
Patel AM, Bhalodiya D (2014) A survey on frequent itemset mining techniques using GPU. Int J Innov Res Technol 1(5)
PhridviRaj MS, GuruRao CV (2014) Data mining–past, present and future–a typical survey on data streams. Procedia Technol 12:255–263
Pillai J, Vyas OP, Muyeba M (2013) Huri–a novel algorithm for mining high utility rare itemsets. In: Advances in computing and information technology. Springer, Berlin, Heidelberg, pp 531–540
Pillai J, Vyas OP (2011) High utility rare item set mining (HURI): an approach for extracting high utility rare item sets. J Futur Eng Technol 7(1)
Pillai J, Vyas OP (2010) Overview of itemset utility mining and its applications. Int J Comput Appl 5(11):9–13
Pillai J, Vyas OP (2013) Transaction profitability using HURI algorithm [tphuri]. Int J Bus Inf Syst 2(1)
Pillai J, Vyas OP (2011) User centric approach to itemset utility mining in market basket analysis. Int J Comput Sci Eng 3(1):393–400
Poovammal E, Ponnavaikko M (2009) Utility independent privacy preserving data mining on vertically partitioned data 1. Int J Comput Sci 5:666–673
Savasere A, Omiecinski ER, Navathe SB (1995) An efficient algorithm for mining association rules in large databases. Georgia Institute of Technology 502
Shankar S, Babu N, Purusothaman T, Jayanthi S (2009) A fast algorithm for mining high utility itemsets. In: Advance computing conference, IEEE international, pp 1459–1464
Shie BE, Tseng VS, Yu PS (2010) Online mining of temporal maximal utility itemsets from data streams. In: Proceedings of the 2010 ACM symposium on applied computing ACM, pp 1622 –1626
Shridhar M, Parmar M (2017) Survey on association rule mining and its approaches, pp 129–135
Szathmary L, Napoli A, Valtchev P (2007) Towards rare itemset mining. In: Tools with artificial intelligence 19th IEEE international conference, vol 1, pp 305–312
Tang P, Turkia MP (2006) Parallelizing frequent itemset mining with FP-trees. In: Computers and their applications, pp 30–35
Thakur SS, Ninoria SZ (2017) An improved progressive sampling based approach for association rule mining. Int J Comput Appl 165:7
Toivonen H (1996) Sampling large databases for association rules. VLDB, vol 96, pp 134–145
Tseng V, Wu CW, Fournier-Viger P, Philip SY (2016) Efficient algorithms for mining top-k high utility itemsets. IEEE Trans Knowl Data Eng (1)
Tseng VS, Shie BE, Wu CW, Philip SY (2013) Efficient algorithms for mining high utility itemsets from transactional databases. IEEE Trans Knowl Data Eng 25(8):1772–1786
Yao H, Hamilton HJ, Geng L (2006) A unified framework for utility-based measures for mining itemsets. In: Proceedings of ACM SIGKDD 2nd Workshop on Utility-Based Data Mining, pp 28–37
Yun H, Ha D, Hwang B, Ryu KH (2003) Mining association rules on significant rare data using relative support. J Syst Softw 67(3):181–191
Yun U, Leggett JJ (2005) WFIM: weighted frequent itemset mining with a weight range and a minimum weight. In: Proceedings of the 2005 SIAM international conference on data mining, pp 636–640
Yun U, Ryang H, Ryu KH (2014) High utility itemset mining with techniques for reducing overestimated utilities and pruning candidates. Expert Syst Appl 41(8):3861–3878
Yun U (2007) Efficient mining of weighted interesting patterns with a strong weight and/or support affinity. Inf Sci 177(17):477–99
Yun U (2009) On pushing weight constraints deeply into frequent itemset mining. Intell Data Anal 13(2):359–383
Zida S, Fournier-Viger P, Lin JC, Wu CW, Tseng VS (2015) EFIM: a highly efficient algorithm for high—utility itemset mining. In: Mexican international conference on artificial intelligence. Springer, pp 530–546
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Ninoria, S.Z., Thakur, S.S. (2020). Review on High Utility Rare Itemset Mining. In: Shukla, R., Agrawal, J., Sharma, S., Chaudhari, N., Shukla, K. (eds) Social Networking and Computational Intelligence. Lecture Notes in Networks and Systems, vol 100. Springer, Singapore. https://doi.org/10.1007/978-981-15-2071-6_31
Download citation
DOI: https://doi.org/10.1007/978-981-15-2071-6_31
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-2070-9
Online ISBN: 978-981-15-2071-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)