Skip to main content

Correlated High Average-Utility Itemset Mining

  • Conference paper
  • First Online:
Evolution in Computational Intelligence

Abstract

High average-utility itemset (HAUI) mining is an advancement over high utility itemset mining, where average-utility is used instead of utility measure to discover meaningful patterns. It has been discussed in several past studies that significance of utility-based patterns can be amplified if items in the patterns are correlated. In this paper, we propose a HAUI mining algorithm named correlated high average-utility itemset (CoHAI) miner which provides highly profitable and non-redundant correlated patterns. CoHAI miner is a one-phase algorithm which uses a vertical list structure to store utility and correlation information to apply pruning and candidate generation. Moreover, use of vertical list avoids repetitive dataset scans to calculate the utility of candidate itemsets. We performed rigorous experiments to compare the CoHAI miner with existing algorithm. The results show that the CoHAI miner performs efficiently and produces more meaningful patterns than existing algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Agrawal, R., Imieliński, T., Swami, A.: Mining association rules between sets of items in large databases. In: ACM Sigmod Record, vol. 22, pp. 207–216. ACM (1993)

    Google Scholar 

  2. Ahmed, C.F., Tanbeer, S.K., Jeong, B.-S., Choi, H.-J.: A framework for mining interesting high utility patterns with a strong frequency affinity. Inf. Sci. 181(21), 4878–4894 (2011)

    Google Scholar 

  3. Chan, R., Yang, Q., Shen, Y.-D.: Mining high utility itemsets. In: Third IEEE International Conference on Data Mining, 2003. ICDM 2003, pp. 19–26. IEEE (2003)

    Google Scholar 

  4. Fournier-Viger, P., Gomariz, A., Gueniche, T., Soltani, A., Wu, C.-W., Tseng, V.S.: SPMF: a Java open-source pattern mining library. J. Mach. Learn. Res. 15(1), 3389–3393 (2014)

    Google Scholar 

  5. Fournier-Viger, P., Wu, C.-W., Zida, S., Tseng, V.S.: FHM: faster high-utility itemset mining using estimated utility co-occurrence pruning. In: International Symposium on Methodologies for Intelligent Systems, pp. 83–92. Springer (2014)

    Google Scholar 

  6. Gan, W., Lin, J.C.-W., Chao, H.-C., Fujita, H., Yu, P.S.: Correlated utility-based pattern mining. Inf. Sci. 504, 470–486 (2019)

    Google Scholar 

  7. Gan, W., Lin, J.C.-W., Fournier-Viger, P., Chao, H.-C., Fujita, H.: Extracting non-redundant correlated purchase behaviors by utility measure. Knowl.-Based Syst. 143, 30–41 (2018)

    Google Scholar 

  8. Geng, L., Hamilton, H.J.: Interestingness measures for data mining: a survey. ACM Comput. Surv. (CSUR) 38(3), 9 (2006)

    Google Scholar 

  9. Hong, T.-P., Lee, C.-H., Wang, S.-L.: Effective utility mining with the measure of average utility. Expert Syst. Appl. 38(7), 8259–8265 (2011)

    Article  Google Scholar 

  10. Krishnamoorthy, Srikumar: Pruning strategies for mining high utility itemsets. Expert Syst. Appl. 42(5), 2371–2381 (2015)

    Article  Google Scholar 

  11. Krishnamoorthy, Srikumar: HMiner: efficiently mining high utility itemsets. Expert Syst. Appl. 90, 168–183 (2017)

    Article  Google Scholar 

  12. Li, Y.-C., Yeh, J.-S., Chang, C.-C.: Isolated items discarding strategy for discovering high utility itemsets. Data Knowl. Eng. 64(1), 198–217 (2008)

    Article  Google Scholar 

  13. Lin, J.C.-W., Gan, W., Fournier-Viger, P., Hong, T.-P., Chao, H.-C.: FDHUP: fast algorithm for mining discriminative high utility patterns. Knowl. Inf. Syst. 51(3), 873–909 (2017)

    Google Scholar 

  14. Lin, J.C.-W., Li, T., Fournier-Viger, P., Hong, T.-P., Zhan, J., Voznak, M.: An efficient algorithm to mine high average-utility itemsets. Adv. Eng. Inform. 30(2), 233–243 (2016)

    Google Scholar 

  15. Lin, J.C.-W., Ren, S., Fournier-Viger, P., Hong, T.-P.: EHAUPM: efficient high average-utility pattern mining with tighter upper bounds. IEEE Access 5, 12927–12940 (2017)

    Google Scholar 

  16. Liu, M., Qu, J.: Mining high utility itemsets without candidate generation. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 55–64. ACM (2012)

    Google Scholar 

  17. Liu, Y., Liao, W.-k., Choudhary, A.: A two-phase algorithm for fast discovery of high utility itemsets. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 689–695. Springer (2005)

    Google Scholar 

  18. Omiecinski, E.R.: Alternative interest measures for mining associations in databases. IEEE Trans. Knowl. Data Eng. 15(1), 57–69 (2003)

    Google Scholar 

  19. Tseng, V.S., Wu, C.-W., Shie, B.-E., Yu, P.S.: Up-growth: an efficient algorithm for high utility itemset mining. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 253–262. ACM (2010)

    Google Scholar 

  20. Tianyi, W., Chen, Y., Han, J.: Re-examination of interestingness measures in pattern mining: a unified framework. Data Min. Knowl. Discovery 21(3), 371–397 (2010)

    Article  MathSciNet  Google Scholar 

  21. Yun, U., Kim, D.: Mining of high average-utility itemsets using novel list structure and pruning strategy. Future Gener. Comput. Syst. 68, 346–360 (2017)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dharavath Ramesh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sethi, K.K., Ramesh, D. (2021). Correlated High Average-Utility Itemset Mining. In: Bhateja, V., Peng, SL., Satapathy, S.C., Zhang, YD. (eds) Evolution in Computational Intelligence. Advances in Intelligent Systems and Computing, vol 1176. Springer, Singapore. https://doi.org/10.1007/978-981-15-5788-0_47

Download citation

Publish with us

Policies and ethics