Skip to main content

Framework of EcomTDMA for Transactional Data Mining Using Frequent Item Set for E-Commerce Application

  • Conference paper
  • First Online:
Data Management, Analytics and Innovation

Abstract

In data mining, comprehending out the common item set is an indispensable job. In statements such as participation rule mining and co-relationships, these conventional item sets are beneficial. These systems use particular algorithms to secure numerous item sets, but during unnecessary data come opposite; these are unproductive in communicating and coordinating the load. With these algorithms, automated parallelization is not feasible either. There is an inadequacy to build an algorithm to determine these difficulties with modern algorithms that will help the lacking characteristics, such as automatic parallelization, balancing and fair data configuration. We are using a new technique in this paper to discover frequent item sets by using MapReduce. With the HDFS framework, the modified Apriori algorithm is used, which is called the EcomTDMA technique. By using the disintegrate method, the MapReduce approach can manipulate individually and simultaneously in this system. The consequence of this approach to the reducers and the redactors will show the produce. Three various algorithms like base Apriori, FP-growth and our advanced development Apriori have practiced in the experiment, and the process was performed in both standalone computer and assigned environment and showed the consequences how the recommended algorithm is more substantial than standard algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Stoenescu LV (2013) Social analytics role in high-tech business. Diss. 2013

    Google Scholar 

  2. Martelli A (2009) System dynamics modeling and data mining analyses: a possible integration

    Google Scholar 

  3. Nowduri S (2011) Management information systems and business decision making: review, analysis, and recommendations. J Manage Marketing Res 7:1

    Google Scholar 

  4. Dinsmore PC, Cooke-Davies TJ (2005) Right projects done right: from business strategy to successful project implementation. Wiley, New York

    Google Scholar 

  5. Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier, Amsterdam

    Google Scholar 

  6. Zhu J et al (2018) Review and big data perspectives on robust data mining approaches for industrial process modeling with outliers and missing data. Ann Rev Control 46:107–133

    Google Scholar 

  7. Gan W et al (2017) Data mining in distributed environment: a survey. Wiley Interdiscip Rev Data Mining Knowl Discov 7(6):e1216

    Google Scholar 

  8. Han JW, Pei J, Yin YW (2000) Mining frequent patterns without candidate generation. In: International conference on management of data, vol 29(2), pp 1–12

    Google Scholar 

  9. Nisbet R, Elder J, Miner G (2009) Handbook of statistical analysis and data mining applications. Academic, London

    Google Scholar 

  10. Li H, Wang Y, Zhang D, Zhang M, Chang E (2008) PFP: parallel FP-growth for query recommendation. In: Proceedings of the 2008 ACM conference on recommender systems, pp 107–114

    Google Scholar 

  11. Waghere SS, Rajarajeswari P (2017) Parallel frequent dataset mining and feature subset selection for high dimensional data on hadoop using map-reduce. Int J Appl Eng Res 12(18):7783–7789

    Google Scholar 

  12. Menon S, Sarkar S (2016) Privacy and big data: scalable approaches to sanitize large transactional databases for sharing. MIS Quart 40(4)

    Google Scholar 

  13. Wang YR, Madnick SE (1989) The inter-database instance identification problem in integrating autonomous systems. In: ICDE

    Google Scholar 

  14. Bajcsy P et al (2013) Terabyte-sized image computations on hadoop cluster platforms. In: 2013 IEEE international conference on big data. IEEE

    Google Scholar 

  15. Shah AA (2019) Performance optimization of big data processing using heterogeneous distributed computing. Dissertation, Maharaja Sayajirao University of Baroda (India)

    Google Scholar 

  16. Ahmad R (2013) Engineering machine translation for deployment on cloud. Dissertation, International Institute of Information Technology Hyderabad, India

    Google Scholar 

  17. Zhang Z, Ji G, Tang M (2013) MREclat: an algorithm for parallel mining frequent itemsets. In: 2013 international conference on advanced cloud and big data

    Google Scholar 

  18. Huang K-Y, Chang C-H (2008) Efficient mining of frequent episodes from complex sequences. Inf Syst 33(1):96–114

    Article  Google Scholar 

  19. Mining, What Is Data. Data mining: Concepts and techniques. Morgan Kaufinann 10, pp 559–569

    Google Scholar 

  20. Buehrer GT (2008) Scalable mining on emerging architectures. Dissertation, The Ohio State University

    Google Scholar 

  21. Gole S, Tidke B (2012) frequent itemset mining for big data in social media using ClustBigFIM algorithm. In: International conference on pervasive computing

    Google Scholar 

  22. Allemang D, Hendler J (2011) Semantic web for the working ontologist: effective modeling in RDFS and OWL. Elsevier, Amsterdam

    Google Scholar 

  23. Helbing D (2015) The automation of society is next: How to survive the digital revolution. Available at SSRN 2694312

    Google Scholar 

  24. Gahar RM et al (2017) ParallelCharMax: An effective maximal frequent itemset mining algorithm based on mapreduce framework. In: 2017 IEEE/ACS 14th international conference on computer systems and applications (AICCSA). IEEE

    Google Scholar 

  25. Chen H, Lin TY, Zhang Z, Zhong J (2013) Parallel mining frequent patterns over big transactional data in extended MapReduce. In: 2013 IEEE international conference on granular computing

    Google Scholar 

  26. Mazumder S (2016) Big data tools and platforms. In: Big data concepts, theories, and applications. Springer, Cham, pp 29–128

    Google Scholar 

  27. Zhou X, Huang Y (2014) An improved parallel association rules algorithm based on MapReduce framework for big data. In: 2014 11th International conference on fuzzy systems and knowledge discovery

    Google Scholar 

  28. Yang XY, Liu Z, Fu Y (2010) MapReduce as a programming model for association rules algorithm on Hadoop. The 3rd international conference on information sciences and interaction sciences. IEEE

    Google Scholar 

  29. Liao J, Zhao Y, Long S (2014) MRPrePost—a parallel algorithm adapted for mining big data. In: 2014 IEEE workshop on electronics, computer and applications. IEEE

    Google Scholar 

  30. SheelaGole, Bharat Tidke, ― Frequent Item set Mining for Big Data in social media using ClustBigFIM algorithm‖, International Conference on Pervasive Computing.

    Google Scholar 

  31. Ambavane PK, Zaware N (2021) Data mining using hadoop distriuted file system (Hdfs) For E-Commerce marketing strategy

    Google Scholar 

  32. Siddique Ibrahim SP, Priyanka R (2015) A survey on infrequent weighted item set mining approaches. In: 2015, IJARCET, vol 4, pp 199–203

    Google Scholar 

  33. Zuo Y, Yada K, Shawkat Ali ABM (2016) Prediction of consumer purchasing in a grocery store using machine learning techniques. In: 2016 3rd Asia-Pacific World Congress on Computer Science and Engineering (APWC on CSE). IEEE

    Google Scholar 

  34. Lee I, Shin YJ (2020) Machine learning for enterprises: Applications, algorithm selection, and challenges. Business Horizons 63(2):157–170

    Google Scholar 

  35. Ma L, Sun B (2020) Machine learning and AI in marketing–Connecting computing power to human insights. Int J Res Mark 37(3):481–504

    Article  Google Scholar 

  36. Bing L, Yuliang S (2016) Prediction of user's purchase intention based on machine learning. In: 2016 3rd International conference on soft computing & machine intelligence (ISCMI), 23 Nov 2016. IEEE, pp 99–103

    Google Scholar 

  37. Baderiya MH, Chawan PM ?(2018) Customer online buying prediction using frequent item set mining

    Google Scholar 

  38. Ibrahim S et. Al (2015) A survey on infrequent weighted itemset mining approaches. IJARCET 4:199–203

    Google Scholar 

  39. Fayyad U, Haussler D, Stolorz P (1996) Mining scientific data. Commun ACM 39(11):51–57

    Article  Google Scholar 

  40. Natarajan S, Sehar S (2013) Distributed FP-ARMH algorithm in hadoop map reduce framework for IEEE

    Google Scholar 

  41. Lu Z et al (2018) IoTDeM: an IoT Big Data-oriented MapReduce performance prediction extended model in multiple edge clouds. J Parallel Distrib Comput 118:316–327

    Google Scholar 

  42. Wei X et al (2014) Incremental FP-growth mining strategy for dynamic threshold value and database based on MapReduce. In: Proceedings of the 18th IEEE international conference on computer supported cooperative work in design

    Google Scholar 

  43. Sakr S et al (2011) A survey of large scale data management approaches in cloud environments. IEEE Commun Surv Tutor 13(3):311–336

    Google Scholar 

  44. Li X, Wang Y, Li D (2019) Medical data stream distribution pattern association rule mining algorithm based on density estimation. IEEE Access 7:141319–141329

    Article  Google Scholar 

  45. Braun P et al (2019) Pattern mining from big IoT data with fog computing: models, issues, and research perspectives. In: 2019 19th IEEE/ACM international symposium on cluster, cloud and grid computing (CCGRID). IEEE

    Google Scholar 

  46. Peter B et al (2018) An innovative framework for supporting frequent pattern mining problems in IoT environments. In: International conference on computational science and its applications. Springer, Cham

    Google Scholar 

  47. Leung CK-S, MacKinnon RK, Jiang F (2017) Finding efficiencies in frequent pattern mining from big uncertain data. World Wide Web 20(3):571–594.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ambavane, P., Zaware, S., Zaware, N. (2022). Framework of EcomTDMA for Transactional Data Mining Using Frequent Item Set for E-Commerce Application. In: Sharma, N., Chakrabarti, A., Balas, V.E., Bruckstein, A.M. (eds) Data Management, Analytics and Innovation. Lecture Notes on Data Engineering and Communications Technologies, vol 71. Springer, Singapore. https://doi.org/10.1007/978-981-16-2937-2_21

Download citation

Publish with us

Policies and ethics