Abstract
Several data mining techniques applied in Web usage mining applications for discovering user access pattern from web log data. To understand and provide better services it will require Web-based applications. Web usage mining is one of the types of Web mining. Web mining is the technique to extract knowledge from web content, structure and usage. It is the collection of technologies to accomplish the possible of extracting valuable knowledge from the World Wide Web and its usage pattern. Web mining enables to find out relevant result from Web data including web document, hyperlink between documents, usage log of website etc. There are three main areas of web mining research –content, structure and usage. This paper provide an overview of previous and existing work in all three areas, and also define an overview of data preprocessing process like Data Cleaning, User Identification, Session Identification, Transaction Identification, Path Completion used in Web usage mining.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Mohd Helmy Abd Wahab, Mohd Norzali Haji Mohd, Hafizul Fahri Hanafi, Mohamad Farhan and Mohamad Mohsin “Data Pre-processing on Web Server Logs for Generalized Association Rules Mining Algorithm” World Academy of Science, Engineering and Technology 2008.
Ms.Dipa Dixit and Ms. M. Kiruthika”Preprocessing of Web Logs” (IJCSE) International Journal on Computer Science and Engineering Volume 02, 2010.
Arshi Shamsi, Rahul Nayak, Pankaj Pratap Singh and Mahesh Kumar Tiwari “Web Usage Mining by Data Preprocessing” IJCST Volume 3, Jan. - March 2012.
T. Revathi, M. Mohana Rao and Ch. S. Sasanka “An Enhanced Pre-Processing Research Framework for Web Log Data” IJARCSSE Volume 2, March 2012.
M. Malarvizhi and S. A. Sahaay.” Preprocessing of Educational Institution Web Log Data for Finding Frequent Patterns using Weighted Association Rule Mining Technique”, 2012.
Suneetha K.R and R. Krishnamoorthi” Data Preprocessing and Easy Access Retrieval of Data through Data Ware House” WCECS, Volume 1, October 2009.
Khasawneh N. And Chan C.”Active user-based and ontology-based web log data preprocessing for web usage mining” IEEE/WIC/ACM International Conference on Web Intelligence, December 2006.
Khasawneh N.,Shatnawi M.,Fraiwan M. “Converting Web Applications into Standard XML Web Services” The Tenth International Conference on Intelligent System Design and Applications, Dec 2010.
R. Cooley, B. Mobasher, J. Srivastava,”Grouping web page references into transactions for mining world wide web browsing patterns”, University of Minnesota, Dept. of Computer Science, Minneapolis, 1997.
R. Kosala, H. Blockeel. “Web Mining Research: A Survey,” In SIGKDD Explorations, ACM press, 2000.
Han, J. and M. Kamber “Data Mining: Concepts and Techniques”. A. Stephan. San Francisco, Morgan Kaufmann Publishers is an imprint of Elsevier, 2006.
Raju. G. T. and Satyanarayana. P. S., “Knowledge Discovery from Web Usage Data: Complete Preprocessing Methodology”, IJCSNS International Journal of Computer Science and Network Security, Volume8, January 2008.
Suneetha, K. R. and D. R. Krishnamoorthi “Identifying User Behavior by Analyzing Web Server Access Log File” IJCSNS International Journal of Computer Science and Network Security, Volume 9, April 2009.
Etminani, K., Delui, A.R., Yanehsari, N.R. and Rouhani, “Web Usage Mining: Discovery of the Users’ Navigational Patterns Using SOM”, First International Conference on Networked Digital Technologies, 2009.
Ramya C and Kavitha G, “An Efficient Preprocessing Methodology for Discovering Patterns and Clustering of Web Users using a Dynamic ART1 Neural Network”, Fifth International Conference on Information Processing, Springer, 2011.
Renata Ivancsy, and Sandor Juhasz, “Analysis of Web User Identification Methods”, World Academy of Science, Engineering and Technology, Volume 34, 2007.
Ling Zheng, Hui Gui and Feng Li, “Optimized Preprocessing Technology for Web Log Mining”, International Conference on Computer Design and Applications, Volume1, 2010.
Li Chaofeng,“Research and Development of Data Preprocessing in Web Usage Mining”, International Journal of computer applications, 2011.
Shaimaa Ezzat Salama, Mohamed I. Marie, “Web Server Logs preprocessing for Web Intrusion Detection”,Computer and Information Science, Volume 4, 2011.
Liang Wei and Zhao Shu-hai,“A Hybrid Recommender System Combining Web Page Clustering with Web Usage Mining”, International Conference on Computational Intelligence and software Engineering, 2009.
Yan Li, Boqin Feng and Qinjiao Mao, “Research on Path Completion Technique in Web Usage Mining”, International Symposium on Computer Science and Computational Technology, Volume 1, 2008.
Jian Chen, Jian Yin, Tung, A.K.H. and Bin Liu, “Discovering Web usage patterns by mining cross-transaction association rules”, International Conference on Machine Learning and Cybernetics, Volume 5, 2004.
Yi Dong, Huiying Zhang and Linnan Jiao, “Research on Application of User Navigation Pattern Mining Recommendation”, Proceeding of the 6th World Congress on Intelligent Control and Automation, IEEE,China, June 21 – 23, 2006.
Tasawar Hussain, Sohail Asghar and Nayyer Masood “Web Usage Mining: A Survey on Preprocessing of Web Log File” Center of Research in Data Engineering (CORDE) Department of Computer Science, 2010.
Sanjay Bapu Thakare,Sangram and Z. Gawali “A Effective and Complete Preprocessing for Web Usage Mining” (IJCSE) International Journal on Computer Science and engineering, Volume 2, 2010.
D.S. Rajput, R.S. Thakur and G.S. Thakur “Rule Generation from Textual Data by using Graph based Approach” International Journal of Computer Applications New York, USA, Nov. 2011.
Aditi Shrivastava, Nitin Shukla “Extracting Knowledge from User Access Logs” International Journal of Scientific and Research Publications, Volume 2, Issue 4, April 2012.
Liu Wenyun, Bao Lingyun, “Application of Web Mining in E-Commerce Enterprises Knowledge Management”, International Conference on E-Business and E-Government, IEEE, 2010.
Zhang Haiyang, “The Research of Web Mining in E-commerce”, IEEE, 2011.
Vijayashri Losarwar, Dr. Madhuri Joshi, “Data Preprocessing in Web Usage Mining”, International Conference on Artificial Intelligence and Embedded Systems (ICAIES’2012) Singapore, July 15-16, 2012.
R. Agrawal, R. Srikant, “Fast Algorithm for Mining Association Rule”, International Conference on Very Large Databases, Santiago, Chile, September 1994.
R. Agrawal, R. Srikant,”Mining sequential patterns”,11th International conference,IEEE Computer Society Press, Taiwan,1995.
Jaideep Srivastav, Robert Cooley, Mukund Deshpande, Pang-Ning Tan,” Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data”, ACM SIGKDD, Volume 1, Issue 2, Jan 2000.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer India
About this paper
Cite this paper
Bakariya, B., Mohbey, K.K., Thakur, G.S. (2013). “An Inclusive Survey on Data Preprocessing Methods Used in Web Usage Mining”. In: Bansal, J., Singh, P., Deep, K., Pant, M., Nagar, A. (eds) Proceedings of Seventh International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA 2012). Advances in Intelligent Systems and Computing, vol 202. Springer, India. https://doi.org/10.1007/978-81-322-1041-2_35
Download citation
DOI: https://doi.org/10.1007/978-81-322-1041-2_35
Published:
Publisher Name: Springer, India
Print ISBN: 978-81-322-1040-5
Online ISBN: 978-81-322-1041-2
eBook Packages: EngineeringEngineering (R0)