Customer Purchase Behavior Prediction in E-commerce: A Conceptual Framework and Research Agenda

Cirqueira, Douglas; Hofer, Markus; Nedbal, Dietmar; Helfert, Markus; Bezbradica, Marija

doi:10.1007/978-3-030-48861-1_8

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11948))

Included in the following conference series:

International Workshop on New Frontiers in Mining Complex Patterns

2171 Accesses
10 Citations

Abstract

Digital retailers are experiencing an increasing number of transactions coming from their consumers online, a consequence of the convenience in buying goods via E-commerce platforms. Such interactions compose complex behavioral patterns which can be analyzed through predictive analytics to enable businesses to understand consumer needs. In this abundance of big data and possible tools to analyze them, a systematic review of the literature is missing. Therefore, this paper presents a systematic literature review of recent research dealing with customer purchase prediction in the E-commerce context. The main contributions are a novel analytical framework and a research agenda in the field. The framework reveals three main tasks in this review, namely, the prediction of customer intents, buying sessions, and purchase decisions. Those are followed by their employed predictive methodologies and are analyzed from three perspectives. Finally, the research agenda provides major existing issues for further research in the field of purchase behavior prediction online.

This research was supported by the European Union Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No. 765395; the industry partner Raiffeisenlandesbank Oberösterreich AG; and supported, in part, by Science Foundation Ireland grant 13/RC/2094.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Predicting customer purchase behavior in the e-commerce context

Article 30 June 2015

Towards early purchase intention prediction in online session based retailing systems

Article Open access 19 December 2020

A practical model to predict the repeat purchasing pattern of consumers in the C2C e-commerce

Article 14 September 2015

Keywords

1 Introduction

Daily online activities generate plenty of opportunities for businesses to understand their consumer behavior in E-commerce platforms [1]. Indeed, consumers around the globe purchased $2.86 trillion on the web in 2018, which represented an 18% growth^{Footnote 1} in online sales compared to the $2.43 trillion sold in 2017. According to predictions of the purchasing behavior of customers, companies aim to anticipate their needs and provide personalized services [2, 3].

However, consumer behavior itself is well known as a complex pattern among the data mining community [4]. Aiming to predict the likelihood of such patterns, researchers were applying multiple probabilistic and machine learning (ML) statistical models to historical online customer’s data, resulting in somewhat reliable probabilities to predict the next customer’s steps [5, 6]. That has also increased the complexity of analyzing this literature, given the multiple approaches and datasets available. Previous reviews and surveys related to this topic have usually focused on the specific literature of recommendation systems [7,8,9,10]. On the other hand, our focus is on reducing complexity for understanding the step before recommendations, which is the prediction of customer’s next purchases, and in visualizing research opportunities in the field.

Therefore, these paper contributions are a novel conceptual framework for analysis and a research agenda. The framework systematically maps this literature regarding datasets adopted, predictive methods, and tasks with their applications. Specifically, the framework reveals three main tasks, namely, prediction of buying sessions, purchase decisions, and customer intents. Next, it provides eight applications enabled by each task. Finally, it illustrates three perspectives on predictive methodologies, and a research agenda with future work opportunities in the field.

The rest of this paper is organized as follows: Sect. 2 describes the research methodology of the literature review; Sect. 3 presents results and the main contributions, followed by final remarks in Sect. 4.

2 Research Methodology

To provide the framework and research agenda proposed, we performed a literature review following systematic guidelines from Watson (2002) [11] and Kitchenham et al. (2009) [12]. Inspired by [13], two research questions and a search query were developed to collect comprehensive literature within the research scope of purchase prediction in E-commerce. Then, the search query was applied in the following scientific databases, well known for containing literature in the field of behavior analytics: Scopus, Web of Science, Science Direct, EBSCO Host (Business Source Complete and Academic Search Complete), Emerald, IEEE Xplore, Association of Information Systems (AIS) library and ACM Digital Library.

Search Query: “(consumer or customer) AND (purchas* OR buy* OR sale* OR shop* OR behavi*) AND (predict* OR forecast*)”

The searches were performed in the abstract field, except for the Web of Knowledge (abstract title and keywords were used) and AIS libraries (full text was used), due to the characteristics of their search engines. The search period has covered papers from 2014 to 2019, only in the English language, which has provided a total of 9824 exported proposals. The next step removed duplicates and had an inclusion filter only to retrieve papers focused on the problem of consumer purchase behavior prediction. That has provided a total of 429 papers.

Next, the exclusion criteria were applied to remove papers not focused on the E-commerce context. At this stage, the total of papers kept was 35. Based on those proposals, backward and forward searches were conducted via Google Scholar, adding 18 and 10 studies, respectively. The final number of papers for extraction and mapping steps was 63. All those results are available at a Github repository (https://github.com/dougcirqueira/literature-review-purchase-prediction).

3 Results

Tables 1 and 2 provide non-exhaustive lists of the proposals selected for this literature review. Table 1 brings single task proposals (prediction of one outcome), while Table 2 provides multi-task proposals (prediction of multiple outcomes).

Table 1. Selected proposals in single task settings (A: Aggregation; R: Rule; P: Personalized Function; L: Learning; CDM: Classical Data Mining; PC: Probabilistic Classifier; DLC: Deep Learning Classifier; CF: Collaborative Filtering)

Full size table

Table 2. Selected proposals in multi-task settings (A: Aggregation; R: Rule; P: Personalized Function; L: Learning; CDM: Classical Data Mining; PC: Probabilistic Classifier; DLC: Deep Learning Classifier; CF: Collaborative Filtering)

Full size table

A Conceptual Framework of Analysis for Customer Purchase Prediction in E-commerce

A conceptual framework of analysis aims to optimize the understanding of a complex topic by breaking it down into smaller and comprehensive components [48]. We adopted a systematic literature review approach to developing the conceptual framework of analysis proposed and illustrated in Fig. 1.

The framework has six components. Component 1 defines the dataset types adopted in this literature. Component 2 classifies in dimensions the input data present in those datasets. Component 3 shows the methodologies adopted for constructing features out of the input data, illustrating how consumer behavior is modeled to predictive analytics. Component 4 introduces the predictive methods summarized into four categories. Component 5 shows which tasks enable what applications from component 6, as identified in Subsect. 3.1. Details on each component will be given under the research questions developed in the literature review.

The two research questions developed to conduct the systematic literature review were the guidance for scoping our findings. The results will be presented, reflecting those questions in Subsects. 3.1 and 3.2.

3.1 RQ 1. What Tasks and Applications Have Been Addressed in the Problem of Consumer Purchase Behavior Prediction in E-Commerce?

This research question addresses components 5 and 6 of the proposed framework. It reveals the literature targeting three main tasks within the online purchase prediction problem. Every task has a different prediction outcome, described as follows:

Predict Customer Intent (PCI): Predict the intention of customer visits online. Examples of intention types reported in the literature are purchase oriented or general [35], browsing, searching, purchasing, and bouncing [37]. This task is essential for identifying similar groups of customers, and for applications in which customer segmentation is needed.
Predict Buying Session (PBS): Predict if a current user online session will end up with a purchase or not. This task is interesting for applications that need to capture the general likelihood of the user conversion during his visit online, without details regarding preferences for specific products.
Predict Purchase Decisions (PPD): Predict customers purchase behavior concerning their buying decisions. For instance, to foresee what product or category a customer will buy; to predict the time or period likely to witness a purchase; to predict the next amount customers are likely to spend in their purchases. PPD is the most complex task, as the aim is to predict fine-grained decisions. That is the ideal task for recommending specific products or services to customers.

Those three identified tasks enable a variety of business intelligence applications for online retailers, such as: A) Product Recommendations [29]; B) Targeted Marketing [16, 42]; C) Layout Personalization of E-commerce Landing Pages [17]; D) Load balance Optimization to Prioritize Quality of Service for Likely Buyers [14]; E) Stock Management Optimization of Products [28, 32]; F) Real-time Customer Service [49]; G) Purchase Trends Discovery [15]; H) Offers Awareness Based on the Detected Intention of Consumers [35].

3.2 RQ 2. What Methodologies Have Been Adopted to Predict Consumer Purchase Behavior Online?

This research question addresses the components from 1 to 4 of the framework proposed. It provides three perspectives in the predictive methodologies adopted in this literature.

Online Customer Behavior Datasets and their Features.

Customer behavior in E-commerce is captured through datasets of past online sessions and shopping logs, which are described in Table 3:

Table 3. Dataset types identified in the literature

Full size table

The input data is further classified in the data layer, inspired by [2], in dimensions, which have specific input data features. Every dimension and its features support in explaining and predicting customer behavior from different perspectives, which bring some benefits for predictive tasks on that data, as illustrated in Table 4.

Table 4. Classification of E-commerce data in dimensions and its benefits

Full size table

Feature Construction for Purchase Prediction.

In this Subsection, we use a formal notation to explain the feature construction process. The input data features $ feat_{in} $ described previously serve as the basis for feature construction, from which is derived new descriptive features $ feat_{eng\_out} $ to capture historical patterns, which can indicate the probability of purchase. Two methodologies are adopted to create descriptive features. The first is Feature Engineering, where domain expertise is used to think of a function or rule $ f_{eng} $ to apply on input data features $ feat_{in} $ present in a dataset, which are related to a current customer transaction $ Ti $. This process can be shaped by conditions $ cond_{n} $ to capture relationships between multiple input data features. The Feature Engineering process can be described in Eq. 1.

$$ feat_{eng\_out} \, = \,f_{eng} \left( {D, \,Ti, feat_{in} ,\,cond_{1} , \,cond_{n} } \right) $$

(1)

The second methodology for feature construction is Feature Learning, in which a function $ f_{learn} $ to create new features is an unsupervised ML model, which automatically derives new explanatory features. For instance, researchers extract Latent Representations, or hidden layer weights $ feat_{learn\_out} $ learned during training time of a Recurrent Neural Network or Autoencoder model, carrying hidden correlations and relationships between variables. This learning process is conditioned by the target outcome $ targ_{out} $ and a cost function $ cost_{f} $, which represent the desired outcome of the learned representation, and how the weights of the hidden layer will be learned. The desired outcome is, for instance, a binary label for predicting buying sessions, or a multi-category label for predicting purchase decisions regarding products. The Feature Learning process is described in Eq. 2.

$$ feat_{learn\_out} \, = \,f_{learn} \left( {D, \,Ti,\, feat_{in} ,\,targ_{out} ,\, cost_{f} } \right) $$

(2)

Table 5 illustrates examples of those methodologies in action.

Table 5. Methodologies for Customer behavior Feature Construction

Full size table

Predictive Methods.

Researchers have been working with ML and probabilistic methods to predict the complex customer purchase behavior online [5]. Based on the conceptual framework, we summarize the predictive models adopted into four categories, with their advantages and disadvantages. It is provided examples of particular methods within each category, specifically for purchase prediction in E-commerce. We illustrate in Table 6 how those models compare concerning their characteristics and suitability for tasks identified in Subsect. 3.1.

The characteristics analyzed are a) Suitability for Real-Time: concerning usual time required for training, if any, and for providing predictions in production settings; b) Interpretability: concerning the capacity of providing explanations for why a predicted outcome is given by the model; c) Sequential Modeling: it illustrates if a predictive method is able to model the customer activities sequentially. That is important when researchers want to explicitly analyze the influence of past purchases in current customer actions; d) Feature Construction Function: reveals what methodology and function are usually adopted for feature construction when applying the predictive method analyzed.

Table 6. Predictive methodologies

Full size table

Details regarding each predictive methodology are provided as follows.

Probabilistic Classifier: A model that uses probability theory to model the uncertainty in the data. Advantage: Usually, it requires a few numbers of engineered features, which makes them a feasible choice for real-time settings, as well as the natural capacity of sequentially modeling short-term patterns in events. Disadvantage: It is difficult to capture the effects of long-term patterns in customer behavior. However, this capacity can be achieved in the cost of increasing model complexity and processing time.

Bayesian Classifier: Estimates conditional probability distributions based on the influence of given features to output a specific prediction. In [42], authors predict purchase decisions by analyzing the influence of sequential purchases, number, and duration of visits to compute probabilities for the customer choice of a specific product or time of purchase.
Hidden Markov Model: A generalization of a probabilistic mixture model, where the probability of an event, such as a purchase, depends on the occurrence of hidden variables through a sequential Markov process modeling a previous customer action [24].

Classical Data Mining Classifiers: Those models work by learning similarities between feature vectors of buying sessions, intents, and purchase decisions. Advantage: Most of the approaches in this category perform well even with small or medium dataset sizes, which makes some of them suitable for real-time settings. Disadvantage: Authors adopting this methodology usually need to perform extensive Feature Engineering to achieve good prediction results, also for detecting sequential patterns.

Unsupervised Clustering: Unlabeled sessions and purchase transactions are input to a model which will discover patterns in similar instances and group them for providing predictions. For example, [37, 38] adopt the K-means algorithm to segment customers based on variables regarding their clickstream behavior.
Association Rules: Enables the discovery of associations between features, which can reveal rules with high confidence to indicate probabilities of sessions ending up with a purchase [16].
Instance-Based: Model which classify new data instances based on similar cases and their features. In [22], authors employ K-Nearest Neighbor to predict buying sessions according to previous examples of sessions, with similar features, which ended up with a purchase.
Linear ML: Machine learning models which assume a linear decision boundary between buying and non-buying sessions, or feature vectors representing purchase decisions of customers. However, the kernel trick can be adopted to detect non-linear relationships between features [50], or Feature Engineering to create combinations between multiple features [27].
Ensemble Learning: stacking of various weak predictive models together to build up a robust model for providing predictions [20].

Deep Learning Classifiers: ML models which can naturally learn complex and non-linear decision boundaries and relationships in the dataset. Advantage: These models can be powerful in modeling long-term influences of past customer events on current decisions [25], and do not require extensive Feature Engineering, as they have Feature Learning built-in. Disadvantage: This method usually requires massive amounts of data, which makes it hard for usage with new customers and a few purchases [40, 41]. The interpretability of predictions is also an issue.
Collaborative Filtering: Classical model applied in recommendation systems. This approach models customers and products in a utility matrix based on their clicks, views, reviews, or purchases, which is then factorized to provide latent factors representing the likelihood of customers choosing similar products [29, 30, 44]. That is the favorite model adopted by researchers focusing on predicting purchase decisions, but it is also utilized in predicting buying sessions [14]. Advantage: One of the most flexible approaches for multiple types of features in different E-commerce platforms. It also scales well with more customers and products being added in a dataset. Disadvantage: The utility matrix is usually sparse, as most of the customers have not viewed many of the products available in an E-commerce platform. Therefore, it is challenging to predict purchases for new customers, and it is important to think of Feature Engineering for creating features that can overcome such issues.

3.3 State-of-the-Art Performance

To have a fair comparison between the identified predictive methodologies, for every specific task, we grouped the existing proposals by the predictive methodology adopted. We evaluated only the F1 score and Area Under Receiver Operating Characteristic Curve (AUC) reported by those. Our choice for those metrics considers the fact that datasets in this literature are usually unbalanced, with few occurrences of purchases, and it is well known that F1 and AUC scores are the ideal metrics in unbalanced scenarios [51]. Table 7 illustrates the average results obtained from predictive methodologies for suitable tasks where they can be applied. It is not reported performance for predicting customer intent as the authors did not adopt the mentioned metrics.

Table 7. State of the Art Results for Predicting Buying Sessions and Purchase Decisions

Full size table

Classical Data Mining Classifiers are the current state-of-the-art for Predicting Buying Sessions, specifically Ensemble learners [20] and Support Vector Machines [19]. Those are followed by Deep Learning classifiers. It is interesting to observe the drop in performance when going to the task of Predicting Purchase Decision, which proves it is the most complex task due to the fine-grained predictions aimed at it. Concerning performance, the classical Collaborative Filtering approach is the most robust, comprised of a Latent Factor Model [30] and Matrix Factorization [31]. Those are followed by Classical Data Mining and Deep Learning classifiers.

3.4 Research Agenda

We derive a research agenda based on the targeted research gaps and findings of this review, containing the following directions:

Sequential Learning: Few proposals have explored sequential ML models in this literature. Examples are recurrent neural networks, which are only adopted in three studies [25, 33, 40]. Such models are indicated to learn the evolving consumer behavior over time, and sequential patterns such as “She is buying a phone case after purchasing a smartphone”.
Interpretability: It is noticed the majority of authors reporting higher performance as those applying Classical Data Mining and Deep Learning classifiers, which also have a black-box nature. Indeed, interpretability seems not to be the focus of this recent literature.
Customer Data and General Data Protection Regulation (GDPR): Given the rise of privacy policies with GDPR in Europe, it is needed more research on the trade-off between the amount of data required and protection of customers’ privacy, regarding the performance of purchase prediction tasks.
Dataset for benchmarking: There is no clear consensus on datasets for state-of-the-art comparison in this literature, as many studies have used private data. However, we observed a significant adoption of the Recsys 2015 challenge data [17, 25, 31, 39, 40, 42], which suggests this dataset as a candidate in this regard.
Evaluation in Multiple E-commerce Platforms: Most researchers evaluate their proposed predictive methods in a single dataset, or focus on specific E-commerce settings. Therefore it is hard to argue their methodologies are general for multiple E-commerce platforms, such as general-purpose and specialized marketplaces.
Feature Engineering and Feature Learning: It was noticed that the well-performing proposals adopting Classical ML models had been heavily investing in Feature Engineering. However, more investigation in the field of Feature Learning is recommended in this area, or the combination of those two methodologies in purchase prediction online.
Creation Process of Personalized Feature Engineering Functions: Some researchers explore the creation of personalized functions in Feature Engineering, such as the popularity of a product [17], the diversity of customer behavior [18, 35] and graph metrics [21]. It could be relevant to map this creation process, and help other researchers in establishing such novel features for customer behavior online.
A Framework for Purchase Prediction Tasks in E-commerce: Existing proposals focus on one of the three tasks identified, but there is a lack of a view into how those tasks can work together. Therefore, further research could be taken to provide a framework which aligns the identified tasks in this review.

4 Final Remarks

This study presents a systematic literature review of recent proposals in consumer purchase prediction in E-commerce. A novel conceptual framework provides lenses in the state-of-the-art of this field. It is noticed that, despite the broad literature, there is still a need for an in-depth investigation of specific directions. Therefore, a research agenda is provided, illustrating potential future work demands.

A next step would be to adopt a benchmark dataset, and evaluate predictive methodologies in multi-task settings, such as to forecast the next product, purchase time, or amount a customer will likely buy. Therefore, it is relevant to investigate the construction of a framework for purchase prediction, which considers the combination of three tasks identified in this review.

Notes

1.
Digital Commerce 360, Global E-commerce Sales 2019. https://www.digitalcommerce360.com/article/global-ecommerce-sales/.

References

Agnihotri, R., Dingus, R., Hu, M.Y., Krush, M.T.: Social media: influencing customer satisfaction in B2B sales. Ind. Mark. Manage. 53, 172–180 (2016)
Article Google Scholar
Bradlow, E.T., Gangwar, M., Kopalle, P., Voleti, S.: The role of big data and predictive analytics in retailing. J. Retail. 93(1), 79–95 (2017)
Article Google Scholar
Le, D.-T., Fang, Y., Lauw, H.W.: Modeling sequential preferences with dynamic user and context factors. In: Frasconi, P., Landwehr, N., Manco, G., Vreeken, J. (eds.) ECML PKDD 2016. LNCS (LNAI), vol. 9852, pp. 145–161. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46227-1_10
Chapter Google Scholar
Erevelles, S., Fukawa, N., Swayne, L.: Big data consumer analytics and the transformation of marketing. J. Bus. Res. 69(2), 897–904 (2016)
Article Google Scholar
Shmueli, G., et al.: To explain or to predict? Stat. Sci. 25(3), 289–310 (2010)
Article MathSciNet MATH Google Scholar
Martens, D., Provost, F., Clark, J., de Fortuny, E.J.: Mining massive fine-grained behavior data to improve predictive analytics. MIS Q. 40(4), 869–888 (2016)
Google Scholar
Ricci, F., Rokach, L., Shapira, B.: Introduction to recommender systems handbook. In: Ricci, F., Rokach, L., Shapira, B., Kantor, P. (eds.) recommender systems handbook, pp. 1–35. Springer, Boston (2011). https://doi.org/10.1007/978-0-387-85820-3_1
Chapter MATH Google Scholar
Bobadilla, J., et al.: Recommender systems survey. Knowl.-Based Syst. 46 109–132 (2013)
Google Scholar
Lu, J., et al.: Recommender system application developments: a survey. Decis. Support Syst. 74, 12–32 (2015)
Article Google Scholar
Isinkaye, F.O., Folajimi, Y.O., Ojokoh, B.A.: Recommendation systems: principles, methods and evaluation. Egypt. Inf. J. 16(3), 261–273 (2015)
Google Scholar
Webster, J., Watson, R.T.: Analyzing the past to prepare for the future: writing a literature review. MIS Q. 26, xiii–xxiii (2002)
Google Scholar
Kitchenham, B., Brereton, O.P., Budgen, D., Turner, M., Bailey, J., Linkman, S.: Systematic literature reviews in software engineering–a systematic literature review. Inf. Softw. Technol. 51(1), 7–15 (2009)
Article Google Scholar
Akter, S., Wamba, S.F.: Big data analytics in e-commerce: a systematic review and agenda for future research. Electron. Mark. 26(2), 173–194 (2016)
Article Google Scholar
Zeng, M., Cao, H., Chen, M., Li, Y.: User behaviour modeling, recommendations, and purchase prediction during shopping festivals. Electron. Mark. 29(2), 1–12 (2018)
Google Scholar
Jia, R., Li, R., Yu, M., Wang, S.: E-commerce purchase prediction approach by user behavior data. In: 2017 International Conference on Computer, Information and Telecommunication Systems (CITS), pp. 1–5. IEEE (2017)
Google Scholar
Suchacka, G., Chodak, G.: Using association rules to assess purchase probability in online stores. Inf. Syst. e-Bus. Manag. 15(3), 751–780 (2017)
Article Google Scholar
Chen, C., Xiao, J., Hou, C., Yuan, X.: Improving purchase behavior prediction with most popular items. IEICE Trans. Inf. Syst. 100(2), 367–370 (2017)
Article Google Scholar
Niu, X., Li, C., Yu, X.: Predictive analytics of e-commerce search behavior for conversion. In: Twenty-Third Americas Conference on Information Systems (2017)
Google Scholar
Lee, M., Ha, T., Han, J., Rha, J.Y., Kwon, T.T.: Online footsteps to purchase: exploring consumer behaviors on online shopping sites. In: 2015 Proceedings of the ACM Web Science Conference. ACM (2015)
Google Scholar
Boroujerdi, E.G., et al.: A study on prediction of user’s tendency toward purchases in websites based on behavior models. In: 2014 6th Conference on Information and Knowledge Technology (IKT), pp. 61–66. IEEE (2014)
Google Scholar
Baumann, A., Haupt, J., Gebert, F., Lessmann, S.: Changing perspectives: using graph metrics to predict purchase probabilities. Expert Syst. Appl. 94, 137–148 (2018)
Article Google Scholar
Suchacka, G., Skolimowska-Kulig, M., Potempa, A.: A k-nearest neighbors method for classifying user sessions in e-commerce scenario. J. Telecommun. Inf. Technol. 3, 64–69 (2015)
Google Scholar
Lin, W., Milic-Frayling, N., Zhou, K., Ch’ng, E.: Predicting outcomes of active sessions using multi-action motifs. In: IEEE/WIC/ACM International Conference on Web Intelligence, pp. 9–17, October 2019
Google Scholar
Park, C.H., Park, Y.H.: Investigating purchase conversion by uncovering online visit patterns. Mark. Sci. 35(6), 894–914 (2016)
Article Google Scholar
Sheil, H., Rana, O., Reilly, R.: Predicting purchasing intent: automatic feature learning using recurrent neural networks (2018). arXiv preprint arXiv:1807.08207
Sakar, C.O., Polat, S.O., Katircioglu, M., Kastro, Y.: Real-time prediction of online shoppers’ purchasing intention using multilayer perceptron and LSTM recurrent neural networks. Neural Comput. Appl. 31(10), 6893–6908 (2019)
Article Google Scholar
Li, Q., Gu, M., Zhou, K., Sun, X.: Multi-classes feature engineering with sliding window for purchase prediction in mobile commerce. In: 2015 IEEE International Conference on Data Mining Workshop (ICDMW), pp. 1048–1054. IEEE (2015)
Google Scholar
Iwanaga, J., Nishimura, N., Sukegawa, N., Takano, Y.: Estimating product-choice probabilities from recency and frequency of page views. Knowl.-Based Syst. 99, 157–167 (2016)
Article Google Scholar
He, T., Yin, H., Chen, Z., Zhou, X., Luo, B.: Predicting users’ purchasing behaviors using their browsing history. In: Sharaf, Mohamed A., Cheema, M.A., Qi, J. (eds.) ADC 2015. LNCS, vol. 9093, pp. 129–141. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-19548-3_11
Chapter Google Scholar
Jia, R., Li, R.: Modeling user purchase preference based on implicit feedback. In: CSCWD, pp. 832–836. IEEE (2018)
Google Scholar
Park, C., Kim, D., Yang, M.C., Lee, J.T., Yu, H.: Your click knows it: predicting user purchase through improved user-item pairwise relationship (2017). arXiv preprint arXiv:1706.06716
Nishimura, N., Sukegawa, N., Takano, Y., Iwanaga, J.: A latent-class model for estimating product-choice probabilities from clickstream data. Inf. Sci. 429, 406–420 (2018)
Article Google Scholar
Singhal, R., et al.: Fast online ‘next best offers’ using deep learning. In: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data. CoDS-COMAD 2019, pp. 217–223. ACM, New York (2019)
Google Scholar
Bai, J., et al.: Personalized bundle list recommendation. In: The World Wide Web Conference. ACM (2019)
Google Scholar
Zheng, B., Liu, B.: A scalable purchase intention prediction system using extreme gradient boosting machines with browsing content entropy. In: 2018 IEEE International Conference on Consumer Electronics (ICCE), pp. 1–4. IEEE (2018)
Google Scholar
Minjing, P., Xinglin, L., Ximing, L., Mingliang, Z., Xianyong, Z., Xiangming, D., Mingfen, W.: Recognizing intentions of e-commerce consumers based on ant colony optimization simulation. J. Intell. Fuzzy Syst. 33(5), 2687–2697 (2017)
Article Google Scholar
Schellong, D., Kemper, J., Brettel, M.: Generating consumer insights from big data click-stream information and the link with transaction-related shopping behavior. In: Proceedings of the 25th European Conference on Information Systems (ECIS) (2017)
Google Scholar
Schellong, D., Kemper, J., Brettel, M.: Clickstream data as a source to uncover consumer shopping types in a large-scale online setting. In: ECIS. Research Paper 1 (2016)
Google Scholar
Romov, P., Sokolov, E.: Recsys challenge 2015: ensemble learning with categorical features. In: Proceedings of the 2015 International ACM Recommender Systems Challenge, vol. 1. ACM (2015)
Google Scholar
Wu, Z., Tan, B.H., Duan, R., Liu, Y., Mong Goh, R.S.: Neural modeling of buying behaviour for e-commerce from clicking patterns. In: Proceedings of the 2015 International ACM Recommender Systems Challenge, vol. 12. ACM (2015
Google Scholar
Vieira, A.: Predicting online user behaviour using deep learning algorithms. arXiv preprint arXiv:1511.06247 (2015)
Yeo, J., Kim, S., Koh, E., Hwang, S.w., Lipka, N.: Predicting online purchase conversion for retargeting. In: Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, pp. 591–600. ACM (2017)
Google Scholar
Li, D., Zhao, G., Wang, Z., Ma, W., Liu, Y.: A method of purchase prediction based on user behavior log. In: 2015 IEEE International Conference on Data Mining Workshop (ICDMW), pp. 1031–1039. IEEE (2015)
Google Scholar
Liu, G., et al.: Repeat buyer prediction for e-commerce. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 155–164. ACM (2016)
Google Scholar
Guo, L., Hua, L., Jia, R., Zhao, B., Wang, X., Cui, B.: Buying or browsing?: predicting real-time purchasing intent using attention-based deep network with multiple behavior. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1984–1992, July 2019
Google Scholar
Kooti, F., Lerman, K., Aiello, L.M., Grbovic, M., Djuric, N., Radosavljevic, V.: Portrait of an online shopper: understanding and predicting consumer behavior. In: Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, pp. 205–214. ACM (2016)
Google Scholar
Panagiotelis, A., Smith, M.S., Danaher, P.J.: From amazon to apple: modeling online retail sales, purchase incidence, and visit behavior. J. Bus. Econ. Stat. 32(1), 14–29 (2014)
Article MathSciNet Google Scholar
Green, H.E.: Use of theoretical and conceptual frameworks in qualitative research. Nurse Res. 21, 6 (2014)
Article Google Scholar
Tang, L., Wang, A., Xu, Z., Li, J.: Online-purchasing behavior forecasting with a firefly algorithm-based SVM model considering shopping cart use. Eurasia J. Math. Sci. Technol. Educ. 13(12), 7967–7983 (2017)
Google Scholar
Schölkopf, B.: The kernel trick for distances. In Advances in Neural Information Processing Systems, pp. 301–307 (2001)
Google Scholar
Jeni, L.A., Cohn, J.F., De La Torre, F.: Facing imbalanced data–recommendations for the use of performance metrics. In 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, pp. 245–251. IEEE, September 2013
Google Scholar

Download references

Author information

Authors and Affiliations

Dublin City University, Dublin, Ireland
Douglas Cirqueira & Marija Bezbradica
Raiffeisenlandesbank Oberösterreich, Linz, Austria
Markus Hofer
University of Applied Sciences Upper Austria, Steyr, Austria
Dietmar Nedbal
Maynooth University, Maynooth, Ireland
Markus Helfert

Authors

Douglas Cirqueira
View author publications
You can also search for this author in PubMed Google Scholar
Markus Hofer
View author publications
You can also search for this author in PubMed Google Scholar
Dietmar Nedbal
View author publications
You can also search for this author in PubMed Google Scholar
Markus Helfert
View author publications
You can also search for this author in PubMed Google Scholar
Marija Bezbradica
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Douglas Cirqueira .

Editor information

Editors and Affiliations

University of Bari Aldo Moro, Bari, Italy
Michelangelo Ceci
University of Bari Aldo Moro, Bari, Italy
Corrado Loglisci
CNR-ICAR, Rende, Italy
Giuseppe Manco
Federico II University, Naples, Italy
Elio Masciari
University of North Carolina, Charlotte, NC, USA
Zbigniew Ras

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cirqueira, D., Hofer, M., Nedbal, D., Helfert, M., Bezbradica, M. (2020). Customer Purchase Behavior Prediction in E-commerce: A Conceptual Framework and Research Agenda. In: Ceci, M., Loglisci, C., Manco, G., Masciari, E., Ras, Z. (eds) New Frontiers in Mining Complex Patterns. NFMCP 2019. Lecture Notes in Computer Science(), vol 11948. Springer, Cham. https://doi.org/10.1007/978-3-030-48861-1_8

Download citation

DOI: https://doi.org/10.1007/978-3-030-48861-1_8
Published: 14 May 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-48860-4
Online ISBN: 978-3-030-48861-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the ECML PKDD community (opens in a new tab)

Customer Purchase Behavior Prediction in E-commerce: A Conceptual Framework and Research Agenda

Abstract

Similar content being viewed by others

Predicting customer purchase behavior in the e-commerce context

Towards early purchase intention prediction in online session based retailing systems

A practical model to predict the repeat purchasing pattern of consumers in the C2C e-commerce

Keywords

1 Introduction

2 Research Methodology

3 Results

A Conceptual Framework of Analysis for Customer Purchase Prediction in E-commerce

3.1 RQ 1. What Tasks and Applications Have Been Addressed in the Problem of Consumer Purchase Behavior Prediction in E-Commerce?

3.2 RQ 2. What Methodologies Have Been Adopted to Predict Consumer Purchase Behavior Online?

Online Customer Behavior Datasets and their Features.

Feature Construction for Purchase Prediction.

Predictive Methods.

3.3 State-of-the-Art Performance

3.4 Research Agenda

4 Final Remarks

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Customer Purchase Behavior Prediction in E-commerce: A Conceptual Framework and Research Agenda

Abstract

Similar content being viewed by others

Predicting customer purchase behavior in the e-commerce context

Towards early purchase intention prediction in online session based retailing systems

A practical model to predict the repeat purchasing pattern of consumers in the C2C e-commerce

Keywords

1 Introduction

2 Research Methodology

3 Results

A Conceptual Framework of Analysis for Customer Purchase Prediction in E-commerce

3.1 RQ 1. What Tasks and Applications Have Been Addressed in the Problem of Consumer Purchase Behavior Prediction in E-Commerce?

3.2 RQ 2. What Methodologies Have Been Adopted to Predict Consumer Purchase Behavior Online?

Online Customer Behavior Datasets and their Features.

Feature Construction for Purchase Prediction.

Predictive Methods.

3.3 State-of-the-Art Performance

3.4 Research Agenda

4 Final Remarks

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation