Food Demand Forecast for Online Food Delivery Service Using CatBoost Model

Pujara, Ansh; Pattabiraman, V.; Parvathi, R.

doi:10.1007/978-3-030-78750-9_9

Ansh Pujara⁶,
V. Pattabiraman⁶ &
R. Parvathi⁶

Part of the book series: EAI/Springer Innovations in Communication and Computing ((EAISICC))

454 Accesses
2 Citations

Abstract

Online food ordering has been proven a great source for businesses from a wide range of sectors. By using an online food ordering system, you can get your food to be delivered to your door without consuming much time. For businesses operating in the food industry including restaurants, agriculture, and many others, accurate forecast is of crucial importance because of the unpredictable demand pattern. In several studies, the choice of an appropriate forecasting model remains a concerning point. In this context, this research aims to analyse the performance of the CatBoost Gradient boosting model for the prediction of the amount of raw materials required for a meal delivery company that operates in multiple cities having multiple centres.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Application of XGBoost Algorithm for Sales Forecasting Using Walmart Dataset

An Extreme Gradient Boosting Model Optimized with Genetic Algorithm for Sales Forecasting of Retail Stores

A Comparative Study for Machine Learning Models in Retail Demand Forecasting

Keywords

1 Introduction

Demand forecasting is an integral component of any rising online business. Without proper demand forecasting techniques in place, it can be nearly impossible to have the required amount of stock available at any given time [1]. A major facet of businesses today is the notion of supply chain integration, whereby resources are combined to provide value to the end consumer and where all the upstream firms realize the importance of integration.

An online food delivery service has to face several difficulties like dealing with a lot of perishable raw materials, which makes it perhaps more essential for such a company to accurately forecast daily and weekly demand [2]. The task is to predict the demand of raw materials for 10 weeks in future using the historical data for a product-centre combination of the past 145 weeks.

Prediction of the type of raw material required at a particular delivery centre/branch depends upon numerous factors such as the type of meal ordered at the centre the most, the type of raw materials available nearest to a centre, etc. This chapter will mainly focus on one factor, that is, the type of meal ordered from a particular centre. The results will determine the demand for raw materials for the centre-meal combination [2, 3].

Many forecasting models have been developed in recent years, and almost all of them share the same basic idea, but follow paradigms from different fields [1]. But the present chapter aims at the performance of the CatBoost Regressor. In the next section, the theoretical structure will be discussed in order to support the model proposed in this study.

2 Literature Survey

Online food ordering is the way of ordering food through an Internet site or a mobile phone application, generally handled by a particular restaurant or a mediator service (such as Zomato, Swiggy, Dunzo, etc.). The product may either be a ready-to-eat food product (delivered directly from a licenced home kitchen, restaurant, or a ghost kitchen) or food not explicitly prepared for direct consumption (like vegetables directly from a farm, frozen meats, etc) [3, 4]. The history dates back to 1994, when the famous pizza chain, Pizza Hut started accepting orders online. Since then, restaurants, hotels, groceries, etc. are majorly accepting orders through online mode. In online food orders placed at restaurants/chains, the restaurants prefer to use their own website/ app, or opt to employ a delivery vendor. Usually, food delivery orders are on demand and meant to be consumed immediately, including hot meals already prepared. Ordering food for delivery usually includes contacting a local restaurant or chain online or by telephone. For deliveries in large cities, it is not possible to deliver the meal on time and fresh [5]. Thus, restaurants have their own chains/franchises in major landmarks, which helps them to attract customers from the whole city.

According to a 2019 market analysis on food delivery services, it was found that the global online ordered food delivery market was valued at ₹7.1755 trillion and is expected to rise approximately 9% a year, approaching ₹10.267 trillion by 2023 [5]. The study defined the market as follows:

1.
Meals ordered online that the restaurants deliver directly, either ordered via a portal (e.g. Zomato) or through its website.
2.
Online meal orders and deliveries all driven by a platform (like Swiggy).
3.
Online orders for takeaway in the restaurant by the customer. Excluding orders made via phone call (Fig. 1).

3 Methodology

3.1 CatBoost Regressor

CatBoost is based on the Gradient Boosting library. Gradient boosting is an effective machine-learning technique that is extensively used in various types of business challenges like fraud detection, recommendations, demand/ supply forecast, and it achieves state-of-the-art results. Also, decision trees have been found as the base predictors when it comes to the implementation of gradient boosting algorithms. While dealing with pure numerical data, it is suitable to use decision trees, but during implementation many datasets contain categorical features that sometimes act as most important features for prediction [6].

A standard solution for dealing with categorical features that are not necessarily equivalent to one another is to substitute each feature with a number at the time of pre-processing [6, 7].

Gradient Boosting Algorithms like XGBoost, LightGBM, etc. deals with categorical data through one-hot encoding of such features before training the model. Unlike other algorithms, CatBoost does not require one-hot encoding of categorical features during model building.

CatBoost, similar to the various Gradient Boosting strategies, fabricates new trees to surmise the angle of the present tree. CatBoost utilizes a more efficient procedure that reduces overfitting and permits to utilize the entire dataset for preparation [6]. To be specific, a random permutation of the dataset in use is carried out, and an average label value for each example is computed. CatBoost comprises two boosting models, Ordered and Plain. Here, Ordered boosting is used [7].

In CatBoost, oblivious trees are used as base predictors. In such trees, the same criterion of splitting is used over a whole level of the tree point. Such trees are balanced and less likely to undergo overfitting [6]. CatBoost model evaluator makes use of these facts: It first converts all used one-hot encoded features, float features and statistics into binary and then uses those binary features to compute model predictions [6, 7].

Assuming a dataset having $ {\left\{\left({x}_k,{y}_k\right)\right\}}_{k=1}^n, $ , where x _k is a vector of features chosen randomly and y _k is a target, initially CatBoost generates s + 1 independent random permutations of the dataset (σ ₀, . . , σ _s) [6, 7]. Now, permutations σ ₁, . . , σ _s will be used for evaluation of splits, while σ ₀ will be used for choosing the leaf values of the trees obtained [7]. In ordered boosting, we have several supporting models M ₁, . . M _nsuch that each M _i model will act as the current prediction for i the sample into the permutation σ _r [7]. In order to prevent “prediction shift”, CatBoost makes use of permutations such that σ ₁ = σ ₂. This technique prevents the use of target for training of model for “Target Statistic” calculation or “gradient estimation”.

How CatBoost works?
1. Initially, one categorical feature is chosen to start with, let us say x.
2. Next, randomly a row is chosen from the training set, and a random level of this categorical feature is exchanged with a number.
3. The number chosen is generally based on the target variable depending on the category level. To put it another way, the target number is based on the estimated output variable.
4. A splitting function is used to construct two different sets of training data: One set with all the features will have a greater target variable, and the other set with smaller target variables.

4 Implementation

Here, Python3 is used to implement the model available in the CatBoost library by Yandex. To implement the CatBoost regressor, Online Food Delivery System’s dataset is used (available online). The dataset contains historical data from 77 distinct delivery centres each of either Type A, Type B or Type C. The time frame of the data provided is 145 weeks. It contains several categorical features that have a direct impact on the prediction like operating area, type of cuisine, category of cuisine, etc. The aim is to perform regression analysis on the number of orders for the upcoming 10 weeks, which will tell about the required raw materials. Figure 2 is a glimpse of the dataset used in this study.

An exploratory analysis of the dataset was done and is shown below to observe the relationship between the variables (Figs. 3, 4, 5, 6, 7 and 8).

CatBoost offers a versatile interface for parameter selection and tuning. The parameters are selected after observing different values through grid search, which builds a model for every combination of hyperparameters specified and evaluates each model. The considered values for the parameters were: learning rate = [0.02, 0.03, 0.1]; tree depth = [4, 6, 8]; l2 regularization = [1, 3, 6, 9, 10]. The below table shows the best combinations where the resulting RMSE values were best (Table 1 and Fig. 9).

Table 1 RMSE values for different combinations of hyperparameters

Full size table

As RMSE (Root Mean Squared Error) is the loss function used in this analysis, the data needs to be converted into log values. Some of the features were even transformed for better understanding of the model. In this dataset, almost all the features were equally important for the prediction, but to achieve a better result, some extra features were also added like discount on base price, discount ratio, difference between price, certain lag features and ewm (exponentially weighted mean) features.

Below are shown some of the features participating in the forecast (Fig. 10).

Root Mean Square Error (RMSE) is the standard deviation of the residuals (prediction errors). It is the square root of MSE (Mean Squared Error) [8, 10]. In case of unbiased estimators, RMSE is just the square root of variance, which is actually Standard Deviation [2, 8]. In case of RMSLE (Root Mean Squared Logarithmic Error), the log of the predictions and actual values are taken. So basically, changes occur in the variance that is being measured [2].

$$ RMSE=\sqrt{\frac{1}{n}\sum_{i=0}^n{\left({y}_i-{\hat{y}}_i\right)}^2} $$

(1)

$$ RMSLE=\sqrt{\frac{1\ }{n\ }{\sum}_{i=0}^n{\left(\log\ \left({p}_i+1\right)-\log \left({a}_i+1\right)\right)}^2\ } $$

(2)

Once the features are decided, the data is split into train, validation and test. But here, the train data is divided for validation using K-Fold Cross Validation. And finally, the regression model is applied to the test dataset resulting in the number of orders for the next 10 weeks data.

5 Analyses and Results

This research on the demand forecasting of an Online Food Delivery service has brought some insightful points to notice. The evaluation metric, Root Mean Squared Logarithmic Error value obtained in this study was 0.0887.

When compared to other studies, the following results we obtained (Table 2).

Table 2 Performance of different models on the food demand dataset

Full size table

The fit of the model can be observed in the below figure (Fig. 11), which depicts the sensitivity analysis of the model.

For a precise output of a forecasting model, adjustments must be made to the parameters for the technique applied. CatBoost, just like other machine learning packages, provides with a way to compare the feature (variable) importance [6]. This can help in building a better model, and allows for easier interpretation of results. Figure 12 is a plot showing the Feature Importance.

It is observed that avg_orders (average orders) have the highest importance among all variables. Evidently, CatBoost has shown some state-of-the-art results with most of the popular datasets containing categorical features. In this chapter, it shows more than expected results.

Also, applications for the Internet of Things (IoT) have evolved immensely, producing a vast amount of data needed for intelligent data processing. To optimize asset utilization as well as customer value, IoT supply chain management tools mine valuable data inside and outside the business [9].

IoT devices communicate with each other to collect and share data [9]. The results obtained through the study can provide insightful results for the data collected through the IoT devices and help in forecasting the demand of raw materials as well as automate the overall process of supply chain management (Fig.13).

6 Conclusion

From the results obtained from this analysis, the following conclusions can be drawn:

1.
The CatBoost model applies shift and ewm (exponentially weighted mean) features to the dataset as the data used contains historical data values.
2.
CatBoost proves to offer improved performance as compared to other regression models that also deals with categorical data.
3.
The historical data has a great and major impact on predicting the future values.
4.
For any retailer, manufacturer or supplier, forecasting product demand is essential for its business’ growth. Demand forecasting helps determine the quantities that should be purchased, produced and shipped.

In general, accurate demand forecasts help the manufacturers/suppliers obtain a correct idea of the raw materials requirements in future, which leads to high levels of customer services. On the other hand, erroneous forecasts can lead to high-cost operations and a poor customer service.

References

Zhang, Y., Xu, H., & Zheng, Y. (2009). Chinese residents cold chain logistics demand forecasting based on GM (1, 1) model. African Journal of Business Management, 6(14), 5136–5141.
Google Scholar
Chai, T., & Draxler, R. R. (2014). Root mean square error (RMSE) or mean absolute error (MAE)? –arguments against avoiding RMSE in the literature. Geoscientific Model Development, 7(3), 1247–1250.
Article Google Scholar
Pigatto, G., Machado, J. G. D. C. F., dos Santos Negreti, A., & Machado, L. M. (2017). Have you chosen your request? Analysis of online food delivery companies in Brazil. British Food Journal, 119, 639.
Article Google Scholar
Keeble, M., Adams, J., Sacks, G., Vanderlee, L., White, C. M., Hammond, D., & Burgoine, T. (2020). Use of online food delivery services to order food prepared away-from-home and associated sociodemographic characteristics: A cross-sectional, multi-country analysis. International Journal of Environmental Research and Public Health, 17(14), 5190.
Article Google Scholar
Zulkarnain, K., Ahasanul, H., & Selim, A. (2015). Key success factors of online food ordering services: An empirical study. Malaysian Institute of Management, 50(2), 19–36.
Google Scholar
Dorogush, A.V., Ershov, V. & Gulin, A. (2018). CatBoost: gradient boosting with categorical features support. arXiv preprint arXiv:1810.11363.
Google Scholar
Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V., & Gulin, A. (2018). CatBoost: Unbiased boosting with categorical features. In Advances in neural information processing systems (pp. 6638–6648).
Google Scholar
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction. Springer Science & Business Media.
Book Google Scholar
Adi, E., Anwar, A., Baig, Z., & Zeadally, S. (2020). Machine learning and data analytics for the IoT. In Neural computing and applications (pp. 1–29).
Google Scholar
Da Veiga, C. P., Da Veiga, C. R. P., Catapan, A., Tortato, U., & Da Silva, W. V. (2014). Demand forecasting in food retail: A comparison between the Holt-winters and ARIMA models. WSEAS Transactions on Business and Economics, 11(1), 608–614.
Google Scholar

Download references

Author information

Authors and Affiliations

Vellore Institute of Technology, Chennai, Tamil Nadu, India
Ansh Pujara, V. Pattabiraman & R. Parvathi

Authors

Ansh Pujara
View author publications
You can also search for this author in PubMed Google Scholar
V. Pattabiraman
View author publications
You can also search for this author in PubMed Google Scholar
R. Parvathi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Parvathi .

Editor information

Editors and Affiliations

Professor (Associate) and Research Head, Department of Computer Science and Engineering, Sri Eshwar College of Engineering, Coimbatore, TN, India
Anandakumar Haldorai
Professor, Department of Computer Science and Engineering, Presidency University, Bengaluru, KA, India
Arulmurugan Ramu
Principal, Sri Eshwar College of Engineering, Coimbatore, TN, India
Sudha Mohanram
Professor, Department of Computer Science, School of Computing and Engineering, Centre for Planning, Autonomy and Representation of Knowledge, University of Huddersfield, Huddersfield, UK
Joan Lu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pujara, A., Pattabiraman, V., Parvathi, R. (2022). Food Demand Forecast for Online Food Delivery Service Using CatBoost Model. In: Haldorai, A., Ramu, A., Mohanram, S., Lu, J. (eds) 3rd EAI International Conference on Big Data Innovation for Sustainable Cognitive Computing. EAI/Springer Innovations in Communication and Computing. Springer, Cham. https://doi.org/10.1007/978-3-030-78750-9_9

Download citation

DOI: https://doi.org/10.1007/978-3-030-78750-9_9
Published: 01 January 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-78749-3
Online ISBN: 978-3-030-78750-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics