3.1 Introduction

Taxes are the most common source of government incomes, which has influence in achieving socio-economic, political, and macroeconomic objectives of countries and other types of territorial divisions. Taxes are a legal instrument for increasing resources into the government to enhance its economic development. In the work of [25], the author highlights that tax payments are a major source of income of governments and are considered a fiscal instrument for regulating and resolving economic and social policies. At the same time, taxes are considered a mechanism for enhancing economic growth. In the case of this work, the tax payment historical behavior in Ecuador is analyzed. In Ecuador the tax payments are fundamental to financing the government national. Unfortunately, Ecuadorians’ tax payment culture can be consider as low with high levels of evasion in different sectors of the economy [3, 31]. However, in spite of historical citizens’ behaviors, nowadays; cultural conditions seems to be changing. According to statistics reports of the Internal Rents Service (from Spanish, Servicio de Rentas InternasFootnote 1 (SRI)), in the last years (2005–2016), the incomes ratio in Ecuador has been increased by 29%. This growth rate is due to public policies in taxes matter in Ecuador. Several of these policies are related to applying information and communications technologies (ICT) tools within taxes processes.

According to the European Commission[19], eGovernment is defined as a way to provide a wide variety of benefits including efficiency and savings for governments and businesses, transparency, and improvement of citizens’ participation in political issues. One of the main challenges for the development of eGovernment is related to the implementation of ICT. However, eGovernment is much more than just ICT, it also involves rethinking organizations and processes and changing the behavior of public administrators so that public services can be delivered more efficiently to citizens.

eCommerce emerged due to the influence of the ICT revolution. Together with eCommerce, RSs were introduced as a mechanism to increase business incomes [35]. In the same way, several researchers have pointed out the benefits of applying the best practices of RSs to eGovernment solutions [26, 27, 38]. In spite of the efforts presented, the use of RSs as way to increase citizens’ awareness towards improving tax payments is an application that needs more attention.

The Municipality of Quito (from Spanish, Distrito Metrolopitano de Quio (DMQ)) faces problems with a past due portfolio that includes a big group of citizens. According to report DAI-AI-0026-2017 presented by the Comptroller General of Ecuador, the past due portfolio until December 2015 was 31,284.41092 USD [10]. This trend continues generating delays in the execution of public services. On the other hand, the Municipality of Quito does not have processes and systems to setup discounts focused on citizens. Discount methods are mainly designed regarding payment deadlines. This has become in a big problem for the city. Moreover, every 2 years the most important taxes (urban and rural) are constantly changing. This process is developed and focuses on construction but it is not applied to citizens. This model request several legal reforms to change the discount formula. The constant modifications in the laws have an negative impact in the trust of citizens and it reduces their engagement level with the municipality [13, 14, 17].

These issues pushed tax authorities inside the Municipality of Quito to apply alternative and creatives approaches to reduce and mitigate the effect of a past due portfolio and motivate citizens towards improving their payment behavior [15, 16, 33]. Applying personalized discounts centered on citizen payment behavior with the use of RSs within the Municipality of Quito, could help to improve the issues mentioned above. The proposed methods in this chapter could present the following benefits:

  • Improving income taxes.

  • Reducing the due portfolio.

  • Supporting decision-making processes to create/update taxes.

  • Increasing citizens’ engagement.

  • Fostering the usefulness of RSs as a way to develop eGovernment.

Next sections are structured as follows. First, Sect. 3.2 gives a brief introduction and description of tax payments, RSs for eCommerce and eGovernment, and fuzzy logic, which are the basis of the model proposed. Then, Sect. 3.3 presents the methods used by the recommendation system model proposed. Additionally, in Sect. 3.4 a simulation model is presented. Finally, the authors give their concluding remarks and the future outlook in Sect. 3.5.

3.2 Background

3.2.1 Tax Payments in Ecuador

According to the Organization for Economic Cooperation and Development (OECD), the term taxes should be understood as compulsory, unrequited payments to general government, which means that the benefits provided by governments to taxpayers are not normally in proportion to their payments [30].

The current tax payment system in Ecuador is based on the following principles: legality, generality, equality, proportionality, and non-retroactivity [36]. The Ecuadorian tax system is based on indirect and direct taxes. In 2015, the fiscal pressure in Ecuador was around 21.7% [9]. On average, taxes represented 76% of the Ecuadorian GDP in 2016 [7] and are administrated by the SRI. On the other hand, external taxes are administrated by the National Customs Service of Ecuador (from Spanish, Servicio Nacional de Aduana del EcuadorFootnote 2 (SENAE)), which also has nationwide jurisdiction. Local and provincial taxes are created and administered by each local government as municipalities and provincial councils, respectively.

Municipalities in Ecuador use different discount methods as way to persuading citizens to pay their taxes. The most common method is to offer discounts by early payment [5].

3.2.2 RSs for eCommerce and eGovernment

RS techniques are applied in several domains. In the same way, Schafer et al. [35] show evidence that RSs can increase businesses’ incomes. Additionally, the survey developed by Lu et al. [27] presents the state of the art in recommender system applications. A list of domains were evidenced with positive outcomes, which includes fields such as: eGovernment, eCommerce, eLibrary, eLearning, eBusiness, eTourism, eResources, and eGroups, among others.

In the academic literature, the most used mechanisms to develop RS applications are: collaborative filtering (CF), content-based (CB), and hybrid [8]. The main problem in the CF approach is related to the so-called sparsity problem, when the number of items rated is small compared to the total number of items [38]. On the other hand, CB faces the problem of overspecialized recommendations [27]. To solve these problems, many advanced recommendation approaches have been proposed including social network-based RSs, fuzzy-based RSs, context awareness RSs, and group RSs [27].

In the work of Yager [41], the author points out that an important component of CF that is the calculation of similarity of interest based on correlations between individuals for predictions and recommendations. Several issues have been identified when using CF approaches, based on this issue. For that reason, a new approach so-called reclusive approach, was proposed. It is based upon finding a similarity between objects while the CF approach is based on similarity between people. This approach considers an RS as a collection of objects as, O = {o 1, …, o n} and it recommends to a user objects of O that could interest him. In [41], the proposed method considers five issues: RS as object collections, object representation, user preferences modeling, user profiles, and environment.

The basis of this approach is an object and its membership degree. According to Zadeh [43], an object do not have a defined membership criteria. For instance, in the case of the class animal, it has a clear definition (e.g., dog, cat, horse, etc.). However, the class people’s ages (e.g., baby, boy, young, and old), an imprecise decision emerges. In this arena, where the decision-making process is not obvious, fuzzy sets and fuzzy logic appears as a solution.

3.2.3 Fuzzy Logic Overview

In the work of Zadeh [43], the author pointed out that fuzzy sets provide a natural way of dealing with problems in which the source is imprecision. On the other hand, in sharp logic, the values obtained after executing a sentence are binary (true, false). These results are represented by crisp sets; however, several solutions in real life need values in the interval [0, 1]. In this case, fuzzy logic is similar to human decision-making with its ability to work with approximated data to find precise solutions [1].

Fuzzy set theory was introduced by Zadeh in 1965 [43]. It described the concept of membership degree. Crisp sets behaviors are denoted by: If x belongs to a set A then x ∈ A, otherwise xA. Therefore, for each x of the set A there are only two responses: either x is an element of A or it is not. Using the concept of membership degree, each element can be represented by a function that defines the values allowed. The values allowed are drawn as linguistic values. A linguistic value refers to a label representing knowledge that has meaning determined by its degree of membership function. For instance, x 1 = old with the degree μ = 0.8 means that the variable x 1 has a linguistic value represented by the label old, whose meaning is determined by the degree 0.8.

To illustrate this concept in the case presented in this chapter, a crisp set for the linguistic variable discount as a function of the payment date of a citizen is used. It tries to resolve the following problem: A local government institution wants to create a discount over one of its taxes. The discount strategy expects that citizens pay as soon as possible to maximize their annual budget. In represents that if one citizen pays between the discount period he gets a discount, but, if he pays in the first days he gets a gradual discount.

The class discount storage the discount periods in days. It is defined as follows: discount = {1, .., 180}. The characteristic function presented in Eq. (3.1) has only two values: true (ten) or false (zero).

$$\displaystyle \begin{aligned} X_{discount}(x) = \left\lbrace \begin{array}{ll} \mbox{if } 1\leq x \leq 180 & 1 \\ \mbox{if } 180<x<366 & 0 \end{array} \right. {} \end{aligned} $$
(3.1)

The function presented in Eq. (3.1) is not useful for the calculation required by the local government institution, since it does not allow setting a real discount value according to payment date.

This issue can be resolved using fuzzy sets with membership functions. Zadeh gives a number for each value inside the universe set; this number is the degree in which the element is inside the set. For instance, in the case of the set discount, there are some values that represent the citizen payment date; therefore, different degrees of discount could be represented.

Zadeh defines fuzzy sets as A in X is characterized by a membership function f A(x) that associates each element in X with a real number in the interval [0, 1], with the value of f A(x) at x representing the grade of membership of x in A. Equation (3.2) shows the membership function for the fuzzy class discount. The equation presented in Eq. (3.2) responds to a gamma inverse fuzzy function definition.

$$\displaystyle \begin{aligned} X_{discount}(x) = \left\lbrace \begin{array}{ll} \mbox{if } 180<x<366 & 0 \\ \mbox{if } x \in (1,180) & 1-((x-1)/(180-1)) \\ \mbox{if } x = 0 & 10 \\ \end{array} \right. {} \end{aligned} $$
(3.2)

The level of granularity used in Eq. (3.2) allows the local government institution to apply a real discount related on payment date. The real values after and before could be represented in linguistic variables by a collection of quantifiers such as: near, close, approximately, etc. [44]. Finally, fuzzy set theory uses a large volume of operations and properties that can be considered as tools to apply in different scenarios.

3.2.4 Fuzzy Logic Applied to Marketing

In the work of Donzé and Meier [18], the authors define marketing as way to identify and pick up the customers’ needs. They also point out the use of customer relationship management (CRM) as a strategy for building customer equity and improving financial revenue. In this sense, fuzzy logic can be applied to marketing in a wide set of tasks.

eGovernment is defined as a subset of the exchange relationships of eCommerce. Therefore, eMarketing is a way to improve the relations between citizens and government institutions through ICT tools. In the work of Meier and Stormer [28], authors propose to add eMarketing as an element into the value chain of eBusiness.

Several researchers highlighted the importance of fuzzy logic in both eCommerce and eGovernment as an effective means of decision support, not only from a business perspective but also from the clients or citizens perspective. Nowadays, fuzzy classification models are used in the implementation of RSs. In the work of Lu [27], the author highlighted a number of RS approaches and the techniques implemented. A summary is presented in Table 3.1.

Table 3.1 RSs techniques applied, adapted from [27]

Additionally, other authors have presented positive outcomes on the use of RSs in eCommerce and eGovernment [11, 12, 22, 23, 26, 27, 39, 42]; however, the application of RSs as a way to increase the citizens’ awareness towards tax payments is poorly explored.

3.3 Fuzzy-Based RS Model

The model presented in this chapter is centered on citizens and their payments behaviour. Therefore, the reclusive approach for recommendation presented by Yager [41] is applied.

Based on the targeted marketing definition as results of RSs in eCommerce field, the approach proposed uses the definition of discount as a way to persuading citizen to paying taxes. In this way, it creates personalized discounts to each citizen. The model is set by two types of discounts centred on citizens, general and specific, defined as follows.

  • General discount—It is applied in relation to the historical payment behaviour.

  • Specific discount—It is related to specific issues that e-government institutions could be interested (e.g., dead line, risk relevance, etc.)

Both, general and specific discounts have been implemented using fuzzy sets definitions. Such definitions were mixed in a Euclidean space to set specific discount for specific citizens, where each discount percentage is unlikely. In the next section, the fuzzy sets used and its details are shown.

3.3.1 Fuzzy Sets

This section aims is to shows every fuzzy definition used to implement the discount methods but also the recommendation notification policies.

3.3.1.1 General Discount

To implement the general discount, the proposed model uses the historical citizens behaviors considering the number of times that the citizen was included in some payment condition as it is show in Table 3.2.

Table 3.2 Condition of payment by taxes

Payment conditions are applied regarding a group of taxes in the past. The values are computed using the fuzzy classification sets presented in Fig. 3.1. The combination presented in Fig. 3.1 allows us to set a citizens classification, as well as a membership degree, defined as citizen behavior ranking (CBR).

Fig. 3.1
figure 1

Fuzzy sets for citizens behavior

Five fuzzy sets were drawn (Fig. 3.1) over the linguistic variable citizens behavior. The linguistic values are related to the citizens payment behavior (i.e., bad, regular, average, good, and excellent). Each linguistic values have a corresponding discount by crisp group as shown in Table 3.3. The ranking values shown in Table 3.3 have a corresponding fuzzy set (refer to Fig. 3.1); however, each ranking value is overlapped by other value. Therefore, the discount percentage does not belong to an element in one category only. In this scenario, the final discount will be done by membership degree on each group of fuzzy sets and the range of discounts set as shown in Table 3.3. This result is named citizen behavior discount percentage (CBDP) as shown in Eq. (3.3).

$$\displaystyle \begin{aligned} CBDP=\frac{\sum_{i=1}^{n} \left\lbrace \left\lvert {(RF-(RI-1))} \right\rvert \times \mu_i\right\rbrace}{n} {} \end{aligned} $$
(3.3)

where RF is the final value on the column discount in % as shown in Table 3.3. RI is the initial value on the column discount in % in Table 3.3. μ i is the membership degree of the fuzzy set i. Finally, n numbers of fuzzy sets overlapped.

Table 3.3 Discount by rating

CBDP computes the discount percentage by every group of taxes to compute the global percentage discount by citizen (GCBDP). Each group of taxes is assigned a weight (W) as shown in Eq. (3.4).

$$\displaystyle \begin{aligned} GCBDP=\sum_{I=1}^{N} W*CBDP {} \end{aligned} $$
(3.4)

where, N is the total number of group of taxes by citizen. W is the ratio between the value to paying of tax group and the total debt for the citizen. As an example, citizen A has to pay three taxes: urban, patent, and operation. His total debt is 100 and the values to pay by each group of taxes are 80, 10, and 10. Therefore, the value of W assigned to each group of taxes are: urban = 0.8; patent = 0.1, and operation = 0.1. Finally, CBDP is the individual discount percentage by group of taxes.

3.3.1.2 Specific Discount

The general discount presented in the previous section is the key element in the citizens payment behaviour. It could be mixed with specific discounts related with other factors (e.g., dead line, risk relevance, etc.). Specific discounts percentages (SDP) are computed as ratio between a point inside of Euclidean space and the maximum distance value (MDV ) that can take negatives or positives values of the scale. It is shown in Eq. (3.5).

$$\displaystyle \begin{aligned} SDP=\frac{\sqrt{\sum_{i=1}^{n} (x_i-y_i)^2})}{MDV} {} \end{aligned} $$
(3.5)

Both general and specific discounts are added for getting the final discount (FD) by group of taxes. It is shown in Eq. (3.6).

$$\displaystyle \begin{aligned} FD=\frac{(CBDP+ SDP)}{2} {} \end{aligned} $$
(3.6)

According to previous definitions, the model proposes two specific discounts. The first one is related to deadline payment, and the other with the risk relevance. Both discounts are combined with CBR and plotted on an Euclidean space as shown in Fig. 3.2. To compute the discounts according to Fig. 3.2, consider the X axis in region dead line represents time, and risk relevance is the ratio between citizens debt and the citizens with the most debt inside each group of taxes; moreover, points labeled as X1 − Y 1 (as shown in Fig. 3.2) are the initial points to compute the Euclidean distance.

Fig. 3.2
figure 2

Discounts combined with CBR

3.3.1.3 Recommendation Notification Policies

The fuzzy sets shown in Figs. 3.3 and 3.4 allow to setup the membership degree to define different recommendation policies, for example, if the membership function is 1 in both scenarios the recommendation should be sent to citizen every standard time.

Fig. 3.3
figure 3

Fuzzy sets by payment deadline

Fig. 3.4
figure 4

Fuzzy sets by risk relevance

Citizens behavior fuzzy sets allow establishing recommendations centered on citizen. Therefore, the proposed model combines every fuzzy set (payment deadline and risk relevance) in an Euclidean space to compute the centroid defined as fuzzy profile (FP). Figure 3.5 presents the fuzzy sets of the citizens behavior and risk relevance combined in an Euclidean space, it is computed using Eq. (3.7). The FP is computed by setting up the initial centers as (0, 0). The X axis represents the relevance (or deadline) and the Y  axis is the point generated by CBR as shown in Fig. 3.5.

$$\displaystyle \begin{aligned} FP=\sqrt{\sum_{i=1}^{n} (x_i-y_i)^2} {} \end{aligned} $$
(3.7)
Fig. 3.5
figure 5

Fuzzy profile (FP)

The FP presented in Fig. 3.5 generates a recommendation point with value 0.87. The risk relevance is calculated considering the value five for the citizen with the highest tax debt; therefore, the relevance for a specific citizen is the ratio between his or her debt and the citizen with the highest debt.

This information changes in real time for each citizens payment. On the other hand, the deadline for payment relevance is calculated considering the value two (non-discounted payment) as the user is moving towards the risk zone. Therefore, in this case notifications will be send. The notifications regarding citizens payments will depend on their behaviors. Both risk relevance and deadline payment generate a recommendations matrix to save recommendations messages.

3.3.2 System Architecture

The system architecture is show in Fig. 3.6. It considers three components: message handler (MH), request validation (RV ), and fuzzy recommender system (FRS).

Fig. 3.6
figure 6

Fuzzy-based recommender system architecture

3.3.2.1 Message Handler

The message handler (MH) implements several functions to send recommendations available to either citizens or government agencies in different ways, for example, e-mail, SMS, voice mail, etc. The recommendation message is splitting according to the fuzzy sets by payment deadline, risk relevance, and citizen behavior. For each citizen C i, a set of recommendation R i will be send. Each R i could be responded by the citizen; therefore, the ranking of accuracy of recommendations is computed by the ratio of response sent for each recommendation. Responses are received by a request validation (RV ) component.

3.3.2.2 Request Validation

The RV  implements a reactive autonomous agent as a real time supervisor a shown in Fig. 3.7. Its main function is to listen the environment, activate the process corresponding and increase the knowledge base. Figure 3.7 presents the inputs and output from this component.

Fig. 3.7
figure 7

Request validation design

The RV  component is spitted by six inputs, three outputs, and six processes to control the states and one data base (refer to Table 3.4). Each input is considered as a sensor or listener, its waiting triggers or shift in the environment are designed to fire an action. The outputs are responsible to connect the message or order from the RV  towards its target. The processes have to comply every rule or restriction until they get a successfully output requested. Finally, the database records each interaction and its results.

Table 3.4 Elements of the request validation component

3.3.2.3 Fuzzy Recommender System

The fuzzy recommender system is divided by three sub components, the central data repository (CDR), the constraint-based engine (CBE), and the fuzzy recommendation engine (FRE).

Central Data Repository

Institutional data warehouses are widely applied in e-government solutions [2, 4, 29]. Therefore, Central Data Repository CDR implements an institutional data warehouse (DWH). Its main function is to integrate the information from large number of homogeneous and/or heterogeneous sources into an unique data storage. Moreover, the DWH has different levels of details, one of this is defined as data mart [24]. Data flows into the DWH from the environment which needs processes to load and transform the data, these processes are named extraction transformation and load (ETL). As shown in Fig. 3.7, CDR contains ETL’ processes to process data from citizens and services. It’s more important features are summarized in Table 3.5.

Table 3.5 Central data repository features

The CPD stores the profiles of involved citizens. A citizen profile (CP i) is associated with a citizen C i. It consists of a tuple (CPDF i, CPDP i, CPDT i), where:

  1. 1.

    CPDF i stores the identification, social, and demographics information such as, name, date of born, gender, province, zone and so on;

  2. 2.

    CPDP i infers the rules from the number of times that the services (taxes) were used in the past;

  3. 3.

    CPDT i stores information of citizens account about pays and debts (statement of account). It includes information such as, payment date, value of pay, agency, discount.

The SPD stores the profiles of involved eServices. A services profile SP i, associated with an service S i consists of a tuple (SPDF i, SPDB i, SPDC i), where:

  1. 1.

    SPDF i stores the identification, descriptions, and demographics information such as, name, date of deliver, target, zone and so on;

  2. 2.

    SPDB i stores the information of service budget including the cost of services without an electronic service, the cost using electronic service, and quantity citizens save using electronic services, among others;

  3. 3.

    SPDC i stores the information of constraints in the service such as, age, yearly income, nationality, permanent residence, physical disability, mental disability, job injury, qualification, academic degree, marital status, number of children, and relevance, among others.

The column “update mode frequency” presents the time executed by the ETL processes. Every data repository (CPD) or (SPD) are executed one time per day or according to requests made from external agents.

Constraint-Based Engine

Constraint-based Engine CBE was adapted from the constraint-based recommender systems approach in [20], and the framework proposed in [6]. Constraint-based recommender systems approach recommends services according to citizens needs [20]. In order to adapt the approach previously mentioned in the case of taxation on eGovernment services, a set of variables related to citizens profiles, services profiles, and a set of constraints are used. They define the relation between them, this set of constraints are named “Constrains-builder”. Its content is presented as follows:

  1. 1.

    Incompatibility (IC) restricts the services according to citizens requests (e.g., age, incomes, etc.)

  2. 2.

    Service (SC) owns restrictions of the service (e.g., availability, date launched, budget, etc.) )

  3. 3.

    Filter (FC) relates between citizens request and services (e.g., business taxes, urban taxes, etc.)

  4. 4.

    Request (RC) constraints related citizens requirements and services assigned (e.g., citizen address).

Information of relation between Citizen Profile and Service Profile in the matter of taxation is saved in Citizen Taxes Records data source (CTR). Data available on (CTR) contain information such as: Tax , citizen , payment date, value to pay, value paid.

Fuzzy-Based Recommender Engine

Fuzzy-based Recommender Engine (FRE) processes the data available on the citizen tax records data source (CTR) and prepares the recommendation list. The recommendation engine components presented in Fig. 3.6 implements the rules mentioned in Sect. 3.3.1. Recommendation lists are prepared using a set of steps defined in the fuzzy recommender builder presented as follows.

  1. 1.

    Citizen behaviour. It computes the citizens behavior by rating them from historical payment data. Every payment is assigned with a rating according to Table 3.2. The final rating is the average of the data set processed.

  2. 2.

    General Discount. With the citizens behaviour rating, the general discount percentage is computed as a membership degree by each group of fuzzy sets multiply discount range as presented in Table 3.3 and Eq. (3.3). As an example, a user rated with 3.2, his discount will be computed as membership degree of fuzzy sets good (μ G(3.2)) and average (μ A(3.2)) with the range of percentage settings in Table 3.3.

  3. 3.

    Computed classification by deadline. It computes the membership degree in the fuzzy set as shown in Fig. 3.3. Citizens with membership degree greater than 0 should be sent a paying recommendation.

  4. 4.

    Computed classification by relevance. It computes the membership degree in the fuzzy set as shown in Fig. 3.4. Citizens with membership degree greater than 0 should be sent a paying recommendation.

  5. 5.

    Computed specific discounts. It is used to compute specific discounts. The fuzzy recommender-builder implements the rules mentioned in Sect. 3.3.1.2.

  6. 6.

    Ranking. It represents a combination of sort processes. It implements three rankings, by discount values, risk relevance, and citizens behavior.

  7. 7.

    List of recommendations. The list of recommendations are performed after the classification calculation by relevance and deadline. Its responsibility is sending details by every recommendation towards the request validation (RV ) component and updates the knowledge base with the new information generated.

  8. 8.

    Feedback. It updates the knowledge base with information responded by the recommendation sent.

3.3.3 Recommendation Computation Process

In this section the case of the citizen of Quito city is used to follow the recommendation process. As an example, the case of the citizen Juan is used. He is 28 years old, lives in Quito in his suite, in the last 5 years he paid his municipal taxes as follows, three times with discount of 8% and two times with punish of 3%. In the present year Juan needs more information about the best way to save money. Juan makes this request on March 5, 2018 and he has a debt of USD 125.20. Juan has an account in municipal website PAM.Footnote 3 The maximum debt inside of the group of tax when Juan sent the request was USD 2125.00. The municipality has set in 2018 as max discount of 10%. From January until June, the municipality offers a discount by early payment.

According to the fuzzy-based recommender system architecture presented in Fig. 3.6, the steps to follow are:

  1. 1.

    Request is processed by (RV ) as “Citizen” and executes “Citizen rule” process (Fig. 3.7), then a request is accepted and sent to the FRS given that Juan has an account in the municipal website.

  2. 2.

    Juan has data from previous year; therefore, the ETL process is skipped.

  3. 3.

    A set of restrictions are applied by the “Constraint-builder”. Those restrictions are evaluated and the request approved as shown in Table 3.6. The data is not update in the “Citizen’ taxes records”, since their were not changed.

    Table 3.6 Land taxes restrictions
  4. 4.

    Juan historical payment behaviour is computed as rating of 4.2; therefore, Juan is considered as citizen between Good and Excellent (refer to Fig. 3.1). The rating assigned to Juan allows him to get a general discount of 37.5% (refer to Eq. (3.3)) over the max allowed by the municipality (10%). That’s means a real general discount of 3.75%.

  5. 5.

    Juan does not pay yet; therefore, he will be notified by deadline. On the other hand, he will not be notified by risk relevance, since his rating of behavior is 4.2 and his ratio of debt is 5.89%. They are not considered as relevant for early payment.

  6. 6.

    Juan percentage specific discounts are: deadline = 13.79%, risk relevance = 29.70%.

  7. 7.

    Juan will be informed with the suggested list to paying as shown in Fig. 3.8.

    Fig. 3.8
    figure 8

    Screenshot citizen recommendation message

3.4 Model Simulation

This section evaluates the outcomes of the proposed model using a sample from the Municipality of Quito data set. The model simulation aims are:

  1. 1.

    To evaluate citizens historical payment behavior

  2. 2.

    To verifying the data set behavior with the discount model proposed

  3. 3.

    To summarize the outcomes over the data simulated

This section is organized as follow: in the first part the data set structure is analyzed, after that the simulation design is presented, and finally, some outcomes are presented.

3.4.1 Dataset Acquisition

In order to prepare the simulation design, the data sets from municipality taxation systems were collected, prepared, and analyzed. The dataset used has the structure presented in the Table 3.7.

Table 3.7 Data set structure for taxes payment

The structure shown in Table 3.7 was filled up from citizen profiles (CP) and service profiles (SP) using an ETL processes of central data repository (CDR) (refer to Fig. 3.6). Data processed is summarized in Table 3.8.

Table 3.8 Municipal taxes data set

Data sets acquisition was applied to Urban Taxes, this group considered a 194.971 citizens that had to pay urban taxes in some one of the categories presented in Table 3.9.

Table 3.9 Kind of municipal urban taxes processed

3.4.2 Simulation Design

In order to prepare scenarios of simulation models, several steps were completed, as so also tools of support were built (refer to Fig. 3.9).

Fig. 3.9
figure 9

Simulation design process

3.4.3 Outcomes

  • The average rating for citizens was 3.44/5 stars. This sample was split according to pay behaviour: discount payment 9%, non discounted payment 53%, penalty payment 35%, and payment after judicial notice 3%.

  • The citizens that paid with discount their mean of discount was 5.01% and their deviation 2.91; therefore, the discounts gotten by citizens were between 2% to 8%.

  • The discount approach proposed presented a mean of discount of 7.68% and it deviation 0.064, therefore, the tax discount distribution is more close between citizens.

  • The citizens that paid with discount in the past could get at least 4.9% of discount in future payments.

  • Three scenarios were simulated to specific discount. In the worst case scenario citizens could get at least 5.98% of discount in future payments.

  • The citizens that never paid with discount in the past could get at least 5.7% of discount in future payments.

Outcomes presented show evidence that the discount strategic used by the municipality was not adequate given that percentages of taxpayers that paid with discount are low. On the other hand, the discount tax model applied gives insights about its opportunities to improve municipally incomes using this strategy.

3.5 Conclusions, Limitations, and Future Work

In this work, a fuzzy-based recommender system was introduced. It was applied to notifying citizens regarding public tax payments and also discount opportunities. The proposed model has been tested using a simulation from the Municipality of Quito dataset. In this work, the model presented behaviors with a positive impact in citizens tax awareness.

This hypothesis is supported in the highlighted results of the simulations. For instance, 91% of citizens are below the level of relevance two; however, their payment behavior rating was presented inside the average. On the other hand, the rest of the citizens payment behavior ratings were consider as regular. The proposed model shows an effective way to identify and recommend this group of citizens.

The discount approach proposed in the model, show effectiveness in the way for compute taxes, therefore, that is an initial point in order to the municipality rethinking the strategies of discounts.

Another conclusion is that fuzzy logic approach applied as a classification method improves the discount balance between citizens and it gives broad options towards new discounts methods.

The recommendation approach presented in this work is centered on citizens behavior as strategy to recommender payment options to compute discounts scenarios by citizen.

Future work should focus on the design machine learning approach to improve the real time supervisor; combining both CF and reclusive approaches to propose recommendations using group preferences by finding other citizens who could have similar behaviors. The next stage for the proposed model is to test it with the citizens from the Municipality of Quito. Data gathered with citizens interactions will be used for impact evaluation and redesign of the proposed RS approach.