1 Introduction

Selecting the best alternative among a set of different propositions is often a complex decision-making problem. Methods with the purpose of aiding this process have been proposed with a strong emphasis on deterministic approaches, including the evaluation of economic and/or operational aspects of the alternatives under consideration (Padalkar & Gopinath, 2016). Recently, the use of Multi-criteria Decision-Making (MCDM) methods has gained popularity, since they provide means for modeling and solving complex decision-making problems under multiple and conflictive criteria, by adopting the concept of a satisficing solution (Garrido-Baserba, et al., 2016; Pohekar & Ramachandran, 2004; Simon, 1979). However, MCDM methods are mostly designed for the use of a single agent and are not appropriate, a priori, to be applied in the scenario of multi-agent decision making processes, which is the scenario of most of the current decision-making processes within organizations.

In the literature, propositions can be found for the use of MCDM methods within a multi-agent scenario through the aggregation of different agents’ preferences into a unity, usually using arithmetic or geometric mean.Footnote 1 It is noteworthy to cite one of the first uses of such procedure, which can be seen in the paper of Lockett et al. (1987), that used the arithmetic mean for presenting the discrepancies of the Analytical Hierarchy Process (AHP) between individual and group scenarios. Due to the possibility of losing information in scenarios with high heterogeneous preferences (Basak, 1988; Leoneti, 2016), other approaches suggest the adaptation of the classical MCDM methods for their suitable operation within a multi-agent scenario. In this direction, the contributions of Dyer & Forman (1992), Basak & Saaty (1993), Chen (2000), Lai et al. (2002), Shih et al. (2007), Escobar & Moreno-Jimenéz (2007), Arnette et al. (2010), Altuzarra et al. (2010), Sanayei et al. (2010), Dong et al. (2010), Hatami-Marbini and Tavana (2011), Huang et al. (2013), Wu et al. (2018) can be cited. Conversely, although avoiding the use of aggregation of agents’ preferences, such adapted MCDM methods still contemplate some aggregation procedures, and therefore, do not allow to fully model the heterogeneity of the agents’ preferences for finding a satisficing solution.

In contrast, game theory is a technique designed for modeling the strategic iteration of multiple agents with heterogeneous preferences and to support their choice. The choice that agents can reach through a game theory structure is usually based on the concept of equilibrium solutions (Hipel et al., 2011). According to Hipel and Fang (2005), in the presence of multiple agents with divergent objectives or value systems, game theory would be the most appropriate decision-making technique for better modeling the conflicting aspirations of agents in a fair and sustainable manner.Footnote 2 Nevertheless, most game theory models assume a scalar to represent the outcomes (payoffs) of the strategic interactions of the agents. In consequence, the outcomes are analyzed on a one-dimensional basis, which disregards the potential benefits that could be provided by multi-criteria analysis.

An initial attempt to model multi-criteria problems using the structure of games was introduced by Shapley (1959) and Blackwell (1956). The so-called vector payoff games (or multi-criteria games) contemplates heterogeneous agents and their multiple decision criteria that are modeled by means of vector utility functions (Sasaki, 2018). An example of the application of vector payoff games to a complex environmental problem can be seen in Lejano & Ingram (2012) for the tragedy of commons. Nevertheless, an exchange ratio between each component of the vector should be known by the designer of the game, which makes this approach very difficult to be applied, since the analyst may lack the knowledge or resources to elicit such utility functions, especially when there are numerous criteria involved (Voorneveld et al., 2000). It should be noted that this is the same type of difficulty generally associated to MCDM methods that depends on the elicitation of utility functions, such as Multi Attribute Utility Theory (MAUT). Given that difficulty, the vector payoff games approach had not gained strong and continuous efforts for its applications and for further developments.

Other approaches focused on hybrid methodologies that linked multi-criteria decision-making techniques and game theory, aimed to favor the analysis of different objectives within a game theoretical structure where the heterogeneous preferences of agents could be fully considered without amalgamation. Towards that integration, a remote example can be seen in Szidarovszky and Duckstein (1984) that proposed to use the principles of game theory for solving a multi-objective programming model in which different viewpoints were represented by objective functions. It is noteworthy that the authors offered means for finding a satisficing solution to group multi-criteria problems, rather than seeking an overall optimal solution, as in the multi-objective programming approach. This direction gained notoriety in the literature, from which other examples can be seen, including Chen (Chen, 2000), Madani and Lund (2011), Ke et al. (2012), Wibowo and Deng (2013), Aplak and Sogut (2013), Deng et al. (2014), and Jing et al. (2018). Nevertheless, the modeling of the games in such approaches still depends on specific elicitation procedures that need to be conducted by the analyst, which represents the similar difficulties from approaches such as vector payoff games or MAUT. Furthermore, in the presence of a non-singular solution, i.e. multiple equilibria solutions, a criterion for equilibrium selection is not addressed in the mentioned approaches, which mostly used Nash equilibrium (Nash, 1951) as the equilibrium solution concept for solving the games.

Following the evident relationship of complementarities between multi-criteria and game theory, the aim of this paper is to present a framework for modeling a multi-agent and multi-criteria decision-making problem as a non-cooperative game for considering the strategic interaction among agents based on their distinctive preferences. These preferences are related to the agents’ evaluation of the multiple criteria, such as those that pursue environmental, social, and economic aspects of sustainability. In this scenario, the framework allows to find alternatives that satisfy equilibrium conditions for solving the game. Also, in the presence of more than one equilibrium, a refinement to the solution is proposed by the application of a social welfare function. This study contributes to the literature by presenting a framework that, based on the use of utility functions, provides means for the modeling of multi-criteria and multi-agent complex decision-making problems as a non-cooperative game from which a solution that is based on equilibrium concepts can be reached as the social choice. This framework can be used to support the process of selecting alternatives in the presence of multiple heterogeneous agents through the resolution of the problem without the need for amalgamating their preferences. Consequently, the framework makes possible to fully consider each agent with their set of different objectives and value systems, which are represented by a set of criteria and their respective preferences.

The mechanism of the framework is built on the theoretical assumption of expected agent’s behavior when trading alternatives within a strategic interaction environment, which is the tendency to reject proposals when realizing that the proposal significantly either favors or disfavors their counterparts (Blount & Bazerman, 1996; Camerer, 2003). The necessary conditions to assume this rational behavior of the agents under this scenario is based on the assumption of Binmore (1988), namely: (i) the fact that it involves a finite and known number of agents, alternatives, and criteria; and (ii) the agents have already dealt with similar situations several times and have got the ability to learn from it. Therefore, the rational behavior assumed is based on the assumption that agents are familiar with: (i) choosing alternatives taking into account the choices of their counterparts; and, if necessary, (ii) trading for alternatives that are not very different between them.

For illustrating the application of the framework herein proposed, the scenario of the choice of a Wastewater Treatment Plant (WWTP) for a municipality has been chosen. Garrido-Baserba, et al. (2016) and Madani and Lund (2011) advocate that engineering problems related to water resources are complex problems that usually involve the interaction of multi-agents with multiple conflictive objectives. In this sense, choosing a WWTP for a municipality can be considered a good illustration for the application of the framework to a complex scenario with multiple agents, which is very common in current societal and technological systems, as described by Hipel and Fang (2005). It should be stressed, however, that the framework can be applied to any other kind of problem involving multiple criteria and multiple agents, which is here also illustrated by means of a numerical example.

Section 2 presents the background of the framework, Sect. 3 presents each step of the framework in a detailed level with their respective references of Sect. 2, Sect. 4 presents a numerical example, and Sect. 5 presents a simulated example of complex multi-criteria group decision-making problem, which is the choice of a WWTP to a municipality. Finally, the comparison of the presented framework with the state of art is presented in Sect. 6 and the final remarks are part of Sect. 7.

2 Technical backgrounds

2.1 The General Structure of a Multi-Criteria and Multi-Agent method

The general structure of multi-criteria methods includes two mathematical objects, namely: (i) a matrix, usually called decision matrix; and (ii) a vector, usually called weighting vector, which is obtained by the application of an elicitation procedure to the agent involved in the decision-making process. The decision matrix \(\left\{ {x_{ij} } \right\}\) is composed by \(i = 1,2, \ldots ,m\) alternatives and \(j = 1,2, \ldots ,n\) discrete and/or continuous criteria (attribute or variable) of cost and/or benefit type, which is the basis of the structure of an MCDM problem. For the scenario of multi-agent decision making, the number of weighting vectors is necessarily more than one (equal to the number of the agents) and the number of matrixes might eventually be more than one, depending on the set of criteria that each agent is taking into account (Kilgour & Hipel, 2005; Keeney, 2013). Consequently, in a multi-agent environment, the decision matrixes \(\left\{ {x_{ij} } \right\}_{k}\) are conventionally composed by \(i = 1,2, \ldots ,m\) alternatives and \(j = 1,2, \ldots ,n_{k} \) criteria, which correspond to the set of criteria of each \(k\)-th agent.

Franco & Montibeller (2010) state that the creation of such decision matrixes involves several steps, in the following order: (i) the identification of the agent’s fundamental values and objectives; (ii) the choice of the criteria for measuring each identified value or objective; and (iii) the characterization of the alternatives, which comprises the performance measurement for each evaluation criteria. For assisting in the proposition of such decision matrixes, Marttunen et al. (2017) present a review of Problem Structure Modeling (PSM) techniques, which includes: (i) Soft Systems Methodology (SSM); (ii) Strategic Assumptions Surfacing and Testing (SAST); and (iii) Strategic Options Development and Analysis (SODA). Other widely known PSM techniques are the Strategic Choice Approach (SCA) and Value-Focused Thinking (VFT).

After the modeling of the decision matrix, it is eventually necessary to apply a standardization process,Footnote 3 since criteria generally have different units of measurement and can be either of benefit or cost type (Kosareva et al., 2018; Vafaei et al., 2016). Yoon and Hwang (1995) present two functions for standardizing the decision matrix: (i) linear standardization, and (ii) vector standardization, both keeping the ordinal property from the original criterion. These functions are the most common, others being the logarithmic standardization technique and fuzzification (Kosareva et al., 2018; Vafaei et al., 2016; Leoneti & Gomes, 2021).

Subsequently, the process of measuring the relative preference of each agent over the criteria of the decision matrix, named elicitation, is performed for providing the weight vectors that are used to weight the standardized decision matrix. According to Jia et al. (1998), there are different elicitation techniques created for different purposes. When the ratio scale properties of the agent’s judgments need to be preserved, there are techniques such as the Swing and the Trade-off techniques. Techniques that preserve only the ordinal properties of agents are classified as rank-order techniques, including the Rank-Sum (RS), the Rank-Reciprocal (RR), and the Rank-Ordered Centroid (ROC) techniques. Ahn (2011) complements by describing subjective techniques that determine the weights solely according to the preferential judgments of the decision-maker, which includes the Simple Multi-attribute Rating Technique by Swing (SMART), the revised Simos’ procedure, the AHP, and objective techniques in which the weighting vectors are defined by solving mathematical models without any consideration of the agents’ preferences, for example, the entropy method.

The standardized decision matrix and the weighting vectors are then operated through the application of an algebraic operator, which can be represented by an MCDM method, for providing different possible outcomes, including: (i) ranking; (ii) classification; or (iii) selection of the alternatives. The general structure of such algebraic operators is that based on pairwise comparison of alternatives,Footnote 4 which occurs by the performance comparison of each alternative per criterion. When the performance of a criterion is better for a given alternative (bigger for benefit criteria and lower for cost criteria) than another, the former must be better scored by the pairwise procedure. Some of the most widely used MCDM methods are: (i) AHP; (ii) Elimination et Choice Traduisant la Realité (ELECTRE); (iii) Preference Ranking Organization Method for Enrichment Evaluations (PROMETHEE); (iv) Technique for Order Preference by Similarity to Ideal Solution (TOPSIS); (v) Measuring Attractiveness by a Categorical Based Evaluation TecHnique (MACBETH); and (vi) Multi-attribute Utility Theory (MAUT) (Leoneti & Pires, 2017).

Here, the standardized decision matrix and the weighting vectors are used as input for the creation of a non-cooperative game by an algebraic operator represented by a utility function, of which the details of its application are described in the next section.

2.2 The General Structure of a Non-Cooperative Game and its Solution Based on Equilibrium Solution Concepts

The non-cooperative game is the branch of game theory that models situations where agents have no guarantee that their counterparts would implement agreements made, which is more adherent to group decision-making scenarios. The general structure of a non-cooperative game can be summarized by the tuple <\(K,A_{k} , \prec_{k}\)>, where \(K\) is the number of the agents, \(A_{k}\) is the set of \(m_{k}\) alternatives for each agent \(k = 1, \ldots ,K\), and \(\prec_{k}\) is the preferences’ relations of each agent \(k = 1, \ldots ,K\) on all alternatives of the set \(A_{k}\). The values for the operator \(\prec_{k}\) are given by a utility function \(\pi_{k} :{\mathbb{R}}^{K} \to \left[ {0,1} \right]\), which provides outcomes (payoffs) for each possible strategy that an agent may chose in response to a particular counterpart’s action.

The rational principles of such utility functions are those summarized by von Neumann and Morgenstern (1944). The authors extended the interpretation of Bernoulli’s classic utility theory by adding the idea of pairwise comparison of alternatives in a strategic interaction environment. According to the rational principles of classic utility theory, agents make their choice aiming to improve the expected utility of goods subjected to the Weber and Fechner Law, which states that the agent’s perception is subjective and proportional to a logarithm intensity (Lengwiler, 2009; Stevens, 1986). Von Neumann and Morgenstern (1944), complementarily, suggested that this measure should be relative and subjected to the relationship of each pair of strategies.Footnote 5 In this sense, considering two hypothetical alternatives \(x\) and \(y\), the operator \(\prec_{k}\) must assume the state \(x \prec_{k} y\), when the alternative \(y\) is strictly preferable to the alternative \(x\); \(x{ \preccurlyeq }_{k} y\), when the alternative \(y\) is preferable to the alternative \(x\); \(x \approx_{k} y\), when the alternative \(x\) is indifferent to the alternative \(y\); \(x{ \succcurlyeq }_{k} y\), when the alternative \(x\) is preferable to the alternative \(y\); and \(x \succ_{k} y\), when the alternative \(x\) is strictly preferable to the alternative \(y\). It follows that in the case \(x \prec_{k} y\), then \(\pi_{k} \left( x \right) < \pi_{k} \left( y \right)\); case \(x{ \preccurlyeq }_{k} y\), \(\pi_{k} \left( x \right) \le \pi_{k} \left( y \right)\); case \(x \approx_{k} y\), \(\pi_{k} \left( x \right) = \pi_{k} \left( y \right)\), and so on.

When the chronological order of agents’ choices is not considered, the structure for presenting the values of the utility function is a matrix. The matrix form is usually called the strategic form, where no information is provided to any agent with relation to their counterparts’ previous choices, which is the scenario of games with incomplete information (Osborne, 1994). The payoff of each agent is given by the arrangement of agent’s choice together with their counterparts’ choices. Therefore, in the particular case of non-cooperative games in the strategic form, the order of the multi-dimensional matrix is given by the number of arrangements \(A_{1} \times A_{2} \times \ldots \times A_{K}\). Particularly to the case of multi-agent decision making where it is mandatory that agents share the same set of alternatives (a unique set \(A\) containing \(m\) elements), the number of arrangements is given by \(m^{K}\), where \(m\) is the number of the alternatives of the set \(A\), and \(K\) is the number of agents. Consequently, within the elicitation process for generating the utility functions \(\pi_{k}\) for each of the \(K\) agents, each one of those possible arrangements should be contemplated. These values provide the payoff tables for each agent that composes the non-cooperative game.

Leoneti (2016) proposed a utility function that allows calculating those payoffs for each arrangement in \(A_{1} \times A_{2} \times \ldots \times A_{K}\) through the pairwise comparison of vector alternatives. This utility function can be seen in the equation

$$ \pi_{k} \left( {x_{k} ,Y} \right) = \varphi_{k} \left( {x_{k} ,IA_{k} } \right)\mathop \prod \limits_{k \ne p,p = 1}^{K} \varphi_{k} \left( {x_{k} ,y_{p} } \right).\varphi_{k} \left( {y_{p} ,IA_{k} } \right) $$
(1)

where \(\pi_{k}\) is the payoff that the \(k\)-th agent would obtain in the arrangement of its alternative \(x_{k}\) with the subset of alternatives \(Y(y_{p \ne k} )\) proposed by all other \(K {-} 1\) agents jointly with \(p = 1, \ldots ,p \ne k, \ldots ,K\), being each alternative represented by vectors constituted by the \(n_{k}\) multiple independent benefit criteria, and \(IA_{k}\) (Ideal Alternative) is the utopian alternative that is composed by the best value of each criteria that the agent \(k\) could obtain. The pairwise comparison function \(\varphi :\) \({\mathbb{R} }_{ + }^{{n_{k} \times 2}}\) → [0,1], inspired by Deng (2007), gives the value for the pairwise comparison between these alternatives according to the equation

$$\varphi _{k} \left( {x_{k} ,x_{p} } \right) = \left[ {\frac{{\alpha _{{x_{k} x_{p} }} }}{{\left\| {x_{p} } \right\|}}} \right]^{\delta } .\cos \theta _{{x_{k} x_{p} }} ,{\text{and}}\;\delta = \left\{ {\begin{array}{*{20}l} {1,\;if\alpha _{{x_{k} x_{p} }} \le \left\| {x_{p} } \right\|} \hfill \\ { - 1,\;{\text{otherwise}}} \hfill \\ \end{array} } \right. $$
(2)

where \(\alpha_{{x_{k} x_{p} }} = \left\| {x_{k} } \right\|cos\theta_{{x_{k} x_{p} }}\) is the scalar projection of the vector representing the alternative \(x_{k}\) onto the vector representing the alternative \(x_{p}\), and \(\left\| {x_{p} } \right\| = \sqrt {\left( {x_{p}^{1} } \right)^{2} + \left( {x_{p}^{2} } \right)^{2} + \ldots + \left( {x_{p}^{{n_{k} }} } \right)^{2} }\) is the norm of the respective vector with \(n_{k}\) components.

For the solution of the non-cooperative game, the application of an equilibrium solution concept to the payoff tables is necessary. Hipel et al. (2011) summarized possible equilibrium solution concepts for non-cooperative games, including: (i) general meta-rationality; (ii) symmetric meta-rationality; (iii) sequential stability; (iv) non-myopic stability; (v) limited-move stability; and (vi) Nash equilibrium. Among them, the most known and used equilibrium solution concept for solving non-cooperative games is the one proposed by Nash (1951). According to Osborne and Rubinstein (1994), an arrangement is a Nash equilibrium if, and only if, \(\pi_{k} \left( {s_{1}^{*} ,s_{2}^{*} , \ldots ,s_{k}^{*} , \ldots ,s_{K - 1}^{*} ,s_{K}^{*} } \right) \ge \pi_{k} \left( {s_{1}^{*} ,s_{2}^{*} , \ldots ,s_{k} , \ldots ,s_{K - 1}^{*} ,s_{K}^{*} } \right)\), for all \(k = 1,2, \ldots ,K\) agents, meaning that no agent’s payoff can be increased if they individually choose to move from that arrangement. However, according to Binmore (2007), the Nash equilibrium could not be considered, a priori, a good outcome for a social choice, since it may not be unique and/or Pareto efficient. When more than one equilibrium is found in a game, it is necessary to eliminate undesirable equilibria (Maskin, 1999). This process is known as the equilibrium selection problem (Harsanyi & Selten, 1988), which is addressed in the next section.

2.3 The General Structure of the Equilibrium Selection Problem and the Social Welfare Functions

The general structure of an equilibrium selection problem involves a finite set of \(K\) agents, a finite set of alternatives \(A^{\prime}\) (with at least three elements), which can be a subset of an original set of alternatives \(A\) after a previous filtering phase for eliminating unfeasible social solutions, and a social welfare function \(\phi :{\mathbb{R}}^{K} \to {\mathbb{R}}\), which is used for ranking the set of alternatives \(A^{\prime}\). Thus, the best ranked equilibrium can be selected as a solution to the social choice. However, Maskin (1999) demonstrated that social welfare functions should be designed under the assumption of monotonicity and no-veto power. It should be noted, therefore, that there is no unique formal approach for equilibrium selection (Mailath, 1998). Consequently, the choice of such social welfare functions, rather than be determined by axiomatic guidance, should be focused on meeting the philosophical and anthropological aspects of the social dilemma (Kamaga, 2018).

A well-known social welfare function is the MaxMin social welfare function, which aims to maximize the minimal utility among the equilibria found. In other words, it aims to find the worst outcome among the equilibria found, and then to use this set of minimal outcomes as a reference for selecting the maximal outcome among them. The MaxMin social welfare function is related to the concept of egalitarian solutions according to Rawls’s theory of justice (1971), which provides that the agent with the lowest utility in the group would be in the best possible scenario. It is considered a pessimistic and risk averse approach. Another famous social welfare function is the MaxMax social welfare function (also known as pure utilitarianism), which aims to sum the agents’ outcomes per each equilibrium found, and then to use this set of aggregated values as a reference for selecting the equilibrium with the maximum summation among them. The MaxMax social welfare function is commonly related to the utilitarian principle of Bentham’s theory of justice (1977), which states that social welfare is achieved when the sum of the group’s utility is maximized. In Arrow et al. (2010), it is discussed very common social welfare functions, including: (i) LexiMin; (ii) weighted utilitarianism; (iii) Gini index; and (iv) Nash’s bargaining solution. Other developed welfare functions to be applied to the equilibrium selection problem include those proposed by Matsui and Matsuyama (1995) and Kim (1996).

Finally, after the selection of an equilibrium, a consultive procedure can be performed with the agents in order for the group to accept or decline the solution proposed. Such procedure can be performed by the use of qualitative techniques, such as negotiation, or quantitative techniques, such as voting procedures.

3 The Proposed Framework

This section presents the framework to assist multi-agents in evaluating multiple criteria for selecting alternatives by modeling their strategic interaction as a non-cooperative game, in order to find alternatives that satisfy equilibrium solution concepts. In the presence of more than one equilibrium, the framework contemplates the adoption of a social welfare function for the refinement of the solution found.

Let \(k = 1,2, \ldots ,K\) denote the \(K\) agents involved in a decision-making process and let \(i = 1,2, \ldots ,m\) be the alternatives to be chosen. Consider that each agent \(k\) will use multiple criteria for their evaluation of \(m\) alternatives,Footnote 6 hence \(j = 1,2, \ldots , \; n_{k}\) are the criteria used by the \(k\)-th agent. The following steps summarizes the implementation of the framework to support multi-criteria and multi-agent decision making processes.

Step 1 Each agent presents their own decision matrix containing the alternatives (the same as the other agents) and criteria (not necessarily the same as the other agents) that will be used within the process of choice.

Step 2 An elicitation procedure is applied to each agent in order to obtain their weighting vectors.

Step 3 Each decision matrix is standardized using a function for standardizing decision matrixes (each cost criterion is also converted to benefit criterion in this process), and weighted, by using the weighting vectors produced by the elicitation procedure, which will generate a standardized and weighted decision matrix for each agent.

Step 4 For each agent of the set of \(K\) agents the payoffs are calculated by means of a utility function applied to each of the \(m^{K}\) arrangements formed by the agent’s initial alternative and the alternative proposed by each specific counterpart from the sub-set of \(K {-} 1\) agents. Finally, the non-cooperative game is composed by \(K\) column vectors (one for each of the agents \(k = 1, 2, ..., K\)) with \(m^{K}\) components, which are the agents’ payoff for each of the possible arrangements . The results of this process are the payoff tables.Footnote 7

Step 5 An equilibrium solution concept is applied to the payoff tables for finding the equilibria solutions to the game.

Step 6 If more than one equilibrium is found, a social welfare function is applied for selecting a social choice among the equilibria.

Step 7 Additionally, a negotiation phase or a voting procedure can be carried out for accepting or declining the equilibrium selected.

In order to the better identification of the different techniques that can give support within the framework, the available techniques described in the Sect. 2 are summarized in Table 1.

Table 1 Available techniques for supporting in each respective framework’s steps

For illustrating the application of the framework, a numerical example is firstly presented. Subsequently, the choice of a WWTP to a municipality is demonstrated with the participation of five experts.

4 A numerical example

Let us consider a hypothetical group decision situation where three agents \(\left\{ {P1,P2,P3} \right\}\) are going to decide from a set of non-dominated alternatives \(\left\{ {A1,A2,A3} \right\}\) the one to be selected as the social choice. Each agent has their own set of criteria, being: \(P1:\left\{ {C1,C2} \right\}\), \(P2:\left\{ {C3,C4,C5} \right\}\), and \(P3:\left\{ {C2,C6} \right\}\). For the purpose of making the calculations easier, all criteria were set in \({\mathbb{N}}^{*}\) and were of the benefit type (the more the better). It is noteworthy that one criterion, the criterion \(C2\), is shared by two agents, \(P1\) and \(P3\). The next paragraphs illustrate the framework application.

Step 1 Each agent presents their own decision matrix containing the alternatives and criteria. The hypothetical decision matrixes can be seen in Table 2.

Table 2 Decision matrixes of the hypothetical situation

Step 2 It is assumed arbitrarily that as the result of an elicitation procedure, the following weighting vectors were obtained: \(P1:\left\{ {0.3,0.7} \right\}\), \(P2:\left\{ {0.6,0.2,0.2} \right\}\), \(P3:\left\{ {0.8,0.2} \right\}\).

Step 3 Since all criteria are of the benefit type, the conversion of any cost criteria was not necessary. Therefore, all criteria were standardized using a standardization technique, e.g. the vector standardization technique, as can be seen in the equation

$$ \left\{ {s_{ij} } \right\}_{k} = \frac{{\left\{ {x_{ij} } \right\}_{k} }}{{\sqrt {\mathop \sum \nolimits_{j = 1}^{{n_{k} }} \left\{ {x_{ij} } \right\}_{k}^{2} } }} $$
(3)

where \(\left\{ {x_{ij} } \right\}_{k}\) is the performance of alternative \(i\) for the criterion \(j\) in the view of agent \(k\), and \(\left\{ {s_{ij} } \right\}_{k}\) is the correspondent standardized value. The application of the equation generated the standardization decision matrixes presented in Table 3.

Table 3 Standardized decision matrixes

Subsequently, the standardized decision matrixes were weighted by using the weighting vectors obtained from the elicitation procedure, which generated the standardized and weighted decision matrixes that can be seen in Table 4.

Table 4 Standardized and weighted decision matrixes

Step 4 A utility function must be applied to each standardized and weighted decision matrix for generating the payoffs tables which contains the 27 (\(m^{K}\)) possible arrangements associated with the strategic choice of the three agents. It was used the utility function presented in Leoneti (2016) for generating such payoff tables for the agents, which can be seen in Table 5.

Table 5 Payoff tables

Step 5 The Nash equilibrium solution concept was applied to the generated payoff tables. For searching Nash equilibria, the add-in for Microsoft Excel® named Nash Equilibrium Finder was employed, (Sugiyama and Leoneti, 2021), which uses an exhaustive search algorithm for searching all pure Nash equilibria within a non-cooperative game. Table 6 presents the two Nash equilibria found.

Table 6 Nash equilibria found and their respective payoffs

Step 6 It can be noticed that a set of non-singular solution was found for the game. In this case, the selection of one equilibrium to be adopted as the social choice should be performed by using a social welfare function. Here, the solution provided by both well-known social welfare functions, MinMax and the MaxMax, converged to the equilibrium NE1, which involves a consensus solution on the alternative A1. This solution is the one provided by the application of the framework.

Step 7 Given the nature of the numerical example, here this step is not addressed.

5 A Simulation: The Choice of a WWTP to a Municipality

Five Brazilians experts in the design of WWTP projects for municipalities, all of them civil engineers, were invited to participate in a simulation of the choice of a WWTP for a hypothetical Brazilian municipality with the following characteristics: (i) population of 40,000 inhabitants; (ii) effluent Biochemical Oxygen Demand (BOD) concentration of 305 mg/L; (iii) effluent nitrogen concentration of 30 mg/L; and (iv) effluent flow of 7 × 103 m3/d. An on-line questionnaire was made available and during the period between December 2019 and January 2020 the experts were invited to participate by means of an email that contained an access link. The next paragraphs detail the steps of the simulation.

Step 1 The decision matrix of the simulated problem was created by using six sustainability indicators for WWTP recurrently found in the literature, namely: (i) implementation cost (IC); (ii) operation and maintenance cost (O&MC) (iii) residual BOD; (iv) residual Nitrogen (N) (v) the required area for implementation (RA); and (vi) sludge production (SP) (Leoneti et al. 2013). Six commonly used alternatives by Brazilian municipality populations of 40 thousand inhabitants were used, according to data from the Brazilian National Sanitation Research (IBGE 2008), namelly: (i) Upflow Anaerobic Sludge Blanket Digestion Reactor (UASB), followed by Aerobic Activated Sludge (WWTP A); (ii) UASB followed by Facultative Lagoon (WWTP B); (iii) UASB followed by High Load Biological Filter (WWTP C); (iv) UASB followed by Aerated Lagoon (WWTP D); (v) UASB followed by Facultative Aerated Lagoon (WWTP E); and (vi) Anaerobic Lagoon followed by Facultative Lagoon (WWTP F). The performances of the alternatives for each criterion were valued by the functions represented in the Appendix section, for which the inputs were the data from the hypothetical municipality. The decision matrix containing the WWTP considered to the hypothetical Brazilian municipality was presented to the experts by means of the on-line questionnaire and can be seen in Table 7.

Table 7 Decision Matrix with the WWTP considered for the hypothetical Brazilian municipality

Step 2 With basis on the decision matrix, the experts were asked to provide a ranking of its criteria and alternatives. The rankings to the criteria that each expert provided were subsequently used for translating them into weighting vectors by using the procedures of the ROC technique. According to Barron and Barrett (1996) and Ahn (2011) the ROC technique is more accurate than other rank-order techniques and is commonly used within multi-agent applications. The application of the ROC is performed by the means of the equation

$$ w\left( j \right) = \frac{1}{n}\mathop \sum \limits_{l = j}^{n} \frac{1}{l} $$
(4)

where \(n\) is the number of criteria, and \(w\left( j \right)\) calculates the value for the weight of the \(j\)-th criterion. Table 8 shows the criteria rankings and weighing vectors of the experts.

Table 8 Weight vectors of each expert

Step 3 All cost criteria were converted into benefit criteria by using the formula 1\(/x_{ij}\), making them all of benefit type. The vector standardization function was then used for standardizing all the criteria. Subsequently, the decision matrixes of each expert were generated by weighting the standardized decision matrix using each of the weighting vectors obtained through the elicitation process. Table 9 shows the standardized and weighted decision matrix of each expert.

Table 9 Standardized and weighted decision matrixes of each expert

Step 4 The utility function presented in Leoneti (2016) was applied to each standardized and weighted decision matrix for generating the payoffs to be associated to each of the 216 (\(m^{K}\)) possible arrangements that can result from the strategic interaction of the five experts. The use of the utility function generates the payoff tables for the experts.

Step 5 The Nash equilibrium solution concept was applied to the generated payoff tables. Table 10 presents the pure Nash equilibria found though the search algorithm provided by NEFinder (Sugiyama and Leoneti, 2021) and the respective agent’s payoff for each equilibrium found.

Table 10 Pure Nash equilibria found and their respective payoffs

Step 6 Four Nash equilibria were found using pure strategies, all of them composed by consensus solutions. It should be noted that following the Bentham utilitarianism principle, the social outcome would be WWTP “B”, since it led the agents to the highest sum of payoffs. However, in this equilibrium a significant unbalanced distribution of payoffs is noticed, which corresponds to the different preferences of the experts. In this sense, the MaxMin criterion was also considered, since whatever the chosen equilibrium is, there will eventually be an unsatisfied expert, which could render problems with alternative implementation given the expected rational behavior of rejecting proposals that either significantly favors or disfavors counterparts. Therefore, following Rawls’s theory of justice through the MaxMin criterion, the WWTP “E” would be the one that maximizes the minimum of the payoffs among the equilibria found, being the recommended social outcome.

Step 7 The next section provides the experts’ qualitative discussions regarding the result of the simulated case.

6 Discussions

The experts were requested to indicate which alternatives they would eventually accept and which they would suggest as the best alternative to be selected as the WWTP for the hypothetical Brazilian municipality. It is possible to see in Table 11 that only WWTP “E” would be accepted by all experts, which is aligned to the choice provided by the framework. In regards to this alternative, Expert 3 stated that “small municipalities do not have qualified technical staff for the proper operation of UASB reactors […] and the Facultative Aerated Lagoon does not require much operation and could mitigate operational failures that occur at UASB”. One can notice that if a plurality voting approach was used, which is a common practice for solving group decision making problems within committees, WWTP “F” would be the winner alternative. However, it would not be an acceptable solution for Expert 4 who stated that “the required area for implementation is an important criterion here”. Furthermore, it should be noted that there is no Nash equilibrium formed by this alternative, which could indicate lower chances of commitment to its implementation (Ziotti & Leoneti, 2020).

Table 11 Acceptable and preferable alternatives suggested by the experts

The experts also measured the importance of the different dimensions of sustainability covered by the selected criteria, namely: (i) social dimension, by the criterion RA; (ii) environmental dimension, by the criteria residual BOD, residual N, and SP; and (iii) economic dimension, by the criteria IC and O&MC. The ranking that the experts indicated with regards to the priorities of the sustainability dimensions in the process of choosing a WWTP can be seen in Table 12.

Table 12 Priorities of experts in the choice of a WWTP

Considering the heterogeneous preferences of the agents involved in the selection process of the WWTP to the municipality, it can be assumed that each agent analyzes the alternatives emphasizing one of the three main sustainability dimensions of the problem. In this sense, it reinforces that “WWTP E” could be considered a satisficing solution to this social choice problem, since this alternative would simultaneously meet economic, environmental, and social criteria of sustainability, from the experts’ point of view.

From the results it is plausible to assume that the framework aided the group decision process with transparency and high accuracy. However, the shortcoming of a non-singular solution should be emphasized, which is due to the use of equilibrium solution concepts from the theory of non-cooperative games as the pathway for solving the social dilemma. Nash equilibrium is a complex computational problem and would require the support of a software for being applied, especially when the number of arrangements increase. Furthermore, according to Binmore (2007), the Nash equilibrium may not be unique and/or Pareto efficient. In this sense, a post refinement process for excluding undesirable equilibria might be necessary. Therefore, on one hand, the framework has demonstrated to be effective, since it successfully found an satisficing outcome as the social choice, which originated from the solution of an equilibrium selection problem. It should be emphasized that it overcomes the limitation on the hybrid multi-criteria and game theory approaches found in the literature (Dyer & Forman, 1992; Basak & Saaty, 1993; Chen, 2000; Shih et al., 2007; Escobar & Moreno-Jimenéz, 2007; Arnette et al., 2010; Altuzarra et al., 2010; Sanayei et al., 2010; Dong et al., 2010; Hatami-Marbini & Tavana, 2011; Huang et al., 2013; Wu et al., 2018; Lai et al. 2002; Chen, 2000; Madani & Lund, 2011; Ke et al., 2012; Wibowo & Deng, 2013; Aplak & Sogut, 2013; Deng et al., 2014; and Jing et al., 2018). In contrast, it might be not efficient when the number of agents or alternatives increases significantly.

A possibility for circumventing this shortcoming was investigated by Ziotti and Leoneti (2020) who enlarged the analysis of the non-cooperative game in the light of the Nash Program. From the perspective of the Nash Program, the solution of a non-cooperative game would converge to the same solution of a cooperative game if the former is modeled as an enlarged game that would contain the pre-play interaction phase that occurs preliminarily of a cooperative game (Binmore, 1994). By assuming that all possible arrangements within a non-cooperative game that is modeled from a multi-criteria perspective by a utility function would contain all the possible outcomes of a pre-play interaction, the authors proposed to use the Nash bargaining social welfare function for solving the non-cooperative game. The Nash bargaining social welfare function provides a unique rational solution to the bargaining problem, which would be given by the maximization of the equation

$$ N\left( {\pi_{1} ,\pi_{2} , \ldots ,{\uppi }_{{\text{K}}} } \right) = \left( {\pi_{1} - \xi_{1} } \right)\left( {\pi_{2} - \xi_{2} } \right) \ldots \left( {\pi_{K} - \xi_{K} } \right) $$
(5)

where \(\xi_{k}\) are the status quo points that represent the outcome that each agent would receive if the negotiation would fail, in this case zero, and \(\pi_{k}\) are the respective outcomes calculated by the application of the utility function.

Here, the application of the Nash bargaining social welfare function also indicates that “WWTP E” would be the winning alternative, which is in accordance with the solution provided by the framework. In fact, Ziotti and Leoneti (2020) have found that the convergence of the Nash bargaining solution to the Nash equilibrium was high, having converged in about 87% of the simulated scenarios that were evaluated.

7 Conclusions

The Multi-Criteria and Multi-Agent Framework proposed in this paper allows to model the decision process involving multiple conflicting objectives within group decision making, aiming for the searching of satisficing solutions to be understood as a recommendation for action towards meeting different goals and values. The numerical example and the simulation presented show how the framework can support complex decision-making processes, providing the possibility to model and solve the problem where individuals interact their conflictive preferences strategically. By stablishing a link between MCDM and game theory, the solutions of the framework aim to provide the means to see the problem of selecting alternatives from different points of view and to assess different scenarios and agents’ preferences, thus contributing to extend the possibilities to perform strategically in the group decision making. This approach could render more efficient decision-making processes for choosing a social outcome that matches the sustainability goals when involving heterogeneous agents with different preferences over the criteria under consideration (environmental, social and economic).

While in MCDM methods the alternatives are simply the labels that represent a set of performance values to each criterion in consideration, in game theory the focus is precisely on the alternatives, which can be assumed as an amalgamation of all diverse types of objectives within a scalar payoff value. The integration proposed here is based on the use of the multi-criteria approach through the pairwise comparison of criteria to generate the payoffs that will be used within the game theory approach. This feature allows the agent to assess eventual concessions to be made in the process of trading alternatives when seeking for a consensus solution in group decision-making, which is in accordance with the expected behavior of rational agents defined in this research (Blount and Bazerman, 1996; Camerer, 2003; Binmore, 1988). Furthermore, the proposed framework contemplates an equilibrium selection phase based on social welfare functions in which the analyst can chose the mathematical formulation that better address different philosophical and anthropological principles.

By means of using utility functions that are generated based on the preferences of agents within pairwise comparisons, it should be noted that the integration proposed here does not require the arbitration by the analyst for defining the payoffs to the agents, as occurs in vector payoff games. For instance, compared to other methods that propose the use of game theory for solving multi-criteria problems, the method proposed in Madani and Lund (2011) requires the creation of a transition matrix, the method presented in Wibowo and Deng (2013) requires intuitionistic assessments for each alternative in a form of an intuitionistic preference relation, and the method of Deng et al. (2014) requires the construction of the utility function by the arbitration given by the analyst based on the evaluation of linguistic terms that range from “very poor” to “very good”. Yet, compared to other MCDM methods adapted for a multi-agent environment, such as the methods proposed in Sanayei et al. (2010) and Dong et al. (2010), which have a good treatment of conflicts, but require the use of linguistic variables and thresholds limits, the framework presented here is cognitively less demanding. Consequently, an advantage of the proposed framework compared to other approaches previously reported is that it only requires the elicitation of the weighting vectors in order to calculate the payoffs, which makes the level of effort required very low while not losing accuracy. Therefore, the proposed framework allows to structure larger games than those with two agents and two alternatives, which are very common in most game theory applications.

Notwithstanding that the use of game theory in group decision-making has been previously reported, the existing literature does not provide a general framework to model group decision making problems that: (i) considers multiple criteria and agents without preferences aggregation; (ii) presents a solution for the interpersonal comparison of the utility among agents, since each payoff table is generated by the individual evaluation of trade-offs between alternatives; (iii) refines the non-singular solution by means of a social welfare function; and (iv) allows shortening the negotiation process. In contrast, the main limitation of the framework is, paradoxically, the use of an equilibrium solution concept as the pathway for finding the solutions for the game. A feature of an equilibrium solution is that it either may not exist, or be more than one, yielding a new problem that is the choice of the best equilibrium for the game solution. Finding equilibrium is a very demanding computational task, although for the case presented here the computing time was very low and other solutions concepts better tractable were presented. Future research could explore the search for other kinds of equilibrium solutions with a lower demand for computational time.