Incentive Compatibility of Pay Per Last N Shares in Bitcoin Mining Pools

Zolotavkin, Yevhen; García, Julian; Rudolph, Carsten

doi:10.1007/978-3-319-68711-7_2

Yevhen Zolotavkin¹⁸,
Julian García¹⁸ &
Carsten Rudolph¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 10575))

Included in the following conference series:

International Conference on Decision and Game Theory for Security

2354 Accesses
10 Citations
1 Altmetric

Abstract

Pay per last N shares (PPLNS) is a popular pool mining reward mechanism on a number of cryptocurrencies, including Bitcoin. In PPLNS pools, miners may stand to benefit by delaying reports of found shares. This attack may entail unfair or inefficient outcomes. We propose a simple but general game theoretical model of delays in PPLNS. We derive conditions for incentive compatible rewards, showing that the power of the most powerful miner determines whether incentives are compatible or not. An efficient algorithm to find Nash equilibria is put forward, and used to show how fairness and efficiency deteriorate with inside-pool inequality. In pools where all players have comparable computational power incentives to deviate from protocol are minor, but gains may be considerable in pools where miner’s resources are unequal. We explore how our findings can be applied to ameliorate delay attacks by fitting real-world parameters to our model.

Access provided by CONRICYT-eBooks. Download conference paper PDF

Yet Another Note on Block Withholding Attack on Bitcoin Mining Pools

Discouraging pool block withholding attacks in Bitcoin

Article 29 July 2021

Incentive Compatibility of Bitcoin Mining Pool Reward Functions

1 Introduction

Blockchain is a distributed ledger technology with demonstrated potential to revolutionize industry and commerce [10]. A number of popular cryptocurrencies based on blockchains have been launched in recent years to unprecedented adoption. These include Bitcoin (BTC) [11], Litecoin (LTC) and Zcash (ZEC) [13], among others [4]. The main technological innovation behind this drive is the proof-of-work consensus mechanism [7], which allows for the ledger integrity to be maintained in a distributed fashion. To achieve this level of decentralization, the system relies on miners who are incentivized to verify transactions. When incentives are compatible, rational players will find it in their best interest to stick to protocol. This paper uses game theory to derive conditions under which a popular mining reward mechanism, Pay per last N shares (PPLNS), is incentive compatible.

The Blockchain is a public ledger that keeps transaction information in a sequence of transaction blocks. Each block contains a hash of the previous block, and the chain grows as new transactions are verified and added to the chain. Any agent can add a block to the chain, so the approach relies on cryptographic puzzles, known as proofs of work, in order to reach consensus. The longest chain, as measured by computational effort exerted, is assumed to be the consensus chain. The agents solving the cryptographic puzzles are known as miners, and they exchange their computational power for new currency and transaction fees. The puzzle is randomized in such a way that each miner has a probability of discovering the next block proportional to their share of computational power in the network [7].

The mining market is very competitive. Individual miners face large variances in income. Consequently, most miners pool their computational resources, sharing the rewards of the pool amongst all members in proportion to the computational effort invested in mining [9]. Through pooling, miners ensure a more stable income flow. Mining pools are managed by an administrator who will often collect fees from miners, distributing the rewards when blocks are discovered in the pool. Miners prove their work on behalf of the pool by discovering “shares”, which are partial proofs of work. It is assumed that every share requires equal computational effort. In addition to satisfying the requirement for partial proof of work, every computed share may in addition qualify as full proof of work. In the latter case, the pool is rewarded by the Bitcoin network, which issues new coins and transfers them to the pool’s account. The reward obtained by the pool is then distributed to the members of the pool, according to its reward scheme and the submission behaviour of all the pool members. Reward mechanisms serve to aggregate shares reported in the pool, so as to perform a fair distribution according to work.

Early reward mechanisms often rewarded miners in proportion to the amount of shares submitted by a miner in each round [14]. However, since the distribution of rewards is exponential, under this scheme, miners may increase their reward expectation by changing pools frequently. This attack is known as pool hopping, and discourages honest mining to unsustainable rates [3]. Pay per last N shares (PPLNS) addresses this issue.

In PPLNS, each miner gets a reward that is proportional to the effort exerted during the last shares preceding a submitted solution. Since solutions are not predictable, this reward scheme discourages hoppers who risk losing shares outside the range given by N. A simplified scheme of PPLNS is shown in Fig. 1.

In Fig. 1, time flows from left to right, so that the right-most share is the most recent. A discovered block is marked with a $ sign, and not counted as a share in PPLNS. In this simple example, we consider only two miners forming the pool with power $\alpha _1 = 0.6$ and $\alpha _2 = 0.4$ for Miner 1 and Miner 2, respectively. The length of the window N is 8 shares.

PPLNS is used by many Bitcoin pools, such as Kano [8], P2Pool [12], AntPool, BCMonster [2], among others. While this reward scheme is resilient to pool hopping attacks, other vulnerabilities are hypothesized to encourage dishonest mining [5]. In other words, the incentive compatibility of PPLNS is questionable [15].

We investigate a new type of attack for PPLNS pools. The idea is that miners can dishonestly increase their revenue by delaying reports of some of the shares that were obtained during a round. Instead of submitting share(s) to the pool manager when these are discovered, an attacker submits them at the end of the mining round, which will happen only if she finds a full solution. The purpose of this paper is to model the strategic incentives behind this kind of attack, as well as to estimate how damaging it can be to the pool. To do so, we formulate a simple game capturing the incentives of pool mining, and solve for Nash equilibria. A PPLNS scheme is incentive compatible if there are Nash equilibria in which miners do not delay their reports.

The rest of this paper is organized as follows: Sect. 2 contains detailed description of the attack and model that can be used to find equilibria. Conditions for incentive compatibility are discussed in Sect. 3, followed by Sect. 4, which addresses how severe attacks may be in pools that are not incentive compatible. We discuss our results and their implications in Sect. 5.

2 Model

Our model starts by computing the expected revenue of a pool member, given the pool composition, pool parameters as well as the rest of Bitcoin network. We consider the puzzle difficulty, D, to be pre-set at the network level. The PPLNS window size, N, is set by the pool manager. We also assume a given distribution of mining power $\tau _i$ for i in $1,\dots , m$, where m is the size of the pool.

Each miner has two actions upon mining every single share: delay or report. For every miner i, we compute how the expected monetary reward changes given these options. The marginal profit for every share depends on the previous decisions made by the miner as well as the strategies of other miners in the pool.

For an attacking miner, there are two separate phases during every round. During the first phase, a miner collects shares for delay (does not report any single share). During the second phase, she reports every newly mined share immediately. For every miner, there is an individual turning point between these phases, which depends on the marginal profit of the two actions (delay or not). The turning point corresponds to the condition when the marginal profit for both actions is equal, or, when the strategy of the miner reaches its natural limit. The rationale behind these limits dictates that the number of delayed shares cannot be less than 0 and cannot exceed N. As soon as the individual turning condition is satisfied, the miner is in the second phase. In terms of time flow, equilibrium arises when every miner is beyond their turning point. Throughout the paper, we assume that rounds end some time after this turning point (the validity of this assumption is addressed in Sect. 4). For simplicity, we define an honest miner as one who is always in the second phase (delays 0). Likewise, attacking miners are those who delay at least 1 share in the first phase.

We also assume the following order for the submission of the delayed shares: if an attacker discovers a full solution of the Bitcoin puzzle, she reports all her delayed shares first, and reveals the full solution immediately after that. In our model, reporting shares collected during the first phase happens without time delays in revealing the full solution.

For an honest miner, the expected reward depends on N and D. Parameter D is the complexity of finding a full solution and can be expressed as the average number of shares that need to be mined to discover a full block. Every miner submits a share that he/she has mined and expects that a number of payments will be received for that share during the period in which the next N shares are sent by the pool members (a share will carry no value after this period). The expectation for that number of payments is $\frac{N}{D}$ and the value of a single payment is $Rew*\frac{1}{N}$, where Rew is a standard monetary reward for discovering a block. For simplicity, we omit the constant Rew. Therefore, every miner expects that every submitted share is worth $\frac{N}{ND}$.

These honest expectations for share payments change under delay attacks. A player j can delay an amount of $x_j \in \mathbb {N}$ shares. The effective window size is then $\hat{N}$ instead of N; and the effective expected number of shares submitted between two full solutions, found by the pool, is $\hat{D}$ instead of D. The reasons causing this are illustrated on Figs. 2 and 3. There are several immediate observations: (1) if an attacker is successful in finding a full solution she will report her delayed shares first; (2) due to delaying, the majority of the attackers will lose all the shares collected during the first phase.

Every reported share will be rewarded in a form of monetary payoff from the pool manager within the next $\hat{N}$ subsequent steps. Observation (1) above, implies that the expected number of steps when a potential reward can be received will be reduced (Fig. 3). This quantity can be computed as follows:

$$\begin{aligned} \hat{N}\,=\,\sum _{j\,=\,1} ^ {m} (N - x_j) \tau _j\,+\,(1 - p^*) N\,\le \,N, \end{aligned}$$

(1)

here, $p^*$ is the probability that the solution is discovered by someone inside the miner’s pool, i.e., $p^*=\sum _{j\,=\,1} ^ {m} x_j$. Expression (1) can be explained as follows: The first term, $\sum _{j\,=\,1} ^ {m} (N - x_j)\tau _j$, accounts for the probability $\tau _j$, that miner j finds the full solution and will reduce the effective period for payment to $N-x_j$. The second term, $(1 - p^*)N$, accounts for the probability of finding the full solution outside the pool, $(1-p^*)$. In this case, all the attacking miners lose their delayed shares and the effective period for payment is N.

Because the majority of the attackers will lose all the shares collected during the first phase, we can conclude that the amount of shares submitted between the nearest two full solutions is less than D. This is reflected in expression (2), which specifies the effective expected number $\hat{D}$ of shares submitted in the pool between the full solutions.

$$\begin{aligned} \hat{D}\,=\,\sum _{j\,=\,1} ^ {m} (x_j - \sum _{k\,=\,1} ^ {m} x_k\,+\,D) \tau _j\,+\,(1-p^*)(D - \sum _{j\,=\,1} ^ {m} x_j) = D - \sum _{j\,=\,1} ^ {m} x_j\,+\,\sum _{j\,=\,1} ^ {m} x_j \tau _j . \end{aligned}$$

(2)

Expression (2) can be explained as follows. Miner j will be able to publish her delayed shares with probability $\tau _j$. In this case, all shares delayed by other attackers will be lost, and, expected number of shares (submitted in the pool since the last full solution was reported) is $(x_j - \sum _{k\,=\,1} ^ {m} x_k\,+\,D)$. Summing up such expectation for all the miners in the pool, we obtain $\sum _{j\,=\,1} ^ {m} (x_j - \sum _{k\,=\,1} ^ {m} x_k\,+\,D) \tau _j$. In addition, with probability $1-p^*$ all delayed shares in the pool will be lost (because the full solution is found by miners outside the pool). This is expressed via term $(1-p^*)(D - \sum _{j\,=\,1} ^ {m} x_j)$. From (1) and (2) it can be noted that when $x_j\,=\,0, \forall {j}$, then $\hat{D}\,=\,D$ and $\hat{N}\,=\,N$.

Previously, it has been stated that if everybody in the pool is honest, the expected revenue from reporting a share is $\frac{N}{ND}$. In contrast, when delaying is possible any miner expects to be paid $\frac{\hat{N}}{N\hat{D}}$ by sending her share to the pool.

Nonetheless, for the share obtained during the first phase (and retained until the end of the mining round) the expectation of the revenue is different. A player j delaying $x_j-1$ shares, expects the following reward from delaying one more share:

$$\frac{\tau _j}{N} \left( 1 + \frac{N - x_j}{\hat{D}} \right) .$$

This expression balances the expectation $\frac{\tau _j}{N}$ to be paid once for a share, when j finds a full solution (with probability $\tau _j$). If that happens, she will also be paid $\frac{N- x_j}{\hat{D}}$ times in the subsequent rounds.

Now, we can sum up: some of the miners may never delay because it is not profitable for them to delay a single share; some can delay every mined share until they collect N; and, some will collect a number between 0 and N. Thus, a situation in which miners have no incentive to deviate is found by solving:

$$\begin{aligned} \frac{\hat{N}}{N \hat{D}} = {\left\{ \begin{array}{ll} \frac{\tau _i}{N} \left( 1 + \frac{N- x_i}{\hat{D}}\right) , &{} \textit{if } 0 \le x_i < N,\\ &{} \\ \frac{\tau _i}{N} \left( 1 + \frac{N- x_i}{\hat{D}}\right) + C_i, &{} \textit{if } x_i =0,\\ &{} \\ \frac{\tau _i}{N} \left( 1 + \frac{N- x_i}{\hat{D}}\right) - C_i, &{} \textit{if } x_i =N,\\ \end{array}\right. } \forall {i} \left( C_i \ge 0 \right) . \end{aligned}$$

(3)

This equation can be explained by the following constraints: (i) $x_i$ cannot be negative – it is impossible to delay a negative number of shares; (ii) $x_i$ cannot exceed N because under PPLNS, only the most recent N shares preceding the full solution (found by that pool) can be paid. The parameter $C_i$ here compensates unequal profitability of delaying versus honest reporting. One can see that at $x_i=0$, reporting may be more profitable for the i-th miner. On the other hand, at $x_i = N$, delaying can be more profitable than reporting.

The symbols listed in Table 1 will be used to define incentive compatibility and to estimate changes in parameters of PPLNS in case the pool is not incentive compatible (Sects. 3 and 4, respectively).

Table 1. Notation and parameters

Full size table

3 Incentive Compatibility

In this section, we will investigate a condition that guarantees honest mining. From Eq. (3), the only kind of incentive compatible equilibrium is described as $\frac{\hat{N}}{N \hat{D}} = \frac{\tau _i}{N} \left( 1 + \frac{N - x_i}{\hat{D}}\right) + C_i$, $ \forall {i} \left( x_i=0, C_i \ge 0 \right) $ which is equivalent to the following inequality:

$$\begin{aligned} \frac{\hat{N}}{N \hat{D}} \ge \frac{\tau _i}{N} \left( 1 + \frac{N- x_i}{\hat{D}}\right) , \forall i, x_i=0. \end{aligned}$$

(4)

Inability to satisfy expression (4) for a single i, would mean that the pool will not mine honestly. For a pool of size m, there are $2^m -1$ possible types of deviations from the mining protocol (each miner can either delay or always report). This yields a brute force search unfeasible for large values of m. Nonetheless, we will show that in order to verify incentive compatibility, we do not require exhaustive search. Instead, we derive a condition that can be checked in a linear time.

To derive conditions for incentive compatibility, it is useful to observe the following:

1.
The set of all deviations needs to be reduced to a set $\mathcal {F}$, $|\mathcal {F}| \le m$, of the deviations which (and only which) may produce an equilibrium (based on Lemma 1)
2.
We show that if there is an incentive compatible equilibrium as described by (4), this equilibrium is unique (Lemma 2).
3.
A single condition is sufficient and necessary to guarantee (4) (Lemma 3).

We start discussing cases that differ from (4). It will be demonstrated that there are only m other profiles that can be equilibria. We point to the fact that a delay attack requires that at least one miner delays a positive number of shares. Further, we show that an equilibrium where for a miner with power $\tau _i$ delays are only possible when all other miners with $\tau _k \ge \tau _i$ delay too.

Lemma 1

If there is an equilibrium and a set $\mathcal {M}$ of delaying miners with power $\tau _i$, $i \in \mathcal {M}$, delaying positive number of shares, then a miner with power $\tau _k$ is also delaying if $\exists k \notin \mathcal {M}, \tau _k \ge \tau _i$ .

(see Appendix for the proof).

As result, a miner with power $\tau _k$ should also be added to the set $\mathcal {M}$ of delaying miners. In the rest of the paper, we assume that miners are assigned indices according to their mining power sorted in descending order, e.g. $\tau _i \ge \tau _{i+1}$. This allows us to label an equilibrium compactly – specifying the index of the least powerful miner who can delay profitably. Since there are only m miners, we have at most m types of equilibria that differ from (4). The result from Lemma 1, showing that $x_i \ge x_{i+1}$ will be used in Lemmas 2 and 3.

Lemma 2

The conditions that support incentive compatibility are inconsistent with any other kind of deviation represented by $\mathcal {F}$.

For delaying miners included in set $\mathcal {M}$ information about other delaying miners may be incomplete. Lemma 2 implies that: under certain conditions, a miner with power $\tau _i$ will delay a non-negative number of shares irrespectively of its inclusion in the set of delaying miners $\mathcal {M}$; expressions (8) and (9) (Appendix) can be used to calculate directly the number of shares delayed by miner i.

For incentive compatibility, it is necessary that for the most powerful miner (with power $\tau _1$) the delay is not profitable. Using Lemmas 1 and 2, we will show that a sufficient and necessary condition for incentive compatibility can be expressed in terms of $\tau _1$.

Lemma 3

For incentive compatible mining under PPLNS it is sufficient and necessary that $\tau _1 \le \frac{N}{N+D}$.

In other words, an incentive compatible pool requires a bound on the computational power of the most powerful miner. This condition for honest mining is important, but even if pools are not incentive compatible the incentives to deviate may be small. The next section explores how these incentives change when we instantiate our model with realistic parameters.

4 Severity of Delay Attacks in the Real World

We propose an algorithm for equilibrium search, and this allows us to show how the parameters of the pool affect the likelihood of delaying attacks. The precondition for our algorithm is existence of equilibria.

To quantify the effect of incentive compatibility it is important to find equilibrium in the form of (3). The main obstacle here is that (3) represents a system of piece-wise expressions. For every single expression with index i, the choice of one out of three different domains affects all expressions in the system.

We use an iterative approach. Consider the schematic illustration on Fig. 4. Here, pool miners are classified into 3 classes ($x=\{0,(0,N),N\}$) according to the power they have. As it has been discussed previously in Lemma 1, miners with more power can profitably delay a greater number of shares, which cannot exceed the size of reward window N. Also, the number of shares cannot be negative. According to (3), to make $C_i$ non-negative, for separate cases $x_i=0$ and $x_i=N$ the mining power should be $\tau _i \le \frac{\hat{N}}{\hat{D}+N}$ and $\tau _i \ge \frac{\hat{N}}{\hat{D}} $, respectively. However, both $\hat{N}$ and, $\hat{D}$ depend on the selection of points a and b (see Fig. 4).

As soon as a, b are known, values of x for the domain (0, N) can be calculated by solving a system of linear equations:

$$N-\sum _{j\,=\,1} ^ {m}x_j \tau {_j}\,=\,\tau {_i}\left( D-\sum _{j\,=\,1} ^ {m}x_j\,+\,{\sum _{j\,=\,1} ^ {m}x_j \tau _j}\,+\,N-x_i\right) , \forall i, \tau _i \in (a, b),$$

where one should first substitute $x=0$ and $x=N$ for corresponding indices.

The size of the window (a, b) can potentially change from 0 to m. Therefore, the left endpoint a can be placed in any position between 1 and $m-l$, $l=\mathrm {length}[(a,b)]$. This requires $\sum _{l\,=\,0} ^ {m}({m-l})$ iterations with each requiring at most 2 computations (at the endpoints) to check validity of the assumption about a and b for that iteration. If the assumption is correct, the other $l-2$ roots inside the window should be calculated. In terms of computation complexity, the whole procedure requires $O(m^2)$.

In our experiments, we used synthetic as well as real-world data for mining power distributions. In particular, we consider uniform and normal distributions. For real-world data, we collected distributions of mining power from Kanopool and F2pool (see Fig. 5).

In the first part of experiment, we compared the number of miners, who delay exactly N shares. In most cases of delay attacks it turns out that $a=b$ which means that miners are either delaying N shares or not delaying at all. The number of delaying miners is plotted for the left ordinate versus parameter k, where $N=kD$. In addition, the right ordinate scale was used to represent dependency of parameter $\frac{\hat{D}}{D}$ from k (Figs. 6 and 7).

Nonetheless, the question of cumulative extra profit (for the group of attackers) is, perhaps, the most important for honest miners. Because pool mining is a zero-sum game, extra profit for one group cause loses for another group of honest miners in that pool. There are several important differences with the concept of marginal profit for a share that has been used to find equilibrium [6]. In order to calculate cumulative extra profit one should consider: (a) extra profit is collected from those rounds where the full solution is submitted by honest miners of that pool; (b) an assumption about the duration of mining round is important and its validity is expressed with certain level of confidence (Fig. 8).

Extra profit is examined for the case when every attacker delays exactly N shares to the end of a round. Since extra profit is discussed in the context of successful solving of a puzzle by the pool, for each miner i we will refer to the power $\alpha _i$ in relation to the pool (not the whole Bitcoin network).

If one considers only the circumstances when attackers win a round, their expected profit is proportional to their power and is equal to what they can earn in fair mining. This is due to the fact that every miner submits N shares before releasing a full solution. Such reward distribution is equivalent to solo mining when a miner collects all the revenue in the case of success.

However, if one considers circumstances when honest miners win, it is clear that each attacker collects a fraction of the reward which is proportional to her power in that pool. This can be seen as an additional profit (because they have already collected their fair portion). Such model of extra profit has one limitation: we assume that every attacker manages to collect her N shares (for the delay attack), and, after that, submits no less than $\alpha _i N$ shares to the pool. Therefore, a round should last the time which exceeds that estimation. For a subgroup of attackers, this happens with a probability determined by the least powerful miner in that subgroup (because collecting N shares for the attack takes her the most time). Hence, collective extra payment of any subgroup of attackers can be obtained with certain level of confidence.

It is assumed that a subgroup of attackers of size l includes all miners with power greater or equal than $\alpha _l$ (see Lemma 1 for support of this assumption). For every integer $l \in [1, m']$ ($m'$ is the number of attackers in the pool) we will calculate: (a) collective extra profit $E_l$; (b) the conditional probability for a round to last longer than it takes for the l-th miner (time $t_l$) to mine $N + \alpha _l N$ shares, given that the round is won by that pool (i.e., probability $p(t_l | p^*)$). In Fig. 8, for every value of N we calculated maximum extra profit $E_l$ where conditional probability $p(t_l | p^*)$ is greater than or equal to the corresponding confidence level C.

The subgroup of attackers exploits honest miners, who earn $Rew \sum _{m' + 1} ^{m} \tau _i$, where Rew is the current reward for discovering a full solution in the network (consisting of 12.5 BTC and transaction fees of up to 13.9 BTC on average). For the subgroup (size l) of attackers whose total power is $\sum _{1}^{l} \alpha _i$, the expected collective extra profit $E_l$ is

$$ E_l = Rew \left( \sum _{m' + 1}^{m} \tau _i \times \sum _{1}^{l} \alpha _i \right) . $$

The value of $p(t_l | p^*)$ is calculated as follows:

$$ p(t_l | p^*) = 1 - \int _{0}^{t_l} f(t | p^*) dt, $$

where $f(t | p^*) = \frac{1}{D p^*} e^{-\frac{t}{D p^*}}$ is the conditional pdf for finding a full solution. The time $t_l$, necessary for l-th miner to collect $N + \alpha _l N$ shares is specified as $t_l = \frac{N + \alpha _l N}{\alpha _l}$. Hence, $p(t_l| p^*) = e^{-N \frac{1 + \alpha _l}{\alpha _l D p^*}}$, and, requiring that $p(t_l| p^*) \ge C$ we arrive to $\alpha _l \left( p^* \ln {C} + k \right) \le - k$, $N = kD$. Considering that $\alpha _l$ is positive, there is an additional requirement $k < -p^* \ln {C}$ (it can be seen from Fig. 8 that blue and green plots are rising from zero level only for $k < 5 \times 10^{-3}$). If the latter is satisfied, we further require that $\alpha _l \ge - \frac{k}{p^* \ln {C} + k}$, $N = kD$.

For every k and corresponding C, we find l, such that $\max _{\left( \alpha _l \ge \frac{-k}{p^* \ln {C} + k}\right) } [l] \le m'$, and compute $E_l$ (other attackers with indices $\le l$ also pass the test and form the subgroup that has C-confident cumulative extra profit).

As one can see from the graphs, the extra profit of attackers can be quite substantial in terms of BTC. Remarkably, real-world power distributions (e.g., from Kano pool) lead to sufficiently higher levels of vulnerability to the attack, when compared with a benchmark uniform distribution of power.

5 Discussion

Incentive compatibility is an easily verifiable condition. It only requires information about the computational power of the most powerful miner in the pool. This verification can thus happen in linear time O(m).

It should be stressed that known PPLNS pools comply with the requirement of incentive compatibility. For the existing majority of the pools, k varies between 1 and 5. Nonetheless, this parameter is under the sole control of the pool administrator who may decide to reduce it in order to satisfy requests from the majority of the miners.

Looking at pool miner forums, one can easily observe that a substantial number of miners would like to collect their payments faster. That aspect is especially important for pools that are not very large and infrequently discover complete solutions. Miners who join such pools during the winning round often find themselves in unfair and underpaid situations. The only way to satisfy their expectations fairly is reducing N, which increases the odds for delay attacks.

In pools that are not incentive compatible, our experimental results show that the fraction of delaying miners decreases with k, regardless of power distribution. Also, the shapes of the plots for the pools of different size (but same power distribution), e.g. 0.1%, 1%, 10% of total network power, are similar. However, a comparison between different pools reveals that for the same value of k, known real-world pools may have a higher proportion of attackers compared to artificially simulated data. This is due to the greater inequality in mining power distribution in real-world pools such as Kano. For instance, the most powerful member of a pool can sometimes account for up to a quarter of the pools total power. This may also be a significant obstacle in satisfying the condition for incentive compatibility, $\tau _1 \le \frac{k}{k+1}$, in large pools with relatively small k.

Interestingly, $\hat{D}$ is non-monotonic on k. Obviously, $\hat{D}$ cannot be greater than D, however, the position of its minimum reflects differences in distribution of mining power in different pools. In addition, greater pool size (e.g. 10% vs 0.1% of network power) allows for attacks with greater k and that causes a greater decline in $\hat{D}$. The non-monotonic behaviour is due to the following property. For very small k, the changes in $\hat{D}$ (compared to D) are insignificant because the amount of shares that are delayed by every miner is negligibly small. For k close to the maximum, changes in $\hat{D}$ are also small due to the fact the number of attackers is small. Interestingly, the position of minimum in $\hat{D}$ for Kano pool (modelling 10% of network power) corresponds to the attack when only two most powerful miners delay. In contrast to that, for simulated data the same effect is achieved only when a majority of pool miners attack. Drops in the number of submitted shares (around 5% for large pools) can serve as a flag feature for pool administrators, who might detect the anomaly even before the attackers collect their first extra revenue.

Our plots for cumulative extra profit for a subgroup of attackers are also non-monotonic. That is because attackers exploit honest miners: when honest miners earn most the fraction of attackers is small; when fraction of attackers is large, honest miners earn little. It should be noted that the red plot (for the both types of power distribution) stands for maximum collective revenue of attackers when the whole group of attackers can exploit honest miners. That may happen only if a round is unlimited in time. Comparing extra profit in real pools with synthetic data one can notice that for high confidence of estimation, uniform distribution produces insignificant incentives for dishonest miners (even though the pool is large, 10%). On the other hand, incentives for dishonest miners may be quite substantial (up to 0.17 BTC) for a pool with power a distribution that is like that of the Kano pool.

A PPLNS variant that is adopted in several large pools uses the concept of sharechain [12]. This assumes that every share is included in a simplified version of the main Blockchain, making delay attacks impossible by protocol. On the other hand, it may also cause a negative effect on honest miners. If for some reason (e.g., network latency) a share is out of sync, it is lost. Dead on Arrival rates can reach up to 15% of all submitted shares with this scheme. This is a disadvantage for miners whose network connection is unreliable. In that sense, traditional PPLNS has an advantage and is unlikely to be replaced in the near future. Hence, aspects of traditional PPLNS scheme should be analysed with greater attention. Our model shows, in summary, that equitable pools and smaller pools are more resilient. This in sharp contrast to the state of the Bitcoin network.

The analysis of incentive compatibility and related strategic models provide an opportunity to better understand reward functions in the Blockchain. The mechanism design of reward functions is a nascent and promising application of non-cooperative game theory. These models are also useful to evaluate implementation trade-offs. For example, the so-called Block Withholding Attack [1], may become less attractive for an attacker who can benefit from delaying. An adversary delaying shares until the end of the round would be unwilling to discard complete solutions. Also note, for example, that the average number of shares submitted per discovered block, $\hat{D}$, decreases with positive delays. This reduction may be significant from the perspective of computational and network load on pool administrators.

References

Bag, S., Ruj, S., Sakurai, K.: Bitcoin block withholding attack: analysis and mitigation. IEEE Trans. Inf. Forensics Secur. 12(8), 1967–1978 (2017)
Article Google Scholar
BCmonster: Mining statistics (2017). http://www.bcmonster.com/. Accessed 22 Mar 2017
Chávez, J.J.G., da Silva Rodrigues, C.K.: Automatic hopping among pools and distributed applications in the bitcoin network. In: 2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA), pp. 1–7, August 2016
Google Scholar
Dziembowski, S.: Introduction to cryptocurrencies. In: Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security, CCS 2015, pp. 1700–1701, NY, USA (2015). http://doi.acm.org/10.1145/2810103.2812704
Fisch, B.A., Pass, R., Shelat, A.: Socially optimal mining pools. ArXiv e-prints March 2017
Google Scholar
Fudenberg, D., Tirole, J.: Game Theory, 11th edn. The MIT Press, Cambridge (1991)
MATH Google Scholar
Gervais, A., Karame, G.O., Wüst, K., Glykantzis, V., Ritzdorf, H., Capkun, S.: On the security and performance of proof of work blockchains. In: Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, CCS 2016, pp. 3–16, NY, USA (2016). http://doi.acm.org/10.1145/2976749.2978341
Kano pool: Pool payout (2017). https://kano.is/index.php?k=payout. Accessed Mar 23 2017
Lewenberg, Y., Bachrach, Y., Sompolinsky, Y., Zohar, A., Rosenschein, J.S.: Bitcoin mining pools: A cooperative game theoretic analysis. In: Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2015, pp. 919–927, International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC (2015). http://dl.acm.org/citation.cfm?id=2772879.2773270
Morabito, V.: Business Innovation Through Blockchain, vol. 1. Springer International Publishing AG, Heidelberg (2017)
Book Google Scholar
Nakamoto, S.: Bitcoin: A peer-to-peer electronic cash system (2008). https://bitcoin.org/bitcoin.pdf. Accessed 29 Jan 2016
P2Pool: P2Pool bitcoin mining pool global statistics (2017). http://p2pool.org/stats/index.php. Accessed 19 Mar 2017
Peck, M.: A blockchain currency that beat s bitcoin on privacy [news]. IEEE Spectr. 53(12), 11–13 (2016)
Article Google Scholar
Rosenfeld, M.: Analysis of bitcoin pooled mining reward systems. arXiv preprint (2011). arXiv:1112.4980
Schrijvers, O., Bonneau, J., Boneh, D., Roughgarden, T.: Incentive compatibility of bitcoin mining pool reward functions. In: Grossklags, J., Preneel, B. (eds.) FC 2016. LNCS, vol. 9603, pp. 477–498. Springer, Heidelberg (2017). doi:10.1007/978-3-662-54970-4_28
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of IT, Monash University, Clayton, Australia
Yevhen Zolotavkin, Julian García & Carsten Rudolph

Authors

Yevhen Zolotavkin
View author publications
You can also search for this author in PubMed Google Scholar
Julian García
View author publications
You can also search for this author in PubMed Google Scholar
Carsten Rudolph
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Julian García .

Editor information

Editors and Affiliations

Universität Klagenfurt, Klagenfurt, Austria
Stefan Rass
Nanyang Technological University, Singapore, Singapore
Bo An
University of Texas at El Paso, El Paso, Texas, USA
Christopher Kiekintveld
Carnegie Mellon University, Pittsburgh, Pennsylvania, USA
Fei Fang
AIT Austrian Institute of Technology GmbH, Klagenfurt, Austria
Stefan Schauer

A Appendix

1.1 A.1 Remarks

In the proofs, several aspects related to the concept of incentive compatibility are discussed. For that purpose, it is important to show that:

(1) for the current proofs, we will distinguish only two cases (instead of 3 in Eq. 3) $ 0<x \le N $ and $ x=0 $. That can be explained by the fact that pool mining is either entirely honest or not (incentive compatibility questions only that aspect). The state of incentive compatibility when nobody delays can be derived from Eq. 3, $ x_{i}=0, \forall i $:

$$ \frac{\hat{N}}{N \hat{D}}\,=\,\frac{ \tau _{i}}{N} \left( 1\,+\,\frac{N-x_{i}}{\hat{D}} \right) \,+\,C_{i},~C_{i} \ge 0 . $$

That is equivalent to

$$ \hat{N} \ge \tau _{i} \left( \hat{D}\,+\,N-x_{i} \right) ,$$

or, this is equivalent to the requirement

$$\begin{aligned} \hat{N}\,=\,\tau _{i} \left( \hat{D}\,+\,N-x_{i} \right) , ~ x_{i} \le 0,~ \forall i . \end{aligned}$$

(5)

The latter notation will be used as it allows to analyse conditions for incentive compatibility using the roots of a system of linear equations.

(2) One should distinguish between two different situations: a miner i may have incentives to delay a positive number of shares even if $ i\notin \mathcal {M} $; or, a miner i is included in $ \mathcal {M} $ and definitely has an incentive to delay. It is assumed that miners in $\mathcal {M} $ do not have information about other delaying miners from outside $\mathcal { M}$. As a result of inclusion (or not inclusion) in the group of delaying miners $ \mathcal {M} $, the incentive may be different. That is easy to see on the following example: the amounts of the shares delayed by miners in $\mathcal {M}$ depend on their information about $\mathcal {M}$, but, for i-th miner who is not in $\mathcal {M}$ the amount of delayed shares depends on the information about himself ($\tau _{i}$) and the information about the number of shares that are delayed by miners in $\mathcal {M}$. However, in case i is the only miner in $\mathcal {M}$, e.g. $\mathcal {M}=\{ i \} $ the incentive of the miner i is the same as if $\mathcal {M}= \varnothing $.

According to the definition, incentive compatibility is an equilibrium when $\mathcal {M}=\varnothing $ and nobody has an incentive to delay. Nonetheless, it is not clear if a pool with incentive compatible conditions can be in a state of another equilibrium when $\mathcal {M} \ne \varnothing $, $ \vert M \vert >1 $. Information about $\mathcal {M}$ may be incomplete, and, answer to the question about other (delaying) equilibrium may require certain assumption about $\mathcal {M}$. In order to resolve that obstacle, we will produce some intermediate results in Lemmas 1 and 2.