Keywords

1 Introduction

Stablecoins are being increasingly adopted as bridge to trade traditional cryptocurrencies [3, 8, 38, 45], marketed as medium-of-exchange for decentralized finance and smart contracts [39, 40] and have recently been approved by US regulators as payment method for federally chartered banks [1]. Increasing practical relevance fosters the demand for understanding economic properties of such instruments. The majority of projects simply tokenize the asset their tokens are stabilized against (e.g. the USD, EUR or gold) or store a third asset in the respective amount [43]. Assuming that traders trust governance, collateral and technology, and their trust is justified, there is little room for disagreement on the token’s fundamental values. If there were any structured deviation, efficient-market theorists might argue that market participants would seize the resulting opportunity of arbitrage, closing the gap [26].Footnote 1 Stablecoin prices would then merely reflect the value of the collateral and random noise. If, however, price adjustments are restricted by incomplete arbitrage, interesting patterns might emerge that reflect the trader’s decisions [10, 22]. This paper explores and quantifies such dynamics.

My empiric approach is based on a recent theoretical approach from asset pricing literature proposed first in [18] and refined by [14] which opens up asset pricing to dynamic systems modeling known from thermodynamic physics. In the following, the latter will be called the “Caginalp and Balenovic (1999) asset flow” (CBAF) approach. In a nutshell, the authors model price determination by abstracting trading decisions as flows from asset-to-cash and vice versa. As such, traders are abstracted as being driven by price trends and the deviation of the asset’s market price from its fundamental value. Exploring theoretic approaches to the instability of price processes, a first application to the field of cryptocurrencies has been attempted by [12]. Applying the CBAF model to stablecoin arbitrage promises to offer a convincing model for the interplay of trend following and peg deviations. In contrast to most models for cryptocurrency pricing, the above model requires few assumptions.Footnote 2 Exploring early trading data for 11 fully collateralized stablecoins, this study evaluates the appropriateness of the intuitive trader abstraction adopted by the CBAF approach and offers insights into stablecoin price dynamics in general. My empirical setup couples variables approximating trend-following with a measure for peg deviations in a dynamic coin-fixed-effects (coin-FE) panel data regression. The study engages in rigorous robustness checking by testing the models for the influence of seasonal dummies, interaction terms for the direction of trends, sign of peg deviations and different parameters in data preprocessing. I find a striking difference between results being based on data with—and without outlier treatment even for merely truncating price changes exceeding 5 standard deviations. For data including extreme price changes, the CBAF model approximates price formation well. Deviations from the peg and trend following are strong determinants of coming price changes. For outlier-free data, however, the effects seem to blend in with other price-determinants (the stablecoin’s token supply and Bitcoin’s price volatility). This result indicates that apart from very few occasions arbitrage activity is weak. This poses the questions, whether costs of arbitrage might be prohibitive high except for extreme but rare market situations. It might be considered to model price formation by switching between two regimes: a first one offering profits for large trades towards the peg, and a second one being characterized by limited levels of arbitrage activity.

The paper is organized as follows: Sect. 2 outlines related work, Sect. 3 introduces the theoretic backgrounds of arbitrage and the CBAF model, Sect. 4 discusses data and econometric approach. Sect. 5 describes the results and Sect. 6 concludes the study.

2 Related Work

Studies investigating the asset class and its manifestations include [19, 23, 41, 43] and [11]. The user perception in the adoption of stablecoins has been studied in [32], while [5] analyzed the suitability of stablecoins as save-haven investment. The relation between cryptocurrency prices and stablecoin trading has been focused on by [3, 8, 45] and [27]. Adopting a risk-oriented approach, [33, 34] and [35] suggest theoretic models to study stability and resilience of stablecoins. While the latter focuses on extreme events, [38] not only offers a theoretic model for day-to-day arbitrage but also provides, to my knowledge, the first extensive empirical analysis of the drivers of stablecoin prices. This paper differs in perspective and econometric approach: Instead of peg deviations, this paper analyzes price processes and tests the CBAF model for applicability. While [38] relies on an auto-regressive distributed lag model with a rich database, however, just for Tether, this paper adopts a dynamic coin-FE panel data regression on market data for 11 stablecoins.

3 Stablecoin Arbitrage and the CBAF Model

Stablecoin arbitrage uses primary and secondary markets [38]. On the primary market, coins are created and redeemed against collateral with the issuer. On the secondary market, stablecoins are traded against fiat and cryptocurrencies. If market prices deviate from the peg, arbitrage traders might decide to either trade against peg deviations directly on the secondary market or involve the stablecoin issuer. In the first case, arbitrage traders simply trade towards the peg. They would buy when prices are below—and sell when prices are above the peg. In the second case, abstracting from technical details,Footnote 3 arbitrage traders would purchase coins from the markets and redeem them with the issuer when prices are below the peg. When prices are above the peg, traders would first create coins with the issuer and subsequently sell them on secondary markets.

The CBAF approach as presented in [14] models traders as switching from cash to asset or vice versa with a certain probability \(k\). Variable \(k\) is modeled to include motivations based on past price changes \(\zeta _{1}\) and the market discount relative to the asset’s fundamental value \(\zeta _{2}\). These components are modeled using the trader’s memory with respect to price trends and deviations from fundamental values (\(c_{1}\) and \(c_{2}\)) and their focus on these two respective components (\(q_{1}\) and \(q_{2}\)). Core component of \(\zeta _{1}\) is the relative price trend \(f(\tau ) = \frac{1}{P(\tau )} \frac{dP(\tau )}{d\tau }\) over period \(\tau \) on time scale \(\tau _{0}\). \(\zeta _{2}\) is constructed around deviations \(g(\tau ) = \frac{P_{a}(\tau )-P(\tau )}{P(\tau )}\) of market price \(P(\tau )\) from fundamental value \(P_{a}\). Thus, the two components can be expressed as

figure a

Limiting \(k\) to values between 0 and 1, \(k\) is constructed as \(k = \frac{1}{2} + \frac{tanh(\zeta _{1}+\zeta _{2})}{2}\). For \(\zeta _{1}\) and \(\zeta _{2}\) equal to zero, the probability of flows from cash to asset and vice versa are thus equally likely. Demand \(D\) and supply \(S\) of the asset are modeled using \(k\) and the fraction already invested into the asset \(B\). Their relation can be expressed as \( D = k(1-B) \text { and } S = (1-k)B \text { and thus } \frac{D}{S} = \frac{k}{1-k}\frac{1-B}{B}\). Prices \(P\) are assumed to change logarithmic with excess demand, leading to

$$\begin{aligned} \frac{1}{P}\frac{dP}{dt} = \delta \cdot log \Bigl ( \frac{D}{S}-1 \Bigr ) = \delta \cdot log \Bigl ( \frac{k}{1-k}\frac{1-B}{B} \Bigr ) \end{aligned}$$
(3)

with \(\delta \) representing an amplitude that scales with time. For a deeper discussion of the model and its applications see [14, 16, 17, 44] and [12]. While this paper will not be offering estimates for the individual parameters, the study gathers evidence supporting the basic intuition of the model: Under the efficient market hypothesis, fluctuations of \(P(t)\) around the peg, assumed to equal \(P_{a}\), would merely be random.Footnote 4 Obviously, in this case, neither of the two components ought to be reflected in market data of stablecoins. If, however, evidence of \(\zeta _{1}\) or \(\zeta _{2}\) is present, it is of considerable interest to understand how the two components are jointly driving stablecoin prices.

4 Data and Econometric Approach

The following paragraphs will discuss data preprocessing and the economic approach for testing the applicability of the CBAF model.

Data was gathered from www.coingecko.com. Similar to www.coinmarketcap.de the data provider currently crawls 382 cryptocurrency exchanges but offers hourly data over a well-documented API for free.Footnote 5 The full dataset comprises 19 cryptocurrencies. The sample includes projects that are listed at www.coingecko.com, promise stability of their exchange rates in their whitepapers and collateralize their tokens to at least 100%. The study considers designs that use the asset-pegged-to as collateral but also includes tokens that use a third asset, often a crypto-asset, in a quantity reflecting at least full collateralization. Some of the 19 cryptocurrencies are quite young and immature. Shallow markets with low volumes and few trades per measurement period might in part be driven by market microstructure effects which could lead to biased regressions. To reduce noise, this study thus excludes months for that the stablecoin shows a market capitalization of under USD 10 m or daily trading volumes of under USD 1 m. Coins have been dropped completely if the remaining dataset included less then \(24 \cdot 31\) hourly observations. This restricted the dataset to 11 stablecoins with 767 to 16970 trading hours leading to 101,243 observations in total. Table 6 of Appendix 7.3 in the full paper shows that the results hold as well if market capitalization thresholds of USD 100 m and USD 5 m—and thresholds for daily trading volumes of USD 50 m and USD 50k are chosen. To understand the effect of extreme values on the estimated models, price changes diverging over 5 standard deviations (SD) from the mean have been truncated. Table 1 in Appendix B gives an overview over cutoff points and the relative and absolute number of classified outliers. The threshold was chosen deliberately high to correct only the most extreme values. Such might have a disproportionate effect on ordinary-least-squares (OLS) regression results [6]. To make the estimated coefficients comparable, all variables apart from the dummies have been standardized based on Z-scores \(X^{stand} = \frac{X-\mu (X)}{\sigma (X)},\) with mean \(\mu (\cdot )\) and standard deviation \(\sigma (\cdot )\). To verify the applicability of the CBAF model, a linear panel regression in conjunction with squared and cubed trend variables to model nonlinear relationships has been applied to exchange-traded funds [17] and stocks [15]. Now this framework is applied to stablecoins by adjusting it slightly to the characteristics of the dataset and adding a variety of robustness checks. Traditional linear modeling might fail to pick up the complex relations in price formation. In contrast to linear models, polynomials allow for very strong negative (positive) past returns to induce positive (negative) bounce-back effects. Mixing up relations between strong and weak price changes, simple auto-regressive price regressions abstaining from the above step might miss potential information on prices. As a remedy, following [15] and [17], this study includes variables to the power into a fixed-effects panel data approach.Footnote 6 To capture price trends, in accordance to [15], this paper uses a simple weighted aggregation of past price changes. The prior might be expressed using prices \(P\) and smoothing factor \(s\) over a look-back window \(d\) as

$$\begin{aligned} T_{t}=\frac{1}{\sum _{k=1}^{d}e^{sk}}\sum _{k=1}^{d} \Bigl ( \frac{P_{t-k+1}-P_{t-k}}{P_{t-k}} \cdot e^{sk} \Bigr ) . \end{aligned}$$
(4)

The smoothing term ensures that older observations of variable changes are entering the sum with lower weights (\(s=-0.25\) and \(d=10\)). Trends in prices have been shown to explain variation in future returns for cryptocurrencies in [20, 30] and [28]. To calculate the distance \(D\) of market price \(P\) from the peg \(P_{a}\), simply \(D_{t}= P_{t}- P_{a,t}\) is formed. As control variables the token supply and Bitcoin’s price-volatility are used. Controlling for the volatility of the second and third largest traditional cryptocurrencies by trading volume (Ether and Ripple) or for the Ethereum Gas price turned out to yield insignificant coefficients. Regressions are formed denominating cryptocurrency price changes as \(\varDelta P\), the trend as \(T\) and peg deviations as \(D\). Moreover \(\varDelta S\) and \(\varDelta V^{\mathtt {BTC}}\) are the first differences of token supply and Bitcoin’s price-volatility respectively. Dummy variables include seasonal ones (\(Z^{\mathtt {hour}}_{t}\), \(Z^{\mathtt {day}}_{t}\), \(Z^{\mathtt {month}}_{t}\)) and others that account for the sign of the peg deviation (\(Z^{D>0}_{t}\)) and the direction of the trend (\(Z^{T>0}_{t}\)). The unobserved coin effect is denominated as \(a_{i}\) while \(b_{t}\) gives the unobserved time effect and the remaining residual errors are given as \(u_{it}\). A parsimonious base-line regression can thus be based on unobserved-effects equation

$$\begin{aligned} \varDelta P_{i,t+1} =&\beta _{2} T_{i,t} + \beta _{3} T_{i,t}^{2} + \beta _{4} T_{i,t}^{3} + \beta _{5} D_{i,t} + \beta _{3} D_{i,t}^{2} + \beta _{4} D_{i,t}^{3} \nonumber \\&+ \beta _{5}\varDelta S_{i,t} + \beta _{6}\varDelta V^{\mathtt {BTC}} + a_{i} + b_{t} + u_{it} , \end{aligned}$$
(5)

while the full general regression is based on

$$\begin{aligned} \varDelta P_{i,t+1} =&\beta _{2} T_{i,t} + \beta _{3} T_{i,t}^{2} + \beta _{4} T_{i,t}^{3} + \beta _{5} D_{i,t} + \beta _{6} D_{i,t}^{2} + \beta _{7} D_{i,t}^{3} \nonumber \\&+ \beta _{8}\varDelta S_{i,t} + \beta _{9}\varDelta V^{\mathtt {BTC}}_{i,t} + Z^{\mathtt {hour}}_{t} + Z^{\mathtt {day}}_{t} + Z^{\mathtt {month}}_{t}+ \beta _{10} T_{i,t} \cdot Z^{T>0}_{t} \\ + \nonumber&\beta _{11} T_{i,t} \cdot Z^{D>0}_{t}+ \beta _{12} D_{i,t} \cdot Z^{D>0}_{t}+ \beta _{13} D_{i,t} \cdot Z^{T>0}_{t}+ a_{i} + b_{t} + u_{it} \nonumber , \end{aligned}$$
(6)

where \(t\) is the time- and \(i\) the coin index.

While two-way fixed-effects regression are applied to eliminate time and entity effects in the original framework [15, 17], recent research indicates that this approach might lead to mostly uninterpretable coefficients [29, 36] and biased inferences in most general applications [31]. This study thus settles on eliminating \(a_{i}\) by coin-fixed-effects but models common time effects by including seasonal dummies and control variables. Treatment of the Nickel bias and other issues related to panel data regressions with long time series dimensions (e.g. heteroskedasticity, non-stationarity and serial correlation) are treated in line with state-of-the-art approaches. For more information refer to Appendix A.

Table 1. Coin-FE regression based on Eq. 4.
Fig. 1.
figure 1

Coefficient interplay for untreated data

Fig. 2.
figure 2

Coefficient interplay for outlier-free data

5 Results

Table 1 supplies estimates of the coefficients given in Eq. 4. As suggested by the applied asset pricing theory, not only current deviations from the peg but also price trends show significant relations with the coming hour’s price change. Also, the adopted nonlinear regression framework has proven useful. Most of the variables that are raised to the second and third power show high significance and our models explain roughly \(20\)% of the variance in the one-hour-ahead price changes.

Footnote 7 However, coefficient estimates vary with the way outliers are treated. For the model based on data for which no outlier treatment has been implemented (compare Column 1 Table 1), the polynomial of significant coefficients for the estimated price change is \(\widehat{\varDelta P}_{t+1} = -0.160 D_{t}+0.085 D_{t}^{2}-0.417 D_{t}^{3}\). This polynomial is displayed in Fig. 1, which relates peg deviations and trends to the coming hours price change. Even relative small peg deviations are associated with moderate price changes. Large peg deviations, however, precede extreme price changes forcefully driving prices back to the peg. The coefficients related to price trends display nonlinearity as well and form \(\widehat{\varDelta P}_{t+1} = -0.266 D_{t}+ 0.178 D_{t}^{3}\). Weak price trends seem to be reverted, while stronger ones lead to trend following. In other words, series of consecutive strong price changes of the same direction are prolonged. This might be a result of consecutive hours of price jumps after large deviations. In comparison to price trends, but also all other tested variables, the joint influence of the coefficients for peg deviations seems be the largest by far. Their influence is roughly symmetric for positive and negative price deviations. For the model based on data for which price changes in excess of 5 SD have been truncated (Column 2 of Table 1), the joint estimated effect of price trends and peg deviations on coming price changes is displayed in Fig. 2. For peg deviations, the polynomial of significant coefficients is \(\widehat{\varDelta P}_{t+1} = -0.259 D_{t} + 0.046 D_{t}^{2}+0.056 D_{t}^{3}\) with roots roughly at \(-2.6\), \(0\) and \(1.8\). The effect of smaller peg deviations is in line with expectations: The relation of peg deviations and the coming price change is positive for \(-2.6 \le D \le 0\) and negative for \(0 \le D \le 1.8\) and would thus lead to mean-reverting prices. For positive deviations \(D > 1.8\) and negative deviations \(D < -2.6\), however, signs turn and peg deviations are associated with price changes leading away from the peg. The reason for this seemingly counter-intuitive result might be that the majority (95.37%) of observed price changes lie within the \(2\) SD displayed in Figs. 1 and 2. Rather than avoiding rare but large errors from outliers, the coefficients seem to be optimized to fit smaller fluctuations around the peg. Not only the results for peg deviations—but also price trends differ. The latter, estimated based on outlier free data, show a linear, negative relation to coming price changes. In other words, series of price changes of equal sign revert soon. The size of the coefficients for trend and peg deviations are relatively low, though. For instance, a peg deviation of \(2\) SD in negative direction is associated merely with an increase in prices of \(0.1\) SD. A negative price trend of \(1\) SD is associated with an increase of prices of around \(0.2\) SD. Arbitrage is estimated to have a very limited effect in the outlier-free dataset. With coefficients of \(0.394\) and \(-0.308\) respectively, the change in token supply and the change in the Bitcoin’s price volatility seem to be equally important price determinants, at least. This contrast is surprising, given that they are caused by the truncation of merely \(363\) of \(101243\) observations. Table 2 of Appendix B gives coefficients for the same model but complemented by dummy representations accounting for hours, weekdays, months as well as interaction terms for the sign of the peg deviation and the direction of the price trend. The results differ only negligibly.

6 Conclusion

The stark contrast of the importance of peg deviations in the fitted models with—and without outlier treatment is striking. For data including extreme price changes, peg deviations and trend dynamics seem to approximate price formation well. For stablecoins, the intuitive CBAF approach to model trader behavior might thus be seen as a good approximation for the determinants of price changes. For outlier-free data, however, the effects seem to blend in with other price-determinants. The above results pose the question of whether stablecoin markets might not best be modeled as switching between a first regime characterized by limited arbitrage, and a second one setting in when markets promise sufficient profits for large trades towards the peg. While the most obvious candidate for explaining such results might be costs of arbitrage, at least short-term variation of the Ethereum Gas price turned out to yield insignificant coefficients when adopted as control variable. Future research might employ cointegration analysis capable of unvailing also longterm relationships.