1 Introduction

In the modern and contemporary visual art market, a small number of artists earn a very high income and emerge as superstars (Thompson 2010; Beatrice 2012). Consider the example of Lucio Fontana. During the period 1999–2012, Lucio Fontana was traded about 2,500 times in the major international art auctions, with an average price of about \(\EUR \)200,000. Lucio Fontana generated revenues of about \(\EUR \)500 million.

Two theories—developed to analyze the phenomenon of a superstar in general (Baumol and Throsby 2012)—may explain the specific phenomenon of superstars in the modern and contemporary visual art market: (1) Rosen’s (1981) and (2) Adler’s (1985) theories (for a description of Rosen’s and Adler’s theories, see MacDonald 1988; Ginsburgh and Throsby 2006; Towse 2013).

  1. (1)

    Rosen’s (1981) theory implies that a variation in talent leads to a more than proportional variation in income, i.e., income is convex in talent (in our empirical setup, we call this implication of Rosen’s theory ‘Rosen’s hypothesis’).

  2. (2)

    Adler’s (1985) theory builds on Rosen’s to show that income also increases with fame (in our empirical setup, we call this implication of Adler’s theory ‘Adler’s hypothesis’).

Empirical studies testing Rosen’s and/or Adler’s hypotheses have provided mixed results.Footnote 1 Most present evidence for mass markets such as music (Hamlen 1991, 1994; Crain and Tollison 2002; Krueger 2005; Filimon et al. 2011) and sports (Lucifora and Simmons 2003; Franck and Nüesch 2008; Lehmann and Schulze 2008; Franck and Nüesch 2012; Bryson et al. 2014)Footnote 2, while empirical studies on the visual art market are still scarce since testing the theories of superstar formation would require a perfect (or nearly perfect) reproducibility of artworks. Nevertheless, it is possible to test these theories if we assume that modern and contemporary visual art buyers are not interested in artworks per se—which are irreproducible by definition—but rather in artists in se.Footnote 3

To test both Rosen’s and Adler’s hypotheses in the modern and contemporary visual art market, we assume that buyers of visual art are actually purchasing artist signatures, styles, or iconographiesFootnote 4—which are infinitely reproducible at no additional cost. This assumption relies on the concept of artist- fixed effects which is typical in hedonic models (Rengers and Velthuis 2002; Hellmanzik 2009; Canals-Cerdá 2012; Etro and Pagani 2013). According to Hellmanzik (2009), artist-fixed effects ‘account for any individual specific characteristics that might explain prices,’ which implies the price heterogeneity is lower within artist than between artists. Artist signature is certainly not the only important aspect to consider in the art market, but it is important enough to select the artist as the unit of observation in empirical analyses. Excluding masterpieces, the artist’s signature plays a similar role to a brand: While each artwork is inherently a unique item, the signature allows us to recreate by aggregating data at artist level the ideal ‘reproducibility condition’ that is at the heart of theories of superstar formation.

We examine the modern and contemporary visual art market using a unique hand-collected dataset of modern and contemporary visual artists, comprising all artwork sales occurring in Italy between 2006 and 2010 (Castellani et al. 2012). In addition to Lucio Fontana, our dataset contains artists generally recognized as talented and popular including Pablo Picasso, Salvador Dalì, Piero Manzoni, Giorgio Morandi, and Giorgio de Chirico.

As a first step, we propose a model of artist factor income as a function of talent and fame and state Rosen’s and Adler’s hypotheses in terms of talent and fame elasticities. Then, we proxy artist income by the total revenue in the secondary art market, taking advantage of the fact that in Italy (similar to other European countries), when an artwork is resold in the secondary art market by professional intermediaries, artists (or their descendants) are entitled to a royalty arising from the Artist’s Resale Right (ARR hereinafter). A factor analysis allows us to extract our measures of artist talent and fame from a set of observable artist-specific variables related to familiar environment, eclecticism, studies, death event, and the number of years an artist (or her artwork) is present in the visual art market. Then, we estimate talent and fame elasticities of income via regression analysis and find evidence supporting Rosen’s and Adler’s hypotheses for the art sector.

To separate the relative importance of average price and number of sales on artist income, we evaluate the effects of talent and fame on the average price and the number of artwork sales (the product of these two variables is equal to artist income). Findings suggest that artist income is primarily affected by talent and fame through the number of artwork sales. We use copula models to investigate the nature and the strength of the conditional dependence relationship between average prices and number of artwork sales in order to better understand how a shock on prices affects number of sales and vice versa. The results of this analysis show that average price and number of sales exhibit a positive and asymmetric dependence relationship.

Finally, to determine whether different models should be estimated for artists primarily traded in auction houses as opposed to galleries, we perform a poolability test to check whether a single model of artist income applies to all artists. Our results indicate that a single encompassing model of artist income applies to all artists in our dataset and that Rosen’s and Adler’s hypotheses also hold true at a disaggregated level. As an additional analysis, we test whether talent and fame elasticities are uniform across income quantiles and, consistent with Rosen, find that in the lower part of the artist income distribution artists are not superstars (Rosen’s hypothesis holds only beyond a median level of income) and fame does not play a role.

The paper is organized as follows. In Sect. 2, we present our model. In Sect. 3, we describe our dataset. In Sect.  4, we present our empirical analysis. In Sect. 5, we summarize the main results of our analysis.

2 Model

We use a Cobb–Douglas function to model the artist factor income as a function of talent and fame,

$$\begin{aligned} Y_i =A_i T_i ^{\tau }F_i ^{\varphi },\quad i=1,\ldots ,I \end{aligned}$$
(1)

where \(Y_i \), \(T_i \), and \(F_i \) are artist i income, talent, and fame, \(A_i \) captures artist heterogeneity, and \(\tau \) and \(\varphi \) are talent and fame elasticities (all variables are assumed to be positive).

  1. (1)

    Rosen’s hypothesis requires \({\partial Y_i }/{\partial T_i }>0\) and \({\partial ^{2}Y_i }/{\partial T_i^2 }>0\), which in Eq. 1 is equivalent to \(\tau >1\).

  2. (2)

    Adler’s hypothesis requires \({\partial Y_i }/{\partial F_i }>0\), which in Eq. 1 is equivalent to \(\varphi >0\).

In logarithmic form, Eqn. 1 becomes

$$\begin{aligned} y_i =a_i +\tau t_i +\varphi f_i \end{aligned}$$
(2)

where we defined \(y_i \equiv \ln \left( {Y_i } \right) \), and the same for \(a_i \), \(t_i \), and \(f_i \).

In Equation  2, we can decompose \(a_i \) as

$$\begin{aligned} a_i =\alpha +\mathbf{c}_i^ {\prime }{\varvec{\upkappa }}+\varepsilon _i \end{aligned}$$
(3)

where \(\alpha \) is a constant, \(\mathbf{c}_i \) is a column vector of control variables that influence artist heterogeneity, \({{\varvec{\upkappa }} }\) is a column vector of coefficients, and \(\varepsilon _i \) is an error term with a conditional mean of zero.

Combining Eqs. 2 and  3, we test Rosen’s and Adler’s hypotheses using the following empirical model

$$\begin{aligned} y_i =\alpha +\tau t_i +\varphi f_i +\mathbf{c}_i^ {\prime }{\varvec{\upkappa }}+\varepsilon _i \end{aligned}$$
(4)

where coefficients can be estimated by Ordinary Least Squares (OLS).

Since artist income is the product of average price, \(P_i \), and the number of artwork sales, \(N_i \), we can evaluate the effects of talent and fame on \(P_i \) and \(N_i \) separately to understand their relative importance. Indicating the logarithm of \(P_i \) and \(N_i \) with \(p_i \) and \(n_i \), we can split, without loss of generality (since null coefficients are admitted), Eq. 4 into two equations

$$\begin{aligned} p_i= & {} \alpha _p +\tau _p t_i +\varphi _p f_i +\mathbf{c}_i^ {\prime }{\varvec{\upkappa }}_p +\varepsilon _{pi} \end{aligned}$$
(5a)
$$\begin{aligned} n_i= & {} \alpha _n +\tau _n t_i +\varphi _n f_i +\mathbf{c}_i ^ {\prime }{\varvec{\upkappa }}_n +\varepsilon _{ni} \end{aligned}$$
(5b)

Using the product rule of logarithm, \(y_i =p_i +n_i \) and so \(\alpha =\alpha _p +\alpha _n \), \(\tau =\tau _p +\tau _n \), \(\varphi =\varphi _p +\varphi _n \), \({\varvec{\upkappa }}={\varvec{\upkappa }}_p +{\varvec{\upkappa }}_n \), and \(\varepsilon _i =\varepsilon _{pi} +\varepsilon _{ni} \).

If we stack the I equations (one for each artist) into a system, then we can write Eqs. 5a and  5b as

$$\begin{aligned} \mathbf{p}= & {} \alpha _p +\tau _p \mathbf{t}+\varphi _p \mathbf{f}+{\mathbf{C}{\varvec{\upkappa }} }_p +{{\varvec{\upvarepsilon }} }_p =\left[ {{\begin{array}{l@{\quad }l@{\quad }l} \mathbf{t}&{} \mathbf{f}&{} \mathbf{C} \\ \end{array} }} \right] \left[ {{\begin{array}{l} {\tau _p } \\ {\varphi _p } \\ {{\varvec{\upkappa }}_p } \\ \end{array} }} \right] +{\varvec{\upvarepsilon }}_p ={\mathbf{X}{\varvec{\upbeta }} }_p +{\varvec{\upvarepsilon }}_p \end{aligned}$$
(6a)
$$\begin{aligned} \mathbf{n}= & {} \alpha _n +\tau _n \mathbf{t}+\varphi _n \mathbf{f}+{\mathbf{C}{\varvec{\upkappa }} }_n +{{\varvec{\upvarepsilon }} }_n =\left[ {{\begin{array}{l@{\quad }l@{\quad }l} \mathbf{t}&{} \mathbf{f}&{} \mathbf{C} \\ \end{array} }} \right] \left[ {{\begin{array}{l} {\tau _n } \\ {\varphi _n } \\ {{\varvec{\upkappa }}_n } \\ \end{array} }} \right] +{{\varvec{\upvarepsilon }} }_n ={\mathbf{X}{\varvec{\upbeta }} }_n +{\varvec{\upvarepsilon }}_n \end{aligned}$$
(6b)

The final system consisting of the two systems of Eqs.  6a and  6b (for a total of 2I equations) takes the form

$$\begin{aligned} \left[ {{\begin{array}{l} \mathbf{p} \\ \mathbf{n} \\ \end{array} }} \right] =\left[ {{\begin{array}{l@{\quad }l} \mathbf{X}&{} \mathbf{0} \\ \mathbf{0}&{} \mathbf{X} \\ \end{array} }} \right] \left[ {{\begin{array}{l} {{\varvec{\upbeta }}_p } \\ {{\varvec{\upbeta }}_n } \\ \end{array} }} \right] +\left[ {{\begin{array}{l} {{\varvec{\upvarepsilon }}_p } \\ {{\varvec{\upvarepsilon }}_n } \\ \end{array} }} \right] ={\mathbf{Z}{\varvec{\upbeta }} } +{\varvec{\upvarepsilon }} \end{aligned}$$
(7)

We can assume the following properties for \({\varvec{\upvarepsilon }}\)

$$\begin{aligned} \hbox {E}\left( {{\varvec{\upvarepsilon }}|\mathbf{X}} \right) =\mathbf{0},\quad \hbox {Var}\left( {{\varvec{\upvarepsilon }}|\mathbf{X}} \right) =\left[ {{\begin{array}{l@{\quad }l} {\sigma _p^2 }&{} {\sigma _{pn} } \\ {\sigma _{pn} }&{} {\sigma _n^2 } \\ \end{array} }} \right] \otimes \mathbf{I}_I ={{\varvec{\Sigma }} }\otimes \mathbf{I}_I \end{aligned}$$
(8)

where \(\mathbf{I}_I \) is the I-dimensional identity matrix. Thus, for any given artist, this model allows for arbitrary cross-correlation between the \(p_i \) and \(n_i \) equations.

Considering the properties of \({\varvec{\upvarepsilon }}\) in Eq. 8, we can estimate the coefficients of Eq.   7 by Feasible Generalized Least Squares (FGLS) in a SUR model (Zellner 1962)—a generalization of a linear regression model consisting of several (possibly cross-correlated) regression equations. Since Eqs. 6a and  6b contain exactly the same set of regressors, FGLS and equation-by-equation OLS estimates are numerically equivalent (Greene 2011). Even in such a case, however, the SUR model is useful to perform joint tests.

The SUR model takes into account the potential cross-correlation between \(p_i \) and \(n_i \) by providing an estimate of all the parameters in \({{\varvec{\Sigma }} }\), i.e., the conditional variance matrix of the system of equations. However, the system in Eq. 7 does not provide a structural relationship between the two equations (i.e., the dependent variable of one equation never appears among the covariates of the other equation). Equations \(p_i \) and \(n_i \) are still linked together through \(\sigma _{pn} \).

3 Data

The sample, based on a unique hand-collected dataset of artist-specific information on all modern and contemporary visual artists whose artworks were traded in Italy between 2006 and 2010 (Castellani et al. 2012),Footnote 5 consists of 514 professional visual artists who are heterogeneous in terms of income, talent, fame, and other artist-specific characteristics (e.g., gender, nationality, age, etc.). Only sales over \(\EUR \)3,000 by artists who died less than 70 years ago are considered, as these are necessary conditions for royalties to be paid under the ARR.

Examining the ARR archives of the SIAE (Società Italiana degli Autori e degli Editori)—the multipurpose society which handles royalty disbursement for artists—we collected all available information (artist, price, trade) about the 22,921 sales involving professional intermediaries that occurred in Italy in the five-year period 2006–2010. The total number of professional visual artists with available information is 514. For each artist, we collected additional information from art information providers available on the web (artprice.com, artnet.com, arsvalue.com, and artfacts.net) and use secondary market trades to make inferences on the income of these 514 artists.

In Italy (as in most European Countries), when artwork is resold in the secondary art market by auction houses or art galleries, the SIAE is entitled by law to collect and distribute royalties to the artist or her descendants (Candela and Scorcu 2012). For this reason, even if the artist is not directly involved in the transaction, the sum of these royalties, which is proportional to revenue generated by auctions and art gallery sales, is the artist’s ‘actual’ income in the secondary market. We call the total revenue in the secondary market ‘potential’ income to reflect the fact that if the royalty fees were equal to one, revenue would equal the artist’s income.

In principle, to fully explain the phenomenon of superstar formation, both secondary and primary market revenues should be considered. However, since multiple trades per artist are available in the secondary market, an artist’s secondary market income distribution tends to overlap her total income distribution. As a result, the primary market trades become quasi-negligible. This assumption holds true when, in the 5-year period we consider, the primary market revenues are zero. During this period, primary market revenues are zero for “inactive” artists that, by definition, do no create and/or sell new artworks in the primary market. In addition, the primary market revenues are negligible for artists whose trades in the secondary market represent the largest part of their trades (highly traded artists).Footnote 6 Finally, distinguishing between superstars and nonstars in short time period is easier in the secondary market than in the primary market. Moreover, since trades per artist are typically nonfrequent in the primary market, it is challenge to distinguish superstars from nonstars when the observational period is short (like in our analysis).

We use three groups of variables in our models: (1) response variables, (2) variables that could be associated with artist talent and/or fame, and (3) control variables. Note that to avoid a proliferation of notation, we indicate empirical variables using the same notation of theoretical variables. Table  1 shows descriptive statistics for each variable.

Table 1 Descriptive statistics (variables are in levels)

Group 1: Response variables

  • y is the logarithm of each artist’s potential income. Since an artist’s actual income is proportional to her potential income, coefficient estimates in all our models (with the exception of the constants) are unaffected by the choice of using potential income instead of actual income as a dependent variable.

  • p is the logarithm of the average price of artwork sales.

  • n is the logarithm of the number of artwork sales.

Group 2: Variables that could be associated with artist talent and/or fame

  • deceased is a dummy variable equal to 1 if the artist is deceased. We expect this variable could be associated with fame, since an artist’s death is a crucial event that influences both the supply and the demand of that artist’s artworks (Ekelund Jr et al. 2000; Ursprung and Wiermann 2011). In our dataset, there are 26 cases where the year of death is between 2006 and 2010. This fact could in principle affect our results. However, unreported robustness checks reveal that the results are not significantly affected when these observations are excluded from the analysis.

  • descendant is a dummy variable equal to 1 if the artist is of artist descendant. We expect this variable could be associated with talent since artistic environments stimulate creativity.

  • existence is the difference between year 2010 and the artist year of birth. We expect existence to be associated with fame.Footnote 7 Based on the assumption that, on average, artists start producing/selling at almost the same age, this variable is related to the number of years that an artist (or her artwork) is present in the visual art market. While not fully realistic, this is an assumption that we maintain as it considerably simplifies the data-gathering process (information is not readily available on the exact year each artist exhibited/sold her first artwork). In principle, descendant could also be correlated with fame and existence with talent. The factor analysis will shed light on the actual correlation structure between these two variables.

  • muforms is a dummy variable equal to 1 if the artist does not specialize in a single form of art (such as painting), but in more than one form of art. Since muforms could be considered a proxy for an artist’s eclecticism, we expect this variable to be associated with talent.

  • study is a dummy variable equal to 1 if the artist has studied art. This is the observable component of an artist’s human capital which is accumulated by education, and thus, we expect this variable to be associated with talent.

Group 3: Control variables

  • gallery is the number of trades mediated by art galleries (rather than auction houses) over the total number of trades.

  • male is a dummy variable equal to 1 if the artist is male.

  • sculpture is a dummy variable equal to 1 if the artist is considered mainly as a sculptor in our data sources.

  • world is a dummy variable equal to 1 if the artist is not Italian. While all trades took place in Italy, not all artists in our dataset are Italian.

  • period is a set of six control dummies indicating the artistic period of the artist. To assign an artist to a period, we consider the historical period of development for the artistic movement to which she belongs (e.g., all futurists are assigned to the artistic period from 1900 to 1925). In the few cases in which an artist claims not to belong to any artistic movement, we inferred the artistic period by adding 30 years to her date of birth.

4 Empirical analysis

4.1 Talent and fame

In this section, we estimate a linear factor model (Basilevsky 2009), which we use as a statistical technique to determine proxy common factors for talent and fame (to be used in following regression analyses) from deceased,descendant, existence, muforms, and study (Kozbelt 2004; Hagtvedt et al. 2008). Standard factor analysis is based on Pearson’s correlation matrices and assumes that the variables are continuous. Since our dataset contains also dichotomous variables, we perform the factor analysis by means of a polychoric correlation matrix (Olsson 1979; Bonett and Price 2005). Table 2 shows the correlation matrix. Note that all variables are positively correlated, with the strongest correlation observed between deceased and existence.Footnote 8

Table 2 Polychoric correlation matrix

The factor loadings for the promax oblique rotation of the factor axes are shown in Table 3.Footnote 9 Reported factor loadings indicate how each variable is weighted in each factor. To determine the number of factors to be included in the analysis, we used the Kaiser rule (all components with eigenvalues under one have been dropped). The resulting number of factors is two which is coherent with our a priori belief of an underlying structure based on two variables, i.e., talent and fame.

Table 3 Rotated (oblique promax) factor loadings (pattern matrix) and unique variances

Observing the factor loadings, we note that deceased and existence are positively associated with the first factor (the association with the second factor is negligible). For this reason, we propose interpreting this first factor as representing artist fame. Note that we are not proposing that death or the simple passage of time ‘causes’ an increase in fame, but simply that in our dataset several famous artists are deceased and/or their artworks have been present in the market for a long time.

The second factor, descendant, muforms, and study, exhibits positive factor loadings. Since these variables are related to the observable component of human capital and ceteris paribus greater human capital implies greater talent, we propose interpreting the second factor as a representation of artist talent (Filer 1990; Throsby 2006; Towse 2006). The key assumption is ‘ceteris paribus’: If two artists (say A and B) are completely identical in all respects, but A has a higher endowment of human capital than B (say A is more eclectic than B), we can assume that A is more talented than B.

After calculating scoring coefficients for the first and second factors, we obtained t and f and use them as explanatory variables in following regression analyses.

A critical comment on our measures is necessary. Since talent and fame are difficult to measure, defining good proxies for artist talent and fame is also a difficult task. All empirical analyses in our paper are necessarily based on the assumption that, though we cannot perfectly track talent and fame, our measures are at least sufficiently correlated with them. Note that this assumption characterizes most of the studies on Rosen’s and Adler’s theories.

Our measure of talent is in line with most of the previous studies testing Rosen’s hypothesis which also use variables related to human capital to proxy for talent. Even though we are not aware of any example directly pertaining the visual art sector (where studies are still sparse), there are several examples for the music and sport sectors. For example, Hamlen (1991) uses the harmonic measurement of an artist’s voice as a talent proxy and thus employs a dimension of human capital that can be improved with education. Other examples in the sport sectors include Lucifora and Simmons (2003),Franck and Nüesch (2008),Lehmann and Schulze (2008), and Franck and Nüesch (2012) who use ‘outcome’ variables such as number of goals, assists, tackles, and other performance statistics as talent proxies. With respect to these previous studies, our work contributes to the literature by providing a measure of talent that combines several variables through factor analysis instead of using one variable at a time.

On the other hand, we understand our measure of fame could seem nonconventional and raise some concerns. Considering that proxies must be chosen for a specific context and not in absolute terms, it is therefore possible that whether or not our fame proxy is a good general proxy, it is still a good enough fame proxy for our specific sample of artists. Even though we maintain this assumption in the following sections, in the “Robustness checks” Section, we provide additional analyses using ‘more conventional’ proxies for fame.

4.2 Talent, fame, and artist income

A preliminary analysis on our dataset shows that artist income is highly concentrated (the Gini index of artist income is equal to 0.792), i.e., a small number of artists earn a high income and emerge as a superstar. In this section, we estimate the model in Eq. 4 to ascertain if Rosen’s and Adler’s hypotheses can explain this income concentration.

Table 4 reports estimates of Eq. 4. We have estimated two OLS models. Both models include our proxies for talent and fame. While the first model includes all control variables described in Sect. 3 (see Group 3), the second model includes only the control variables with a p value less than 0.2.Footnote 10 The results of the two specifications are very similar.Footnote 11 In particular, our results show that artist income is positively affected by talent and fame. Since the estimated talent elasticity is significantly larger than one, as seen in the reported confidence intervals, we cannot reject Rosen’s hypothesis. Furthermore, since the estimated fame elasticity is positive, we cannot reject Adler’s hypothesis either.

A visual representation of the estimated effects of talent and fame on income for the ‘representative’ (or average) artist is provided in Fig. 1 (Candela and Scorcu 1997). In particular, the plot on the left shows that a variation in talent implies a more than proportional variation in income; the plot on the right shows that a variation in fame implies an (approximately) proportional variation in income. Both these effects are coherent with Rosen’s and Adler’s hypotheses.

Table 4 Regression models

4.3 Average price and number of artwork sales

We evaluate the effects of our proxies for talent and fame on the average price and the number of artwork sales separately to test whether the effect of talent and fame on income passes mainly through price or number of sales. The results may help auction houses and galleries in their artist selection mechanisms and pricing strategies. For example, an interesting result would be an asymmetric effect of talent and/or fame on prices and number of sales, indicating the existence of a main ‘channel’ through which talent and fame influence income.

Fig. 1
figure 1

Effects of talent (left plot) and fame (right plot) on average artist income (mln \(\EUR \))

In Table 5, we present our estimates of Eqn. 7. Specifically, we have estimated a SUR model. The results show that both p and n are positively affected by t and f. Since the coefficients associated with talent in the two equations sum up (by construction) to the estimated elasticity reported in Table 4, we conclude that most of the effect of talent on artist income passes through n (\({\tau _n }/\tau =76.41\,\% )\). A similar result applies to fame (\({\varphi _n }/\varphi =75.87\,\% )\). Therefore, the main channel through which talent and fame influence income is number of sales.

Moreover the conditional correlation between p and n is positive and significant (0.261, p value \(<\)0.001), implying that a positive shock on one equation also positively affects the other equation (and vice versa). This supports our expectation of a dependence relationship between p and n.

We can further investigate the nature and the strength of the conditional dependence relationship between p and n in order to better understand how a shock on p affects n and vice versa. Specifically, we use copula models to investigate the dependence relationship between estimated \(\varepsilon _{pi} \) and \(\varepsilon _{ni} \) (Cherubini et al. 2004; Nelsen 2006; Trivedi and Zimmer 2007).Footnote 12

Table 5 SUR model

We fit several copula models (symmetric, left asymmetric, and right asymmetric models) to our data through the so-called inference for margins method (5) in a semi-parametric fashion.Footnote 13 We find that a Clayton copula with a significant positive dependence parameter \(\theta =\hbox {0.279}\) produces the best fit.Footnote 14 Since the Clayton copula exhibits strong left tail dependence but weak right tail dependence and the parameter is positive, our analysis confirms the results obtained via the SUR model (i.e., a positive relationship between \(\varepsilon _{pi} \) and \(\varepsilon _{ni} )\) and adds that the relationship is a lower tail dependency (i.e., p and n are likely to experience extreme low values together and are not likely to simultaneously realize upper tail values).

4.4 Auction houses versus galleries

The results in Table 4 are coherent with both Rosen’s and Adler’s hypotheses and indicate a single model of artist income is common to all artists in our dataset. However, since auction houses and galleries differ in their artist selection mechanisms and pricing strategies, a distinction could emerge between the models of income determinants for artists prevalently traded in auction houses and artists prevalently traded in galleries, implying different effects of talent and fame on artist income in these two artist subsamples.

In this section, we use a poolability test (Patuelli et al. 2010; Verbeek 2012) with respect to the auction houses/galleries subdivision. This poolability test checks for subsample stability of the estimated coefficients to determine whether a single model (in our case the model in Table 4) applies to all artists in our dataset or if it would be better to specify different models for artists whose works are normally traded in auction houses and artists usually traded in galleries.

Indicating the artists traded in auction houses with the subscript A and the artists traded in galleries with the subscript G, our poolability test applied to Eqn. 4 can produce two results.

  1. (1)

    If \(\left[ {{\begin{array}{l@{\quad }l@{\quad }l} {\tau _A }&{} {\varphi _A }&{} {{\varvec{\upkappa }}^ {\prime }_A } \\ \end{array} }} \right] =\left[ {{\begin{array}{l@{\quad }l@{\quad }l} {\tau _G }&{} {\varphi _G }&{} {{\varvec{\upkappa }}^ {\prime }_G } \\ \end{array} }} \right] \), then a single model of artist income applies to all artists in our dataset.

  2. (2)

    If \(\left[ {{\begin{array}{l@{\quad }l@{\quad }l} {\tau _A }&{} {\varphi _A }&{} {{\varvec{\upkappa }}^ {\prime }_A } \\ \end{array} }} \right] \ne \left[ {{\begin{array}{l@{\quad }l@{\quad }l} {\tau _G }&{} {\varphi _G }&{} {{\varvec{\upkappa }}^ {\prime }_G } \\ \end{array} }} \right] \), then two separated models of artist income need to be estimated for each subsample.

Table 6 shows our estimates of Eqn. 4, using the variable gallery as a threshold variable to split our sample in two subsamples: The first subsample includes artists prevalently traded in auction houses (\(gallery<0.5)\); the second subsample includes artists prevalently traded in galleries (\(gallery \ge 0.5)\). The poolability test reported at the bottom of the table is not significant (p value = 0.359), indicating that the estimated coefficients are stable across the two subsamples, and supports the existence of an encompassing model of artist income determinants.Footnote 15 This result also implies that \(\tau _A =\tau _G \) and \(\varphi _A =\varphi _G \), so the talent and fame elasticities for artists prevalently traded in auction houses are the same as those for artists generally traded in galleries. Furthermore, as the tests on the single coefficients in Table 6 indicate, Rosen’s and Adler’s hypotheses are supported in both subsamples, i.e., \(\tau _A =\tau _G >1\) and \(\varphi _A =\varphi _G >0\).

Table 6 Regression models for artists prevalently traded in auction houses and for artists prevalently traded in galleries

4.5 Low, median, and high artist income

In the previous sections, we modeled the conditional mean of artist income. However, the effects of our proxies for talent and fame in the lower part of the artist income distribution may differ from the effect of these same variables in the upper part of artist income distribution. As Franck and Nüesch (2008) note, Rosen (but a similar reasoning applies to Adler) defines superstars as high-income artists but does not define any explicit income threshold to distinguish between superstars and nonstars. Studying the characteristics of the conditional distribution of artist income, such as its quantiles, could help identify this threshold.

In this section, we estimate a quantile regression model (Koenker and Hallock 2001; Koenker 2005; Kleiber and Zeileis 2008; Chamarbagwala 2010), in which the conditional quantile function (indexed by the quantile q) is given by

$$\begin{aligned} Q_y \left( {\left. q \right| t,f,\mathbf{c}} \right) =\alpha +\tau _q t_i +\varphi _q f_i +\mathbf{c}_i^ {\prime }{\varvec{\upkappa }}_q \end{aligned}$$
(9)

where \(Q_y \left( {\left. q \right| t,f,\mathbf{c}} \right) \) indicates the q-quantile of artist income conditional on talent, fame, and control variables. We estimate Eqn. 9 for quantiles 0.25 (low income, L), 0.5 (median income, M), and 0.75 (high income, H) simultaneously by simultaneous quantile regression.

  1. (1)

    If \(\left[ {{\begin{array}{l@{\quad }l} {\tau _L }&{} {\varphi _L } \\ \end{array} }} \right] =\left[ {{\begin{array}{l@{\quad }l} {\tau _M }&{} {\varphi _M } \\ \end{array} }} \right] =\left[ {{\begin{array}{l@{\quad }l} {\tau _H }&{} {\varphi _H } \\ \end{array} }} \right] \), then talent and fame elasticities are homogenous across artist income quantiles.

  2. (2)

    If \(\left[ {{\begin{array}{l@{\quad }l} {\tau _L }&{} {\varphi _L } \\ \end{array} }} \right] \ne \left[ {{\begin{array}{l@{\quad }l} {\tau _M }&{} {\varphi _M } \\ \end{array} }} \right] \) and/or \(\left[ {{\begin{array}{l@{\quad }l} {\tau _L }&{} {\varphi _L } \\ \end{array} }} \right] \ne \left[ {{\begin{array}{l@{\quad }l} {\tau _H }&{} {\varphi _H } \\ \end{array} }} \right] \), then talent and fame elasticities are heterogeneous across artist income quantiles.

Table 7 shows our estimates for Eqn. 9 with the equality tests for the estimated elasticities at the bottom of the table. Our results indicate that neither talent nor fame elasticities are uniform across quantiles (p value = 0.036 and p value = 0.006), implying that the regression surfaces corresponding to each income quantile are not parallel. Specifically, as the single coefficients in the table show, the effect of talent on artist income is positive for all quantiles with increasing intensity, i.e., \(\tau _L <\tau _M <\tau _H \). A similar result applies to fame (albeit its effect on low-income artists is not significant), i.e., \(\varphi _L <\varphi _M <\varphi _H \). Furthermore, Fig. 2 shows that elasticities in the median- and high-income quantiles (in contrast to low-income quantile) are similar to each other and to the elasticities estimated by OLS in Table 4 (represented by dashed lines). The evidence in Table 7 and Fig. 2 suggests that Rosen’s and Adler’s hypotheses hold in high-median income quantiles but not for the low- income quantile, where talent has a proportional effect on income (\(\tau _L \cong 1)\), but fame has no effect (\(\varphi _L \cong 0)\).

Table 7 Simultaneous quantile regression
  1. (1)

    \(\tau _L \cong 1\) implies that in the low-income quantile artists are not superstars and their income is proportional to their talent. However, there is a threshold of income (median income) beyond which Rosen’s hypothesis holds.

  2. (2)

    \(\varphi _L \cong 0\) implies that fame plays no role in the low-income quantile, so Adler’s hypothesis does not hold.

  3. (3)

    The joint result \(\tau _L \cong 1\) and \(\varphi _L \cong 0\) implies that in the lower part of the income distribution, income and talent distributions are overlapping.

Fig. 2
figure 2

Talent and fame elasticities for varying quantiles (low = 0.25, median = 0.5, high = 0.75). Elasticities estimated by OLS are represented by dashed lines

4.6 Robustness checks

We now present some sensitivity analyses and robustness checks. In Table 8 (columns 1–3), we regress artist income on talent (model 1), fame (model 2), and talent, fame, and their interaction (model 3). The obtained results confirm those in Table 4 and add that the interaction between talent and fame, at least when using our proxies, is negligible. These findings suggest that the relationship between artist income and our measures of talent and fame is robust enough to withstand changes in the specification of the model.

A concern with our analysis is the risk of having chosen a poor fame proxy. For this reason, we perform a robustness check using two new and ‘more conventional’ proxies for fame. The first measure is based on the methodology in Garcia-del-Barrio and Pujol (2007) and creates a measure of fame based on the number of Google hits (fame Google)Footnote 16; the second measure uses the reputation index provided by the artfacts.net website ‘which indicates the amount of attention each particular artist has received from art institutions’ (fame artfacts). The results of the regression models obtained using these additional proxies are presented in Table 8 (columns 4–9). These results suggest that our findings are robust to the choice of the fame proxy and that talent and fame might positively interact in influencing artist income. Although we cannot completely rule out the risk of having chosen a poor fame proxy, the risk does not seem strong enough to invalidate the conclusions of our work.

Table 8 Robustness checks

A further concern with our analysis is the potential problem of endogeneity of our proxies for fame: For example, it could be that fame is determined by income but not vice versa. Since in case of endogeneity, estimates are inconsistent, we perform endogeneity tests for each of our fame proxies following the two-step procedure described in Wooldridge (2010). To perform these endogeneity tests, we first need a variable that is related to fame and can be omitted from the income regression. The variable sculpture is a good candidate since it is reasonable to believe it is related to fame, but in all regression models (regardless of the chosen proxy for fame), it is not significant. The results of the endogeneity tests (one for each proxies for fame) do not reject the null hypothesis that fame is exogenous (fame factor analysis p value = 0.345, fame Google p value = 0.947, fame artfacts p value = 0.515) and suggests that, while we cannot exclude the problem of endogeneity in general terms, we can exclude it in the specific context of our analysis.

5 Conclusions

Using a unique hand-collected dataset, which comprises all artwork sales in Italy between 2006 and 2010, we tested Rosen’s and Adler’s hypotheses using a plethora of econometric models on our dataset of modern and contemporary visual artists. Other empirical studies tested Rosen’s and Adler’s hypotheses for the music and sport sectors, while we test Rosen’s and Adler’s hypotheses on the modern and contemporary visual art market and find that the differences in the level of talent and fame are reflected in artist income.

Our dataset allows us to extract new measures of artist talent and fame by means of a factor analysis: Our measure of artist talent is positively associated with an artistic family environment, eclecticism and artistic studies; our measure of artist fame is found to be positively associated with death event and the number of years an artist (or her artwork) is present in the visual art market.

Using our proxies, we model artist factor income as a Cobb-Douglas function of talent and fame and estimate talent and fame elasticities by OLS. The talent elasticity is larger than one, implying a convex relationship between income and talent. This convex relationship is consistent with Rosen’s hypothesis. The fame elasticity is about one, implying a linear relationship between income and fame. This evidence offers support to Adler’s hypothesis.

Using a SUR model, we evaluated the effects of our proxies for talent and fame on the average price and the number of artwork sales to understand their relative importance in generating artist income. The findings, based on our measures, show that about 3/4 of the effects of talent and fame on an artist income pass through the number of artwork sales. A copula model shows that average prices and number of artwork sales exhibit a positive and asymmetric dependence relationship, implying that they are likely to experience extreme low values together and are not likely to simultaneously realize upper tail values.

We split our sample into two subsamples, including artists prevalently traded in auction houses in the first subsample and artists prevalently traded in galleries in the second subsample. Then, we performed a poolability test. Our results indicate that a single encompassing model of artist income applies to all artists in our dataset and that Rosen’s and Adler’s hypotheses also continue to hold true at a disaggregated level. Thus, even though significant differences exist in the artist selection mechanisms and pricing strategies at auction houses and galleries, these differences do not affect the artist income generating process.

Furthermore, we estimated a simultaneous quantile regression to test whether talent and fame elasticities were uniform across income quantiles. Our results show that (consistent with Rosen) in low- income quantiles artists are not superstars, while Rosen’s hypothesis only holds for artists with an income greater than the median level. In addition, our proxy for fame plays no role in low-income quantiles.

Sensitivity analyses and robustness checks, which evaluate the appropriateness of our proxy for fame and its potential endogeneity, confirm the main findings of our analysis.

Our findings are important to understanding the mechanisms of superstar formation and implementing policies to support the income of talented artists who are not yet famous. Further empirical studies are needed to generalize our conclusions to other art forms and non-Italian markets. In addition, our findings also raise a few unanswered questions, which we cannot handle with our static cross- sectional dataset, and open new avenues for future research: The measure of fame used in this study is time invariant, but fame may indeed change over time. While we believe that in our sample and during the short time period we consider (5 years), fame was sufficiently stable to produce reliable results, further empirical studies based on panel datasets may extend our conclusions considering time-varying determinants of income. In particular, panel datasets would make it possible to test dynamic implications that are only weakly identified in our static setting.