Introduction

There exists a general consensus among researchers about journal articles which are the most direct results of research, especially when journals with a selective process that guarantees quality and originality are considered. Although expert opinion is believed to be the most appropriate method of valuing the contribution of an article to a specific field of knowledge, this system presents some limitations, such as the subjective character and its high cost. In this context, bibliometric indexes represent objective evidences that can be used to complement expert opinion.

It is known that some works of limited success are published by the best journals, and some works of great success are published in journals that are not top ranked according to the impact factor. Therefore, there is some rejection to evaluating the impact of a work by the impact factor of the publishing journal.

Most common indexes used to evaluate researchers are based on counting publications and received citations. The number of publications (N p) is a quantitative indicator that does not value the scientific advance of the contribution. As qualitative indicators able to assess the impact, influence or visibility of a research, the total number of citations (N c) and the average citations per article (n c = N c/N p) are used. However, although these indicators show the success of a scientific career in many cases, sometimes isolated successes accumulate a high percentage of the total number of citations. In addition, important biases are introduced by large collaborations that collect many citations derived from the work of a large number of researchers.

The h-index (Hirsch 2005) tries to solve these limitations. A researcher has an h-index when h of its publications have each received at least h citations, and the rest have h or less citations. The number of important articles pertaining to a researcher is estimated by this index, increasing their requirements at the same time as their value rises. Moreover, a lower bound h 2 of N c is provided. The value N c is generally much greater than h 2 (Hirsch has estimated between three and five times greater). This amount underestimates the citations of the h most cited articles (Hirsch core) and ignores articles with less than h citations. A correlation between the h-index and the success of a researcher appreciated by his peers has been obtained (Hirsch 2005), and the future success of a researcher could be predicted by this value (Hirsch 2007).

The h-index has been extensively studied (see reviews by Bornmann and Daniel 2007; Alonso et al. 2009; and the stochastic model by Burrell 2007a, 2009) and important mathematical properties have been fulfilled (Glänzel 2006). However, limitations have been found, some of which are shown below.

This index depends on the scientific field and the number of collaborations. It is not appropriate, therefore, to compare researchers from different scientific fields, due to different habits of publication, citation, and collaboration. This problem may be corrected, since the maximum h value obtainable in each field strongly correlates with the impact factors of the journals in the field, a reference h-index can be estimated in each scientific field (Imperial and Rodríguez-Navarro 2007). The b-index (Bornmann et al. 2007) is an alternative that indicates the number of articles in the 10% most cited publications in a field, considering ISI-ESI percentiles for example. Multiple authorships and self-citations have been investigated by Schreiber (2008a, b). Concerning the number of collaborators, the h 1-index (Batista et al. 2006) obtained dividing h by the average number of authors of these h articles, can be used.

This index correlates with the number of publications. The index tends to favor, therefore, those with more extensive scientific careers and is less effective among those with a low number of publications (Cronin and Meho 2006; Saad 2006; Van-Raan 2006). To differentiate between active and inactive researchers and compare scientists at different stages of their careers, the growth rate h′(t) has been proposed, being t the number of years since the publication of the first article (Liang 2006; Burrell 2007b; Rousseau and Ye 2008). As an alternative, the h-index can be calculated for a certain period of time, instead of along the professional life of a researcher.

All citations of the most cited articles are not considered in this index. These most cited works contribute to the h-index, but their value is not affected by the number of times these articles are cited, since the tails of the citation distribution are not considered. These tails correspond to those publications that move away from the average impact, either because they have been highly cited (upper tail), or less cited (lower tail). Based on the definition of the Hirsch core, several authors have proposed new indicators. The g-index (Egghe 2006) considers all citations of the g most cited articles, and represents an average citation of these g articles. Once the articles have been sorted in decreasing order of citations, g is the largest value, such that the first g articles have at least g 2 citations. As a matter of fact, the h-index and the g-index are special cases of a family of Hirsch index variants (Schreiber 2010). Similarly, the A-index (average citation) and the AR-index (considering the age of the articles) (Jin et al. 2007) have the particularity of taking into account the citations of the Hirsch core. However, as stated above, a heavy upper tail may correspond to the work of many authors included in large research lines that generate many citations.

This index penalizes selective researchers, that is, those producing a moderate number of high impact articles as opposed to large producers of moderate impact articles. Although this index has proven to be useful in identifying relevant researchers in a field, empirical evidence has shown it does not discriminate among researchers situated at intermediate levels and penalizes selective producers versus large producers (Costas and Bordons 2007). Cases with similar values of h, where citation curves are intersected, are especially questioned due to some researchers presenting higher levels of citations at the beginning of the curve and lower levels at the end. Additionally, this index is not consistent (Waltman and Van-Eck 2009). That is, the effect of incorporating a new paper with a given number of citations may be different between researchers, increasing the value of h in some cases and maintaining its value in others.

Finally, some variants have been published in order to improve the accuracy of the h-index: the tapered h-index (Anderson et al. 2008), the R m-index (Panaretos and Malesios 2009), the w-index (Wohlin 2009), and the e-index (Zhang 2009). The h 2 lower, h 2 center, and h 2 upper as well as the sRM value (Bornmann et al. 2010), represent new approaches providing additional information that increase the accuracy of the h-index.

In this work, a complement to the h-index that increases the consistency of the indicator and favors selective authors against large producers is presented. This approach also increases the accuracy of the h-index giving information about the shape of the citation distribution. The main difference with respect to the variants which have been proposed previously (g-index, A-index, AR-index, h 2 upper, tapered h-index, R m-index, w-index, and e-index) is that all of them are a function of all citations included in a core of most cited papers. By contrast, we establish an upper limit to the maximum number of citations considered for each publication in order to reduce the effect that isolated successes and/or large collaborations may have on the final result (as was pointed out by Hirsh). This upper limit can be modified without further changing the radius of the central index.

Central indexes

Given the published articles of an author in decreasing order of citations, let c i be the number of citations received by the publication i \( (c_{1} \ge c_{2} \ge \cdots \ge c_{{N_{\rm p} }} ), \) and let \( N_{\rm c}^{j} = \sum_{i = 1}^{j} {c_{i} } \) be the aggregated number of citations of the j most highly cited papers.

The citation distribution is obtained plotting the number of citations versus the position of the articles. Connecting these points, the citation curve is obtained. The h-index is the largest integer number that satisfies c h  ≥ h, that is,

$$ h = \max \left\{ {i \in Z\,:\,c_{i} \ge i} \right\}. $$

Graphically, the integer part of the intersection point between citation curve and the first quadrant bisector is h. This is indicated in Fig. 1.

Fig. 1
figure 1

Two citation curves with the same h-index but different average citations per article

H = h 2 is a lower bound for the number of citations of those papers in the Hirsch core. The upper tail U is the excess citations received by the Hirsch core over the lower bound. The lower tail L is the number of citations received by those papers outside the Hirsch core. The following relationships are satisfied:

$$ \begin{aligned} N_{\rm c} &= H + U + L,\\ U &= N_{\rm c}^{h} - H, \\ L &= N_{\rm c} - N_{\rm c}^{h}. \end{aligned} $$

The relative weight of the citation distribution tails is given by N c/H. According to Hirsch (2005) estimations, if N c/H < 3 the tails of the distribution are light, while if N c/H > 5 the tails are heavy. The h-index penalizes those researchers who present heavy tails, especially those with a great tail ratio U/L.

An example with the citation curves of two researchers is shown in Fig. 1. The first researcher presents higher citation levels at the beginning and lower levels at the end of the curve. Therefore, two different profiles of researchers are appreciated, one more selective and another more massive in the production of papers. However, both scientists have the same h-index. A researcher may present less h-index than another, although it does not necessarily indicate the former presents a less successful career than the latter. The problem of discriminating between two distributions with similar h-index but significantly different distribution tail ratios is presented in Fig. 1. As can be appreciated, the higher the rate between tails is presented, the better average citations per article is obtained.

In the above cases, it seems reasonable to measure part of U and L in order to complement the h-index with the area around H. Thus, the discrimination capacity is increased. This idea allows us to introduce the following index.

Central area index

Let E (F) be the upper (lower) area next to H, that is, the part of the upper tail U (lower tail L) in the citation distribution closest to H. The lower area corresponds to those articles that will likely contribute to increasing the value of h in the future, since they are closer to the Hirsch core. The upper area includes those citations that will form part of H at the time the h-index increases its value. Therefore, it seems reasonable to include this area and increase, in this way, the discrimination capacity of the index.

The central area index of radius j is defined as the citations of the h + j most cited papers limited to the number of citations of paper h − j. That is, the citations of those papers in the Hirsch core, restricted by the citations of paper h − j, jointly with the citations of papers from h + 1 to h + j. The geometrical representation is showed in Fig. 2.

Fig. 2
figure 2

Central area index (a) and central interval index (b) of radius \( j = \left\lfloor \frac{h}{2} \right\rfloor \)

The arithmetic definition of the index of radius j is the following:

$$ A_{j} = (h - j) \cdot c_{h - j} + \sum\limits_{i = h - j + 1}^{h + j} {c_{i} } , \quad j = 1, \ldots ,h - 1. $$

Note that \( A_{h - 1} = N_{\rm c}^{2h - 1} \) includes the total upper tail U. Although the radius could be defined for j ≥ h, in this case, it would only be adding part of the lower tail.

Figure 3 shows the central area index of radius \( j = \left\lfloor {h/2} \right\rfloor \) (integer part) for two citation distributions. In the case of distribution D i the central area index is \( A_{j}^{i} = H + E_{i} + F_{i} , \quad i = 1,2. \)

Fig. 3
figure 3

Comparison between the central area index (a) and the central interval index (b) of radius \( j = \left\lfloor \frac{h}{2} \right\rfloor, \) for two citation curves with the same h-index

As mentioned above, authors whose citation distributions have heavy tails are penalized by the h-index. However, the central area index of these authors grows faster than those with less heavy tails, increasing its capacity of discrimination.

Selective researchers are also penalized by the h-index. However, the central area index solves this problem. For example, suppose a researcher has 10 publications, the least of which have 20 citations. Then h = 10, which represents only 100 citations. However, A 1 ≥ 200, that is, twice the number of represented citations.

Central interval index

The central interval index of radius j is defined as the aggregated citations of the articles from h − j to h + j:

$$ I_{j} = \sum\limits_{i = h - j}^{h + j} {c_{i} } , \quad j = 1, \ldots ,h - 1. $$

That is, the citations at the interval [h – j, h + j]. Note that \( I_{h - 1} = A_{h - 1} = N_{\rm c}^{2h - 1}. \) The geometrical representation is shown in Fig. 2.

The central interval index of radius \( j = \left\lfloor {h/2} \right\rfloor \) (integer part) for two citation distributions is shown in Fig. 3. In the case of distribution D i the central interval index is \( I_{j}^{i} = G_{i} + F_{i} , \quad i = 1,2. \)

Comparison among central indexes

Both indexes have the same area in the lower tail. However, significant differences between both indexes are appreciated in the upper tail. Thus while central interval indexes add citations of articles to the left of h, the central area indexes add zones of variable size in the upper tail.

Lets now look at differences between both indexes for two authors with the same h, one more selective than the other. Increasing the radius in one unit and reducing the comparison to the upper tail, where differences exist, the central interval index adds to the selective author the height of the rectangle R, as shown Fig. 3. However, the central area index also adds to the selective author the area R. For this reason, the central area index is more beneficial for selective authors.

As an author is more selective, the height of R increases and the area of R also increases in a greater proportion.

In the following section, an empirical application determines an optimal radius of the central indexes, obtaining the value of j that best describes the central shape of the citation distribution.

Empirical application

The behavior of central indexes for researchers who have received the Price Medal is analyzed in this section. Data about these scientists was obtained from the ISI Web of Science database in February 2010. To estimate the predictive capacity of indexes for 5 and 10 years ahead, and their comparison with the h-index, the cited articles and the number of citations obtained in 1999, 2004, and 2009, have been considered. In order not to distort further analysis, especially regression analysis, only the 15 existing and currently productive scientists were considered.

The objective consists in obtaining A j , I j , j = 1, …, h − 1, at instant t and estimating the value of j (optimal radius) most correlated with A k , I k , k = j, …, h − 1 at instant t + 1, t = 1, 2 and t + 2, t = 1, that is, the future indexes.

Table 1 shows, for each author, the year of the first article published in the database, the total number of cited articles and the total citations in 1999, 2004, and 2009. This table also shows the evolution of the h-index.

Table 1 The production and impact of the researchers

Figure 4 shows the citation curves of four researchers. Three curves are shown, the closest to the origin corresponds to 1999, followed by 2004 and the farthest to 2009. The value 100 has been taken as maximum only for clarity. This plot allows us to observe the evolution of the h-index, and also to distinguish between selective and large producer researchers. As an example, McCain and Small show a more selective behavior than Egghe and Garfield, respectively.

Fig. 4
figure 4

Citation curves for some researchers in 1999, 2004, and 2009

Production-impact scatter plots are presented in Fig. 5. As shown, linear correlation between the number of articles and the number of citations exists. Authors located above the regression line show a more selective behavior than those below this line. Thus, the more selective authors of the sample are Small and Garfield, respectively.

Fig. 5
figure 5

Production-impact scatter plot

Table 2 shows the central indexes of the years considered. These indexes can be obtained up to a radius of 24 for some authors, but as the radius increases the number of data in each column of the table is reduced. Data has been shown until radius 10 to ensure that later the correlation coefficient is calculated with more than half of the sample data (at least nine out of 15). This table is useful in estimating future success. Lets see some examples using radius 7 as reference, approximately half the average h-index for the first period. It will be shown later that this indicator provides good estimations for 5 year predictions. As can be seen in the case of Leydesdorff, the area index varies from A 1 = 97 to A 7 = 171 in 1999, a significant increment that reveals the evolution of the h-index in following periods (2004 and 2009). Indeed, this author has an h-index of 9, 13, and 21, respectively. Something similar can also be seen in the case of McCain, among others. These examples suggest the area indexes obtained in a period, predict the increase in the following period.

Table 2 Central indexes up to radius 10

Lets now see a comparison between two authors with the same h. McCain and Vlachy have h 1999 = 11. A 6 in the case of McCain (because A 7 is not defined) is greater than the case of Vlachy, which estimates a higher future h-index; which holds true (15 vs. 11) in Table 1. Something similar happens with Ingwersen and Vinkler, for which h 1999 = 7. A 7 in the case of Ingwersen is greater than in the case of Vinkler, which estimates a higher future h-index; which also holds true (12 vs. 10) in Table 1. The same conclusion can also be observed for a period of 10 years. Although area indexes have been taken as reference, something similar occurs in the case of interval indexes.

Now, as an example, lets consider a case where the discrimination capacity of the central indexes compared to the h-index is appreciated. Braun has h 1999 greater than Small. However, the central indexes from a certain radius are higher for the later author. Attending to these indicators, the second author seems more selective, which is true according to total citations and the production-impact scatter plot.

Since the central index is an aggregation of citations, its representation with respect to the radius is an increasing function, as can be seen in Fig. 6. The first plot shows that McCain’s area indexes are higher than those of Egghe, indicating the first author is more selective than the second. Something similar can be seen with the interval index in the second plot.

Fig. 6
figure 6

Comparison of central indexes for two authors

Table 3 shows the linear correlation coefficients among indexes for 5 and 10 years. Matrices of order 10 to ensure the correlation coefficient is calculated with more than half of the researchers (at least nine out of 15) are shown. As can be seen, the area indexes for 1999 are strongly correlated with those for 2004, so they look like good estimators for 5 years. In all cases, correlations are higher than 0.94. The strongest correlations are located close to the main diagonal of the matrix. From the fifth element, all coefficients on the diagonal are greater than the correlation between h-indexes corr(h 1999, h 2004) = 0.977. As can be seen, all elements in column 7 are also higher than this. Therefore, A 7 seems a good estimator for 5 years and the radius is about half the average h-index of the sample.

Table 3 Indexes correlations

Area indexes for 1999 also show high correlations with 2009, although slightly lower than those mentioned in the previous paragraph, making them also good estimators for 10 years. All of the coefficients are greater than the correlation between the h-indexes corr(h 1999, h 2009) = 0.812. As can be seen, all elements in column 7 are higher than 0.9.

Finally, the area indexes for 2004 also present correlations with the year 2009. Most of the elements (including all of them in column 7) are higher than the correlation between h-indexes corr(h 2004, h 2009) = 0.889.

With respect to interval indexes, something very similar occurs. Correlations are also high in all cases. In order to better appreciate what indicators provide the best correlations, the differences between correlations for central indexes are also shown in Table 3. As can be seen, most of the elements of these matrices are positive, which means the correlations for the area index are greater than for the interval index (only 10 out of 165 items are negative).

Conclusions

The h-index is a bibliometric indicator that attempts to measure the success of a researcher with just a part of the total amount of publications and citations. Due to not considering all production and impact, this index corrects biases of mass collaborations and punctual successes, which may not be significant in the researcher’s career as a whole. However, different citation distributions, like those of a selective researcher and a large producer, may cause similar h-indexes, and in these cases, it is not possible to distinguish between these researchers using the h-index exclusively.

In this paper two complements to the h-index, the area and the interval indexes, have been proposed with the aim of increasing the capacity of discrimination among researchers with similar h, and improving the prediction of future successes. These indicators consider some areas that are larger for selective authors than for large producers. Thus, a problem described in the literature about the h-index, which penalizes selective researchers compared to large producers, is corrected.

Both central indexes are good estimators and correlations are generally higher for the area index than for the interval index. Moreover, a radius that well describes the shape of the citation distribution has been estimated empirically. This radius is about half the average h-index of researchers being evaluated.

Finally, we would like to point out that the area index is not considered a substitute, but a complement to the h-index, especially in an evaluation process where doubts among researchers might exist.

Materials and methods

Data collection

The underlying bibliometric data (publications and citations) in the empirical application were obtained from the Web of Science® database (published by Thomson Reuters—ISI, Philadelphia, PA, USA). Searches were conducted in the first week of February 2010. The data gathering about the scientists was as follows. We selected all journal papers of the Price Medallists who are still active (published to December 2009). For this purpose, the “Advanced Search” function to restrict results by author, document type, and timeframe, was used. In order to debug results the “View Distinct Author Sets” tool of the Web of Science database was utilized. This discovery function uses cross-citation data to show sets of papers likely written by the same person. A citations report from the databases Science Citation Index Expanded and Social Sciences Citation Index, was obtained. Then, output records with documents and citations were saved on a text file.

Data analysis

The text file with documents and citations was imported using Microsoft Office Excel. To estimate the predictive capacity of indexes for 5 and 10 years ahead, three different periods were filtered using this software, 1999, 2004, and 2009. Only the documents with any citation in each period were considered. Tables and figures were obtained using Excel. Finally, indexes were calculated using C++ Programming Language.