Introduction

This paper offers three axiomatic characterizations of the Hirsch (2005) index; see Wikipedia (2008) for a discussion of advantages and criticisms of the Hirsch index. The three differ from Woeginger’s (2008a) characterization in requiring fewer axioms (three instead of five) and in dispensing with the axiom on which Woeginger’s result hinges conceptually: monotonicity (more citations or papers do not lower the index).

Definitions and axioms

Let \(\mathbb{N}\) be the set of non-negative integers and R the set of non-negative real numbers. Members of \(\mathbb{N}\) represent both the number of papers of a given researcher and the number of citations that a paper can receive. Define X to be the set of all vectors x = (x 1, x 2,…,x n ) such that n\(\mathbb{N}\)\{0} and x 1 ≥ x 2 ≥\(\ldots\)≥ x n . For xX: (i) d x is the number of components of vector x (the dimension or size of x); (ii) c x is the number of components of vector x different from 0; (iii) for i ∈ {1,…,d x }, x i is the ith component of vector x and stands for the total number of citations of paper i; and (iv) \( x^{\Sigma } = x_1 + x_2 + \cdots + x_{{d_{n} }} \) is the sum of the d x components of x (the weight of x). With ∅ designating the empty vector (the no paper case), a researcher’s output will be represented by a member of D = X ∪ {∅}. For x = ∅ the convention is that \( c_{x} = d_{x} = min {\left\{ {x_{1} , \ldots ,x_{{d_{n} }} } \right\}} = 0. \)

For xX and yX: (i) the distance δ(x, y) between xX and yX is defined as δ(x, y) = max{x Σ, y Σ} − min{x Σ, y Σ}; and (ii) x ≥ y holds if, and only if, d x  ≥ d y and, for all i ∈ {1,…,d y }, x i  ≥ y i . With respect to the empty vector ∅: (i) for all xX, δ(x, ∅) = δ(∅, x) = x Σ; and (ii) for all xX, x ≥ ∅. Define D 0 = {xD: d x  = 0} = {∅} and, for n\(\mathbb{N}\)\{0}, D n  = {xD: d x  = n}.

Definition 1

A research output index (or index, for short) is a mapping f: D → R.

Woeginger (2008a, p. 225) defines an (impact) index as a mapping f: D → \(\mathbb{N}\) satisfying the monotonicity property MON and such that, for all xX with c x  = 0, f(x) = 0.

MON. For all xD and yD, x ≥ y implies f(x) ≥ f(y).

The definition of an index as an integer-valued mapping is restrictive because it excludes reasonable indices like the average citation index. In addition, assuming f(x) = 0 when c x  = 0 and d x  ≥ 1 is also restrictive because an index need not always be interpreted as an impact index: viewed as a research output index, it is not unreasonable to attribute value to the production of papers and make f(0,…,0) ≠ 0. Finally, Woeginger (2008a, p. 227) stresses that his axioms should be interpreted within the context of MON. Though it is difficult to question MON as a desirable property of an index, it may be worth approaching the characterization of the Hirsch index without constraining the choice of axioms by their connection with MON.

Definition 2

The Hirsch index is the research output index h such that f(∅) = 0 and, for all xX, h(x) = max{n ∈ {0, 1,…,c x }: x n  ≥ n}.

A1. For all xX, if c x  = d x then \( min {\left\{ {min {\left\{ {x_{1} , {\ldots}, x_{{d_{x} }} } \right\}, } { d_{x}}} \right\}} \leq f(x) \leq d_{x}.\)

A1 sets upper and lower bounds to the index in the case in which all the papers are cited: on the one hand, the index cannot be greater than the number d x of papers; and, on the other, the index is, as long as this is consistent with the previous constraint, not smaller than the smallest number of citations. A1 establishes that the index is bounded above by size and bounded below by the smallest magnitude between size and the minimum contribution to the weight of the output.

A2. For all n\(\mathbb{N}\), xD n and yD n+1, if y ≥ x and \( f(y) > f(x) = max {\left\{ {f(z)} \right\}}_{{z \in D_{n} }} \) then δ(x, y) > c x .

Suppose x is an output with size n reaching the maximum index that size n allows and that x is subsequently expanded by gaining weight (the number of citations of existing papers) or size (by adding another paper, possibly receiving some citation). Suppose this output expansion generates an increase of the index. By A2, the weight necessary to achieve this must be higher than the number c x of cited papers in x; that is, if the maximum index reachable in D n requires all papers to be cited, the new output y must have more than n citations more than x. Roughly speaking, if more citations and one more paper rise the index of an output already achieving the maximum index in the domain of outputs with n papers then more than n citations must have been necessary. This suggests that, once the maximum index in a size category has been reached, a further increase in the index by jumping to the next size category demands adding at least the equivalent to one citation to each cited paper.

It may appear that A2 brings an index very close to the Hirsch index. Nonetheless, A2 does not imply MON: the index f(x) = 1/(1 + h(x)) satisfies A2 but not MON.

A2 can be generalized to a family of axioms of the sort “if y ≥ x and f(y) > f(x) then δ(x, y) > c(x, y)”, for any given c:D × D → R. For instance, the use of the constant function c(x, y) = 0 seems to point to indices in which each citation counts, as occurs, for instance, with the index generating the average number of citations.

A21. For all n\(\mathbb{N}\), xD n and yD n+1, if y ≥ x and \(max {\left\{ {f(z)} \right\}}_{{z \in D_{{n + 1}} }} = f(y) > f(x)\) then δ(x, y) > c x .

A21 is a version of A2 in which it is not the initial output x that is required to reach the highest index within the set of outputs of its size but the final output y.

A22. For all n\(\mathbb{N}\), k\(\mathbb{N}\)\{0}, xD n and yD n+k, if y ≥ x and \( max {\left\{ {f(z)} \right\}}_{{z \in D_{{n + k}} }} = f(y) > f(x) = max {\left\{ {f(z)} \right\}}_{{z \in D_{n} }}\) then δ(x, y) > kc x .

A22 is less general than A2 in forcing both inputs to reach the maximum index in their respective category sizes but is more general in relating several sizes. In this respect, A22 is, in a way, a transitive version of A2: if, under the given constraints, going from size n to size n + 1 takes more than n citations, then going from size n to n + k must take more than kn citations. The results presented next give an impression that, to a certain extent, A2, A21 and A22 are exchangeable conditions, with A21 and A22 being closer substitutes for each other than A2. For n\(\mathbb{N}\)\{0} and xD n , x n  = (x 1,…,xn−1) is the member of D n−1 obtained from x by deleting the last component x n of x.

A3. For all n\(\mathbb{N}\)\{0} and xD n , if \( f(x) \ne max {\{ {f(y)} \}}_{{y \in D_{n} }} \) then f(x) = f(x n ).

By A3, if an output without minimum size is not achieving the maximum index corresponding to its size then losing one paper should not affect the index. A3 can be viewed as a weak version of paper monotonicity, because it identifies a situation in which having one paper more does not lower the index: when the addition of another paper does not make the resulting output attain the maximum index associated with its size, then the paper is worthless in the sense that its presence or absence does not modify the index. Even seen as a monotonicity property, A3 is weaker than MON, that expresses both paper and citation monotonicity.

A4. For all xX, and letting n = d x , if f(x) = f(x 1,…, x n−1) then, for all k such that 0 ≤ k ≤ x n , f(x 1,…,x n−1) = f(x 1,…,x n−1, k) and f(x) = f(x 1,…,x n , k).

A4 is a sort of independence condition: if adding a paper with r citations does not alter the index, then adding another paper with r or fewer citations produces the same effect in both the initial output and in the one obtained after including the paper with r citations. In consequence, if a certain change does not affect a given output then a smaller change never affects a larger output.

Results

Remark 3

The Hirsch index satisfies A1, A2, A21, A22, A3 and A4.

A1 is an immediate implication of the definition of the Hirsch index. Notice that, for all n\(\mathbb{N}\), \(max \{h(y)\} _{{y \in D_{n} }} = n.\) Concerning A2, if n\(\mathbb{N}\), xD n , yD n+1, y ≥ x and \(h(y) > h(x) = max \{ h(z)\} _{{z \in D_{n} }}\) then h(x) = n and h(y) ≥ n + 1, so paper n + 1 must receive at least n + 1 citations in y, which implies δ(x, y) > n = c x . As for A21, if n\(\mathbb{N}\), xD n , yD n+1, y ≥ x and \( max \{ h(z)\} _{{z \in D_{n+1} }} = h(y) > h(x)\) then h(y) = n + 1 and h(x) ≤ n, so paper n + 1 must receive at least n + 1 citations in y, which implies δ(x, y) > n ≥ c x . With respect to A22, if n\(\mathbb{N}\), xD n , yD n+k, y ≥ x and \( max \{ h(z)\} _{{z \in D_{{n + k}} }} = h(y) > h(x) = max \{ h(z)\} _{{z \in D_{n} }}\) then h(y) = n + k and h(x) = n, so papers n + 1,…,n + k must each receive at least n + k citations in y. Therefore, δ(x, y) ≥ k(n + k) > kn ≥ kc x . As regards A3, it follows from \( f(x) \ne max \{ h(y)\} _{{y \in D_{n} }}\) that x n  < n. This makes the number x n of citations of the last paper irrelevant to compute h(x) and, accordingly, h(x) = h(x n ). With respect to A4, h(x) = h(x 1,…,x n−1) means that x n  ≤ h(x 1,…,x n−1). Hence, adding to both (x 1,…,x n−1) and x another paper having at most x n citations cannot increase the Hirsch index.

Proposition 4

With α ∈ {1, 2}, an index f satisfies A1, A2α and A3 if, and only if, f is the Hirsch index.

Proof

“⇐”Remark 3. “⇒”With α ∈ {1, 2}, let f be an index satisfying A1, A2α and A3. Step 1: f agrees with the Hirsch index on D 0. Since the only member of D 0 is x = ∅ and since \( d_{x} = min \{ x_{1} , \ldots ,x_{{d_{x} }} \} = 0,\) by A1, f(∅) = 0 = h(∅).

Step 2: f agrees with the Hirsch index on D 1. Let xD 1. Case 1: x 1 ≥ 1. By A1, f(x) = 1. Case 2: x 1 = 0. Case 2a: \( f(x) \ne max \{ f(z)\} _{{z \in D_{1} }}. \) Since xD 1, x −1 = ∅. By A3, f(x) = f(x −1) = 0 = h(x). Case 2b: \( f(x) = max \{ f(z)\} _{{z \in D_{1} }}.\) Let y = ∅. By step 1, \( f(y) = max \{ f(z)\} _{{z \in D_{0} }}=0. \) Case 2b1: A21 holds. Then yD 0, xD 1, \( max \{ f(z)\} _{{z \in D_{1} }} = f(x) \), x ≥ y and δ(y, x) = 0 ≤ c y  = 0. By A21, f(x) ≤ f(y) = 0. Since f(x) ≥ 0 by definition of index, f(x) = 0 = h(x). Case 2b2: A22 holds. Then yD 0, xD 1, \( f(x) = max \{ f(z)\} _{{z \in D_{1} }} ,f(y) = max \{ f(x)\} _{{z \in D_{0} }} , \) x ≥ y and δ(y, x) = 0 ≤ c y  = 0. By A22 when k = 1, f(z) ≤ f(y) = 0. Hence, f(x) = 0 = h(x).

Step 3: for n\(\mathbb{N}\)\{0, 1}, f agrees with the Hirsch index on D n . Choose n\(\mathbb{N}\)\{0, 1} and, by steps 1 and 2, suppose that, for all k ∈ {0, 1,…,n - 1}, f agrees with the Hirsch index on D k . To prove that f agrees with the Hirsch index on D n , choose xD n . Let h = h(x). Case 1: h = n. This means that, for all i ∈ {1,…,n}, x i  ≥ n. Hence, c x  = d x  = n and, by A1, f(x) = d x  = n = h. Case 2: h < n. By the induction hypothesis, f(x n ) = h(x n ). As h(x) = h < n, it follows that x n  ≤ h and, thus, h(x n ) = h(x). In sum, f(x n ) = h.

Case 2a: \(f(x) \ne max \{ f(z)\} _{{z \in D_{n} }}\). By A3, f(x) = f(x n ) = h = h(x). Case 2b: \(f(x) = max \{ f(z)\} _{{z \in D_{n} }}\). Let k ∈ {2,…,n} and yD k satisfy, for all i ∈ {1,…,k}, y i  ≥ k. By A1, f(y) ≥ min{min{y 1,…,y k }, k} = k. The Hirsch index is such that, for all r\(\mathbb{N}\), \(max \{ h(z)\} _{{z \in D_{r} }} = r.\) Given f(y) ≥ k, by the induction hypothesis, \(f(v) = max \{ f(z)\} _{{z \in D_{k} }}\)implies f(v) = k. As a consequence, for all k ∈ {2,…,n},

$$ max\left\{ {f\left( z \right)} \right\}_{{z \in D_{k} }} = k. $$
(1)

Case 2b1: α = 1. By (1), \( max \{ f(z)\} _{{z \in D_{n} }} = f(x)\) implies f(x) > f(x n ). As a result, x n D n−1, xD n , x ≥ x n and max{f(z)}zDn = f(x) > f(x n ) imply, by A21, \( {\delta}(x_{-n}, x) > c_{x_{-n}} \ge h.\) But δ(x n , x) = x n and, since h(x n ) = h, x n  ≤ h: contradiction.

Case 2b2: α = 2. Let vD h satisfy, for all i ∈ {1,…,h}, v i  = x i . By A1, f(v) = h. By (1), \(f(v) = max \{ f(z)\} _{{z \in D_{h} }}.\) Let r = nh. For t ∈ {1,…,r}, let x tD h+t satisfy, for all i ∈ {1,…,h + t}, \( x_{i}^{t} = x_{i} . \) It follows from h(x) = h that, for all i ∈ {1,…,h + t}, x i  ≤ h = c v . Given this, the fact that x r = x implies δ(v, x) ≤ rh ≤ rc v . Summarizing, vD h and xD h+r are such that x ≥ v, and δ(v, x) ≤ rc x . By A22, f(x) ≤ f(v). Hence, f(x) ≤ f(v) = h < n, which contradicts f(x) = max{f(z)}zDn = n.□

Remark 5

Neither A21 nor A22 can be replaced by A2 in Proposition 4: an index f satisfying A1, A2 and A3 need not be the Hirsch index, as Example 6 proves.

Example 6

Let f be the index such that f(3, 1, 1) = 3 and, for all xD\{(3, 1, 1)}, f(x) = h(x). Whereas f satisfies A1, A2 and A3, it is not the Hirsch index.

Proposition 7

An index f satisfies A1, A2 and A4 if, and only if, f is the Hirsch index.

Proof

“⇐”Remark 3. “⇒” Let f be an index satisfying A1, A2 and A4. Step 1: f agrees with the Hirsch index on D 0. Since the only member of D 0 is x = ∅ and since \( c_{x} = d_{x} = min \{ x_{1} , \ldots ,x_{{d_{x} }} \} = 0 \), by A1, f(∅) = 0 = h(∅). Step 2: f agrees with the Hirsch index on D 1. Let xD 1. By A1, min{x 1, 1} ≤ f(x) ≤ 1. Thus, x 1 ≥ 1 implies f(x) = 1 = h(x). If x 1 = 0 then let y = ∅. By step 1, \( f(y) = max \{ f(z)\} _{{z \in D_{0} }} = 0.\) In addition, x ≥ y and δ(y, x) = 0 < c y  = 0. By A2, f(x) ≤ f(y) = 0. By definition of index, f(x) ≥ 0. In sum, f(x) = 0 = h(x).

Step 3: for n\(\mathbb{N}\)\{0, 1}, f agrees with the Hirsch index on D n . Choose n\(\mathbb{N}\)\{0, 1} and, by steps 1 and 2, suppose that, for all k ∈ {0, 1,…,n − 1}, f agrees with the Hirsch index on D k . To prove that f agrees with the Hirsch index on D n , choose xD n . Let h = h(x). Case 1: h = n. This means that, for all i ∈ {1,…,n}, x i  ≥ n. Hence, c x  = d x  = n and, by A1, f(x) = d x  = n = h. Case 2: h < n. Let vD h satisfy, for all i ∈ {1,…,h}, v i  = x i . By A1, f(v) = h. The Hirsch index is such that, for all r\(\mathbb{N}\), \(max \{ h(z)\} _{{z \in D_{r} }} = r.\) By A1, the induction hypothesis and \( f(v) = h,max \{ f(z)\} _{{z \in D_{h} }} = h.\) Let r = n − h. For t ∈ {1,…,r}, let x tD h+t satisfy, for all i ∈ {1,…,h + t}, \( x_{i}^{t} = x_{i} . \) It follows from h(x) = h that, for all i ∈ {h + 1,…,n}, x i  ≤ h. Define w to be the member of D h+1 such that w h+1 = h and, for all i ∈ {1,…,h}, w i  = v i . Then vD h , wD h+1, w ≥ v, \( f(v) = max \{ f(z)\} _{{z \in D_{h} }} \) and δ(w, v) = h = c v . Therefore, by A2, f(w) ≤ f(v) = h. By A1, f(w) ≥ h. Consequently, f(w) = h = f(v). Given this, by A4, f(v) = f(x 1). This result, by A4, yields f(x 1) = f(x 2). By repeated application of A4, for all t ∈ {1,…, r − 1}, f(x t) = f(x t+1). Summing up, h = f(v) = f(x 1) = ··· = f(x r) = f(x).□

Remark 8

Examples 9, 10 and 11 prove that no axiom in Propositions 4 and 7 is redundant.

Example 9

Let f be the index such that, for all xD, f(x) = 1 + h(x). Then f satisfies A2, A21, A22, A3 and A4; does not satisfy A1; and is not the Hirsch index.

Example 10

Let f be the index such that, for all xD, f(x) = d x . Then f satisfies A1, A3 and A4; satisfies neither of A2, A21 and A22; and is not the Hirsch index.

Example 11

Let f be the index such that, for all xD, f(x) = h(x) − 1 if \(min \{ x_{1} , {\ldots} , x_{d_{x}} \} < h(x) < d_x \) and f(x) = h(x) otherwise. Then f satisfies A1, A2, A21 and A22; satisfies neither A3 nor A4; and is not the Hirsch index.

Concluding comments

Woeginger (2008b, p. 301) provides another characterization of the Hirsch index, on the domain of integer-valued indices, in which monotonicity is still assumed and an interesting symmetry axiom is postulated. For x = (x 1,…,x n ) ∈ D, Woeginger defines the reflection R(x) of x to be the vector (y 1,…,y k ) such that k = x 1 and y i is the number of components in x whose value is not smaller than i. For instance, if x = (7, 2, 2, 1, 0) then R(x) = (4, 3, 1, 1, 1, 1, 1). The symmetry axiom holds that the value of the index should be preserved under reflections: f(x) = f(R(x)). As a result, papers and citations are exchangeable variables through reflection.

One of the referees recommends mentioning Quesada (2008) as another characterization of the Hirsch index relying as well on monotonicity. This paper axiomatizes the Hirsch index, on the domain of real-valued indices, using monotonicity and another two axioms (Woeginger 2008b assumes six). The first axiom strengthens A1 by requiring that \( min \{ min \{ x_{1} , {\ldots} ,x_{{d_{x} }} \} ,c_{x} \} \leq f(x) \leq min \{ max \{ x_{1} , {\ldots} ,x_{{d_{x} }} \} ,d_{x} \}. \) The second axiom can be viewed as another monotonicity-type property and bears some resemblance to A4: if f(x 1,…,x n ) = f(y 1,…,y m ) and f(x 1,…,x n , a) > f(x 1,…,x n ) then f(y 1,…,y m , a) > f(y 1,…,y m ), provided that (y 1,…,y m , a) is a well-defined output. This says that if the index does not distinguish between two outputs and the addition of another paper to one output causes an increase in the index then the same qualitative effect should arise from the addition of the same paper to the second output.

The resulting characterization seems to indicate that the Hirsch index can be obtained by postulating sufficiently strong monotonicity requirements and by imposing appropriate bounds to that monotonicity. Propositions 4 and 7 can be seen as obtained from the strategy of weakening monotonicity and, in exchange, adopting independence conditions stating when the index should remain unaltered: whereas A1 is the axiom setting the bounds, the A2 axioms express a necessary condition for the index to be monotonic in a particular case and A3 and A4 are independence axioms identifying changes in a research output that should not affect the index.