More axiomatics for the Hirsch index

Quesada, Antonio

doi:10.1007/s11192-009-0026-x

More axiomatics for the Hirsch index

Published: 12 June 2009

Volume 82, pages 413–418, (2010)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Scientometrics Aims and scope Submit manuscript

More axiomatics for the Hirsch index

Download PDF

Antonio Quesada¹

313 Accesses
28 Citations
Explore all metrics

Abstract

The Hirsch index is a number that synthesizes a researcher’s output. It is defined as the maximum number h such that the researcher has h papers with at least h citations each. Woeginger (Math Soc Sci 56: 224–232, 2008a; J Informetr 2: 298–303, 2008b) suggests two axiomatic characterizations of the Hirsch index using monotonicity as one of the axioms. This note suggests three characterizations without adopting the monotonicity axiom.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

This paper offers three axiomatic characterizations of the Hirsch (2005) index; see Wikipedia (2008) for a discussion of advantages and criticisms of the Hirsch index. The three differ from Woeginger’s (2008a) characterization in requiring fewer axioms (three instead of five) and in dispensing with the axiom on which Woeginger’s result hinges conceptually: monotonicity (more citations or papers do not lower the index).

Definitions and axioms

Let $\mathbb{N}$ be the set of non-negative integers and R the set of non-negative real numbers. Members of $\mathbb{N}$ represent both the number of papers of a given researcher and the number of citations that a paper can receive. Define X to be the set of all vectors x = (x ₁, x ₂,…,x _n) such that n ∈ $\mathbb{N}$\{0} and x ₁ ≥ x ₂ ≥$\ldots$≥ x _n. For x ∈ X: (i) d _x is the number of components of vector x (the dimension or size of x); (ii) c _x is the number of components of vector x different from 0; (iii) for i ∈ {1,…,d _x}, x _i is the ith component of vector x and stands for the total number of citations of paper i; and (iv) $ x^{\Sigma } = x_1 + x_2 + \cdots + x_{{d_{n} }} $ is the sum of the d _x components of x (the weight of x). With ∅ designating the empty vector (the no paper case), a researcher’s output will be represented by a member of D = X ∪ {∅}. For x = ∅ the convention is that $ c_{x} = d_{x} = min {\left\{ {x_{1} , \ldots ,x_{{d_{n} }} } \right\}} = 0. $

For x ∈ X and y ∈ X: (i) the distance δ(x, y) between x ∈ X and y ∈ X is defined as δ(x, y) = max{x ^Σ, y ^Σ} − min{x ^Σ, y ^Σ}; and (ii) x ≥ y holds if, and only if, d _x ≥ d _y and, for all i ∈ {1,…,d _y}, x _i ≥ y _i. With respect to the empty vector ∅: (i) for all x ∈ X, δ(x, ∅) = δ(∅, x) = x ^Σ; and (ii) for all x ∈ X, x ≥ ∅. Define D ₀ = {x ∈ D: d _x = 0} = {∅} and, for n ∈ $\mathbb{N}$\{0}, D _n = {x ∈ D: d _x = n}.

Definition 1

A research output index (or index, for short) is a mapping f: D → R.

Woeginger (2008a, p. 225) defines an (impact) index as a mapping f: D → $\mathbb{N}$ satisfying the monotonicity property MON and such that, for all x ∈ X with c _x = 0, f(x) = 0.

MON. For all x ∈ D and y ∈ D, x ≥ y implies f(x) ≥ f(y).

The definition of an index as an integer-valued mapping is restrictive because it excludes reasonable indices like the average citation index. In addition, assuming f(x) = 0 when c _x = 0 and d _x ≥ 1 is also restrictive because an index need not always be interpreted as an impact index: viewed as a research output index, it is not unreasonable to attribute value to the production of papers and make f(0,…,0) ≠ 0. Finally, Woeginger (2008a, p. 227) stresses that his axioms should be interpreted within the context of MON. Though it is difficult to question MON as a desirable property of an index, it may be worth approaching the characterization of the Hirsch index without constraining the choice of axioms by their connection with MON.

Definition 2

The Hirsch index is the research output index h such that f(∅) = 0 and, for all x ∈ X, h(x) = max{n ∈ {0, 1,…,c _x}: x _n ≥ n}.

A1. For all x ∈ X, if c _x = d _x then $ min {\left\{ {min {\left\{ {x_{1} , {\ldots}, x_{{d_{x} }} } \right\}, } { d_{x}}} \right\}} \leq f(x) \leq d_{x}.$

A1 sets upper and lower bounds to the index in the case in which all the papers are cited: on the one hand, the index cannot be greater than the number d _x of papers; and, on the other, the index is, as long as this is consistent with the previous constraint, not smaller than the smallest number of citations. A1 establishes that the index is bounded above by size and bounded below by the smallest magnitude between size and the minimum contribution to the weight of the output.

A2. For all n ∈ $\mathbb{N}$, x ∈ D _n and y ∈ D _n+1, if y ≥ x and $ f(y) > f(x) = max {\left\{ {f(z)} \right\}}_{{z \in D_{n} }} $ then δ(x, y) > c _x.

Suppose x is an output with size n reaching the maximum index that size n allows and that x is subsequently expanded by gaining weight (the number of citations of existing papers) or size (by adding another paper, possibly receiving some citation). Suppose this output expansion generates an increase of the index. By A2, the weight necessary to achieve this must be higher than the number c _x of cited papers in x; that is, if the maximum index reachable in D _n requires all papers to be cited, the new output y must have more than n citations more than x. Roughly speaking, if more citations and one more paper rise the index of an output already achieving the maximum index in the domain of outputs with n papers then more than n citations must have been necessary. This suggests that, once the maximum index in a size category has been reached, a further increase in the index by jumping to the next size category demands adding at least the equivalent to one citation to each cited paper.

It may appear that A2 brings an index very close to the Hirsch index. Nonetheless, A2 does not imply MON: the index f(x) = 1/(1 + h(x)) satisfies A2 but not MON.

A2 can be generalized to a family of axioms of the sort “if y ≥ x and f(y) > f(x) then δ(x, y) > c(x, y)”, for any given c:D × D → R. For instance, the use of the constant function c(x, y) = 0 seems to point to indices in which each citation counts, as occurs, for instance, with the index generating the average number of citations.

A2₁. For all n ∈ $\mathbb{N}$, x ∈ D _n and y ∈ D _n+1, if y ≥ x and $max {\left\{ {f(z)} \right\}}_{{z \in D_{{n + 1}} }} = f(y) > f(x)$ then δ(x, y) > c _x.

A2₁ is a version of A2 in which it is not the initial output x that is required to reach the highest index within the set of outputs of its size but the final output y.

A2₂. For all n ∈ $\mathbb{N}$, k ∈ $\mathbb{N}$\{0}, x ∈ D _n and y ∈ D _n+k, if y ≥ x and $ max {\left\{ {f(z)} \right\}}_{{z \in D_{{n + k}} }} = f(y) > f(x) = max {\left\{ {f(z)} \right\}}_{{z \in D_{n} }}$ then δ(x, y) > kc _x.

A2₂ is less general than A2 in forcing both inputs to reach the maximum index in their respective category sizes but is more general in relating several sizes. In this respect, A2₂ is, in a way, a transitive version of A2: if, under the given constraints, going from size n to size n + 1 takes more than n citations, then going from size n to n + k must take more than kn citations. The results presented next give an impression that, to a certain extent, A2, A2₁ and A2₂ are exchangeable conditions, with A2₁ and A2₂ being closer substitutes for each other than A2. For n ∈ $\mathbb{N}$\{0} and x ∈ D _n, x _−n = (x ₁,…,x_n−1) is the member of D _n−1 obtained from x by deleting the last component x _n of x.

A3. For all n ∈ $\mathbb{N}$\{0} and x ∈ D _n, if $ f(x) \ne max {\{ {f(y)} \}}_{{y \in D_{n} }} $ then f(x) = f(x _−n).

By A3, if an output without minimum size is not achieving the maximum index corresponding to its size then losing one paper should not affect the index. A3 can be viewed as a weak version of paper monotonicity, because it identifies a situation in which having one paper more does not lower the index: when the addition of another paper does not make the resulting output attain the maximum index associated with its size, then the paper is worthless in the sense that its presence or absence does not modify the index. Even seen as a monotonicity property, A3 is weaker than MON, that expresses both paper and citation monotonicity.

A4. For all x ∈ X, and letting n = d _x, if f(x) = f(x ₁,…, x _n−1) then, for all k such that 0 ≤ k ≤ x _n, f(x ₁,…,x _n−1) = f(x ₁,…,x _n−1, k) and f(x) = f(x ₁,…,x _n, k).

A4 is a sort of independence condition: if adding a paper with r citations does not alter the index, then adding another paper with r or fewer citations produces the same effect in both the initial output and in the one obtained after including the paper with r citations. In consequence, if a certain change does not affect a given output then a smaller change never affects a larger output.

Results

Remark 3

The Hirsch index satisfies A1, A2, A2₁, A2₂, A3 and A4.

A1 is an immediate implication of the definition of the Hirsch index. Notice that, for all n ∈ $\mathbb{N}$, $max \{h(y)\} _{{y \in D_{n} }} = n.$ Concerning A2, if n ∈ $\mathbb{N}$, x ∈ D _n, y ∈ D _n+1, y ≥ x and $h(y) > h(x) = max \{ h(z)\} _{{z \in D_{n} }}$ then h(x) = n and h(y) ≥ n + 1, so paper n + 1 must receive at least n + 1 citations in y, which implies δ(x, y) > n = c _x. As for A2₁, if n ∈ $\mathbb{N}$, x ∈ D _n, y ∈ D _n+1, y ≥ x and $ max \{ h(z)\} _{{z \in D_{n+1} }} = h(y) > h(x)$ then h(y) = n + 1 and h(x) ≤ n, so paper n + 1 must receive at least n + 1 citations in y, which implies δ(x, y) > n ≥ c _x. With respect to A2₂, if n ∈ $\mathbb{N}$, x ∈ D _n, y ∈ D _n+k, y ≥ x and $ max \{ h(z)\} _{{z \in D_{{n + k}} }} = h(y) > h(x) = max \{ h(z)\} _{{z \in D_{n} }}$ then h(y) = n + k and h(x) = n, so papers n + 1,…,n + k must each receive at least n + k citations in y. Therefore, δ(x, y) ≥ k(n + k) > kn ≥ kc _x. As regards A3, it follows from $ f(x) \ne max \{ h(y)\} _{{y \in D_{n} }}$ that x _n < n. This makes the number x _n of citations of the last paper irrelevant to compute h(x) and, accordingly, h(x) = h(x _−n). With respect to A4, h(x) = h(x ₁,…,x _n−1) means that x _n ≤ h(x ₁,…,x _n−1). Hence, adding to both (x ₁,…,x _n−1) and x another paper having at most x _n citations cannot increase the Hirsch index.

Proposition 4

With α ∈ {1, 2}, an index f satisfies A1, A2_α and A3 if, and only if, f is the Hirsch index.

Proof

“⇐”Remark 3. “⇒”With α ∈ {1, 2}, let f be an index satisfying A1, A2_α and A3. Step 1: f agrees with the Hirsch index on D ₀. Since the only member of D ₀ is x = ∅ and since $ d_{x} = min \{ x_{1} , \ldots ,x_{{d_{x} }} \} = 0,$ by A1, f(∅) = 0 = h(∅).

Step 2: f agrees with the Hirsch index on D ₁. Let x ∈ D ₁. Case 1: x ₁ ≥ 1. By A1, f(x) = 1. Case 2: x ₁ = 0. Case 2a: $ f(x) \ne max \{ f(z)\} _{{z \in D_{1} }}. $ Since x ∈ D ₁, x ₋₁ = ∅. By A3, f(x) = f(x ₋₁) = 0 = h(x). Case 2b: $ f(x) = max \{ f(z)\} _{{z \in D_{1} }}.$ Let y = ∅. By step 1, $ f(y) = max \{ f(z)\} _{{z \in D_{0} }}=0. $ Case 2b1: A2₁ holds. Then y ∈ D ₀, x ∈ D ₁, $ max \{ f(z)\} _{{z \in D_{1} }} = f(x) $, x ≥ y and δ(y, x) = 0 ≤ c _y = 0. By A2₁, f(x) ≤ f(y) = 0. Since f(x) ≥ 0 by definition of index, f(x) = 0 = h(x). Case 2b2: A2₂ holds. Then y ∈ D ₀, x ∈ D ₁, $ f(x) = max \{ f(z)\} _{{z \in D_{1} }} ,f(y) = max \{ f(x)\} _{{z \in D_{0} }} , $ x ≥ y and δ(y, x) = 0 ≤ c _y = 0. By A2₂ when k = 1, f(z) ≤ f(y) = 0. Hence, f(x) = 0 = h(x).

Step 3: for n ∈ $\mathbb{N}$\{0, 1}, f agrees with the Hirsch index on D _n. Choose n ∈ $\mathbb{N}$\{0, 1} and, by steps 1 and 2, suppose that, for all k ∈ {0, 1,…,n - 1}, f agrees with the Hirsch index on D _k. To prove that f agrees with the Hirsch index on D _n, choose x ∈ D _n. Let h = h(x). Case 1: h = n. This means that, for all i ∈ {1,…,n}, x _i ≥ n. Hence, c _x = d _x = n and, by A1, f(x) = d _x = n = h. Case 2: h < n. By the induction hypothesis, f(x _−n) = h(x _−n). As h(x) = h < n, it follows that x _n ≤ h and, thus, h(x _−n) = h(x). In sum, f(x _−n) = h.

Case 2a: $f(x) \ne max \{ f(z)\} _{{z \in D_{n} }}$. By A3, f(x) = f(x _−n) = h = h(x). Case 2b: $f(x) = max \{ f(z)\} _{{z \in D_{n} }}$. Let k ∈ {2,…,n} and y ∈ D _k satisfy, for all i ∈ {1,…,k}, y _i ≥ k. By A1, f(y) ≥ min{min{y ₁,…,y _k}, k} = k. The Hirsch index is such that, for all r ∈ $\mathbb{N}$, $max \{ h(z)\} _{{z \in D_{r} }} = r.$ Given f(y) ≥ k, by the induction hypothesis, $f(v) = max \{ f(z)\} _{{z \in D_{k} }}$implies f(v) = k. As a consequence, for all k ∈ {2,…,n},

$$ max\left\{ {f\left( z \right)} \right\}_{{z \in D_{k} }} = k. $$

(1)

Case 2b1: α = 1. By (1), $ max \{ f(z)\} _{{z \in D_{n} }} = f(x)$ implies f(x) > f(x _−n). As a result, x _−n ∈ D _n−1, x ∈ D _n, x ≥ x _−n and max{f(z)}_z∈Dn = f(x) > f(x _−n) imply, by A2₁, $ {\delta}(x_{-n}, x) > c_{x_{-n}} \ge h.$ But δ(x _−n, x) = x _n and, since h(x _−n) = h, x _n ≤ h: contradiction.

Case 2b2: α = 2. Let v ∈ D _h satisfy, for all i ∈ {1,…,h}, v _i = x _i. By A1, f(v) = h. By (1), $f(v) = max \{ f(z)\} _{{z \in D_{h} }}.$ Let r = n − h. For t ∈ {1,…,r}, let x ^t ∈ D _h+t satisfy, for all i ∈ {1,…,h + t}, $ x_{i}^{t} = x_{i} . $ It follows from h(x) = h that, for all i ∈ {1,…,h + t}, x _i ≤ h = c _v. Given this, the fact that x ^r = x implies δ(v, x) ≤ rh ≤ rc _v. Summarizing, v ∈ D _h and x ∈ D _h+r are such that x ≥ v, and δ(v, x) ≤ rc _x. By A2₂, f(x) ≤ f(v). Hence, f(x) ≤ f(v) = h < n, which contradicts f(x) = max{f(z)}_z∈Dn = n.□

Remark 5

Neither A2₁ nor A2₂ can be replaced by A2 in Proposition 4: an index f satisfying A1, A2 and A3 need not be the Hirsch index, as Example 6 proves.

Example 6

Let f be the index such that f(3, 1, 1) = 3 and, for all x ∈ D\{(3, 1, 1)}, f(x) = h(x). Whereas f satisfies A1, A2 and A3, it is not the Hirsch index.

Proposition 7

An index f satisfies A1, A2 and A4 if, and only if, f is the Hirsch index.

Proof

“⇐”Remark 3. “⇒” Let f be an index satisfying A1, A2 and A4. Step 1: f agrees with the Hirsch index on D ₀. Since the only member of D ₀ is x = ∅ and since $ c_{x} = d_{x} = min \{ x_{1} , \ldots ,x_{{d_{x} }} \} = 0 $, by A1, f(∅) = 0 = h(∅). Step 2: f agrees with the Hirsch index on D ₁. Let x ∈ D ₁. By A1, min{x ₁, 1} ≤ f(x) ≤ 1. Thus, x ₁ ≥ 1 implies f(x) = 1 = h(x). If x ₁ = 0 then let y = ∅. By step 1, $ f(y) = max \{ f(z)\} _{{z \in D_{0} }} = 0.$ In addition, x ≥ y and δ(y, x) = 0 < c _y = 0. By A2, f(x) ≤ f(y) = 0. By definition of index, f(x) ≥ 0. In sum, f(x) = 0 = h(x).

Step 3: for n ∈ $\mathbb{N}$\{0, 1}, f agrees with the Hirsch index on D _n. Choose n ∈ $\mathbb{N}$\{0, 1} and, by steps 1 and 2, suppose that, for all k ∈ {0, 1,…,n − 1}, f agrees with the Hirsch index on D _k. To prove that f agrees with the Hirsch index on D _n, choose x ∈ D _n. Let h = h(x). Case 1: h = n. This means that, for all i ∈ {1,…,n}, x _i ≥ n. Hence, c _x = d _x = n and, by A1, f(x) = d _x = n = h. Case 2: h < n. Let v ∈ D _h satisfy, for all i ∈ {1,…,h}, v _i = x _i. By A1, f(v) = h. The Hirsch index is such that, for all r ∈ $\mathbb{N}$, $max \{ h(z)\} _{{z \in D_{r} }} = r.$ By A1, the induction hypothesis and $ f(v) = h,max \{ f(z)\} _{{z \in D_{h} }} = h.$ Let r = n − h. For t ∈ {1,…,r}, let x ^t ∈ D _h+t satisfy, for all i ∈ {1,…,h + t}, $ x_{i}^{t} = x_{i} . $ It follows from h(x) = h that, for all i ∈ {h + 1,…,n}, x _i ≤ h. Define w to be the member of D _h+1 such that w _h+1 = h and, for all i ∈ {1,…,h}, w _i = v _i. Then v ∈ D _h, w ∈ D _h+1, w ≥ v, $ f(v) = max \{ f(z)\} _{{z \in D_{h} }} $ and δ(w, v) = h = c _v. Therefore, by A2, f(w) ≤ f(v) = h. By A1, f(w) ≥ h. Consequently, f(w) = h = f(v). Given this, by A4, f(v) = f(x ¹). This result, by A4, yields f(x ¹) = f(x ²). By repeated application of A4, for all t ∈ {1,…, r − 1}, f(x ^t) = f(x ^t+1). Summing up, h = f(v) = f(x ¹) = ··· = f(x ^r) = f(x).□

Remark 8

Examples 9, 10 and 11 prove that no axiom in Propositions 4 and 7 is redundant.

Example 9

Let f be the index such that, for all x ∈ D, f(x) = 1 + h(x). Then f satisfies A2, A2₁, A2₂, A3 and A4; does not satisfy A1; and is not the Hirsch index.

Example 10

Let f be the index such that, for all x ∈ D, f(x) = d _x. Then f satisfies A1, A3 and A4; satisfies neither of A2, A2₁ and A2₂; and is not the Hirsch index.

Example 11

Let f be the index such that, for all x ∈ D, f(x) = h(x) − 1 if $min \{ x_{1} , {\ldots} , x_{d_{x}} \} < h(x) < d_x $ and f(x) = h(x) otherwise. Then f satisfies A1, A2, A2₁ and A2₂; satisfies neither A3 nor A4; and is not the Hirsch index.

Concluding comments

Woeginger (2008b, p. 301) provides another characterization of the Hirsch index, on the domain of integer-valued indices, in which monotonicity is still assumed and an interesting symmetry axiom is postulated. For x = (x ₁,…,x _n) ∈ D, Woeginger defines the reflection R(x) of x to be the vector (y ₁,…,y _k) such that k = x ₁ and y _i is the number of components in x whose value is not smaller than i. For instance, if x = (7, 2, 2, 1, 0) then R(x) = (4, 3, 1, 1, 1, 1, 1). The symmetry axiom holds that the value of the index should be preserved under reflections: f(x) = f(R(x)). As a result, papers and citations are exchangeable variables through reflection.

One of the referees recommends mentioning Quesada (2008) as another characterization of the Hirsch index relying as well on monotonicity. This paper axiomatizes the Hirsch index, on the domain of real-valued indices, using monotonicity and another two axioms (Woeginger 2008b assumes six). The first axiom strengthens A1 by requiring that $ min \{ min \{ x_{1} , {\ldots} ,x_{{d_{x} }} \} ,c_{x} \} \leq f(x) \leq min \{ max \{ x_{1} , {\ldots} ,x_{{d_{x} }} \} ,d_{x} \}. $ The second axiom can be viewed as another monotonicity-type property and bears some resemblance to A4: if f(x ₁,…,x _n) = f(y ₁,…,y _m) and f(x ₁,…,x _n, a) > f(x ₁,…,x _n) then f(y ₁,…,y _m, a) > f(y ₁,…,y _m), provided that (y ₁,…,y _m, a) is a well-defined output. This says that if the index does not distinguish between two outputs and the addition of another paper to one output causes an increase in the index then the same qualitative effect should arise from the addition of the same paper to the second output.

The resulting characterization seems to indicate that the Hirsch index can be obtained by postulating sufficiently strong monotonicity requirements and by imposing appropriate bounds to that monotonicity. Propositions 4 and 7 can be seen as obtained from the strategy of weakening monotonicity and, in exchange, adopting independence conditions stating when the index should remain unaltered: whereas A1 is the axiom setting the bounds, the A2 axioms express a necessary condition for the index to be monotonic in a particular case and A3 and A4 are independence axioms identifying changes in a research output that should not affect the index.

References

Hirsch, J. E. (2005). An index to quantify an individual’s scientific research output. Proceedings of the National Academy of Sciences, 102(46), 16569–16572.
Article Google Scholar
Quesada, A. (2008), Monotonicity and the Hirsch index. Journal of Informetrics (to appear).
Wikipedia (2008), h-index, http://en.wikipedia.org/wiki/Hirsch_index, accessed the 9th of December, 2008.
Woeginger, G. J. (2008a). An axiomatic characterization of the Hirsch-index. Mathematical Social Sciences, 56(2), 224–232.
Article MATH MathSciNet Google Scholar
Woeginger, G. J. (2008b). A symmetry axiom for scientific impact indices. Journal of Informetrics, 2(3), 298–303.
Article Google Scholar

Download references

Acknowledgements

Financial support from the Spanish Ministerio de Educación y Ciencia under research project SEJ2007-67580-C02-01 and from the Departament d’Universitats, Recerca i Societat de la Informació (Generalitat de Catalunya) under research project 2005SGR-00949 is gratefully acknowledged. Many thanks to the two reviewers of this paper and to the Editor in Chief, Tibor Braun.

Author information

Authors and Affiliations

Departament d’Economia, Universitat Rovira i Virgili, Avinguda de la Universitat 1, 43204, Reus, Spain
Antonio Quesada

Authors

Antonio Quesada
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Antonio Quesada.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Quesada, A. More axiomatics for the Hirsch index. Scientometrics 82, 413–418 (2010). https://doi.org/10.1007/s11192-009-0026-x

Download citation

Received: 10 November 2008
Accepted: 30 December 2008
Published: 12 June 2009
Issue Date: February 2010
DOI: https://doi.org/10.1007/s11192-009-0026-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

More axiomatics for the Hirsch index

Abstract

Introduction

Definitions and axioms

Definition 1

Definition 2

Results

Remark 3

Proposition 4

Proof

Remark 5

Example 6

Proposition 7

Proof

Remark 8

Example 9

Example 10

Example 11

Concluding comments

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation