Introduction

The study of acknowledgments in research publications is gaining ground in science. At present, acknowledgments can be considered an important element in the scholarly communication process, forming part of the so-called “reward triangle” together with authorship and citations (Cronin and Weaver 1995). Acknowledgments may contain expressions of gratitude to individuals, institutions or research infrastructures that have contributed to the work being reported, as well as formal recognition to research funders. Accordingly, the study of acknowledgments is of interest not only to better understand communication practices in science, but also for research policy purposes (Paul-Hus et al. 2016; Rigby 2011).

The collection of funding acknowledgments merits special attention because it may facilitate a wide range of studies. The aims of such efforts include tracking the research output supported by funding bodies and specific grant and research programs (e.g. Boyack and Jordan 2011; Lewison 1994; Lyubarova et al. 2009), exploring the relationship between research impact and economic support (e.g. Harter and Hooten 1992; Lewison and Dawson 1998; Rigby 2013; Zhao 2010), identifying main research funders by country (e.g. Wang et al. 2012), investigating the strategic scope of a funding body (e.g. Wang and Shapira 2011), identifying funders of the most cited papers (Giles and Councill 2004) or studying interactions between the public and private sectors (Morillo 2016), among other topics.

Since 2008, when funding acknowledgments were first included in Web of Science (WoS), the number of bibliometric studies focused on this source of information has increased. From studies at the micro level which analyze small units such as journals (see, for example, Cronin and Shaw 1999; Salager-Meyer et al. 2011; Zhao 2010), research has moved to studies of disciplines (see for example, Díaz-Faes and Bordons 2014; Wang and Shapira 2011) or countries (see, for example, Wang et al. 2012), and to macro-level studies which focus on worldwide scientific output (e.g. Costas and Van Leeuwen 2012). Regardless of the level of analysis, however, the results of these studies may be affected by technical limitations in how databases process information, as well as by shortcomings or limitations in how authors provide their acknowledgments. Concerning the latter, it has been noted that authors do not always acknowledge their funders (Costas and Yegros-Yegros 2013; Grassano et al. 2017; Koier and Horlings 2015), and that cultural and political issues may have an influence on the way in which authors acknowledge their support (Rigby 2011). It has been also shown that some acknowledgments retrieved from publications may be unknown to the corresponding funders, probably because some articles are published after finalization of the project and not included in the final report submitted to the funder (Costas and Yegros-Yegros 2013). This potential source of uncertainty supports the usefulness of publication-based studies.

The analysis of funding data is a difficult task because the data are usually included in the article’s acknowledgments, which is a non-structured section with heterogeneous content comprising technical, ethics-related or intellectual support in addition to financial assistance (Cronin 1995). Early studies focused mainly on personal acknowledgments and usually relied on manual procedures for data analysis (e.g. Cronin et al. 1993). More recently, the inclusion of acknowledgments in databases such as WoS (since August 2008) or Scopus (since June 2013)Footnote 1 has made automatic extraction of data possible, but the heterogeneous nature of the field remains an obstacle to their systematic exploitation. Nevertheless, due to funding body requirements, funding acknowledgments are increasingly included in publications and additional efforts are needed to assess the reliability of the data collected from this source, which is crucial for the trustworthiness of subsequent analytical studies.

In the WoS database, funding acknowledgments are captured if they meet the following criteria.Footnote 2

  • Document type: only for articles and reviews.

  • Indexing: included in Science Citation Index Expanded since 2008 and in the Social Sciences Citation Index since 2015.

  • Language: funding statement information is in English.

The paragraph containing funding information in the original publication is recorded in a field named “funding text” (FX). Moreover, the funding organizations and grant numbers are extracted and recorded in two separate subfields: “funding agency” (FA) and “grant number” (GN). An illustrative example is shown in Fig. 1.

Fig. 1
figure 1

Example of the funding acknowledgment field (FX) and subfields FA and GN in a WoS article

Note “FA–GN entry” is our own term to designate the pair of FA and GN subfields corresponding to a specific source of financial support

In principle, the extraction of funders from the full text is a valuable improvement which may enhance funding acknowledgment studies. However, a number of factors remain that constitute an obstacle to the reliability of studies for science policy purposes. The lack of standardization of funder names, which can appear in different languages, the coexistence of long, abridged and abbreviated forms, and even spelling errors, makes it difficult to obtain reliable data on the presence of funders in the acknowledgment field (Sirtes 2013). Moreover, WoS errors in the collection and processing of FA data can be an important potential hindrance to FA analyses. Since the reliability of acknowledgment-based studies depends to a great extent on the truthfulness and accuracy of the collected data, better knowledge of how funding information is collected and processed by Clarivate Analytics is needed to assess the validity of WoS-based analyses and their usefulness for research policy studies. In this connection, a few recent studies analyzed the scope and coverage of the funding acknowledgment data collected by WoS (Paul-Hus et al. 2016; Tang et al. 2017). Moreover, the recall and precision of WoS in extracting funding acknowledgments from publications has been analyzed in specific areas such as climate change (Koier and Horlings 2015) and cancer research (Grassano et al. 2017). According to Koier and Horlings, some loss of information by WoS in the recognition of acknowledgments in the original publications was observed in 24% of the articles, while inaccuracies in the identification of individual funders appeared in another 12%. The study by Grassano et al. (2017) reported recall and precision values above 90%, although the authors suggest that differences among research domains may exist.

In this context, our study seeks to improve our knowledge about different aspects of the “quality” of funding data collected by WoS through the analysis of a sample of publications from different research domains. Our sample is selected from the scientific output of Spain, since the study is part of a research project funded by the Spanish Ministry of Economy and Competitiveness aimed at analyzing funding presence in Spanish scientific publications. Some preliminary results of this research were presented elsewhere (Álvarez-Bornstein et al. 2016).

Objectives

The main objectives addressed in this article are the following:

  • To explore WoS completeness in extracting data from the “funding text” (FX) field as well as the accuracy of the distribution of these data by FA and GN subfield.

  • To analyze the extent to which funding information is complete regarding two important aspects: whether both FA and GN for a given source of support are included, and whether the country of origin of the funding is mentioned.

Our ultimate objectives are to propose recommendations intended to improve the way in which authors and databases include funding information, and in the long term, to enhance the reliability and usefulness of studies based on funding acknowledgments.

Methodology

This study focuses on the scientific output of Spain covered by WoS in 2014, which is the most recent year analyzed in the aforementioned project. Ten broad thematic areas were considered by aggregation of WoS categoriesFootnote 3: Agriculture, Biology and Environment; Biomedicine; Clinical Medicine; Chemistry; Engineering and Technology; Humanities; Mathematics; Multidisciplinary; Physics; and Social Sciences. Articles not written in English and those published in Social Sciences and Humanities journals were excluded from the study because their funding acknowledgments were not captured by WoS in 2014. Regarding the remaining areas, the scientific output of Spain-based researchers in 2014 accounted for 38,409 articlesFootnote 4 published in English with funding acknowledgments (76% of the total). A stratified random sampling method was used to ensure that the eight broad thematic areas were well represented. A random sample of articles from each area was obtained in a number proportional to the area’s size compared to the population (total Spanish output), that is, a proportional allocation design was used, with a confidence level of 95% and a sampling error of 3% (Fig. 2). A sample of 1045 articles was selected and used for this analysis.

Fig. 2
figure 2

Distribution of the sample of articles by broad thematic area (WoS 2014) (N = 1045 articles with funding acknowledgments)

The following aspects were analyzed:

  1. 1.

    Completeness of WoS in extracting funding data from the FX field.

    Manual review of the data collected by WoS in the FA and GN subfields was carried out and compared to the data included in the full FX field to obtain

    • The percentage of articles in which all funders and grants mentioned by the authors in FX were included by WoS in FA and/or GN.

    • The percentage of articles with any lost information, specifying whether it concerned funders, grants, or both.

  2. 2.

    Accuracy in the distribution of funding data in subfields FA and GN

    In theory, each grant is allocated by the database to an FA–GN entry, including the name of the funder in the FA subfield and grant identification in the GN subfield. Accordingly, in “ordinary” cases (one funder = one grant), one would expect to find as many FA–GN entries as research grants in a given article, so the number of FA–GN entries might be a good proxy for the number of grants. However, we observed that some GN subfields included more than one grant, usually because they were supported by the same funder, whereas in some FA subfields more than one funder was recorded, frequently as a result of co-funded grants. Accordingly, we aimed to address the question of how the information was distributed in these “special” cases (co-funded grants and multiple grants from the same funder). For example, does WoS always include co-funded grants in a single FA–GN entry, or might they be distributed across different entries? The same question applies for multiple grants from a given funder. In order to answer this question the following measures were obtained:

    • For the set of articles with co-funded grants (see the example in Fig. 3), the proportion of articles with co-funders in a single FA–GN entry (e.g. article A in Fig. 3), as well as the proportion of articles with co-funders in different entries (e.g. article B in Fig. 3) was calculated.

      Fig. 3
      figure 3

      Two examples of extraction of funding data from the FX field to the FA and GN subfields in the case of co-funded grants

    • For the set of articles with more than one grant from the same funder (see the example in Fig. 4), the proportion of articles in which these grants were included in a single FA–GN entry (e.g. article A in Fig. 4), as well as the proportion of articles in which these grants appeared in different entries (e.g. article B in Fig. 4) was calculated.

      Fig. 4
      figure 4

      Two examples of extraction of funding data from the FX field to the FA and GN subfields in the case of multiple grants from the same funder

    Two different goals were pursued: to determine whether the number of FA-GN entries is a good proxy for the number of grants, and to explore the consistency of WoS in their coverage of funding data in “special cases” (multiple grants from the same funder and co-funded grants).

  3. 3.

    Completeness of the data in FA-GN entries.

    For the purposes of this study, we consider “completely identified funding grants” as those which were identified by both a funder name and a grant number. If only one of these elements was provided, we considered them as “incompletely identified funding grants”. To investigate this issue, the following measures were obtained:

    • The percentage of FA–GN entries which showed only FA data (GN was empty).

    • The percentage FA-GN entries which showed only GN data (FA was empty).

    • The percentage of FA-GN entries with both FA and GN data.

  4. 4.

    Identification of the funders’ country of origin.

    Although we considered a grant to be “completely identified” if data were provided for both the FA and GN subfields, the country of origin of the funder(s) can be extremely important to avoid confusion between funders with the same name in different countries (e.g. there is a National Science Foundation in USA, but also in other countries such as Switzerland). The fact that authors very often tend to translate the original name of their funding bodies into English in articles written and published in English increases the likelihood of confusion. Moreover, the chance of misunderstanding increases if only an acronym is mentioned. Accordingly, we calculated the percentage of FA in which the country of origin of the funding body was clearly stated as a country name or an adjectival form. Thus, examples of clear identification include “Swiss National Science Foundation” or “National Science Foundation, Switzerland”. This indicator was calculated separately for Spanish and foreign funders, to explore potential differences in the tendency of authors to mention the funder’s country of origin. As a preliminary step to obtain these data, the FA subfield was examined manually and funders were characterized as “foreign” or “Spanish” as appropriate. Doubtful cases were resolved through web searches.

The statistical analyses used to study potential between-area differences in the behavior of funding-related variables included contingency tables, and Kruskal–Wallis and Mann–Whitney tests for non-parametric variables (P < 0.05) (SPSS version 22).

Results

For the 1045 articles we analyzed, a total of 2924 FA–GN entries were collected, i.e. a mean of 2.8 FA–GN entries per article, ranging from 2.5 in Mathematics to 3.4 in the Multidisciplinary area (see detailed data in “Appendix” section). Differences between areas were statistically significant (P < 0.05).

Completeness of WoS in extracting funding data from the FX field

The rate of completeness of WoS in extracting funding information was quite high, since all funders and grants included in the FX field were extracted to the FA and/or GN subfields in 87.8% of the articles. The rate of completeness ranged from 94.3% in the Multidisciplinary area to 85.1% in Engineering/Technology and Biomedicine, but between-area differences were not statistically significant. We wondered whether longer FX fields, which supposedly contain more complex texts, were less well covered than shorter ones. This was indeed the case, since the subset of articles with lost data had significantly longer FX fields (higher number of characters) than articles for which all the information was collected (Z = −4.023, P < 0.001).

Concerning lost information, a funding grant was entirely lost (neither the funder nor the grant number provided in the full text was shown in FA/GN) in 74 articles (7.1% of articles), and was partially lost in other cases in which only the funder (28 articles, 2.7%) or only the grant number (32 articles, 3.1%) was missing. Overall, information was missing in 12.2% of the articles (128 articles).Footnote 5

Accuracy in the distribution of funding data by subfield

Co-funded grants were found in 214 articles (20.5%). In 24.8% of them, different co-funders were recorded in a single FA–GN entry while in 75.2% they were recorded in different entries (Table 1).

Table 1 Distribution of funding data in two special cases: co-funded grants and multiples grants from a given funder

Multiple grants from the same funder were detected in 341 articles (32.6%) (Table 1). These grants tend to be mentioned together in a single FA–GN entry (61.6%) (see, for example, Article A in Fig. 4) but sometimes they were separated in different FA-GN entries (24.9%) (see Article B in Fig. 4). In a small proportion of articles (13.5%) multiple grants mentioned in a single entry co-existed with other(s) separated in different entries.

Focusing on thematic areas, the proportion of articles with co-funded grants ranged from 14.3% in the Multidisciplinary area to 27.6% in Chemistry (with no significant differences by area), whereas the rate of multiple grants from a given funder ranged from 14.6% in Mathematics to 41.6% in Engineering/Technology (P < 0.05). Interestingly, no between-area differences were observed in the way these data were distributed across subfields by WoS in the case of co-funded grants (Fig. 5) or in the case of multiple grants from the same funder (Fig. 6).

Fig. 5
figure 5

Distribution of funding data in the case of co-funded grants, by thematic area

Fig. 6
figure 6

Distribution of funding data in the case of multiple grants from the same funder, by thematic area

Finally, different types of misallocation of data were observed, such as including program or project names instead of the funder in the FA subfield (9.2% of the FA subfields and 20.1% of the articles contained at least one program or project instead of a funder), or splitting a funder’s name across two or more FA–GN entries as if there were more than one funding agency (observed in 2.7% of the articles) (see Fig. 7 for an example).

Fig. 7
figure 7

Example of a single funder incorrectly split into two FA–GN entries

Completeness of Data Collected in FA-GN Entries

Around half of the FA–GN entries (53.1%) were “completely identified”, that is, they included at least one funder and one grant number in the corresponding subfields. However, funders were present in almost all the FA–GN entries (96.6%), whereas grant numbers appeared in only 56.5% of the cases. In 43.5% of the entries only funders were provided, whereas grants appeared as the only information in only 3.3% of the entries (Table 2).

Table 2 Completeness of FA–GN entries

Turning to thematic areas, the highest proportion of complete FA–GN entries (with data in both FA and GN subfields) was observed in Mathematics (60.2%), whereas the lowest proportion was found in the Multidisciplinary area (41.2%). Interestingly, this latter area also had the highest share of entries that contained only grant data (9.2%) (Fig. 8). Overall, between-area differences in the completeness profile of FA–GN entries were statistically significant (P < 0.001).

Fig. 8
figure 8

Completeness of FA–GN entries by thematic area

Identification of the funder’s country of origin

Funders included in the FA subfield were classified as Spanish (61.3%) or foreign (36.9%) according to their country of origin (3.6% of funders were not classified because there were no data in FA or the data included did not correspond to any funder). For this classification, the funders included in FA were examined to look for the following evidence: first, a country name (e.g. “Spain” or “Italy”) or an adjectival form (e.g. “Spanish” or “Italian”) in the funder´s name; second, cities or regions which could be ascribed unequivocally to countries; and third, well-known funders or programs in Spain or other countries which could be ascribed unequivocally to their corresponding country or supranational region.

As seen in Table 3, the names of around 42% of Spanish funders and 50% of foreign funders included a country name or an adjectival form. In most of the remaining cases, we could determine the country of origin of the funder with variable degrees of difficulty and precision. In many cases the country was identifiable unequivocally when cities were mentioned, although caution was needed because of homonymous cities in different countries (for example, Cordoba is a city in Spain and also in Argentina). With regard to funding from Spain, the proportion of funders’ names that included a region but not a country was high (22.5%) probably because of research funding bodies administered by regional governments (e.g. the Generalitat Valenciana in the autonomous region of Valencia in Spain). Finally, some well-known funding institutions and programs could be ascribed to countries or supranational regions unequivocally. Examples included funders such as ANR in France, MIUR in Italy or MICINN in Spain,Footnote 6 as well as programs such as the European Framework Program in the European Union. We identified approximately 9.5% of Spanish funders and 22.1% of foreign funders by means of web searches.

Table 3 Distribution of funders by national/foreign origin and way of identification

We observed some differences among thematic areas in how the national or foreign origin of funders was identified (Figs. 9 and 10). The behavior of the Multidisciplinary area deserves special mention: on the one hand, a country name appeared in a high share of entries; on the other hand, the national or foreign origin of the funder was identifiable from the name of the institution or program less often than in the remaining areas (Figs. 9 and 10). In fact, in this area web searches were needed to identify the type of funder in many cases, especially for foreign funders. Overall, between-area differences were more significant for foreign funders, where the country or region was identifiable for around 60% of funders in Chemistry but only 40% of those in Mathematics (Fig. 10).

Fig. 9
figure 9

Distribution of Spanish funders by way of identification and thematic area

Fig. 10
figure 10

Distribution of foreign funders by way of identification and thematic area

Discussion and conclusions

This study focuses on the funding acknowledgment section of a sample of articles covered by the WoS to analyze the completeness of the database in extracting data from the FX field to the FA and GN subfields, its accuracy in the distribution of data by subfields, and the completeness of the collected data. Our analysis shows that there is room for improvement in both the completeness in the data extracting process (some funding information was lost in 12% of the articles), and in the accuracy of the subfield distribution of the data, which was not always consistent.

Completeness of WoS in extracting funding data

Our findings on the extraction of funding data from the FX subfield are consistent with those described by Koier and Horlings (2015), who observed some loss of information in the extraction of funders from the acknowledgment section in 12% of the articles in a study of the scientific output of two Dutch programs in climate change research. Interestingly, these authors also showed some loss of information in the previous stage, i.e. in the ability of WoS to recognize and retrieve funding acknowledgments from original papers. The fact that the location of funding acknowledgments in papers may vary from a general acknowledgment section to a specific funding section or even within a footnote hinders their recognition and subsequent extraction (Grassano et al. 2017). Further analysis of this issue was beyond the objectives of our study, but should be kept in mind in future studies of the contents of the acknowledgments.

Accuracy in the distribution of funding data by subfield

Our findings pinpoint some issues with inconsistency in the distribution of funding data by subfield. On one hand, the names of projects and programs are sometimes the only information included in the FA subfield, where it is assumed that funders will appear. In some cases this occurs because the funder’s name is not provided by the authors, as has also been detected in other cases in which the funder was mentioned, but was included by WoS in a separate entry. On the other hand, we found inconsistencies in the way that co-funded grants and multiple grants from the same funder were recorded, since sometimes they appeared in a single FA–GN entry, whereas on other occasions they were disaggregated into different entries. This means that the basic structure of “one FA–GN entry = one funding grant” was not always maintained. Therefore the number of FA–GN entries is not an accurate measure of the number of sources of funding, which is an important consideration from a bibliometric perspective. In fact, our data suggests that the number of FA–GN entries is a better proxy for the number of funders than for the number of grants, since the latter may be underestimated due the presence of multiple grants within a given FA–GN entry (in 256 articles comprising 24% of the 1045 papers analyzed here). Although the presence of multiple funders in a given FA has been also detected, and may reduce the validity of the number of FA–GN entries as a measure of the number of funders, this type of co-occurrence was less common (in 53 articles, i.e. 5% of the 1045 papers we analyzed).

Completeness of collected data

With regard to the rate of completeness of the FA and GN subfields, a given funder was almost always included in the FA subfield (97% of FA–GN entries) whereas specific grant information appeared in only about half of the entries. This suggests that authors very often provide incomplete information about their funding grants, giving priority to the funder over the grant number. However, other factors may also contribute to the incompleteness in the FA and GN subfields:

  • Some data may be lost by the database during the extraction process from the original paper or from the FX field. The first issue was not analyzed in this study, although we did find that the grant number was lost in the extraction from the FX field in around 10% of the papers (in 3.1% of the papers only the GN information was lost; in 7.1% both GN and FA data were lost).

  • Financial support might not be associated to a specific grant number in some cases, such in those involving “core funding” from institutional resources. Nonetheless, this type of funding is rarely acknowledged by authors (Grassano et al. 2017).

  • In some cases such as in co-funded grants, the basic structure “one FA–GN = one funding grant” was not preserved. Co-funders may be distributed in different FA–GN entries, whereas the grant number appears only once, so the additional GN subfield/s remain empty (in around 14% of the papers in our study).

These factors per se, however, cannot explain the high percentage of empty GN subfields, which in many cases can be attributed to the omission of information by the author. Our findings thus suggest a need to encourage authors to include more complete data in their funding acknowledgments.

National and foreign research funders

The classification of research funders under the category of “national” or “foreign” was a laborious task because very often the country of the funder was not mentioned in the acknowledgments: it was missing for 58% of Spanish funders and 50% of foreign funders. If the country was not specified, the geographic origin of funding was traced and identified in many cases on the basis of geographic data (regions, cities) and well-known national and international institutions or programs. Web searches were crucial for identifying the funder’s country of origin in the remaining 10–20% of the FA entries. In the light of these results, we stress that the lack of information about the funder´s country can be misleading, since a given funder may have the same or a very similar name in different countries. In this connection, incomplete or inaccurate information can hinder automatic counts and increase the probability of erroneous allocations. We suggest that the inclusion by authors of the country of origin of each research funder is highly desirable.

Differences by thematic area

Our data reveal that the completeness of WoS in covering funding information did not differ significantly among areas, whereas the completeness of FA–GN entries did vary, with the share of complete entries ranging from 41% in the Multidisciplinary area to 60% in Mathematics. This variation may be related to a number of factors such as differences in the number and/or diversity of funders by area, which might influence how authors include funding data as well as how the data are then collected by WoS. In fact, Mathematics was the area with the lowest average number of funders per paper, a situation which probably makes it easier for authors in this area to compose the funding acknowledgments, which are subsequently more likely to be collected accurately by the database. The Multidisciplinary area stands at the other end of the spectrum, with the greatest average number of funders per paper and the lowest share of complete FA–GN entries. In addition, between-area differences in the frequency of co-funded grants and multiple grants from a given agency might also influence the rate of completeness of FA–GN entries, since we have observed some degree of incompleteness in these situations. In summary, the main patterns we found for our entire sample of papers emerged at the level of specific thematic areas, although in some cases small differences between areas were detected which can probably be attributed to the varying degrees of complexity of funding environments in different areas.

Future actions and measures

Both the completeness and the accuracy of the process of funding data extraction are curtailed, we believe, by two important factors: (a) the fact that in many journals funding acknowledgments are included in a general “Acknowledgments” section, mixed with other types of data such as personal and institutional acknowledgments or conflicts of interest; and (b) the incomplete and/or unclear way in which many authors provide funding acknowledgments in their articles. Concerning the first issue, it is clear that using separate headings to collect and identify funding data, conflicts of interest and other types of acknowledgment in journal articles is a helpful measure. In fact, some journals have adopted this practice for the sake of clarity and to avoid misunderstandings (see for example, Virology Journal, https://virologyj.biomedcentral.com/). The use of a specific heading for sources of funding (called “Funding”, “Financial support”, “Financial data”, etc.) has already been suggested as a way to facilitate the correct identification of funding data (Henderson et al. 2003).

Regarding the incompleteness of data included by authors, we observed different examples in this study, such as the non-inclusion of grant numbers or the inclusion of the program name instead of the name of the corresponding funder. This suggests that authors need clearer guidelines concerning both the information that should be included and the format in which it should be provided. At present, major funders require grant-holders to acknowledge the support received, but instructions about how to present this information are frequently vague. That is the case, for example, at the Ministry of Economy and Competitiveness in Spain, which funds the Spanish National Plan for Research (see, for example, the main call for research project proposals, BOE-A-2013-10258, which mentions only the need to acknowledge support from the Ministry). More specific statements can be observed in other countries such as Canada (see, for example, http://canada.pch.gc.ca/eng/1428491994616) and the USA (https://grants.nih.gov/grants/acknow.htm) or supranational organizations such as the European Union (COST 2015). Unfortunately, funders often mention only the requirement to acknowledge funding, but provide no specific advice or instructions on the exact content or wording.

Our data suggest that authors need clearer guidelines on aspects such as the type of information to include (funder, grant number), the format and notation of the funder’s name (full name, acronym, country), and the desirable format and notation of the information in cases that involve research support from multiple funders or through multiple grants. Authors should be encouraged to use a standardized form of the funders’ name in their acknowledgments, since the lack of standardization has been recognized as an important problem in previous studies (Sirtes 2013). Consensus between research funders and publishers would be desirable to ensure that both actors request the same information from authors. In this connection, an interesting initiative was launched by the Research Information Network in the UK (Research Information Network 2008), as the result of discussions with UK funders and publishers, to offer recommendations for authors and publishers regarding the content of funding acknowledgments. This group proposes that research articles should have a funding acknowledgment in the form of a sentence as follows:

This work was supported by the Medical Research Council [grant number xxxx].

In other words, they suggest including the funder’s name written out in full, followed by the grant number in square brackets. In the case of multiple funders and grants, RIN proposes the following format:

This work was supported by the Wellcome Trust [grant numbers xxxx, yyyy]; the Natural Environment Research Council [grant number zzzz]; and the Economic and Social Research Council [grant number aaaa].

This is an interesting example of notation, which could also be improved with the inclusion of the funder’s country of origin (if applicable). This information is relevant given the increasing internationalization of research and the existence of funder homonyms in different countries. In fact, having detailed guidelines for funding acknowledgments at the national or even international level can be an important step forward in enhancing the collection of funding data.

From the perspective of bibliographic databases, standardization of funding data is essential to enable information retrieval. Useful strategies to achieve this goal include the creation of lists with standardized names of funders and/or including specific notation to designate grants. Among the latter, the Pubmed system is noteworthy in that it includes grants and contracts according to the notation: grant number/funder/country. Concerning standardized lists of funders, one collaborative project worth highlighting is FundRef (http://crossref.org/fundref/), launched by CrossRef in late May 2013 in collaboration with Elsevier and other partner organizations. Interestingly, it seeks to obtain a standardized list of funders worldwide, which is incorporated into journals’ manuscript tracking system so that authors can select their funder from the list and provide additional information if needed (e.g. grant numbers). This tool would ensure the use of standardized names for all funders included in the system, and would be updated periodically.

Regarding bibliometric studies, caution is needed when dealing with funding acknowledgments. If we rely on FA and GN data, we should keep in mind that small pieces of funding information could be lost by WoS in the extraction of data from the FX field, and that some inconsistencies in the distribution of data by subfield do exist. From our point of view, this does not undermines the usefulness of studies based on FA and GN data conducted with descriptive or exploratory purposes, although turning to FX could be necessary if an exhaustive coverage of data is required. In any event, we consider that certain level of inaccuracy in the results of funding acknowledgment studies needs to be assumed for the time being. Some potential errors in the analyses, such as those derived from the lack of standardization of funders’ names, can be reduced or prevented through an in-depth cleaning of data before analyses (Rigby 2011). However, other problems remain, such as the fact that authors very often provide incomplete information about their funding grants that may lead to confusion and errors in the identification of funders and grants. It is for this reason that improving procedures for disclosing funding sources by authors and their subsequent collection by journals and databases are both crucial for increasing the completeness of registered data and, in the long term, the quality and reliability of bibliometric studies.

In summary, we consider that further efforts are needed to increase the accurate disclosure of funding sources by authors and the capacity of databases to collect this information. Initiatives designed to achieve these aims will benefit researchers (who need more assistance at the time of writing their acknowledgments), funders (to better track the outcomes of the research they fund), publishers (which will gain from uniform information across articles and journals) and bibliometricians (since more relevant and reliable studies would be possible).