Introduction

Electronic commerce, commonly known as e-commerce or ecommerce (EC), consists of the buying and selling of products or services over electronic systems such as the internet and other related networks. The amount of trade conducted electronically has grown extraordinarily with widespread Internet usage. The use of commerce is conducted in this way, spurring and drawing on innovations in electronic funds transfer, supply chain management, Internet marketing, online transaction processing, electronic data interchange (EDI), inventory management systems, and automated data collection systems. Modern electronic commerce typically uses the World Wide Web at least at some point in the transaction’s lifecycle, although it can encompass a wider range of technologies such as e-mail as well.

A large percentage of EC is conducted entirely electronically for virtual items such as access to premium content on a website, but most EC involves the transportation of physical items in some way. Online retailers are sometimes known as e-tailers and online retail is sometimes known as e-tail. Almost all big retailers have EC presence on the world wide web.

EC that is conducted between businesses is referred to as business-to-business or B2B. B2B can be open to all interested parties or limited to specific, pre-qualified participants. EC that is conducted between businesses and consumers, on the other hand, is referred to as business-to-consumer or B2C. This is the type of EC conducted by companies such as Amazon.com.

EC is generally considered to be the sales aspect of e-business. It also consists of the exchange of data to facilitate the financing and payment aspects of the business transactions.

This paper surveys EC technology trends and forecasts using bibliometric analysis from 1996 to July 2015 with topic as “e-commerce” in SSCI database. The bibliometric analytical technique is used to examine the topic in SSCI journals from 1996 to July 2015, we found 5429 articles with EC. This paper implemented and classified EC literatures using the eight categories as: publication year, citation, country/territory, institute name, document type, language, research area and source title for different distribution status in order to explore how EC technology trends and applications have developed in this period.

For verifying the analysis result, the paper will perform by the following steps to verify the reliability of Lotka’s Law: (1) Collect data (2) List author and literature distribution Table (3) Calculate n value (slope) (4) Calculate c value (5) Utilizing K–S (Kolmogorov–Smirnov, K–S) test to evaluate if matched Lotka’s Law. (Potter 1988)

The paper provides an EC roadmap to guide future research and abstract the trend information so that EC researchers can save some time since core knowledge will be concentrated in EC core categories. In higher quality publications, it is very common for “success breeds success” phenomenon.

Methodology and data retrieval strategy

Methodology

One of the main areas in bibliometric research concerns the application of bibliometric laws (Luor et al. 2014). The three most commonly used laws in bibliometrics are: Lotka’s law of scientific productivity, Bradford’s law of scatter, and Zipf’s law of word occurrence.

Lotka’s Law describes the frequency of publication by authors in a given field. It states that “the number (of authors) making n contributions is about 1/n2 of those making one; and the proportion of all contributors, that make a single contribution, is about 60 percent” (Tsay 2003). This means that out of all the authors in a given field, 60 % will have just one publication, and 15 % will have two publications (1/22 times 0.60). 7 % of authors will have three publications (1/32 times 0.60), and so on. According to Lotka’s Law of scientific productivity, only 0.6 % of the authors in a field will produce more than ten articles. Lotka’s Law, when applied to large bodies of literature over a fairly long period of time, can be accurate in general, but not statistically exact. It is often used to estimate the frequency with which authors will appear in an online catalog (Tsay 2003).

Lotka’s law is generally useful for understanding the productivity patterns of authors in a bibliography (Coille 1977; Gupta 1987; Nicholls 1989; Lour et al. 2014; Pao 1985; Rao 1980; Vlachy 1978). In this article, Lotka’s Law is selected to perform bibliometric analysis to check on literature record count versus accumulated authors between 1996 and July 2015 to perform author productivity inspection for discovering historical review and collecting the results for research tendency forecast in the near future. For verifying the analysis result, the paper implements KS test to evaluate if the result matched Lotka’s Law.

Data retrieval strategy

The data retrieval strategy (shown as Fig. 1) demonstrates as below:

Fig. 1
figure 1

Data retrieval strategy diagram

1. Retrieving technology trends and forecasts in eight categories by Bibliometrics.

2. Retrieving the distribution of author article production by KS test.

Based on the above information, governments and institutes may infer collective tendencies and demands of scientific researcher in EC and formulate appropriate training strategies and policies in the future.

Research results

This research is accessing the Social Science Citation Index (SSCI) on Web of Science created by ISI. The result is summarizing those 5429 paper indexes which topic is “e-commerce” from 1996 to July 2015, shown as Fig. 2. Obviously, the literature production of EC is rising since 1998 and citation is also increasing steadily and gradually by every year. It indicates the research of EC is very popular in the highly exploration period, referred to Fig. 3. The research of EC reached the highest record in 2014.

Fig. 2
figure 2

The tendency chart of literature growth of EC (source SSCI database)

Fig. 3
figure 3

Citation of EC in each year (source SSCI database)

By viewing on Table 1, they displayed that the distribution of country/territory from 1996 to July 2015, USA is a champion with 1954 record counts (35.99 %), following by People R. China, Taiwan, England and South Korea which achieved the record counts as 596(10.98 %), 433(7.98 %), 380(7.00 %) and 283(5.21 %) oppositely. For the distribution of institution name as shown in Table 1, City University of Hong Kong is a champion with 83 record counts (1.53 %), following by University of Minnesota, Hong Kong Polytech University, University of Wisconsin and National University of Singapore which achieved the record counts as 60(1.11 %), 53(0.98 %), 52(0.96 %) and 50(0.92 %) oppositely.

Table 1 Distribution of top 25 countries/territories and institutes from 1996 to July 2015

From the Table 2, it indicated that the most popular publication document type is article (4845 record counts, 89.24 %), following by proceedings paper (756 record counts, 13.93 %) and editorial material (211 record counts, 5.98 %). The most popular language for literature is using English (5345 record counts, 98.45 %) in the research domain of EC, following by German (41 record counts, 0.76 %), see the following Table 2. The result shows that article document type and English language are still the main trend in EC research domain.

Table 2 Distribution of top 25 document types and languages from 1996 to July 2015

In the Table 3, it is important to summarize the trend information for EC researchers since core knowledge will be concentrated in core categories and to get understanding about the distribution of top 25 subjects in future research trends and forecasts. Focus on the right categories, researchers will catch the core research information. The top five ranking of research subjects are computer science (2829 record counts, 52.11 %), following by business economics (1764 record counts, 32.49 %), engineering (1055 record counts, 19.43 %), information science and library science (663 record counts, 12.21 %), and operations research and management science (520 record counts, 9.58 %).

Table 3 Distribution of top 25 research areas and source titles from 1996 to July 2015

Moreover, it also discovered top 25 important research sources for EC researchers. According to these core Journals, it will be easy to recognize the developing trends and contents in EC. The top five ranking of research sources are Lecture Notes in computer science (400 record counts, 7.37 %), following by Expert Systems with Applications (123 record counts, 2.27 %), International Journal of Electronic Commerce (123 record counts, 2.27 %), Electronic Commerce Research and Applications (120 record counts, 2.21 %), and Decision Support Systems (114 record counts, 2.10 %).

The literatures productivity analysis of EC by Lotka’s Law

The section will perform by the following steps to verify the reliability of Lotka’s Law:

  1. 1.

    Collect data

  2. 2.

    List author and literature distribution Table

  3. 3.

    Calculate n value (slope)

  4. 4.

    Calculate c value

  5. 5.

    Utilizing K–S (Kolmogorov–Smirnov, K–S) est to evaluate if the result matched Lotka’s Law

1. Collect data and

2. List author and literature distribution table:

In Table 4, it calculated the author quantity by the equality method from 9284 literatures which retrieved by index on SSCI. Thus, it is obtained altogether 6175 of authors on research aspect of EC.

Table 4 Calculation of author productivity of EC

3. Calculate n value (slope):

By the result of calculation on Table 5, it could bring into Lotka’s Law’s equation as below to calculate n value:

$$n = \frac{{N\sum {XY} - \sum X \sum Y }}{{N\sum {X^{2} } - \left( {\sum {X^{2} } } \right)}}$$
(1)

We can refer Table 5 for the values in equation, then n = −1.310626

Table 5 Calculation of the exponent n for EC

4. Calculate c value:

After that, we also found c = 0.26187474, the equation is shown as below:

$$c = \frac{1}{{\sum\nolimits_{1}^{p - 1} {\frac{1}{{x^{n} }} + \frac{1}{{(n - 1)p^{n - 1} }} + \frac{1}{{2p^{n} }} + \frac{n}{{24(p - 1)^{n + 1} }}} }}$$
(2)

P = 20, x = 1–19

When we got n = −1.310626 and c = 0.26187474, it explored:

$$f\left( x \right) = 0.26187474/x^{1.310626}$$
(3)

Referring to the data from Table 6, authors with only one literature is 78.30 % (100 %−21.70 % = 78.30 %), which is close to primitive c value 78.25 % generated by Lotka’s law. After that, it can follow the calculation to get n and c value by the least squares law, carry onto the further proceeding examination for Lotka’s law compliance.

Table 6 The K–S test for EC

According to Pao (1989) suggestion, the absolute value of n should be between 1.2 and 3.8 which formed by the generalized Lotka’s Law, the result is matched the reference data by observation. The distribution chart is shown as Fig. 4.

Fig. 4
figure 4

Distribution of literature productivity of author on EC

5. Utilizing K–S (Kolmogorov–Smirnov, K–S) test to evaluate if matched Lotka’s Law:

For discussing the value of n and c, we got n = −1.310626 and c = 0.26187474 generated by Lotka’s Law, the result demonstrated that the EC literature author distribution and the primitive Lotka’s Law are matched approximately, refer to Fig. 4. In order to examine whether the theoretical value and the observation value are tallied, the paper use KS test to evaluate the suitability of Lotka’s Law. Regarding the n and c value which gained by the formula, it is possible to calculate the expected value and the accumulation value of author, following by K–S test examination.

From Table 6, we can find D max [D max = ABS Value Fo(x) − Sn(x)] = 0.5407. According to K–S test, the threshold value is:

$$1.63/\sqrt {6175} = 0.2097$$
(4)

Because D max is bigger than the K–S test threshold value, the result also indicated that the distribution of author productivity is not matched by the Lotka’s Law. The consequence means the Lotka’s Law is not suitable for the literature author productivity distribution in EC research domain (Potter 1981).

Conclusion

EC is one of fast growing research topics in recently years, the trend forecast of this research field by each kind of literature characteristic and author productivity distribution are in growing period. In this study, it demonstrated that the current EC literatures are still continuously to grow, the main research development facility with delivered the largest production is USA, but Peoples R China, Taiwan, England, South Korea and Canada also have potential to deliver more literatures in the future. In addition, the result shows that article document type and English language are still the main trend in EC research domain.

After checked by K–S test, the distribution of frequency indexes of author productivity in EC is not suitable for Lotka’s Law. The reason why EC does not match is that the number of authors who publish only one article is too large; as a result, the difference between the observed value and the expected value becomes greater than the K–S test critical value. This outcome causes the EC distribution to diverge from the slope of Lotka’s law.

The most relevant disciplines for EC subject categories provided by computer science, business economics, engineering, information science and library science, and operations research and management science. In addition, telecommunications, psychology, social sciences and other topics, government law and mathematics also have the potential in EC research domain.

The most important journals for EC source titles provided by Lecture Notes in Computer Science, Expert Systems with Applications, International Journal of Electronic Commerce, Electronic Commerce Research and Applications, and Decision Support Systems. Furthermore, Lecture Notes in Artificial Intelligence, Chemical Week, Journal of Management Information Systems, Information Management, and Industrial Management and Data Systems.

Focus on the right categories, the researchers may catch the valuable information and research trends. This study provides an EC roadmap to lead future research and to abstract the tendencies. Therefore, EC researchers can save time to browse sources since core knowledge will be concentrated in EC core categories. In higher quality publications, it is very common for “success breeds success” phenomenon.

Implementation

The study results can also support governments and institutes to recognize the scientific research trends and forecasts of EC, and to realize the developing scale of research in EC through analyzing the increases of the article author. The resources are always conflicted, especially for emerging and developing countries, and small and medium enterprises. According to the above information, it will be easy for governments and institutes to organize the proper training strategies and policies in the future.