Keywords

1 Introduction

Open data provided an opportunity to establish a participatory society, develop innovative data-based solutions, discover new business opportunities and make better decisions in the public and private sectors [1]. Data is considered a driver of change and innovation in governance and business processes, but the realization of its potential requires building and functioning in a targeted environment called an ecosystem. The ecosystem in a broader sense represents a system in which entities work in collaboration and interdependently, as well as strive to contribute to the ecosystem [2]. In a narrow sense, an ecosystem is an environment with its own specificities and includes stakeholders who contribute to the ecosystem by sharing resources, knowledge and skills, striving to achieve social and business goals. In many domains, the value potential of data is recognized and consequently increasingly dominate by interdependent services and data exchange between different stakeholders, which leads to the emergence of data ecosystems [3]. Different research has been conducted in the open data management field, but knowledge about data ecosystem management and its operation is limited [1, 4, 5], specifically in the various professional field. Nevertheless, the creation, use and reuse of data [1] are necessary for the successful construction and development of a data-driven culture, as one of the strategically important directions in the growing data economy according to European strategy for data [6]. Accordingly, there is a need to research data ecosystems, that is, to identify the key elements, characteristics and ways how to successfully achieve principles that such a system should satisfy. In this research, an ecosystem approach was applied and it was analyzed how the ecosystem elements contribute to the realization of this approach principles. In this paper, the research questions (RQ) are defined as a following:

  • RQ1: What are the elements of the (open) data ecosystem?

  • RQ2: What characteristics should the (open) data ecosystem satisfy for the purpose of successful operations?

  • RQ3: Which identified elements of the (open) data ecosystem have an impact on the realization of the principles of the ecosystem approach?

The contribution of this research stems from the need to create circular data ecosystems and strengthen specific data environments. A circular data ecosystem presents an environment where stakeholders together contribute to building a common field where they operate in a way that shares their own data and uses data from others, they know data needs and upgrade data flows. To create an environment that successfully leverages its full potential in the creation, use, and reuse of data, an analysis of the ecosystem’s key elements, characteristics, and modes of operation are essential.

The following part provides an explanation of the ecosystem approach used in this research, a description of the problems observed, and the research design. The third part of the paper provides a literature review with an analysis of the results of existing research. The fourth part provides a presentation of the results, while the fifth part provides a discussion and suggestions for further research.

2 Research Approach and Design

2.1 The Ecosystem Approach

Originally, the ecosystem approach implies a strategy for the integrated management of land, water and living resources that promotes conservation and sustainable use in an equitable manner [7,8,9]. This approach seeks to achieve a balance in conservation, sustainable use as well as a fair and equitable sharing of the benefits arising from the use of resources [7,8,9,10]. First, this approach came to settle down in natural science such as biological and environmental science, while later it found useful application in different sciences such as information science. The ecosystem approach has become a guide through various theoretical, conceptual and practical aspects of complex systems relevant to socio-ecological management [11]. Each complex system with its own specific, elements, characteristics and objectives are called an ecosystem. An ecosystem is defined as a system of stakeholders, practices, values and technologies in a particular environment, where the mentioned elements are connected and interdependent [12]. The literature mentions 12 principles (P) on which the ecosystem approach is based. They are shortly mentioned as follow: The management objectives of resources are a matter of societal choice (P1), Decentralized management to the lowest appropriate level (P2), Ecosystem managers should consider their effects (P3), Recognizing potential gains from management (P4), Conservation of ecosystem structure and functioning (P5), Ecosystems must be managed within the limits of their functioning (P6), The ecosystem approach should be undertaken at the appropriate spatial and temporal scales (P7), Long-term objectives for ecosystem management (P8), Management must recognize that change is inevitable (P9), Balance between, and integration of, conservation and use of biological diversity (P10), Processes based on information, including scientific and indigenous and local knowledge, innovations and practices (P11), Involvement of all relevant stakeholders (P12) [7, 9, 13]. The operation of the ecosystem based on these principles is significant for the development of a sustainable and strengthened environment, as, for example, a data ecosystem should be. Essential elements and characteristics of some ecosystem contribute to the realization of these principles. The application of the ecosystem approach is necessary for the analysis of the elements and characteristics of a specific ecosystem, as well as the achievability of the principle.

2.2 Research Problem and Motivation

According to the ecosystem approach, dependence on other parties is necessary for the strategic management and operation of the ecosystem. Governance institutions, which have largely been initiators and promotors of open data initiatives, have launched open data portals, and are increasingly dependent on other organizations that produce high-quality data. Data management in such organizations is becoming increasingly critical. In addition to existing research on data management within organizations, data management in the ecosystem is more difficult to understand and very limited is known about it [4]. Despite the great interest in the data ecosystem, clear definitions are lacking [1]. The information and data flow between different stakeholders are obscure and their roles are not clearly defined [1, 5]. The publication and exchange of data in the private sector are encouraged [6], but the significant needs for different types of valuable government and private data remain superficial and incomprehensible [1]. The impact of open data in specific sectors of public governance has not yet been deeply understood and assessed [1], as well as initiatives at the level of data ecosystems are missing. Also, it is discussed a lot of issues and concludes that there is a paradox that open data (publication need) potential is decreased due to the provision of already enhanced (well-developed) e-commerce and e-government services [14]. Therefore, the observed problems can be classified in a narrow and broad sense.

Several problems were identified in a narrow sense of the data ecosystem:

  • unclear definition of a data ecosystem;

  • the lack of a sound identification of the key elements of the data ecosystem and limited knowledge of its characteristics;

  • the relationship confusion and unclear data flow between the stakeholders of the ecosystem;

  • lack of focus on developing and building the data ecosystem in practice;

  • lack of data management knowledge and skills of those involved in the data ecosystem.

In a broader sense, the mentioned problems are reflected in the limited growth and development of the open data field which causes the following problems:

  • lack of use and reuse of open data

  • questionable sustainability of open data initiatives.

2.3 Research Design

The study was divided into two parts. The first part of the research focused on determining the answers to RQ1 and RQ2. The second part of the research provided an overview of the connection of elements to the principles of the ecosystem approach and provided an answer to RQ3.

For the first part of the research, based on the ecosystem approach, the guideline for reporting systematic reviews Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) was applied. PRISMA guidelines enable the implementation of research through the identification of relevant literature in three phases: Identification, Screening and Included. Papers from the relevant databases in the investigated research field Web of Science (WoS) and Scopus were included in the investigation process. The complex query in the research was: “data ecosystem” OR “open data ecosystem” AND “elements” OR “components”.

Fig. 1.
figure 1

Preferred Reporting Items in research

In the Identification phase (Fig. 1), a list of papers from WoS and Scopus was gathered based on a complex query, and duplicate as well as unavailable papers were removed. In the Screening phase, an analysis of abstracts was made, and papers whose thematically did not correspond to the researched field were excluded from the research. Secondly, papers whose content corresponds to the selected research topic were subjected to a detailed analysis. Focus in literature analysis is placed on papers that deal with the topic of “data ecosystem” or “open data ecosystem”, and which in the content include an analysis of the elements or components of the data ecosystem, as well as the characteristics, pre-requisites or principles of the ecosystem. The Included phase represents the total number of papers included in the results of this research.

3 Literature Review

With the development of appropriate technologies and digital society, data has become a tradable and valuable resource [15]. The establishment of industrial open data ecosystems is still in its infancy with many unknowns and unmapped in how they should be built and operate [16]. Challenges and ambiguity in all their key elements, the development of data ecosystems represent an obstacle [16] and require a research effort in defining the approach. An open data ecosystem is an environment of stakeholders who can mutually exchange, produce and consume data and its provide conditions for the creation, management and maintenance of data exchange initiatives [15]. The conditions that the data ecosystem should satisfy include the multidimensional nature of different perspectives such as economic, technical, operational, legal, social, political and institutional perspectives [15,16,17,18].

The previous literature individually deals with the research of different constructs of data ecosystems. Stakeholders in the data ecosystem are mentioned as one of the most important constructs [1, 19,20,21,22]. They can simultaneously be producers and/or users of data with different roles in the data ecosystem. The activities they carry out take place under different conditions [15]. The next more significant construct is the technologies, standards and tools used in the data ecosystem [22,23,24]. They enable networking in the ecosystem, actions on data and visualization of the data environment [25, 26]. Data, as a purposeful resource in the ecosystem, is the construct around which the most significant activities of the data ecosystem are made [27]. Policy/Governance is responsible for defining policies, guidelines and dissemination of data use and reuse activities [4, 28, 29]. Also, their role is significant in providing the infrastructure and support law perspective in data initiatives, such as control of data publishing obligations and rights as well as care of General Data Protection Regulation (GDPR). The concluding reflection points to a set of heterogeneous, dynamic and evolving constructs in the landscape of the data ecosystem [15].

3.1 Open Data Ecosystem Elements

The open data ecosystem is an environment that gathers stakeholders at multiple levels [18, 21, 30, 31] and infrastructurally supports all processes related to data [25, 32], which are key resources for creating diverse values in society [30]. The purpose of a data ecosystem is to actively create and use data by stakeholders, therefore their needs should be clearly identified and defined. Stakeholders must know each other’s needs in order to be able to contribute to the common goal in the ecosystem such as sharing resources (data), providing support (upgrading of skills and knowledge, infrastructure support), and creating innovations (development of new products, the innovation culture). The ecosystem can be shown through the basic elements and their characteristics with certain contextual conditions that must be met for its process and technological functioning. Key elements of the open data ecosystem are:

  1. 1.

    Stakeholders

  2. 2.

    Infrastructure

  3. 3.

    Data

  4. 4.

    Policy/Governance.

In the literature analysis, emergent entities of the data ecosystem were identified. Entities are identified as elements of the data ecosystem without which it cannot operate. A total of 60 articles were analyzed and the mentioned entities that were partially or completely subject to the treatment of the topic were extracted. One of the most frequently mentioned entities is infrastructure or various infrastructure capacities. As many as 50 articles mention different infrastructure capacities as a significant element in the data ecosystem. Furthermore, 39 articles mentioned stakeholders as an element that is indispensable for the operating of the data ecosystem due to the roles that they, as actors, occupy in the system. The data are highlighted in 38 articles and they are the subject resource of the data ecosystem. Policy/Governance is important for regulatory purposes and is marked in 24 articles.

3.1.1 Stakeholders

Stakeholders in the ecosystem, in addition to their primary goal, such as achieving some business result, have an obligation to support and contribute to the development of the data ecosystem [18, 32]. In the open data ecosystem, the contribution of stakeholders is noticeable through active participation, dynamic interaction, support of open data initiatives and the development of a data-based innovation culture in the sector [33]. Open data ecosystems are developed through the adaptation of stakeholders, their feedback loops and dynamic interaction, and the strengthening of other interdependent factors [18]. This process is supported by the proposal of the triple helix model [31]. The inclusion of all stakeholders in the process of knowledge sharing and innovation creation makes this model useful for ecosystem analysis. In one of the studies, the quintuple helix model was presented, which unites all the stakeholders of the observed ecosystem [31]. The same approach is necessary to identify stakeholders that share the same or similar problems, collaborate with similar partners, and have similar needs [34, 35]. To create a sustainable open data environment in a sector, the roles and data needs must be understood [30, 36]. Typical roles of stakeholders in the open data ecosystem are data providers, infrastructure providers, application developers, support service providers, end-users [21].

3.1.2 Infrastructure

To enable the proper functioning of all processes and an accessible environment to stakeholders, a complex infrastructure is essential in the background of the open data ecosystem [3, 24, 37]. The technical and technological infrastructure includes programs, tools, services, and data preparation activities [18]. The background activities, such as the development and implementation of programs and tools in the ecosystem, aim to make data available to users on portals or platforms [24]. The infrastructure supports the preparation of the data and its publication in an appropriate form, while the users on the portal or platform should find the data very easily and download it for use, which requires a user-friendly interface, appropriate visualization, and support service [18, 38].

Several activities characterize the importance of this element in the ecosystem. First, (a) the infrastructure for preparing and publishing open data on the Internet is important, (b) cleaning, analyzing, enriching, combining, linking, and visualizing data, (c) searching, finding, accessing, browsing, and evaluating metadata, (d) discussing data and providing feedback to the data provider and other stakeholders [16, 18]. These activities cover a wide range of topics that are critical to the development of the open data ecosystem. Some of them are data audit, dataset selection, address and map data, privacy, licensing, high quality publishing, data access, data discovery, supporting public agencies, engaging data users, promoting reuse, and evaluation [25, 39]. The open data ecosystem requires significant infrastructure capacity and a commitment to improving it in order to build an organized, functioning, and sustainable system.

3.1.3 Data

Data is a product of a data ecosystem. Fundamental processes in the data ecosystem are based on and with data, from the generation and collection of data to its availability in the ecosystem [40]. Data in the ecosystem should be easily discoverable, available and known to stakeholders, accessible for download and application, and usable for an operational purpose that provides benefits [41, 42]. The characteristics of datasets can be evaluated based on parameters such as legal, practical, technical, and social aspects [43]. Legal parameters include a machine-readable rights statement, a clear rights statement, licensing and privacy issues. Practical parameters include web accessibility, guaranteed timeliness of data, and quality issues. Technical parameters include appropriate formats and open standards. Social parameters relate to metadata, providing feedback, and promoting data. Open data can be viewed in a variety of ways, and there are several definitions. Rather than provide another formal definition, we prefer to look at the characteristics of what makes data truly open. Finally, the characteristics of data could be included and related to the conceptual model in the following ways: (a) data must be complete, (b) data must be primary, (c) data must be timely, (d) data must be accessible, (e) data must be machine-processable and posted online in a permanent archive, (f) access must be nondiscriminatory, (g) data formats must be non-proprietary, (h) data licensing must be unrestricted and have no usage costs, and (i) data should be as accurate as possible [44].

3.1.4 Policy/Governance

Policy/Governance is an element that determines the progress and development direction of data ecosystem initiatives [4, 45]. First, establishing sustainable and strong data ecosystems requires defining policies at the national level and developing them at lower, sectoral levels [2, 17, 46]. Government efforts should focus on promoting better information sharing in the public and private sectors, supporting open innovation for co-creation of products and services through budgeting and infrastructure, and actively engaging in interoperation within the data ecosystem [47, 48].

3.2 Open Data Ecosystem Characteristics

The success of the data ecosystem depends largely on the collaboration and communication among stakeholders and their joint interaction. A collaborative environment is important for creating a good climate in the environment and for sharing data knowledge and skills [12, 18]. Infrastructural capacities can be enhanced by sharing technological resources. Certainly, it is necessary to research good practices and analyze what kind of infrastructure is desirable in an ecosystem where multiple stakeholders with different needs operate. Infrastructure capacities should be interoperable and stakeholders should be familiar with their functional features. In the ecosystem, it should be outlined what kind of data is desirable and in what form it is acceptable for further use and reuse, therefore certain rules regarding data should be defined. The data ecosystem should be characterized by a culture of experimentation with data. For an open data ecosystem in a certain industry or sector, it is important to build a development strategy and development models at the management and governance levels. Attempts to develop potential products or services can improve business processes within organizations or contribute to the development of socially useful solutions [42, 47, 49, 50].

4 Open Data Ecosystem Conceptualization

The conceptual model of the open data ecosystem (Fig. 2) is presented in terms of the four elements of the data ecosystem and their characteristics to be considered in the detailed elaboration and analysis of specific ecosystems.

Stakeholders in the data ecosystem are important for several reasons. In addition to their primary role in the industry, they also have specific roles in the data ecosystem. They may be data creators or data users, or they may take on other roles. In addition, it is important to identify their potential for creating data, i.e., what useful data they create for other stakeholders in their organization. Identifying data needs, i.e., determining potentially useful data for developing one’s business, requires a great deal of attention.

Infrastructure is the most complex element that requires detailed elaboration because it supports all activities in the ecosystem, including activities related to data and activities between stakeholders. The technologies and techniques selected should enable the easiest possible processing and preparation of data so that they are accessible to stakeholders for use and further processing [24]. The data ecosystem should be a secure environment for stakeholders that supports user-friendly features such as a collaborative platform, APIs, licenses, standards, tools and more [24, 51, 52].

Data is a resource that stakeholders in the ecosystem care about. It should be available in an acceptable format and meet certain characteristics and quality standards to be used for an appropriate business or societal purpose.

Policy/Governance is an element that includes the industry as a data ecosystem in the national open data policy. In addition, detailed elaboration at a lower level is desirable. Initiatives should be a priority of government institutions through supporting activities [4, 53, 54].

Fig. 2.
figure 2

Conceptual Model of Open Data Ecosystem Elements and Characteristics

Proposed elements of open data ecosystem can be explained from perspective of specific sector or a specific industry. For example, stakeholder analysis will explain how to identify roles in the sector and data environment and the potential they have in the creating and using data. In addition, infrastructure element will assess the accessibility and interoperability of technological solutions and principles. Element called data will include identification of what data are available, its quality, and its potential for use in the sector. Element Governance/Policy identify the potential of such an ecosystem and whether government or leading initiatives aim to build and financially support an ecosystem. Future work will include the improvement of proposed model to assess the state of a specific data ecosystem and development of framework. The improvement and framework will be based on the proposed conceptual model in order to assess the identified elements of the data ecosystem. Future research will aim to assess the status of the data ecosystem of individual industries and benchmark at the level of different countries. The data ecosystem should contribute to the fulfillment of the principles of the ecosystem approach through the mentioned elements. Stakeholders are closely related to principles P3, P4, P10, P11 and P12. Infrastructure contributes to the fulfillment of principles P5, P6, P7 and P8. Data covers several principles, namely P6, P7, P8 and P11. Policy/Governance covers most of the principles, namely P1, P2, P3, P6, P7, P8 and P9.

5 Discussion and Future Work

Some issues were discovered during the study project to examine open data ecosystems in a specific sector and industry, which motivated the need for this research. Recognizing the potential, a number of projects have been started to advance the field of open data. Despite all efforts, determining how much contribution has been made and the amount to which stakeholders in a given open data ecosystem have profited is extremely challenging. As a result, open data use and reuse have not grown as they should, and future plans for the sustainability of open data programs are uncertain. To address this, our concept focuses on developing open data ecosystems within the context of a certain sector or industry. The study proposes a model for creating a powerful data environment, which covers stakeholder analysis, infrastructure challenges, data, and governance/policy considerations. Each of the aforementioned elements demands future research to elaborate on and carefully study its properties in the context of a certain sector or industry. Open data knowledge and skills are required for stakeholders to identify and fulfil their responsibilities in the data ecosystem as creators or users. A multidisciplinary approach by stakeholders can support in the creation of a stronger and more sustainable data ecosystem. The most critical issue right now is infrastructural capability. They require investments, a high-quality strategic approach, an overview of the ecosystem, and the operationalization of all ecosystem parts. National data policies continue to exclude the development of sectoral data demands and the objective of creating sector-based ecosystems.

6 Conclusion

Development-oriented open data efforts should be aligned, created, and operationalized as an ecosystem at the sector or industry level so that actual progress can be evaluated. Existing research frequently mentions the open data ecosystem, but rarely in the context of a sector or industry. These results are not surprising considering the research’s complexity, which necessitates industry expertise and abilities connected to the characteristics of the open data ecosystem.

The open data ecosystem is an environment in which stakeholders with different responsibilities but similar interests operate, share, and use specific data, that is enabled by interoperable infrastructure solutions. To create a specialized open data ecosystem, it is required to concentrate on promoting and strengthening a data-driven economy, as well as sharing open data through the deployment and availability of appropriate support technologies. The purpose of data ecosystem development proposed on this paper is to shift stakeholders’ roles from passively informed actors to active co-producers of ecosystem services. Future research will concentrate on developing a framework for assessing open data ecosystems based on the proposed conceptual model, which includes important characteristic elements.