Keywords

1 Introduction

Cloud computing poses both opportunities and challenges for companies and IT professionals. Some of these are technical challenges that can be solved over time, while others are related to uncertainties arising from the commitment to a recent innovation. The objective of this research is to identify some of the uncertainties that IT professionals may have and can discourage them from adopting cloud computing.

An innovation is an idea, practice or object that is perceived as new [17]. Although the newness of cloud computing is certainly debatable, there is no doubt that its introduction challenges our conventional understanding of the location and management of IT infrastructure, the nature of products and services, business processes and practice of its services (both for IT professionals and consumers).

The factors which are potentially affecting the intention of IT professionals in the use of cloud computing to deliver products and services to their customers are taken mainly from the theory of diffusion of innovation [17].

The theory identifies five variables that have a profound influence on the rate of adoption of innovation including: perceived attributes of innovation, the type of decision in innovation, communication channels, the nature of the social system, and changing the promotional efforts of agents. The perceived attributes of innovation are an important predictor of intent in adopting innovations [17].

There are many benefits in taking the services offered by a cloud provider. Its application will depend on the nature, size and needs of the company. The decision to choose the option that best meets customer needs is a complex task due to the appearance of different suppliers.

Each provider has its own pricing policy, a degree of flexibility in offering services and a technical support appropriate to the service that he thinks he should supply. The offer revolves around these three important pillars. Many companies like Amazon, Google, Microsoft and Salesforce have become cloud computing providers.

The open source community is also present as a provider within the business model that offers, looking very active in the area of cloud computing with numerous contributions, especially in virtualization technologies [15].

Virtualization is a key technology for the cloud which allows a more efficient and flexible use of resources. Virtualization is a key element of the cloud for its advantages such as flexibility, isolation and utilization rate of resources. Building a cloud environment often initially involves choosing a management solution for the cloud. Often, this decision is difficult, because each solution has its specific characteristics [6].

2 Theoretical Background

There are many variables that can influence the adoption of cloud computing. Research in this field is still scarce due to its recent emergence and adoption by businesses. Often, technological literature focuses on addressing issues and challenges related to adoption, such as service availability, performance, lack of interoperability standards and difficulty of integration and customization [7, 8, 10].

Moreover it is possible to find several studies that emphasize the importance of confidence, both in the adoption of cloud technology, as in the privacy in data storage. This paper focuses on the importance of various especially significant aspects in the literature of adoption of technological innovations, like Cooperation, Complexity Technology, Training, Top Management Support and Communication [16, 21].

  • Cooperation, either in its internal or external aspects, provides synergies, it reassures users and helps achieve expectations. Internal cooperation is the exerted between the different functional areas of a company [12, 13]. Furthermore, external cooperation refers to the links that the organization maintains with the cloud provider.

  • Top Management Support. It is defined as the active involvement of those responsible for the management in the successful implementation of technology [2]. This active participation materializes mainly through leadership and continuous contact with those who are directly linked to the IT planning [19]. Through these measures, users tend to assimilate the expectations of management, in addition to perceive that those responsible for an organization support its implementation. This increases the employees’ favourable attitudes regarding IT.

  • Training. Training is described as the degree to which a company instructs its employees in the use of a tool in terms of quality and quantity. In a complex information system such as cloud computing, the organization needs to train employees and develop skills for effective use in the future [9]. This reduces the potential stress of staff and provides greater motivation and a better understanding of the benefits of the cloud.

  • Communication. Quality communication occurs when members employ a certain amount of time exchanging information and views, either formally or informally [11]. This type of communication increases the distribution of ideas and improves knowledge transfer, especially when the information transmitted is credible and from reliable sources.

  • Technological complexity. It is defined as the degree by which an innovation is perceived as relatively difficult to understand and use. In the case of cloud computing, the aspects related to the complexity could be the time needed for the development of tasks, application integration with cloud infrastructure, interface design or efficiency in data transfer, etc. [9]. In short, the technological complexity can influence the adoption of a cloud solution.

3 Methodological Framework

3.1 Choquet Fuzzy Integral

Non-additive measures are known in literature as fuzzy measures, monotonic measures and capacities between others. Some additive operators such as simple weighted average, ordered weighted average, quasi arithmetic means, weighted min and weighted max are usually used for aggregation purpose. These operators assume that the attributes are always independent between them. This assumption is not correct in scenarios where in many cases, the attributes are strongly interrelated. According to this, aggregation should not be always carried out using common additive operators instead, Fuzzy Choquet Integrals are useful to aggregate with interrelated attributes [5].

Let’s define a fuzzy measure \(\mu \) on a finite set \(N = \{1,2,\ldots ,n\}\) as a function \(\mu : P(N) \rightarrow [0, 1]\) (where P(N) is the power set of N) satisfying the following conditions:

$$\begin{aligned} \mu (\phi ) = 0 \end{aligned}$$
(1)
$$\begin{aligned} \mu (N)=1 \end{aligned}$$
(2)
$$\begin{aligned} A \subseteq B \text { implies that } \mu (A) \le \mu (B) \end{aligned}$$
(3)

The third condition allows measures that do not satisfy the strong condition of additivity. For our purposes, this means that we can model systems where the high value of an attribute of the system in itself does not indicate deviations unless a set of other attributes show deviations from their usual values at the same time [14].

Fuzzy Choquet Integral is one of the most general formulations when using monotone measures as the basis of aggregation [18]. To formulate the definition, we assume n attributes measures \((c_1,\ldots ,c_n)\) that it generate the corresponding \((s_1,\ldots ,s_n)\) values after an evaluation performed.

The basic properties of the operator are determined by the monotone measure, such as symmetry, additivity and linearity. A discrete Fuzzy Choquet Integral with respect to a monotone measure \(\mu \) is defined as

$$\begin{aligned} C_\mu (s_1,\ldots ,s_n) = \sum _{i=1}^n (s_{(i)}-s_{(i-1)})\mu (C_{(i)}) \end{aligned}$$
(4)

where s(i) denotes a permutation of the \(s_i\) values such that \(s_{(1)} \le s_{(2)} \le \ldots \le s_{(n)}\) and \(C_{(i)} = \{c_{(i)}, c_{(i+1)},\ldots , c_{(n)}\}\).

3.2 Support Vector Machines

Let us start from a binary classification

$$\begin{aligned} \{x_i, y_i\}, i = 1,..., n, y_i \in \{-1,1\}, x_i \in R^n \end{aligned}$$
(5)

where x\(_i\) are data points, and \(y_i\) are labels. The data points are separated with a hyperplane given by \(w^T x + b = 0\), where w is a n-dimensional coefficient vector that is normal to the hyperplane, and b is the offset from the origin (Fig. 1).

The linear SVM obtains an optimal separating margin by solving an optimization problem [20] as follows

$$\begin{aligned} \begin{array}{rl} \text {min}(w,\xi )&{}=\frac{1}{2}||\mathbf {w}||^2 + C\sum _{i=1}^n \xi _i \\ \text {s.t.}&{}y_i (\mathbf {w}^T\mathbf {x}_i + b)\ge 1-\xi _i, \xi _i\ge 0 \end{array} \end{aligned}$$
(6)

where \(\xi _i=\text {max}(0, 1-y_i(w\cdot x_i +b))\) if and only if \(\xi _i\) is the smallest non-negative value satisfying \(y_i(w\cdot x_i+b) \ge 1-\xi _i\).

Figure 2 shows an example of a linear SVM, where the solid line h in the figure is the final SVM solution.

Fig. 1
figure 1

A linear support vector machine for binary classification

Fig. 2
figure 2

From raw data to higher dimensional with kernels

Usually, the classification cannot be done linearly. In order for the linear classification to work well in non-linear data, kernels are introduced [20]. The original input space can be mapped into some higher-dimensional feature space where the training set is linearly separable [4]. With this kind of mapping, the decision function can be expressed as

$$\begin{aligned} g(\mathbf {x})=\text {sgn}\Bigg ( \sum _{i=1}^n \alpha _i \cdot y_i \cdot K(\mathbf {x}_i,\mathbf {x})+b\Bigg ) \end{aligned}$$
(7)

where the kernel function is \(K(\mathbf {x}_i,\mathbf {x})=\phi (\mathbf {x}_i)^T\phi (\mathbf {x}_j)\), \((\mathbf {x}_i)^T\mathbf {x}\) in the input space is represented as the form \(\phi (\mathbf {x}_i)^T\phi (\mathbf {x}_j)\) in the feature space. It is not needed to know the functional form of the mapping \(\phi (\mathbf {x}_i)\) since it is implicitly defined by one selected kernel (Table 1) [1].

Table 1 Kernel types and functions

4 Experimental Results

To evaluate our proposed theoretical approach we performed a survey with data from 128 respondents. All of them are CIO from companies located in Spain.

This paper is going to predict the perceived easy-of-use in cloud business services. For that purpose, the items used for measuring the Perceived easy-of-use construct are the following [2, 3]:

  • Interaction with the cloud computing services is clear and friendly

  • Working with the cloud services do not demand a mental effort

  • Cloud services are simple to use

  • It is straightforward to find some stuff in the cloud service

The construct Top management support is measured by the following items:

  • Top management is interested in cloud services

  • Top management understands the importance of cloud services

  • Top management sponsors cloud services

  • Top management understands the opportunities of cloud services

The items used for measuring the Technological complexity construct are the following:

  • It is hard to understand what cloud services is doing

  • Working with cloud services takes too long

  • Working with cloud services needs a hard training

  • In general terms, working with cloud services is so hard

The construct Communication is measured by the following items:

  • Communication about the cloud services is fluid

  • There is not constraints about cloud services’ communication

  • The information about cloud services is correct

After run the proposal hybrid methodology we checked the results with three common error measures. The first one is the Mean Squared Error (MSE). It measures the average of the squares of the errors and is computed as follows

$$\begin{aligned} \text {MSE} = \frac{1}{n}\cdot \sum _{i=1}^n (\hat{x}_i-x_i)^2 \end{aligned}$$
(8)

where n is the number of experiments, \(x_i\) is the real value and \(\hat{x}_i\) is the estimated one. The second error measure computed is the Root-Mean-Square Error (RMSE). It measures the differences between the estimated values and the real observed values. It is computed as follows

$$\begin{aligned} \text {RMSE} = \sqrt{\frac{1}{n}\cdot \sum _{i=1}^n (\hat{x}_i-x_i)^2} \end{aligned}$$
(9)

The third error measures is the Symmetric Mean Absolute Percentage Error (SMAPE) is based on relative errors. It is usually defined as follows

$$\begin{aligned} \text {SMAPE}=\frac{1}{n}\cdot \sum _{i=1}^n\frac{|\hat{x}_i-x_i|}{\big (|\hat{x}_i|+|x_i|\big )/2}\cdot 100 \end{aligned}$$
(10)

The results are detailed in Table 2.

Table 2 Error measurement

We consider that the results confirm that our proposal is a worthy endeavour.

5 Conclusions

Cloud computing is challenging for companies and IT professionals. The research proposes a hybrid machine learning methodology for predicting the perceived easy-of-use of cloud business services.

For that purpose, we use Choquet Fuzzy Integral and Support Vector Machines. The results confirm that this proposal is a worthy endeavour.