Keywords

8.1 Introduction

Smart learning environment (SLE) [1] is a powerful learning tool that provides on-demand and adaptive support to the learners based on the analysis of their learners needs. In recent years, it has became more popular and a large number of learners, academic institutions, and industries started to use across the worldwide that generates large amount of learners’ data. Smart Competence LEarning Analytics platform (SCALE) [2] is an in-house learning analytics platform designed and developed to track the fine-level learning experiences and translates those experiences into opportunities for customized feedback, reflection, and regulation. It has witnessed that, it has to deal with a large amount of learners data also known as Big data. Hence, it is essential to extract, process, and analyze those learners data in a rapid and an efficient manner to understand the learners various competence measures. Ku [3] analyzed the problems in conventional e-learning environment and stated that “smart learning is not just providing e-learning services through mobile devices, it should have an intelligent customized learning services optimized for the learners’ on a personalized mobile device.” Map Reduce [4] and Dryad [5] are the most popular distributed processing frameworks used in the market to process the Big data. These frameworks split the Big data and distribute those data in parallel across the Hadoop [6] clusters to analyze, transfer, and merge the results. The Big data processing requires vast amount of computation cycles and bandwidth across the clusters and between the analytics engine and the clusters in on-demand manner.

However, SCALE is currently using the conventional networking principle of subscription-based method. Conversely, the main drawback in the conventional networking is allocation of bandwidth in an on-demand manner that requires lots of manual intervention and time-consuming process and involves more laboring cost that increases the total computation time and computation cost. Hence, in this research paper, we propose to marry the software-defined networking (SDN)-based [7] networking principle with SCALE platform, which will create the network paths, inject the flows, and allocate the bandwidth based on the demand. OpenFlow [8] protocol is used to establish the communication between the networking devices. The research works [911] have investigated the advantages of SDN for Big data application processing to reduce the processing time, dynamically creating network paths, etc. The article discussed [12] the capability of SDN to build a large, intelligent network that is capable to handle both structured and unstructured data as part of Big data analytics, and it eliminated the pain of manual administration, which exists in the conventional networking resources. In summary, the main contributions of research work are summarized below:

  1. (A)

    Designing a SDN-based network services for SCALE platform for performing learning analytics in a Smart Learning Environment.

  2. (B)

    Provisioning of network paths, injecting flows, and allocating the bandwidth in a dynamic manner for reducing the total computation time and total computation cost.

  3. (C)

    Creating a SDN-based Hadoop infrastructure with OpenFlow switches and OpenFlow protocols.

  4. (D)

    Analyzing the impact of SDN-based networking services in the SCALE platform.

The rest of this paper is organized as follows: Sect. 8.2 describes the research works, which are closely related and provides background knowledge to support our research work. Section 8.3 discusses the problem statement and the proposed solution to solve the problem. Section 8.4 describes the system architecture for implementing the proposed work. Section 8.5 discusses the performance measures of the proposed system. Section 8.6 concludes the paper with recommendations and future work to be explored further.

8.2 Related Works

Ku [3] analyzed the base technologies for the SLE and examined the problems in conventional e-learning environment. The study has concluded that, it is essential to include context aware technology, Big data technology, and cloud computing in traditional e-learning services to build a SLE. Das et al. [10] proposed a network management framework named FlowComb, which is helpful for Big data application processing to achieve high utilization and low data processing times. Their proposed system is capable to predict the application transfer time based on the software agents installed on application servers. Their experimental results are evident that their proposed system reduces the Big data processing time in a remarkable manner. Benson et al. [13] proposed a framework named as Cloud Networking as a Service (CloudNaaS). It extends the self-provisioning model of providing network devices in an on-demand manner in addition to compute and store devices. Mahenge and Mwangoka [14] designed a cost-effective mobile-based content delivery system to deliver the contents in a resource and bandwidth constraint environment. They claimed that the proposed system has the potential to reduce the bandwidth cost, the server workload, and the Internet usage overhead. They improved the quality of experience and learners’ participation using their proposed system.

8.3 Problem Statement and Solution

SCALE requires a large amount of computation cycles and network bandwidth to process and analyze the collected data in an on-demand manner. Figure 8.1 represents the working mode of SCALE using the conventional  networking structure. Figure 8.2 represents the schematic representation of the solution to solve the above-said problem.

Fig. 8.1
figure 1

Problem representation

Fig. 8.2
figure 2

Schematic representation of solution

SCALE communicates with SDN controller through SDN connector to initiate learners data analytics process. The controller node and cluster nodes should be configured with Open vSwitch (OVS) [15] software that can act as the physical switch, as well as the virtual switch. The queues (Q1–QN) are created in OVS and each queue is assigned with a specific bandwidth value. The virtual network V1 and V2 has been created, and it is allocated with a link bandwidth value of B1 and B2, respectively; it can be modified dynamically based on the demand.

8.4 Proposed System Architecture

The system architecture for implementing the proposed research work is shown in Fig. 8.3.

Fig. 8.3
figure 3

System architecture

The data collector module collects the learners’ data from Moodle, MI-LATTE, CODEX, and Hackystat sensors that consists of both structured and unstructured data. It stores the data in Hadoop Distributed File System (HDFS) clusters through data storage manager. The SDN connector acts as the interface between the OpenFlow switches and SCALE platform using the OpenFlow protocol. The data preprocessor periodically queries, fetches and, preprocesses the learners’ data that remove the unwanted fields, name, and id of the learners for privacy purpose. The preprocessed data are sent to the data clustering manager which is embedded with the particle swarm optimization (PSO) [16] clustering algorithm. It group the similar learners in an optimal manner based on the learners efficiency, quality, and accuracy. The data analyzer component is integrated with various machine learning and statistical algorithms that initiates the learning analytics process across the Hadoop clusters. The SDN controller provisions the network paths in the OpenFlow switches in a dynamic manner and allocates the required bandwidth to network links in each cluster by injecting the flows for performing learning analytics process. Finally, the analyzed learners’ data are fed into the data visualizer to visualize the various competence measures in terms of graphical charts and statistical values.

8.5 Performance Measures

The performance of a SDN-based learning analytics platform is determined by analysis of total computation time and computation cost, which is calculated using Eqs. (8.1) and (8.2). To perform this experiment, we have generated the learners data, which is based on the traces of real-time learners data. The data size ranges from 2 to 32 GB and the OpenFlow switches are configured with a maximum date rate of 200 Mbps. For the first experiment (without SDN), a single queue is created and allocated a bandwidth value of 100 Mbps. The second experiment (with SDN) is conducted by creating multiple queues, which ranges from Q0 to Q4 and bandwidth is allocated in the range of 20–100 Mbps. The proposed work allocates the bandwidth to the learning analytics process of various data sizes in an on-demand manner based on the required bandwidth in SCALE platform. Figures 8.4 and 8.5 represent the total computation time and total computation cost to perform the learning analytics process for the collected learners data. The total computation time (TCTime) is calculated using Eq. (8.1). It is the summation of task distribution time (TDTime), data analyzing time (DATime), results merging time (RMTime), and data transfer time (DTTime). The total computation cost (TCost) is calculated using the Eq. (8.2), which is based on the usage of bandwidth value for the learning analytics process of various data sizes.

$${\text{TC}}_{\text{Time}} = {\text{TD}}_{\text{Time}} + {\text{DA}}_{\text{Time}} + {\text{RM}}_{\text{Time}} + {\text{DT}}_{\text{Time}}$$
(8.1)
$${\text{TC}}_{\text{Cost}} = ({\text{TD}}_{\text{Time}} {\text{ + RM}}_{\text{Time}} {\text{ + DT}}_{\text{Time}} )*{\text{BW\_Usage}}*\$ / {\text{h}}$$
(8.2)
Fig. 8.4
figure 4

Comparison of total computation time

Fig. 8.5
figure 5

Comparison of total computation cost

8.6 Discussion and Recommendation

SLE is capable of providing an on-demand and adaptive support for the learners based on their needs. SCALE is the learning analytics platform to analyze and understand the learners competence measures that requires a huge amount of computation cycles and bandwidth in an on-demand manner. In this research paper, we have investigated the effect of coupling SDN-based networking principle in learning analytics process. From this study and the experimental results, the following things have been identified and recommended as follows:

  • The SLE generates a huge volume of learners’ data known as Big data. The total computation time required to process those data are high that increase the total computation cost.

  • The conventional networking is unsuccessful for provisioning of bandwidth in an on-demand manner due to various limitations. However, SDN has the capability to provision the network paths, allocating the bandwidth to the provisioned network paths in an on-demand manner.

  • The SDN-based network services in Smart Competence LEarning Analytics (SCALE) platform reduces the total computation time and computation cost of learning analytics process.

The future work intends to extend this research study further: (i) to develop an intelligent scheduling mechanism to identify the suitable computation and network nodes that increase the efficiency of learning analytics process. (ii) to extend the SDN-based SCALE platform for video streaming of online lectures and improving the Quality of service.