Keywords

1 Introduction

The modern era of 21st century encompasses a digital backbone. Fresh principles are entering our lives each and every day by use of technology. Things which were once dreams for our forefathers are now existing in reality and serving the basic human needs. Cloud computing is one such aspect that is propagating contemporary human civilization to a next level by its unmatchable services. The fundamental definition of cloud computing is “the usage of computing resources over the internet”. Today cloud computing has acquired a serious level of popularity as compared to existing techniques. The chief contributing aspect to this substantial growth is the “Elasticity” behavior catered by the cloud computing to the user. This elasticity permits the user to increase or decrease the demand of computing resources at any instant of time as per their requirements. This allows the users to lower down the superfluous costs that are generally taxed on them if physical or personnel resources were deployed from their end. Now only usage fee is charged upon them and nothing else. Moreover the maintenance costs also lowers down by shifting to cloud. Cloud computing has completely revolutionized the scenario of business computing. But the continuous evolution of this gem has now turned its application towards diverse fields. One such aspect where cloud computing has spread its wings is “Air Pollution Monitoring”. Currently, a lot of research work is being carried out by different researchers who are leaving no stone unturned to find an antidote to this perilous mess. Number of pollution monitoring devices are deployed by various organizations to carry out the surveillance work. The major problem associated with the assigned devices is that they generate huge amount of data which requires some serious computation. Now cloud computing can be efficient solution to this due to its elastic nature. But what turns out to be boon for one, sometimes turns out to be a bane for the other. The resources that are allocated by the service provider to the devices are sometimes overused and sometimes underused. The major influential factor that leads to this muddle is of improper management of resources. The provider has to bear huge losses for the underutilization part as unutilized resources are just eating up the power without providing any substantial benefits. The over utilization taxes the resources pretty brutally and leads to deterioration of the resources. Hence there is some serious need to address this issue and a dynamic approach is required which can find a blend to allocate appropriate amount of resources in such a manner that utilization factor the resources is always high and succor the provider with some good amount of benefits.

In this paper we have tried to adopt a completely different technique to increase the utility of the resources. This algorithmic technique named as “OCR-SAPM” (optimizing cloud resources to sate air pollution monitoring) endeavors to utilize the unused resources of peculiar host by adjusting the load on different hosts along with considering their capacities to handle the load. The future part is divided into 5 sections. Section 2 discusses the literature survey and analysis the work that has been carried out earlier by various authors to gain the best possible solution. Section 2.1 describes the problematic nature of the mentioned techniques. Section 2.2 discusses the major motivational force behind the work. Section 3 firstly describes the terminology required for the smooth understanding of the algorithm. This section further contains Sects. 3.1 and 3.2 which discusses the “Post and Pillar Problem” and OCR-SAPM algorithm respectively. The subsequent part the results and analysis part followed by conclusion respectively. At last all the references of different works are listed out without help of which this work wouldn’t have been possible.

2 Related Work

For a service provider, an effective management of resources is becoming a convoluted task as with the popularity of cloud computing. Thoughtful efforts are being made towards achieving the desired outcomes. The research under this is not fresh but had been there for quite a time. [2] Under his work provided an appropriate “MLE” method of estimating the resource requirement of the users under severe complexities of IAAS and E-Learning system. The practical implementations were carried out with the help of “Samsung School Solutions” to monitor the effectiveness of the algorithm. Few Additional Resources were always reserved in case of emergency requests. Nagpure in [3] employs a special load balancing server, whose sole purpose is to envisage the upcoming demands. “Skewness” was used to determine the factors responsible for creation of chaos under resource optimization and how it can be dealt. [4] Provides a methodology to counter back and forth occurrence of resource demand using certain bound values. [1] Proposes a proficient technique named as “auto-scaling algorithm” to provide resource allocation for different workflow inputs. [5] Provides a technique to balance the load on the system by considering the response time associated with each task. The author first address the contemporary issues with other load balancing algorithms out there in the world. The algorithms predicts the response time for the incoming requests. In [6], author strives to publish a perfect difference between the static and dynamic techniques out there that helps to balance the load. It also caters a new elasticity policy to help the provider to cater needs in a better and appropriate manner. [7] Works for the multimedia cloud and helps the cloud industry by an algorithm for allocation of resources on “Near-Client-Datacenter” basis. The network analysis was taken into due consideration along with the allocation part in his work. The problem of catering sudden outburst in workload with efficient management and allocation of resources was handled in [8]. Here traffic burst was considered for different applications under different scenarios like using dataset of 1998 World Cup to find out the desired results. Saad and Babar in their taxonomy [9] tries to present the analysis of various categories under “Resource Management” which includes Energy-aware management, SLA-aware management and many more categories. A detailed comparative analysis was provided by the authors to process the fortes and flaws of different techniques. [10] Addresses a very critical, unmanageable problem of energy consumption of the data centers and tries to propose a holistic approach based on multi-criteria decision system. S. Manvi in his work [11] studies the various issues of one of the services of cloud computing namely IAAS and suggests various schemes out in the computing world that are being utilized to counterpoise the issue. An adaptive migration technique for various resources by using skewness and prediction methodology was provided by [12]. The next section explores a brief comparison between certain specified algorithms for specifying the problems that these algorithms are facing (Table 1).

Table 1. Brief comparative analysis between certain specified algorithms.

2.1 Problem Description

Though the proposed works of all the above mentioned author is pretty impressive, but the major stage used for their deployment is either “Mean/Average” based or a completely different “Central monitoring system” based whose purpose is to be a spectator and just monitor the other hosts for any adverse circumstances or for switching task if required. The problem with the latter is that deploying a complete system just for monitoring purpose is not valuable taking into effect that the contemporary systems are way more powerful and efficient. This “Patterned” or synchronous approach is not appropriate as the system can be utilized for some other useful computing and scientific work. Moreover, the problem with the former “Mean” method is that analysis are based on average of a parameter as compared to a bigger picture. As “Mean” can only lower the gap between the available and the optimal but cannot provide an optimal solution which eventually hampers the optimization of resources. Certain Algorithms also uses “Prediction” based methodology to assign resources which most of the times backfires. OCR-SAPM address these problems by switching from a “Patterned” approach to a “Non-Patterned” approach. Here there is no deployment of any central monitoring system, rather there is an integration of the existing systems.

2.2 Motivational Work

Whenever we talk about the advancements in the technology, one name is there that can’t be left unnoticed. The name is “Google”. As we all are very well aware about the drastic effects that are being foreseen by the environmentalists regarding climate change due to snowballing pollution levels, Google has come upfront and took the initiative to put curtains to this mess. At the 11th Annual Clinton Global Annual Google declared his partnership with Aclima, a San-Francisco based sensor manufacturing organization, to utilize its street view cars to generate pollution maps [15] which would help the users using google maps to get instantaneous updates of the pollution levels of different areas. The Cars will be properly equipped with different sensors to monitor different pollutants like CO2, NO2, and Particulate Matter (PM) etc. The data collected by the sensory nodes is transferred to cloud systems, but before transmitting it, the values are amplified refined as per requirements to make a hassle free transfer. The cloud system analyzes the data and provide feedback as RED zones or GREEN zones according to severity of pollution level. Moreover, the data that has been collected is further made available to ENVIRONMENTALIST, public Government agencies etc. to use that data to cater certain useful measures that can help to put curtains to this ever growing hazardous problem. The following figure illustrates the appropriate stepwise process of the above mentioned scenario (Fig. 1).

Fig. 1.
figure 1

An overview of general steps under specified Air pollution monitoring scenario.

Data generation by these sensory devices varies from moderate to gigantic level. This fitful generation makes data processing at the cloud side difficult in terms of resource allocation. Provider cannot simply predict the amount of resources that are required to be allocated. This unevenness of data depends upon the working area of the sensors. For instance, a crowded market place will generate an enormous amount of data after monitoring as compared to a residential area. Hence, our algorithm “OCR-SAPM” considers this chaos and delivers an efficient and effective algorithmic solution for the plight of provider. The next section will address the work that has been proposed under “OCR-SAPM”.

3 Proposed Work

There is a definite set of terminology associated with OCR-SAPM that needs to be addressed before leaping into any subsequent part. The following table describes them all (Table 2).

Table 2. Terminology and Definition.

3.1 Post and Pillar Effect

Before jumping into the pool of OCR-SAPM, there is a concept associated with BOBO and TOBO values that needs to be addressed. The concept is named as “Post and Pillar Effect”. The concept is named after back and forth flow that occurs sometimes due to allocation and deallocation of load i.e. on increasing the load, the overload condition occurs, whereas on decreasing the load, the under load condition occurs. This is known as “Post and Pillar effect”. To make sure that such situation never occurs, the values of BOBO and TOBO for each hosts are assigned using two methods which are discussed as under:

  • Inert Allocation: Under this technique the value BOBO and TOBO are assigned in a static context. Here the TOBO is assigned higher to achieve a greater utilization factor. Now assigning a higher bound also increases the chances of underutilization. To counterpoise this problem we limit the value of BOBO as:

    $$ \varvec{BOBO} < \frac{{\varvec{TOBO}}}{2} $$
    (1)
  • Relative allocation: Under relative allocation the TOBO is settled first, but the value of BOBO is assigned by scrutinizing TOBO. Under this method, there is occurrence of frequent changes to BOBO value according to the load on the system. Under this BOBO is given in terms of TOBO as:

    $$ \varvec{BOBO} < \frac{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right) - \left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}^{ '} } \right)}}{{\varvec{MAX}\_\varvec{RA} - \varvec{RAP}}}.\varvec{TOBO} $$
    (2)
  • Declaration: Decreasing load does not simultaneously generates under loading situation if BOBO and TOBO are assigned as stated above.

  • Evidence: Initial total load before decrease in load can be given as:

    $$ \left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right).\varvec{BOBO} $$
    (3)

Now after the load is decreased, the new total load according to (3) can be given as,

$$ \frac{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right)}}{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right) - \left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}^{ '} } \right)}}.\varvec{BOBO} $$

Were RAP’ is the new proportion of available resources. To make sure that on increasing load, overload condition does not occurs, we must set

$$ \varvec{TOBO} > \frac{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right)}}{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right) - \left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}^{ '} } \right)}}.\varvec{BOBO} $$

Or

$$ \varvec{BOBO} < \frac{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right) - \left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}^{ '} } \right)}}{{\varvec{MAX}\_\varvec{RA} - \varvec{RAP}}}.\varvec{TOBO} $$
(4)
  • Declaration: Increasing load does not simultaneously generates over loading situation if BOBO and TOBO are assigned as stated above.

  • Evidence: Initial total load before increase in load can be given as:

    $$ \left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right).\varvec{TOBO} $$
    (5)

Now after the load is increased, the new total load according to (5) can be given as,

$$ \frac{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right)}}{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right) + \left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}^{ '} } \right)}}.\varvec{TOBO} $$

To make sure that on removing, under load condition does not occurs, we must set,

$$ \varvec{BOBO} < \frac{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right)}}{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right) + \left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}^{ '} } \right)}}.\varvec{TOBO} $$
(6)

Now from (4) and (6),

$$ \frac{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right)}}{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right) + \left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}^{ '} } \right)}}.\varvec{TOBO} > \frac{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right) - \left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}^{ '} } \right)}}{{\varvec{MAX}\_\varvec{RA} - \varvec{RAP}}}.\varvec{TOBO} > \varvec{BOBO} $$

Hence to make sure that post pillar problem never occurs, set

$$ \varvec{BOBO} < \frac{{\left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}} \right) - \left( {\varvec{MAX}\_\varvec{RA} - \varvec{RAP}^{ '} } \right)}}{{\varvec{MAX}\_\varvec{RA} - \varvec{RAP}}}.\varvec{TOBO} $$

3.2 OCR-SAPM Algorithm

The modern industry computers have become so powerful that handling hundreds of users at a particular instance of time is just like a piece of cake. Appointing a scrutinizer over these powerful machines is just a wastage of computing power which can be labeled with some other scientific and beneficial work. In order to put an end to this approach, OCR-SAPM algorithm uses “entangled” approach as stated in diagram. Under this each computing node has a copy of Common_Tab_Map table which gets apprised by each and every other node once the resource are manipulated by incoming or outgoing of Data that requires resources for processing. As below in Fig. 3, as soon as the resources are consumed from node A, all other nodes are apprised by it from A and the Common_Tab_Map is refreshed by all (Fig. 2).

Fig. 2.
figure 2

Entangled Approach for Common_Tab_Map.

The Common_Tab_Map and RAP performs the core functionality of the algorithm. The successful working of algorithm depends upon these two parameters. The greater value of RAP signifies that there is substantial amount of underutilization is prevailing among hosts and vice-versa. Genesis of OCR-SAPM contains a module as Regular_Optimize() which first gets introduced to the incoming data. The incoming CL_data is simply assigned to Host [x] presuming it to be capable enough to provide service. Now, Common_Tab_Map is revised for the existing resources and is sorted out in an aspiring manner. Check_HDL() now joins the party and looks for the ideal host that satisfies two conditions namely, Fulfill Data resource requirement and Improve resource utilization. These conditions are surveyed from Common_Tab_Map in such a manner that the specific host has ample RAP value (minimum) available to provide resource supply without fracturing its TOBO. Now if there is an availability, then the ID of that host is stored in HDL and that CL_Data is assigned to it for processing. Common_Tab_Map is again restructured via sorting.

But, this simple thing can become problematic if there is not even a single host to provide service and the data arrived sticks to the existing host which also unfortunately becomes overloaded (Monitor_OL()) and demands immediate offloading. If this kind of situation arrives then OCR-SAPM transforms itself from “intra host” where exchange was from one host to another of a machine to “inter host” where now it will be carried out between hosts under different machines. OL_OCR_SAPM() is asked for a solution. The OL_OCR_SAPM() first derives the TOBO and BOBO values and along with this, a list named as “CL_Data_List” is prepared for the current data that is under processing on that host. Now in order to fully optimize the resource usage, it is not appropriate to assign any ith CL_Data. To achieve the best results, The CL_Data that requires minimum amount of resources is selected and is stored under, CL_Data_Min. To assign this under a suitable Host, get_HDLR_ID_Common_Tab_Map_OL() is summoned which scrutinizes the Common_Tab_Map for the host (other than current) that can successfully satisfy optimization conditions as well fulfil resource catering service for the CL_Data. The RAP again plays an important role here as selection of host is based on the minimal level i.e. the host having minimum RAP is preferred, provided it does not tax its resources upon selection. If under extremely unfavorable circumstances, a situation comes when there is no host that can cater the service, then we have to apply new nodes to lower the burden on current host.

Till now our central theme for optimization was restricted to overloading situation. But optimization of resources doesn’t always associated with overloading. Sometimes the situation of under loading can also severely obstruct the optimization. Most of the contemporary algorithm doesn’t take this situation into their radar for optimization. OCR-SAPM challenges this problem and provides and efficient solution. MonitorUL() raises an under load condition whenever the RAP becomes greater than BOBO i.e. the maximum resources are unwaged. UL_OCR_SAPM() is called which works likewise as OL_OCR_SAPM() but instead of selecting the CL_Data_Min, it chooses CL_Data_Max i.e. the supreme resource yearning is chosen. On the similar guidelines get_HDLR_ID_Common_Tab_Map_UL() is invoked and on accomplishment, the unrequired freed hosts are Signed-Off to provide resource provider ample and high yielding amount of benefits. The following section discusses the complete pseudo-code OCR-SAPM algorithm (Tables 3, 4, 5, 6, 7, 8, 9 and 10).

Table 3. Algorithm: Regular_Optimize(Common_Tab_Map).
Table 4. Algorithm: Check_HDL (Common_Tab_Map, Resources).
Table 5. Algorithm: OL_OCR_SAPM (Host [x], Common_Tab_Map).
Table 6. Algorithm: get_HDLR_ID_Common_Tab_Map_OL (Common_Tab_Map,Resources).
Table 7. Algorithm: MonitorUL(Common_Tab_Map).
Table 8. Algorithm: MonitorOL(Common_Tab_Map).
Table 9. Algorithm: UL_OCR_SAPM (Host [x], Common_Tab_Map).
Table 10. Algorithm: getHDLR_ID_Common_Tab_Map_UL (Common_Tab_Map,Resources).

3.3 OCR-SAPM Algorithm

4 Results and Analysis

Under this section a detailed graphical analysis is presented containing the percentage increase in the optimization levels using OCR-SAPM algorithm. The Sample data from [13, 14] was used to calculating the results. Pollutants such as Particulate Matter (PM), Ozone (O3), Nitrogen Dioxide (NO2) and Sulfur Dioxide (SO2) were taken into consideration as shown in Table 11.

Table 11. Different values of different pollutants received from different areas. (In AQI).

Simulation experiments were carried out using CloudSim 3.0.3. The inceptive load was assigned to different hosts on different machines using the random data generator function and further the run was augmented by the dataset for evaluation. Comparative analysis were performed using different number of task under different time intervals as described in Figs. 3 and 4. Preliminary analysis clearly reveals that there is significant resource utilization improvement if we switch to the given algorithm.

Fig. 3.
figure 3

Resource utilization comparison. (Color figure online)

Fig. 4.
figure 4

Sign-Off. (Color figure online)

The simulation results clearly indicates the increase in utilization factor of the resources. Yellow and Grey lines indicate the TOBO and BOBO respectively Here TOBO is static, but BOBO keeps on adjusting as per the load. Blue and Orange line signifies the improvement factor discovered with the use of OCR-SAPM as compared to centrally appointed systems.

Figure 4 address the site of immediate load shrinkage which leads to increase in the RAP and as soon as RAP goes greater than BOBO, The load of that host in transferred to other host that can service its needs and also remain under TOBO and the current host is logged off.

5 Conclusion

Air pollution control is one of the utmost important agendas that needs some serious addressing. This paper proposes a dynamic resource optimization technique that caters its services to air pollution monitoring devices for efficient working. The Experimental analysis does reveal that there is significant improvement achieved in utilization part if OCR-SAPM is deployed. The algorithm benefits the resource providing authority with some serious reductions in cost factors by signing off certain unused resources and effectively utilizing the deployed ones. The future work can include to further optimize the algorithm to adapt network fluctuations and unavoidable circumstances related to optimization.