Open Access Open Access  Restricted Access Subscription Access

Cloak-Reduce Load Balancing Strategy for Mapreduce


Affiliations
1 Department of Mathematics and Computer Science, Nazi Boni University, Bobo-Dioulasso, Burkina Faso
 

The advent of Big Data has seen the emergence of new processing and storage challenges. These challenges are often solved by distributed processing.

Distributed systems are inherently dynamic and unstable, so it is realistic to expect that some resources will fail during use. Load balancing and task scheduling is an important step in determining the performance of parallel applications. Hence the need to design load balancing algorithms adapted to grid computing.

In this paper, we propose a dynamic and hierarchical load balancing strategy at two levels: Intra-scheduler load balancing, in order to avoid the use of the large-scale communication network, and inter-scheduler load balancing, for a load regulation of our whole system. The strategy allows improving the average response time of CLOAK-Reduce application tasks with minimal communication.

We first focus on the three performance indicators, namely response time, process latency and running time of MapReduce tasks.


Keywords

Big Data, Distributed Processing, Load Balancing, CLOAK-Reduce, Task Allocation.
User
Notifications
Font Size


  • Cloak-Reduce Load Balancing Strategy for Mapreduce

Abstract Views: 317  |  PDF Views: 166

Authors

Mamadou Diarra
Department of Mathematics and Computer Science, Nazi Boni University, Bobo-Dioulasso, Burkina Faso
Telesphore Tiendrebeogo
Department of Mathematics and Computer Science, Nazi Boni University, Bobo-Dioulasso, Burkina Faso

Abstract


The advent of Big Data has seen the emergence of new processing and storage challenges. These challenges are often solved by distributed processing.

Distributed systems are inherently dynamic and unstable, so it is realistic to expect that some resources will fail during use. Load balancing and task scheduling is an important step in determining the performance of parallel applications. Hence the need to design load balancing algorithms adapted to grid computing.

In this paper, we propose a dynamic and hierarchical load balancing strategy at two levels: Intra-scheduler load balancing, in order to avoid the use of the large-scale communication network, and inter-scheduler load balancing, for a load regulation of our whole system. The strategy allows improving the average response time of CLOAK-Reduce application tasks with minimal communication.

We first focus on the three performance indicators, namely response time, process latency and running time of MapReduce tasks.


Keywords


Big Data, Distributed Processing, Load Balancing, CLOAK-Reduce, Task Allocation.

References