Paper Title
Design And Estimation Of Big Data Analysis Using Mapreduce And Hadoop-A
Abstract
Hadoop is a trendy open-source implementation of the MapReduce programming model. Map-Reduce and
HDFS are the two major mechanisms of Hadoop. To avoid network congestions, a new technique to preprocess intermediate
data between the maps and reduce stages, thus increasing the throughput of Hadoop clusters. These include a serialization
barrier that delays the reduce phase and repetitive merges, disk accesses. To handle large dataset needs to progress the
performance by modifying existing Hadoop system. Describe Hadoop-A, an acceleration framework that optimizes Hadoop
with plugin mechanism implemented for fast data movement, overcoming its existing limitations. A merge algorithm is
introduced to merge data without replication and disk access. In addition, a full pipeline is designed to overlap the shuffle,
merge and reduce phases. Experimental results show that Hadoop-A significantly speeds up data movement in MapReduce.
Keywords— Hadoop, MapReduce, Hadoop Acceleration.
Author - Darshana Wajekar, Sushila Ratre
Published : Volume-3,Issue-6 ( Jun, 2016 )
DOIONLINE Number - IJAECS-IRAJ-DOIONLINE-4792
View Here
|
|
| |
|
PDF |
| |
Viewed - 49 |
| |
Published on 2016-07-12 |
|
|
|
|
|
|