Citation
Abstract
Hadoop Distributed File System (HDFS) and MapReduce programming model are for storage and retrieval of the big data. The Terabytes size file can be easily stored on the HDFS and can be analyzed with MapReduce. HDFS is becoming more popular in recent years as a key building block of integrated grid storage solution in the field of scientific computing. However, due to the nature of HDFS that it cannot support asynchronous write, it is widely confirmed that for the case of sustained high throughput in WAN transfer, single stream per GridFTP transfer is the best solution. GridFTP, designed by using Globus, is one of the most popular protocols for performing data transfers in the Grid environment. In this paper, we take on the challenge of integrating Hadoop with grid, by proposing a new framework called Grid-over-Hadoop by retaining the features of Hadoop and using GridFTP for data transfer.
Download File
Full text not available from this repository.
Official URL or Download Paper: https://www.arpnjournals.com/jeas/volume_18_2015.h...
|
Additional Metadata
Item Type: | Article |
---|---|
Divisions: | Faculty of Computer Science and Information Technology Institute for Mathematical Research |
Publisher: | Asian Research Publishing Network |
Keywords: | Hadoop; HDFS; GridFTP; MapReduce; Globus |
Depositing User: | Ms. Nuraida Ibrahim |
Date Deposited: | 15 Nov 2023 08:54 |
Last Modified: | 15 Nov 2023 08:54 |
URI: | http://psasir.upm.edu.my/id/eprint/44237 |
Statistic Details: | View Download Statistic |
Actions (login required)
View Item |