MOON: MapReduce on opportunistic eNvironments

Heshan Lin*, Xiaosong Ma, Jeremy Archuleta, Wu Chun Feng, Mark Gardner, Zhe Zhang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

122 Citations (Scopus)

Abstract

MapReduce offers an ease-of-use programming paradigm for processing large data sets, making it an attractive model for distributed volunteer computing systems. However, unlike on dedicated resources, where MapReduce has mostly been deployed, such volunteer computing systems have significantly higher rates of node unavailability. Furthermore, nodes are not fully controlled by the MapReduce framework. Consequently, we found the data and task replication scheme adopted by existing MapReduce implementations woefully inadequate for resources with high unavailability. To address this, we propose MOON, short for MapReduce On Opportunistic eNvironments. MOON extends Hadoop, an open-source implementation of MapReduce, with adaptive task and data scheduling algorithms in order to offer reliable MapReduce services on a hybrid resource architecture, where volunteer computing systems are supplemented by a small set of dedicated nodes. Our tests on an emulated volunteer computing system, which uses a 60-node cluster where each node possesses a similar hardware configuration to a typical computer in a student lab, demonstrate that MOON can deliver a three-fold performance improvement to Hadoop in volatile, volunteer computing environments.

Original languageEnglish
Title of host publicationHPDC 2010 - Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Pages95-106
Number of pages12
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event19th ACM International Symposium on High Performance Distributed Computing, HPDC 2010 - Chicago, IL, United States
Duration: 21 Jun 201025 Jun 2010

Publication series

NameHPDC 2010 - Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing

Conference

Conference19th ACM International Symposium on High Performance Distributed Computing, HPDC 2010
Country/TerritoryUnited States
CityChicago, IL
Period21/06/1025/06/10

Keywords

  • Cloud computing
  • MapReduce
  • Volunteer computing

Fingerprint

Dive into the research topics of 'MOON: MapReduce on opportunistic eNvironments'. Together they form a unique fingerprint.

Cite this