FRESH: Fair and efficient slot configuration and scheduling for Hadoop clusters

Jiayin Wang, Yi Yao, Ying Mao, Bo Sheng, Ningfang Mi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

30 Scopus citations

Abstract

Hadoop is an emerging framework for parallel big data processing. While becoming popular, Hadoop is too complex for regular users to fully understand all the system parameters and tune them appropriately. Especially when processing a batch of jobs, default Hadoop setting may cause inefficient resource utilization and unnecessarily prolong the execution time. This paper considers an extremely important setting of slot configuration which by default is fixed and static. We proposed an enhanced Hadoop system called FRESH which can derive the best slot setting, dynamically configure slots, and appropriately assign tasks to the available slots. The experimental results show that when serving a batch of MapReduce jobs, FRESH significantly improves the makespan as well as the fairness among jobs.

Original languageEnglish
Title of host publicationProceedings - 2014 IEEE 7th International Conference on Cloud Computing, CLOUD 2014
EditorsCarl Kesselman
PublisherIEEE Computer Society
Pages761-768
Number of pages8
ISBN (Electronic)9781479950638
DOIs
StatePublished - 3 Dec 2014
Event7th IEEE International Conference on Cloud Computing, CLOUD 2014 - Anchorage, United States
Duration: 27 Jun 20142 Jul 2014

Publication series

NameIEEE International Conference on Cloud Computing, CLOUD
ISSN (Print)2159-6182
ISSN (Electronic)2159-6190

Other

Other7th IEEE International Conference on Cloud Computing, CLOUD 2014
Country/TerritoryUnited States
CityAnchorage
Period27/06/142/07/14

Keywords

  • MapReduce
  • Resource Management
  • Scheduling

Fingerprint

Dive into the research topics of 'FRESH: Fair and efficient slot configuration and scheduling for Hadoop clusters'. Together they form a unique fingerprint.

Cite this