On optimization of scientific workflows to support streaming applications in distributed network environments

Qishi Wu, Yi Gu, Xukang Lu, Michelle Zhu, Patrick Brown, Wuyin Lin, Yangang Liu

Research output: Chapter in Book/Report/Conference proceedingConference contributionResearchpeer-review

2 Citations (Scopus)

Abstract

Large-scale data-intensive streaming applications in various science fields feature complex DAG-structured workflows comprised of distributed computing modules with intricate inter-module dependencies. Supporting such workflows in high-performance network environments and optimizing their throughput are crucial to collaborative scientific exploration and discovery. We formulate workflow mapping as a frame rate optimization problem and propose an efficient heuristic solution, which is integrated into the Condor-based Scientific Workflow Automation and Management Platform (SWAMP) in place of Condor's default mapping scheme. The SWAMP system is also augmented with several new components to improve the workflow management process. The performance superiority of the proposed solution is verified using both simulations and a real-life scientific workflow for climate modeling deployed in a distributed heterogeneous network environment.

Original languageEnglish
Title of host publication2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010
DOIs
StatePublished - 1 Dec 2010
Event2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010 - New Orleans, LA, United States
Duration: 14 Nov 201014 Nov 2010

Publication series

Name2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010

Other

Other2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010
CountryUnited States
CityNew Orleans, LA
Period14/11/1014/11/10

Fingerprint

Automation
Heterogeneous networks
Distributed computer systems
Network performance
Throughput

Keywords

  • Distributed workflow
  • Frame rate
  • Grid computing

Cite this

Wu, Q., Gu, Y., Lu, X., Zhu, M., Brown, P., Lin, W., & Liu, Y. (2010). On optimization of scientific workflows to support streaming applications in distributed network environments. In 2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010 [5671851] (2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010). https://doi.org/10.1109/WORKS.2010.5671851
Wu, Qishi ; Gu, Yi ; Lu, Xukang ; Zhu, Michelle ; Brown, Patrick ; Lin, Wuyin ; Liu, Yangang. / On optimization of scientific workflows to support streaming applications in distributed network environments. 2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010. 2010. (2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010).
@inproceedings{b6b45023133447a889a524138d6fab0c,
title = "On optimization of scientific workflows to support streaming applications in distributed network environments",
abstract = "Large-scale data-intensive streaming applications in various science fields feature complex DAG-structured workflows comprised of distributed computing modules with intricate inter-module dependencies. Supporting such workflows in high-performance network environments and optimizing their throughput are crucial to collaborative scientific exploration and discovery. We formulate workflow mapping as a frame rate optimization problem and propose an efficient heuristic solution, which is integrated into the Condor-based Scientific Workflow Automation and Management Platform (SWAMP) in place of Condor's default mapping scheme. The SWAMP system is also augmented with several new components to improve the workflow management process. The performance superiority of the proposed solution is verified using both simulations and a real-life scientific workflow for climate modeling deployed in a distributed heterogeneous network environment.",
keywords = "Distributed workflow, Frame rate, Grid computing",
author = "Qishi Wu and Yi Gu and Xukang Lu and Michelle Zhu and Patrick Brown and Wuyin Lin and Yangang Liu",
year = "2010",
month = "12",
day = "1",
doi = "10.1109/WORKS.2010.5671851",
language = "English",
isbn = "9781424489893",
series = "2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010",
booktitle = "2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010",

}

Wu, Q, Gu, Y, Lu, X, Zhu, M, Brown, P, Lin, W & Liu, Y 2010, On optimization of scientific workflows to support streaming applications in distributed network environments. in 2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010., 5671851, 2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010, 2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010, New Orleans, LA, United States, 14/11/10. https://doi.org/10.1109/WORKS.2010.5671851

On optimization of scientific workflows to support streaming applications in distributed network environments. / Wu, Qishi; Gu, Yi; Lu, Xukang; Zhu, Michelle; Brown, Patrick; Lin, Wuyin; Liu, Yangang.

2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010. 2010. 5671851 (2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010).

Research output: Chapter in Book/Report/Conference proceedingConference contributionResearchpeer-review

TY - GEN

T1 - On optimization of scientific workflows to support streaming applications in distributed network environments

AU - Wu, Qishi

AU - Gu, Yi

AU - Lu, Xukang

AU - Zhu, Michelle

AU - Brown, Patrick

AU - Lin, Wuyin

AU - Liu, Yangang

PY - 2010/12/1

Y1 - 2010/12/1

N2 - Large-scale data-intensive streaming applications in various science fields feature complex DAG-structured workflows comprised of distributed computing modules with intricate inter-module dependencies. Supporting such workflows in high-performance network environments and optimizing their throughput are crucial to collaborative scientific exploration and discovery. We formulate workflow mapping as a frame rate optimization problem and propose an efficient heuristic solution, which is integrated into the Condor-based Scientific Workflow Automation and Management Platform (SWAMP) in place of Condor's default mapping scheme. The SWAMP system is also augmented with several new components to improve the workflow management process. The performance superiority of the proposed solution is verified using both simulations and a real-life scientific workflow for climate modeling deployed in a distributed heterogeneous network environment.

AB - Large-scale data-intensive streaming applications in various science fields feature complex DAG-structured workflows comprised of distributed computing modules with intricate inter-module dependencies. Supporting such workflows in high-performance network environments and optimizing their throughput are crucial to collaborative scientific exploration and discovery. We formulate workflow mapping as a frame rate optimization problem and propose an efficient heuristic solution, which is integrated into the Condor-based Scientific Workflow Automation and Management Platform (SWAMP) in place of Condor's default mapping scheme. The SWAMP system is also augmented with several new components to improve the workflow management process. The performance superiority of the proposed solution is verified using both simulations and a real-life scientific workflow for climate modeling deployed in a distributed heterogeneous network environment.

KW - Distributed workflow

KW - Frame rate

KW - Grid computing

UR - http://www.scopus.com/inward/record.url?scp=78751488983&partnerID=8YFLogxK

U2 - 10.1109/WORKS.2010.5671851

DO - 10.1109/WORKS.2010.5671851

M3 - Conference contribution

SN - 9781424489893

T3 - 2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010

BT - 2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010

ER -

Wu Q, Gu Y, Lu X, Zhu M, Brown P, Lin W et al. On optimization of scientific workflows to support streaming applications in distributed network environments. In 2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010. 2010. 5671851. (2010 5th Workshop on Workflows in Support of Large-Scale Science, WORKS 2010). https://doi.org/10.1109/WORKS.2010.5671851