Making time-stepped applications tick in the cloud

Tao Zou; Guozhang Wang; Marcos António Vaz Salles; David Bindel; Alan Demers; Johannes Gehrke; Walker White

doi:10.1145/2038916.2038936

Making time-stepped applications tick in the cloud

Tao Zou, Guozhang Wang, Marcos António Vaz Salles, David Bindel, Alan Demers, Johannes Gehrke, Walker White

Datalogisk Institut

17 Citationer (Scopus)

Abstract

Scientists are currently evaluating the cloud as a new platform. Many important scientific applications, however, perform poorly in the cloud. These applications proceed in highly parallel discrete time-steps or "ticks," using logical synchronization barriers at tick boundaries. We observe that network jitter in the cloud can severely increase the time required for communication in these applications, significantly increasing overall running time. In this paper, we propose a general parallel framework to process time-stepped applications in the cloud. Our framework exposes a high-level, data-centric programming model which represents application state as tables and dependencies between states as queries over these tables. We design a jitter-tolerant runtime that uses these data dependencies to absorb latency spikes by (1) carefully scheduling computation and (2) replicating data and computation. Our data-driven approach is transparent to the scientist and requires little additional code. Our experiments show that our methods improve performance up to a factor of three for several typical timestepped applications.

Originalsprog	Engelsk
Titel	Proceedings of the 2nd ACM Symposium on Cloud Computing
Antal sider	14
Forlag	Association for Computing Machinery
Publikationsdato	2011
Artikelnummer	20
ISBN (Trykt)	978-1-4503-0976-9
DOI	https://doi.org/10.1145/2038916.2038936
Status	Udgivet - 2011
Begivenhed	2nd ACM Symposium on Cloud Computing - Cascais, Portugal Varighed: 26 okt. 2011 → 28 okt. 2011 Konferencens nummer: 2

Konference

Konference	2nd ACM Symposium on Cloud Computing
Nummer	2
Land/Område	Portugal
By	Cascais
Periode	26/10/2011 → 28/10/2011

Adgang til dokumentet

10.1145/2038916.2038936

Citationsformater

@inproceedings{8d2cd5ab3d7744a98f0d77b602459d14,

title = "Making time-stepped applications tick in the cloud",

abstract = "Scientists are currently evaluating the cloud as a new platform. Many important scientific applications, however, perform poorly in the cloud. These applications proceed in highly parallel discrete time-steps or {"}ticks,{"} using logical synchronization barriers at tick boundaries. We observe that network jitter in the cloud can severely increase the time required for communication in these applications, significantly increasing overall running time. In this paper, we propose a general parallel framework to process time-stepped applications in the cloud. Our framework exposes a high-level, data-centric programming model which represents application state as tables and dependencies between states as queries over these tables. We design a jitter-tolerant runtime that uses these data dependencies to absorb latency spikes by (1) carefully scheduling computation and (2) replicating data and computation. Our data-driven approach is transparent to the scientist and requires little additional code. Our experiments show that our methods improve performance up to a factor of three for several typical timestepped applications.",

author = "Tao Zou and Guozhang Wang and {Vaz Salles}, {Marcos Ant{\'o}nio} and David Bindel and Alan Demers and Johannes Gehrke and Walker White",

year = "2011",

doi = "10.1145/2038916.2038936",

language = "English",

isbn = "978-1-4503-0976-9",

booktitle = "Proceedings of the 2nd ACM Symposium on Cloud Computing",

publisher = "Association for Computing Machinery",

note = "2nd ACM Symposium on Cloud Computing, SOCC 2011 ; Conference date: 26-10-2011 Through 28-10-2011",

}

TY - GEN

T1 - Making time-stepped applications tick in the cloud

AU - Zou, Tao

AU - Wang, Guozhang

AU - Vaz Salles, Marcos António

AU - Bindel, David

AU - Demers, Alan

AU - Gehrke, Johannes

AU - White, Walker

N1 - Conference code: 2

PY - 2011

Y1 - 2011

N2 - Scientists are currently evaluating the cloud as a new platform. Many important scientific applications, however, perform poorly in the cloud. These applications proceed in highly parallel discrete time-steps or "ticks," using logical synchronization barriers at tick boundaries. We observe that network jitter in the cloud can severely increase the time required for communication in these applications, significantly increasing overall running time. In this paper, we propose a general parallel framework to process time-stepped applications in the cloud. Our framework exposes a high-level, data-centric programming model which represents application state as tables and dependencies between states as queries over these tables. We design a jitter-tolerant runtime that uses these data dependencies to absorb latency spikes by (1) carefully scheduling computation and (2) replicating data and computation. Our data-driven approach is transparent to the scientist and requires little additional code. Our experiments show that our methods improve performance up to a factor of three for several typical timestepped applications.

AB - Scientists are currently evaluating the cloud as a new platform. Many important scientific applications, however, perform poorly in the cloud. These applications proceed in highly parallel discrete time-steps or "ticks," using logical synchronization barriers at tick boundaries. We observe that network jitter in the cloud can severely increase the time required for communication in these applications, significantly increasing overall running time. In this paper, we propose a general parallel framework to process time-stepped applications in the cloud. Our framework exposes a high-level, data-centric programming model which represents application state as tables and dependencies between states as queries over these tables. We design a jitter-tolerant runtime that uses these data dependencies to absorb latency spikes by (1) carefully scheduling computation and (2) replicating data and computation. Our data-driven approach is transparent to the scientist and requires little additional code. Our experiments show that our methods improve performance up to a factor of three for several typical timestepped applications.

U2 - 10.1145/2038916.2038936

DO - 10.1145/2038916.2038936

M3 - Article in proceedings

SN - 978-1-4503-0976-9

BT - Proceedings of the 2nd ACM Symposium on Cloud Computing

PB - Association for Computing Machinery

T2 - 2nd ACM Symposium on Cloud Computing

Y2 - 26 October 2011 through 28 October 2011

ER -

Making time-stepped applications tick in the cloud

Abstract

Konference

Adgang til dokumentet

Fingeraftryk

Citationsformater