Abstract
Scientists are currently evaluating the cloud as a new platform. Many important scientific applications, however, perform poorly in the cloud. These applications proceed in highly parallel discrete time-steps or "ticks," using logical synchronization barriers at tick boundaries. We observe that network jitter in the cloud can severely increase the time required for communication in these applications, significantly increasing overall running time. In this paper, we propose a general parallel framework to process time-stepped applications in the cloud. Our framework exposes a high-level, data-centric programming model which represents application state as tables and dependencies between states as queries over these tables. We design a jitter-tolerant runtime that uses these data dependencies to absorb latency spikes by (1) carefully scheduling computation and (2) replicating data and computation. Our data-driven approach is transparent to the scientist and requires little additional code. Our experiments show that our methods improve performance up to a factor of three for several typical timestepped applications.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2nd ACM Symposium on Cloud Computing |
Number of pages | 14 |
Publisher | Association for Computing Machinery |
Publication date | 2011 |
Article number | 20 |
ISBN (Print) | 978-1-4503-0976-9 |
DOIs | |
Publication status | Published - 2011 |
Event | 2nd ACM Symposium on Cloud Computing - Cascais, Portugal Duration: 26 Oct 2011 → 28 Oct 2011 Conference number: 2 |
Conference
Conference | 2nd ACM Symposium on Cloud Computing |
---|---|
Number | 2 |
Country/Territory | Portugal |
City | Cascais |
Period | 26/10/2011 → 28/10/2011 |