Abstract
The emerging interest in Massively Parallel Stream Processing Engines (MPSPEs), which are able to process longstanding computations over data streams with ever-growing velocity at a large-scale cluster, calls for efficient dynamic resource management techniques to avoid any waste of resources and/or excessive processing latency. In this paper, we propose an approach to integrate dynamic resource management with passive fault-tolerance mechanisms in a MP-SPE so that we can harvest the checkpoints prepared for failure recovery to enhance the efficiency of dynamic load migrations. To maximize the opportunity of reusing checkpoints for fast load migration, we formally define a checkpoint allocation problem and provide a pragmatic algorithm to solve it. We implement all the proposed techniques on top of Apache Storm, an open-source MPSPE, and conduct extensive experiments using a real dataset to examine various aspects of our techniques. The results show that our techniques can greatly improve the efficiency of dynamic resource reconfiguration without imposing significant overhead or latency to the normal job execution.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the 24th ACM International Conference on Information and Knowledge Management |
Antal sider | 10 |
Forlag | Association for Computing Machinery |
Publikationsdato | 17 okt. 2015 |
Sider | 13-22 |
ISBN (Elektronisk) | 978-1-4503-3794-6 |
DOI | |
Status | Udgivet - 17 okt. 2015 |
Udgivet eksternt | Ja |
Begivenhed | 24th ACM International Conference on Information and Knowledge Management - Melbourne, Australien Varighed: 18 okt. 2015 → 23 okt. 2015 Konferencens nummer: 24 |
Konference
Konference | 24th ACM International Conference on Information and Knowledge Management |
---|---|
Nummer | 24 |
Land/Område | Australien |
By | Melbourne |
Periode | 18/10/2015 → 23/10/2015 |