Dynamic resource management in a massively parallel stream processing engine

Kasper Grud Skat Madsen; Yongluan Zhou

doi:10.1145/2806416.2806449

Dynamic resource management in a massively parallel stream processing engine

Kasper Grud Skat Madsen, Yongluan Zhou

13 Citations (Scopus)

Abstract

The emerging interest in Massively Parallel Stream Processing Engines (MPSPEs), which are able to process longstanding computations over data streams with ever-growing velocity at a large-scale cluster, calls for efficient dynamic resource management techniques to avoid any waste of resources and/or excessive processing latency. In this paper, we propose an approach to integrate dynamic resource management with passive fault-tolerance mechanisms in a MP-SPE so that we can harvest the checkpoints prepared for failure recovery to enhance the efficiency of dynamic load migrations. To maximize the opportunity of reusing checkpoints for fast load migration, we formally define a checkpoint allocation problem and provide a pragmatic algorithm to solve it. We implement all the proposed techniques on top of Apache Storm, an open-source MPSPE, and conduct extensive experiments using a real dataset to examine various aspects of our techniques. The results show that our techniques can greatly improve the efficiency of dynamic resource reconfiguration without imposing significant overhead or latency to the normal job execution.

Original language	English
Title of host publication	Proceedings of the 24th ACM International Conference on Information and Knowledge Management
Number of pages	10
Publisher	Association for Computing Machinery
Publication date	17 Oct 2015
Pages	13-22
ISBN (Electronic)	978-1-4503-3794-6
DOIs	https://doi.org/10.1145/2806416.2806449
Publication status	Published - 17 Oct 2015
Externally published	Yes
Event	24th ACM International Conference on Information and Knowledge Management - Melbourne, Australia Duration: 18 Oct 2015 → 23 Oct 2015 Conference number: 24

Conference

Conference	24th ACM International Conference on Information and Knowledge Management
Number	24
Country/Territory	Australia
City	Melbourne
Period	18/10/2015 → 23/10/2015

Access to Document

10.1145/2806416.2806449

Cite this

Dynamic resource management in a massively parallel stream processing engine. / Madsen, Kasper Grud Skat; Zhou, Yongluan.

Proceedings of the 24th ACM International Conference on Information and Knowledge Management. Association for Computing Machinery, 2015. p. 13-22.

Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

Madsen, KGS & Zhou, Y 2015, Dynamic resource management in a massively parallel stream processing engine. in Proceedings of the 24th ACM International Conference on Information and Knowledge Management. Association for Computing Machinery, pp. 13-22, 24th ACM International Conference on Information and Knowledge Management, Melbourne, Australia, 18/10/2015. https://doi.org/10.1145/2806416.2806449

@inproceedings{79b2912e6507452e9866605bfa06ea09,

title = "Dynamic resource management in a massively parallel stream processing engine",

abstract = "The emerging interest in Massively Parallel Stream Processing Engines (MPSPEs), which are able to process longstanding computations over data streams with ever-growing velocity at a large-scale cluster, calls for efficient dynamic resource management techniques to avoid any waste of resources and/or excessive processing latency. In this paper, we propose an approach to integrate dynamic resource management with passive fault-tolerance mechanisms in a MP-SPE so that we can harvest the checkpoints prepared for failure recovery to enhance the efficiency of dynamic load migrations. To maximize the opportunity of reusing checkpoints for fast load migration, we formally define a checkpoint allocation problem and provide a pragmatic algorithm to solve it. We implement all the proposed techniques on top of Apache Storm, an open-source MPSPE, and conduct extensive experiments using a real dataset to examine various aspects of our techniques. The results show that our techniques can greatly improve the efficiency of dynamic resource reconfiguration without imposing significant overhead or latency to the normal job execution.",

author = "Madsen, {Kasper Grud Skat} and Yongluan Zhou",

year = "2015",

month = oct,

day = "17",

doi = "10.1145/2806416.2806449",

language = "English",

pages = "13--22",

booktitle = "Proceedings of the 24th ACM International Conference on Information and Knowledge Management",

publisher = "Association for Computing Machinery",

note = "24th ACM International Conference on Information and Knowledge Management, CIKM 2015 ; Conference date: 18-10-2015 Through 23-10-2015",

}

TY - GEN

T1 - Dynamic resource management in a massively parallel stream processing engine

AU - Madsen, Kasper Grud Skat

AU - Zhou, Yongluan

N1 - Conference code: 24

PY - 2015/10/17

Y1 - 2015/10/17

N2 - The emerging interest in Massively Parallel Stream Processing Engines (MPSPEs), which are able to process longstanding computations over data streams with ever-growing velocity at a large-scale cluster, calls for efficient dynamic resource management techniques to avoid any waste of resources and/or excessive processing latency. In this paper, we propose an approach to integrate dynamic resource management with passive fault-tolerance mechanisms in a MP-SPE so that we can harvest the checkpoints prepared for failure recovery to enhance the efficiency of dynamic load migrations. To maximize the opportunity of reusing checkpoints for fast load migration, we formally define a checkpoint allocation problem and provide a pragmatic algorithm to solve it. We implement all the proposed techniques on top of Apache Storm, an open-source MPSPE, and conduct extensive experiments using a real dataset to examine various aspects of our techniques. The results show that our techniques can greatly improve the efficiency of dynamic resource reconfiguration without imposing significant overhead or latency to the normal job execution.

AB - The emerging interest in Massively Parallel Stream Processing Engines (MPSPEs), which are able to process longstanding computations over data streams with ever-growing velocity at a large-scale cluster, calls for efficient dynamic resource management techniques to avoid any waste of resources and/or excessive processing latency. In this paper, we propose an approach to integrate dynamic resource management with passive fault-tolerance mechanisms in a MP-SPE so that we can harvest the checkpoints prepared for failure recovery to enhance the efficiency of dynamic load migrations. To maximize the opportunity of reusing checkpoints for fast load migration, we formally define a checkpoint allocation problem and provide a pragmatic algorithm to solve it. We implement all the proposed techniques on top of Apache Storm, an open-source MPSPE, and conduct extensive experiments using a real dataset to examine various aspects of our techniques. The results show that our techniques can greatly improve the efficiency of dynamic resource reconfiguration without imposing significant overhead or latency to the normal job execution.

U2 - 10.1145/2806416.2806449

DO - 10.1145/2806416.2806449

M3 - Article in proceedings

SP - 13

EP - 22

BT - Proceedings of the 24th ACM International Conference on Information and Knowledge Management

PB - Association for Computing Machinery

T2 - 24th ACM International Conference on Information and Knowledge Management

Y2 - 18 October 2015 through 23 October 2015

ER -

Dynamic resource management in a massively parallel stream processing engine

Abstract

Conference

Access to Document

Fingerprint

Cite this