Abstract
There is a recent interest in building MapReduce-style platforms for fast data processing, such as MapReduce online [2] and Muppet [5]. In this paper, we highlight the need for dynamic load management in a distributed data stream processing system and present Enorm, a MapReduce-style data stream processing platform with the focus on techniques to achieve dynamic resource management, i.e. the ability to dynamically balance the workload among the running instances and scale the resource usage according to the runtime workload fluctuations. The original MapReduce framework is designed for batched processing and dynamic scaling can only be achieved between batches. To address this problem, we propose a MapReduce-style computation framework and a set of corresponding adaptation strategies that can perform dynamic scaling on the fly with low processing latency.
Original language | English |
---|---|
Title of host publication | Proceedings of the 31st IEEE International Conference on Data Engineering Workshops |
Number of pages | 4 |
Publisher | IEEE |
Publication date | 19 Jun 2015 |
Pages | 10-13 |
ISBN (Electronic) | 978-1-4799-8442-8 |
DOIs | |
Publication status | Published - 19 Jun 2015 |
Externally published | Yes |
Event | 31st IEEE International Conference on Data Engineering Workshops - Seoul, Korea, Republic of Duration: 13 Apr 2015 → 17 Apr 2015 |
Conference
Conference | 31st IEEE International Conference on Data Engineering Workshops |
---|---|
Country/Territory | Korea, Republic of |
City | Seoul |
Period | 13/04/2015 → 17/04/2015 |