Online data partitioning in distributed database systems

Kaiji Chen, Yongluan Zhou, Yu Cao

10 Citationer (Scopus)

Abstract

Most of previous studies on automatic database partitioning focus on deriving a (near-)optimal (re)partition scheme according to a specific pair of database and query workload and oversees the problem about how to efficiently deploy the derived partition scheme into the underlying database system. In fact, (re)partition scheme deployment is often non-trivial and challenging, especially in a distributed OLTP system where the repartitioning is expected to take place online without interrupting and disrupting the processing of normal transactions. In this paper, we propose SOAP, a system framework for scheduling online database repartitioning for OLTP workloads. SOAP aims to minimize the time frame of executing the repartition operations while guaranteeing the correctness and performance of the concurrent processing of normal transactions. SOAP packages the repartition operations into repartition transactions, and then mixes them with the normal transactions for holistic scheduling optimization. SOAP utilizes a cost-based approach to rank the repartition transactions' scheduling priorities, and leverages a feedback model in control theory to determine in which order and at which frequency the repartition transactions should be scheduled for execution. When the system is under heavy workload or resource shortage, SOAP takes a further step by allowing repartition operations to piggyback onto the normal transactions so as to mitigate the resource contention. We have built a prototype on top of PostgreSQL and conducted a comprehensive experimental study on Amazon EC2 to validate SOAP's significant performance advantages.

OriginalsprogEngelsk
TitelAdvances in Database Technology — EDBT 2015 : Proceedings of the 18th International Conference on Extending Database Technology, Brussels, Belgium, March 23-27, 2015
RedaktørerGustavo Alonso, Floris Geerts, Lucian Popa, Pablo Barceló, Jens Teubner, Martín Ugarte, Jan van den Bussche, Jan Paredaens
Antal sider12
ForlagOpenProceedings.org
Publikationsdato2015
Sider1-12
ISBN (Elektronisk)978-3-89318-067-7
DOI
StatusUdgivet - 2015
Udgivet eksterntJa
Begivenhed18th International Conference on Extending Database Technology - Brussels, Belgien
Varighed: 23 mar. 201527 mar. 2015

Konference

Konference18th International Conference on Extending Database Technology
Land/OmrådeBelgien
ByBrussels
Periode23/03/201527/03/2015

Fingeraftryk

Dyk ned i forskningsemnerne om 'Online data partitioning in distributed database systems'. Sammen danner de et unikt fingeraftryk.

Citationsformater