Online data partitioning in distributed database systems

Kaiji Chen, Yongluan Zhou, Yu Cao

10 Citations (Scopus)

Abstract

Most of previous studies on automatic database partitioning focus on deriving a (near-)optimal (re)partition scheme according to a specific pair of database and query workload and oversees the problem about how to efficiently deploy the derived partition scheme into the underlying database system. In fact, (re)partition scheme deployment is often non-trivial and challenging, especially in a distributed OLTP system where the repartitioning is expected to take place online without interrupting and disrupting the processing of normal transactions. In this paper, we propose SOAP, a system framework for scheduling online database repartitioning for OLTP workloads. SOAP aims to minimize the time frame of executing the repartition operations while guaranteeing the correctness and performance of the concurrent processing of normal transactions. SOAP packages the repartition operations into repartition transactions, and then mixes them with the normal transactions for holistic scheduling optimization. SOAP utilizes a cost-based approach to rank the repartition transactions' scheduling priorities, and leverages a feedback model in control theory to determine in which order and at which frequency the repartition transactions should be scheduled for execution. When the system is under heavy workload or resource shortage, SOAP takes a further step by allowing repartition operations to piggyback onto the normal transactions so as to mitigate the resource contention. We have built a prototype on top of PostgreSQL and conducted a comprehensive experimental study on Amazon EC2 to validate SOAP's significant performance advantages.

Original languageEnglish
Title of host publicationAdvances in Database Technology — EDBT 2015 : Proceedings of the 18th International Conference on Extending Database Technology, Brussels, Belgium, March 23-27, 2015
EditorsGustavo Alonso, Floris Geerts, Lucian Popa, Pablo Barceló, Jens Teubner, Martín Ugarte, Jan van den Bussche, Jan Paredaens
Number of pages12
PublisherOpenProceedings.org
Publication date2015
Pages1-12
ISBN (Electronic)978-3-89318-067-7
DOIs
Publication statusPublished - 2015
Externally publishedYes
Event18th International Conference on Extending Database Technology - Brussels, Belgium
Duration: 23 Mar 201527 Mar 2015

Conference

Conference18th International Conference on Extending Database Technology
Country/TerritoryBelgium
CityBrussels
Period23/03/201527/03/2015

Fingerprint

Dive into the research topics of 'Online data partitioning in distributed database systems'. Together they form a unique fingerprint.

Cite this