SemStore: a semantic-preserving distributed RDF triple store

Buwen Wu, Yongluan Zhou, Pingpeng Yuan, Hai Jin, Ling Liu

28 Citationer (Scopus)

Abstract

The flexibility of the RDF data model has attracted an increasing number of organizations to store their data in an RDF format. With the rapid growth of RDF datasets, we envision that it is inevitable to deploy a cluster of computing nodes to process large-scale RDF data in order to deliver desirable query performance. In this paper, we address the challenging problems of data partitioning and query optimization in a scale-out RDF engine. We identify that existing approaches only focus on using fine-grained structural information for data partitioning, and hence fail to localize many types of complex queries. We then propose a radically different approach, where a coarse-grained structure, namely Rooted Sub-Graph (RSG), is used as the partition unit. By doing so, we can capture structural information at a much greater scale and hence are able to localize many complex queries. We also propose a k-means partitioning algorithm for allocating the RSGs onto the computing nodes as well as a query optimization strategy to minimize the inter-node communication during query processing. An extensive experimental study using benchmark datasets and real dataset shows that our engine, SemStore, outperforms existing systems by orders of magnitudes in terms of query response time.

OriginalsprogEngelsk
TitelProceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management
Antal sider10
ForlagAssociation for Computing Machinery
Publikationsdato3 nov. 2014
Sider509-518
ISBN (Elektronisk)978-1-4503-2598-1
DOI
StatusUdgivet - 3 nov. 2014
Udgivet eksterntJa
Begivenhed23rd ACM International Conference on Conference on Information and Knowledge Management - Shanghai, Kina
Varighed: 3 nov. 20147 nov. 2014

Konference

Konference23rd ACM International Conference on Conference on Information and Knowledge Management
Land/OmrådeKina
ByShanghai
Periode03/11/201407/11/2014

Fingeraftryk

Dyk ned i forskningsemnerne om 'SemStore: a semantic-preserving distributed RDF triple store'. Sammen danner de et unikt fingeraftryk.

Citationsformater