Skip to main navigation Skip to search Skip to main content

How representative is a SPARQL benchmark? An analysis of RDF triplestore benchmarks

  • Muhammad Saleem
  • , Syed Ahmad Chan Bukhari
  • , Gábor Szárnyas
  • , Qaiser Mehmood
  • , Felix Conrads
  • , Axel Cyrille Ngonga Ngomo
  • University of Leipzig
  • Yale University School of Medicine
  • Hungarian Academy of Sciences
  • Budapest University of Technology and Economics
  • University of Galway
  • Paderborn University

Research output: Chapter in Book or Conference Publication/ProceedingConference Publicationpeer-review

39 Citations (Scopus)

Abstract

Triplestores are data management systems for storing and querying RDF data. Over recent years, various benchmarks have been proposed to assess the performance of triplestores across different performance measures. However, choosing the most suitable benchmark for evaluating triplestores in practical settings is not a trivial task. This is because triplestores experience varying workloads when deployed in real applications. We address the problem of determining an appropriate benchmark for a given real-life workload by providing a fine-grained comparative analysis of existing triplestore benchmarks. In particular, we analyze the data and queries provided with the existing triplestore benchmarks in addition to several real-world datasets. Furthermore, we measure the correlation between the query execution time and various SPARQL query features and rank those features based on their significance levels. Our experiments reveal several interesting insights about the design of such benchmarks. With this fine-grained evaluation, we aim to support the design and implementation of more diverse benchmarks. Application developers can use our result to analyze their data and queries and choose a data management system.

Original languageEnglish
Title of host publicationThe Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019
PublisherAssociation for Computing Machinery, Inc
Pages1623-1633
Number of pages11
ISBN (Electronic)9781450366748
DOIs
Publication statusPublished - 13 May 2019
Externally publishedYes
Event2019 World Wide Web Conference, WWW 2019 - San Francisco, United States
Duration: 13 May 201917 May 2019

Publication series

NameThe Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019

Conference

Conference2019 World Wide Web Conference, WWW 2019
Country/TerritoryUnited States
CitySan Francisco
Period13/05/1917/05/19

Fingerprint

Dive into the research topics of 'How representative is a SPARQL benchmark? An analysis of RDF triplestore benchmarks'. Together they form a unique fingerprint.

Cite this