The official code repository of our VLDB 2021 paper Columnar Storage and List-based Processing for Graph Database Management Systems.
This repository contains the dataset, queries and the versions of the Graphflow system that we use in our paper.
This repository contains 2 versions of GraphflowDB.
- GF-RV [link]: The baseline version that implements vanilla row-based storage and a volcano-based processor.
- GF-CL [link]: Version of GraphflowDB that implements our novel column-oriented storage and the List-based Processor.
Both projects follow common instructions for building and benchmarking that can be found here.
We provide 2 datasets that we use for system comparison in the paper.
Users can use their own datasets to test their own benchmark queries. Instructions for creating own dataset can be found here.
We test our system on 2 leading benchmarks: JOB and LDBC SNB. Since Graphflow do not support some advanced SQL features, we modify some queries as needed. We include the exact queries that we use here in the format that can directly be run on the builts.
- JOB Benchmark Queries [link]
- LDBC SNB Interactive Complex Queries [link]
- LDBC SNB Interactive Short Queries [link]
This project is licensed under the MIT License - see the LICENSE file for details.
Created at Data Systems Group, University of Waterloo, Canada.