duckdb

LDBC SNB DuckDB implementation

DuckDB implementation of the LDBC Social Network Benchmark's Interactive workload.

Setup

Grab DuckDB:

scripts/get.sh

Generating the data set

The data sets need to be generated before loading it to the database. No preprocessing is required. To generate the data sets for DuckDB, use the same settings as for PostgreSQL, i.e. the Hadoop-based Datagen's CsvMergeForeign serializer classes.

Running the benchmark

Set the following environment variable based on your data source:

export DUCKDB_CSV_DIR=`pwd`/../postgres/test-data

Loading the data set

Load the data set as follows:

scripts/load.sh

Running the benchmark driver

The instructions below explain how to run the benchmark driver in one of the three modes (create validation parameters, validate, benchmark). For more details on the driver modes, check the "Driver modes" section of the main README.

Create validation parameters

Edit the driver/benchmark.properties file. Make sure that the ldbc.snb.interactive.scale_factor, ldbc.snb.interactive.updates_dir, ldbc.snb.interactive.parameters_dir properties are set correctly and are in sync.
Run the script:
```
driver/create-validation-parameters.sh
```

Validate

Edit the driver/validate.properties file. Make sure that the validate_database property points to the file you would like to validate against.
Run the script:
```
driver/validate.sh
```

Benchmark

Edit the driver/benchmark.properties file. Make sure that the ldbc.snb.interactive.scale_factor, ldbc.snb.interactive.updates_dir, and ldbc.snb.interactive.parameters_dir properties are set correctly and are in sync.
Run the script:
```
driver/benchmark.sh
```

Reload between runs

⚠️ The default workload contains updates which are persisted in the database. Therefore, the database needs to be reloaded or restored from backup before each run. Use the provided scripts/backup-database.sh and scripts/restore-database.sh scripts to achieve this.

Name		Name	Last commit message	Last commit date
parent directory ..
ddl		ddl
driver		driver
queries		queries
scratch		scratch
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

duckdb

duckdb

README.md

LDBC SNB DuckDB implementation

Setup

Generating the data set

Running the benchmark

Loading the data set

Running the benchmark driver

Create validation parameters

Validate

Benchmark

Reload between runs

Files

duckdb

Directory actions

More options

Directory actions

More options

Latest commit

History

duckdb

Folders and files

parent directory

README.md

LDBC SNB DuckDB implementation

Setup

Generating the data set

Running the benchmark

Loading the data set

Running the benchmark driver

Create validation parameters

Validate

Benchmark

Reload between runs