storage

Orchid Storage Project

Orchid Storage is a work in progress

Orchid is an open source project. Help us in the effort to build a truly decentralized, incentive-aligned storage system.
Check out the Orchid Storage Litepaper and join the discussion on the Orchid Subreddit.

This repository contains work in progress on the file encoding CLI and server framework.

Overview

Orchid Storage is a (truly) decentralized storage system capable of maintaining files and issuing payments based on data availbility even when the client is offline. This is accomplished through incentive-aligned providers, non-interactive verification protocols, and efficient rebuilding and migration capabilities.

In this demo implementation files are imported to a local repository directory structure where they are optionally encrypted using an RSA key with AES and then encoded to a specified number of shards. Shards are linear coded in a "k of n" scheme such that only a specified subset of shards must be recovered to reconstruct the file. The use of Twin Coding (see below) allows individual shards to be reconstructed with minimal bandwidth by cooperating providers.

KZG Polynomial Commitments are generated for each shard. These commitments can later be used by either an interactive verifier or an on-chain contract without access to the original data to verify data availability.

Shards are distributed to one or more providers as desired. Individual providers may host full complements of shards or they may be split for maximum distribution.

Periodic challenges are issued to providers, either interactively or via a source of on-chain randomness, requiring proof of specific data components. Providers collect payments by proving data availability. These payments can be optimized to require zero on-chain transactions when the client is available online but can still function via a non-interactive contract mechanism in the absence of the client.

Twin Coding

A key aspect of the Orchid Storage project is the use of an efficient encoding scheme that minimizes bandwidth costs incurred during migration of distributed data through providers over time.

Twin Coding is a hybrid encoding scheme that works with any two linear coding schemes and combines them to achieve a space-bandwidth tradeoff, minimizing the amount of data that must be transferred between storage nodes in order to recover a lost shard of data. In contrast to a traditional erasure scheme, in which restoration of a lost node requires a full reconstruction of the original file, Twin Coding allows for the recovery of a lost data shard with data transfer totalling exactly the size of the lost data shard, with no additional transfer overhead.

This repository contains an implementation of Twin Coding, as well as a command line API for encoding files, decoding files with erasures, and optimally recovering lost shards.

See twin_coding.py for an explanation of the algorithm, example code, and a link to the original paper.

And for more information check out the Orchid Storage Litepaper

Development Installation

# Create a virtual environment
python3 -m venv venv

# Activate the virtual environment
# For macOS and Linux:
source venv/bin/activate
# For Windows:
.\venv\Scripts\activate

# Install the dependencies
pip install -r requirements.txt

Environment

The STRHOME (storage home) environment var is a path that determines the location of the default repository folder and providers.jsonc data stores. During development you can source the provided env.sh script to automatically set STRHOME and PYTHONPATH to the project folder and activate the venv in that folder.

export STRHOME=[Project Folder]
export PATH=$PATH:"$STRHOME"
export PYTHONPATH="$STRHOME"

Example Usage


# Generate a test file
dd if=/dev/urandom of="foo_file.dat" bs=1K count=1 status=none

# Import a file into the default local repository with default encoding
storage.sh import "foo_file.dat"

# Optionally, generate a test encryption key
# ssh-keygen -t rsa -f test_key -N ""
# ... and encrypt the file as it is imported into the default local repository with default encoding
# storage.sh import --key_path "test_key" "foo_file.dat"

# List the repository
storage.sh repo list

# Start a test provider server cluster
examples/test-cluster.sh start 5001 5002 5003 5004 5005

# Confirm that the test servers are running
examples/test-cluster.sh list

# "Discover" these providers, adding them to our known provider list
# This will normally be done via the directory service and performed at file push time.
providers.sh add 5001 5002 5003 5004 5005

# List the known providers
providers.sh list

# Start the monitor application (in another window)
# tmux split
monitor.sh --update 1

# Push the file by name
# (Observe the availability of the file in the monitor)
storage.sh push foo_file.dat

# Verify the file data availability by issuing KZG challenges to the servers.
# (Observe the verified time update in the monitor)
storage.sh verify file_1MB.dat

# Delete a shard from one of the providers
# (Observe the availability is reduced as a unique shard is lost)
storage.sh delete_shard --provider 5001 foo_file.dat --node_type 0 --node_index 0

# Request that the provider rebuild the lost node from specified other nodes in the cluster.
storage.sh request_repair --to_provider 5001 foo_file.dat --node_type 0 --node_index 0 --from_providers 5002 5003 5004
...


# Shut down the servers
examples/test-cluster.sh stop

CLI Documentation

See The CLI Documentation for detailed CLI usage.

Name		Name	Last commit message	Last commit date
parent directory ..
bin		bin
commitments		commitments
design		design
docs		docs
encoding		encoding
examples		examples
monitor		monitor
server		server
storage		storage
vhs-demo		vhs-demo
.gitignore		.gitignore
README-in.md		README-in.md
README.md		README.md
build_readme.sh		build_readme.sh
clean.sh		clean.sh
env.sh		env.sh
requirements.txt		requirements.txt
tests.sh		tests.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage

storage

README.md

Orchid Storage Project

Overview

Twin Coding

Development Installation

Environment

Example Usage

CLI Documentation

Files

storage

Directory actions

More options

Directory actions

More options

Latest commit

History

storage

Folders and files

parent directory

README.md

Orchid Storage Project

Overview

Twin Coding

Development Installation

Environment

Example Usage

CLI Documentation