Skip to content

lukeneil/delta-sharing

Repository files navigation

Delta Lake Logo

Build and Test

Delta Sharing is an open API to share Delta Lake tables between data providers and data recipients.

Introduction

This repo includes a Pandas connector and an Apache Spark connector that implement the Delta Sharing Protocol, and a reference implementation of a Delta Sharing server.

Data Share Profile

Pandas Connector

Apache Spark Connector

You can add the Spark connector as a dependency using your favorite build tool.

Maven

<dependency>
  <groupId>io.delta</groupId>
  <artifactId>delta-sharing-spark_2.12</artifactId>
  <version>0.1.0</version>
</dependency>

SBT

libraryDependencies += "io.delta" %% "delta-sharing-spark" % "0.1.0"

Compatibility with Apache Spark Versions

Delta Sharing Spark Connector currently requires Apache Spark 3.0.0 and above.

Delta Sharing Server

Delta Sharing Protocol

Delta Sharing Protocol document provides a specification of the Delta Sharing protocol.

Reporting issues

We use GitHub Issues to track community reported issues. You can also contact the community for getting answers.

Building

Delta Sharing Spark Connector and Delta Sharing Server are compiled using SBT.

To compile, run

build/sbt compile

To generate artifacts, run

build/sbt package

To execute tests, run

build/sbt test

Refer to SBT docs for more commands.

Contributing

We welcome contributions to Delta Sharing. See our CONTRIBUTING.md for more details.

We also adhere to the Delta Lake Code of Conduct.

License

Apache License 2.0, see LICENSE.

Community

There are two mediums of communication within the Delta Lake community.

About

An open protocol for secure data sharing

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Scala 82.8%
  • Python 15.1%
  • Shell 2.1%