Delta Sharing is an open API to share Delta Lake tables between data providers and data recipients.
This repo includes a Pandas connector and an Apache Spark connector that implement the Delta Sharing Protocol, and a reference implementation of a Delta Sharing server.
You can add the Spark connector as a dependency using your favorite build tool.
<dependency>
<groupId>io.delta</groupId>
<artifactId>delta-sharing-spark_2.12</artifactId>
<version>0.1.0</version>
</dependency>
libraryDependencies += "io.delta" %% "delta-sharing-spark" % "0.1.0"
Delta Sharing Spark Connector currently requires Apache Spark 3.0.0 and above.
Delta Sharing Protocol document provides a specification of the Delta Sharing protocol.
We use GitHub Issues to track community reported issues. You can also contact the community for getting answers.
Delta Sharing Spark Connector and Delta Sharing Server are compiled using SBT.
To compile, run
build/sbt compile
To generate artifacts, run
build/sbt package
To execute tests, run
build/sbt test
Refer to SBT docs for more commands.
We welcome contributions to Delta Sharing. See our CONTRIBUTING.md for more details.
We also adhere to the Delta Lake Code of Conduct.
Apache License 2.0, see LICENSE.
There are two mediums of communication within the Delta Lake community.
-
Public Slack Channel
-
Public Mailing list