-
Notifications
You must be signed in to change notification settings - Fork 6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Datasets] Add local and S3 filesystem test coverage for file-based datasources. #17158
[Datasets] Add local and S3 filesystem test coverage for file-based datasources. #17158
Conversation
python/ray/experimental/data/datasource/file_based_datasource.py
Outdated
Show resolved
Hide resolved
python/ray/experimental/data/datasource/file_based_datasource.py
Outdated
Show resolved
Hide resolved
0791b0a
to
bac914e
Compare
5ef83de
to
66466df
Compare
66466df
to
0030e68
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We need to add parquet tests here (currently S3 parquet read support is totally broken since we don't import pyarrow.fs before deserialization of the filesystem impl).
This reverts commit 5ef83dee68a937c7903fcd69407fe31be20b3e16.
0030e68
to
c4b633b
Compare
This PR is only for adding filesystem test coverage for the file-based datasources (JSON and CSV), I'm planning on adding coverage for Parquet in a separate PR to keep the diff small.
Still hitting the infinite recursion here when pickling. |
Can you ask Yi for help on ideas in resolving this problem?
…On Wed, Aug 11, 2021, 8:56 AM Clark Zinzow ***@***.***> wrote:
Still hitting the infinite recursion here when pickling.
|
CC @iycheng @suquark, also going to ping y'all in the Slack thread that has more info. |
Datasets tests are looking good! This should be ready to merge. |
🎉 |
This PR adds back |
PR #16929 removed opencv-python-headless. PR #17158 added it back but did not use it. This was noted by [a reviewer](#17158 (comment)) since it breaks python3.9 (no wheel is available for installation).
PR ray-project#16929 removed opencv-python-headless. PR ray-project#17158 added it back but did not use it. This was noted by [a reviewer](ray-project#17158 (comment)) since it breaks python3.9 (no wheel is available for installation).
Ensures that manually specified local and S3 filesystems are covered in our file-based datasources tests.
Related issue number
Closes #17084
Checks
scripts/format.sh
to lint the changes in this PR.