Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue #2574: pipe storage cp shall start data upload before traversing full source hierarchy (S3 provider) #2597

Merged
merged 22 commits into from
Jul 9, 2024

Conversation

ekazachkova
Copy link
Contributor

@ekazachkova ekazachkova commented Apr 11, 2022

The current PR provides implementation for issue #2574

  • pipe storage cp/mv operations - this feature enables for:

    • S3 provider only
    • --verify-destination shall not be enabled (since this option required full source listing)
    • --file-list shall not be enabled
  • pipe storage ls operation - full listing enables for:

    • S3 provider only
    • --recursive options shall be specified
    • --show_versions shall not be specified
    • --page size limit shall not be specified
  • pipe storage rm operations - added page size support for S3 provider

  • batch size can be tuned via CP_CLI_STORAGE_BATCH_SIZE environment variable (default: 1000)

  • CP_CLI_STORAGE_ASYNC_BATCH_ENABLE=[True|False] - enables asynchronous batch transfer - listing operation performs in separate thread (default: False)

…g full source hierarchy - async batch collection for cp/mv operation
…ipe-storage-cp

# Conflicts:
#	pipe-cli/src/utilities/datastorage_operations.py
@ekazachkova ekazachkova requested a review from tcibinan April 11, 2022 16:44
pipe-cli/src/utilities/datastorage_operations.py Outdated Show resolved Hide resolved
pipe-cli/src/utilities/storage/s3.py Outdated Show resolved Hide resolved
pipe-cli/src/utilities/storage/s3.py Outdated Show resolved Hide resolved
…g full source hierarchy - refactor cp/mv paging
…g full source hierarchy - refactor cp/mv paging
…_2574-batch-pipe-storage-cp

# Conflicts:
#	pipe-cli/src/utilities/datastorage_operations.py
#	pipe-cli/src/utilities/storage/s3.py
…g full source hierarchy (S3 provider) - cleanups
…g full source hierarchy (S3 provider) - async batch
…g full source hierarchy (S3 provider) - disable async batch by default
…g full source hierarchy (S3 provider) - cleanup
…g full source hierarchy (S3 provider) - cleanup
…g full source hierarchy (S3 provider) - cleanup
…g full source hierarchy (S3 provider) - cleanup
@ekazachkova ekazachkova requested a review from SilinPavel July 5, 2024 08:54
…g full source hierarchy (S3 provider) - support local paths
…g full source hierarchy (S3 provider) - cleanup
…g full source hierarchy (S3 provider) - cleanup
@SilinPavel SilinPavel merged commit e73c60c into develop Jul 9, 2024
4 checks passed
ekazachkova added a commit that referenced this pull request Jul 15, 2024
…g full source hierarchy (S3 and Local provider) (#2597)

(cherry picked from commit e73c60c)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants