Skip to content

[CH] Fully Support writing parquet and mergetree in spark 3.5.x with delta protocol #7028

Open
@baibaichen

Description

Description

This is umbrella issue.

Previously, #6705 is just a POC to prove that we can implemtent Delta Write based on CumnarWriteFilesExec.

  1. [GLUTEN-7028][CH][Part-1] Using PushingPipelineExecutor to write merge tree #7029
  2. [GLUTEN-7028][CH][Part-2] Refactor: Move MergeTree related UT to mergetree module #7279
  3. [GLUTEN-7028][CH][Part-3] Refactor: Move mergetree related codes to backends-clickhouse #7234
  4. [GLUTEN-7028][CH][Part-4] Refactor DeltaMergeTreeFileFormat to read table configuration from deltalog's metadata #7170
  5. [GLUTEN-7028][CH][Part-5] Refactor: add NativeOutputWriter to unify CHDatasourceJniWrapper #7395
  6. [GLUTEN-7028][CH][Part-6] Introduce MergeTreeDelayedCommitProtocol #7506
  7. [GLUTEN-7028][CH][Part-7] Support one pipeline write for mergetree  #7788
  8. [GLUTEN-7028][CH][Part-8] Support one pipeline write for partition mergetree #7924
  9. [GLUTEN-7028][CH][Part-9] Collecting Delta stats for parquet #7993
  10. [GLUTEN-7028][CH][Part-10] Collecting Delta stats for MergeTree #8029
  11. [GLUTEN-7028][CH][Part-11] Support write parquet files with bucket #8052
  12. [GLUTEN-7028][CH][Part-12] Add Local SortExec for Partition Write in one pipeline mode #8237
  13. [GLUTEN-7028][CH][Part-13] Support partition with escape value #8158
  14. [GLUTEN-7028][CH][Part-14] Refactor Case Sensitive Support for MergeTree #8346
  15. [GLUTEN-7028][CH][Part-15] [MINOR] Fix UTs #8364

backlog

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions