Antalya 26.3 Backport of #99521, #100150 - Add Arrow and Parquet format support for UUID data type#1774
Open
mkmkme wants to merge 2 commits intoantalya-26.3from
Open
Antalya 26.3 Backport of #99521, #100150 - Add Arrow and Parquet format support for UUID data type#1774mkmkme wants to merge 2 commits intoantalya-26.3from
mkmkme wants to merge 2 commits intoantalya-26.3from
Conversation
…rquet-uuid Add Arrow and Parquet format support for UUID data type
…et-builder Improve Arrow Parquet writer to include UUID logical type
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Note for reviewer
Besides 99521, 100150 was backported as well as a follow-up PR fixing an issue in 99521. The tests are passing locally (apart from ones that couldn't run on the local machine)
Changelog category (leave one):
Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):
Adds native support for importing and exporting UUID data types in Arrow and Parquet formats. Users can now directly query and transfer UUID data between ClickHouse and other data tools without requiring manual string conversions or workarounds. Automated logical inference for top-level UUIDs, and support for explicit schema hint for nested UUIDs (ClickHouse#99521 by @ivanmantova).
Exporting UUIDs to Parquet via the Arrow encoder now includes the correct UUID type annotation, eliminating the need to manually cast
FixedString(16)data when reading the files back into ClickHouse or other systems (ClickHouse#100150 by @ivanmantova).Documentation entry for user-facing changes
...
CI/CD Options
Exclude tests:
Regression jobs to run: