Skip to content

Conversation

@Jefffrey
Copy link
Contributor

@Jefffrey Jefffrey commented Dec 2, 2023

Which issue does this PR close?

Closes #5152

Rationale for this change

What changes are included in this PR?

Populate column_orders in Parquet FileMetaData when writing files.

Are there any user-facing changes?

@github-actions github-actions bot added the parquet Changes to the parquet crate label Dec 2, 2023
Comment on lines +334 to +337
// This field is optional, perhaps in cases where no min/max fields are set
// in any Statistics or ColumnIndex object in the whole file.
// But for simplicity we always set this field.
let column_orders = Some(column_orders);
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comment on lines +327 to +328
// Currently only supported ColumnOrder is TypeDefinedOrder so we set this
// for all leaf nodes.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In future there may be new supported order, see here: apache/parquet-format#221

@tustvold tustvold merged commit f16d2f5 into apache:master Dec 5, 2023
@Jefffrey Jefffrey deleted the parquet_write_col_order branch December 5, 2023 12:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

parquet Changes to the parquet crate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Parquet: ColumnOrder not being written when writing parquet files

2 participants