Skip to content

Conversation

@mbutrovich
Copy link
Contributor

@mbutrovich mbutrovich commented Apr 15, 2025

Which issue does this PR close?

Partially address #1441. Fixes 3 more test cases.

Rationale for this change

Better support for Parquet INT96 data that originated from Spark 64-bit microsecond timestamps.

What changes are included in this PR?

Most of the changes were in dependencies:

This PR just enables the new feature that was added in the PRs above.

How are these changes tested?

Re-enable 4 Comet tests for the native readers.

@mbutrovich mbutrovich marked this pull request as draft April 15, 2025 15:40
@mbutrovich mbutrovich changed the title Int96 fr fix: better int96 support for experimental native scans Apr 15, 2025
# Conflicts:
#	native/Cargo.lock
#	native/Cargo.toml
#	native/core/Cargo.toml
#	native/core/src/parquet/parquet_support.rs
@mbutrovich mbutrovich marked this pull request as ready for review April 17, 2025 12:22
@mbutrovich
Copy link
Contributor Author

This should be ready for review now that #1563 merged. I am investigating an issue with complex types and int96 that prevents me from enabling the fuzz test for int96.

@andygrove andygrove requested a review from parthchandra April 17, 2025 16:44
@mbutrovich
Copy link
Contributor Author

apache/datafusion#15763

I opened an issue for nested INT96. For now we'll keep the CometFuzzTestSuite, I will open a PR early next week to re-enable just for primitive type schemas, and then work on the fix.

@andygrove andygrove merged commit bfcb968 into apache:main Apr 18, 2025
78 checks passed
@mbutrovich mbutrovich deleted the int96_fr branch April 21, 2025 18:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants