-
Notifications
You must be signed in to change notification settings - Fork 1k
Description
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
RowFilter is the API used to apply predicates when reading Parquet data.
Although RowFilter itself is public, its predicates are marked as public within the current crate. Today the only way to apply predicates using RowFilter is through ParquetRecordBatchReader or ParquetRecordBatchStreamReader. This prevents some usecases we have to read Parquet data not through ParquetRecordBatchReader or ParquetRecordBatchStreamReader. If we can expose predicates it would help with those special usecases.
Describe the solution you'd like
Mark predicates as public instead of public within the crate.
Describe alternatives you've considered
For us to maintain a forked version of arrow rs which is not ideal.
Additional context
It would be also nice to expose ReadPlanBuilder and ReadPlan as well so that users can construct their own reader instead of relying on ParquetRecordBatchReader or ParquetRecordBatchStreamReader.