-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Open
Labels
enhancementAny new improvement worthy of a entry in the changelogAny new improvement worthy of a entry in the changelog
Description
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
This ticket collected the Utf8View slower than Ut8 cases and try to improve it.
Mostly the cases happen when the string has same 4bytes prefix, but one of the string to compare is larger than 12 bytes, it will make it happen.
Describe the solution you'd like
Make Utf8View regression cases faster.
-
- Add reproducer cases which the Utf8View will slower than Utf8
-
- Add code implementation to improve the Utf8View regression cases
Describe alternatives you've considered
Make Utf8View regression cases faster.
Additional context
Make Utf8View regression cases faster.
From the benchmark testing from datafusion sort tpch, there are regressions about the Utf8View compare:
We'd better to improve it from arrow-rs, so we can benefit a lot for datafusion.
alamb
Metadata
Metadata
Assignees
Labels
enhancementAny new improvement worthy of a entry in the changelogAny new improvement worthy of a entry in the changelog