Modified the loop that counts non-ASCII characters in _string_from_ge… #14

thomasdullien · 2019-03-26T19:54:00Z

When scanning lots of data, the existing code can spend significant amounts of time in the loop that checks for non-ASCII characters.

Unfortunately, all compilers I tested failed to properly vectorize the loop; on the other hand, it is easy to just use a bitmask + popcount instruction to check 8 characters per loop iteration. This should also be nice for speculative execution, as there are no branches to mispredict inside the loop any more. In my benchmarks, the new version is about 6-8x faster.

This may lead to a performance regression on pre-2010 intel CPUs - but I am not sure that matters still?

…t_bytes to be vectorized; provides ~8x speedup on modern CPUs.

Modified the loop that counts non-ASCII characters in _string_from_ge…

c9ac29e

…t_bytes to be vectorized; provides ~8x speedup on modern CPUs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Modified the loop that counts non-ASCII characters in _string_from_ge… #14

Modified the loop that counts non-ASCII characters in _string_from_ge… #14

Uh oh!

thomasdullien commented Mar 26, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Modified the loop that counts non-ASCII characters in _string_from_ge… #14

Are you sure you want to change the base?

Modified the loop that counts non-ASCII characters in _string_from_ge… #14

Uh oh!

Conversation

thomasdullien commented Mar 26, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant