faster `isfinite` #46166

oscardssmith · 2022-07-25T14:04:46Z

julia> function mytest!(f)
           s=0
           for x in typemin(UInt32):typemax(UInt32)
               s += f(reinterpret(Float32, x)) 
           end
           s
       end
mytest! (generic function with 1 method)

#old
julia> @btime mytest!(isfinite)
  1.971 s (0 allocations: 0 bytes)
4278190080

#new
julia> @btime mytest!(isfinite)
  1.770 s (0 allocations: 0 bytes)
4278190080

This has been tested exhaustively for Float32 and Float16

mikmoore · 2022-07-25T14:55:17Z

Nice!

It seems brave to assume that x-x === zero(x) for any finite AbstractFloat, since we really have no documented interface or conditions for AbstractFloat. This method would be unsuitable for any mutable (including Base's very own BigFloat, which thankfully has its own method) or some not-completely-unreasonable type where x-x === -zero(x) for some values.

I think this method should apply only to IEEEFloat. The AbstractFloat method should remain intact, become even more generic (e.g., isfinite(x::AbstractFloat) = !(isnan(x) | isinf(x))), or be removed entirely and fall back to Real.

giordano · 2022-07-25T19:34:24Z

base/float.jl

 isnan(x::AbstractFloat) = (x != x)::Bool
 isnan(x::Number) = false

+isfinite(x::IEEEFloat) = x - x === zero(x)


As I said in #46163 (comment), I feel like this should be iszero(x - x), and also iszero can be optimised for IEEFloat with x === zero(x), instead of x == zero(x)

julia> iszero(-0.0) true

so that would break the optimization. The reason for this change is that === on floating points can turn into a bit-compare rather than a more expensive floating point compare.

Right, sad 😕

Instead then:

julia> Base.iszero(x::Float64) = reinterpret(UInt64, x) & 0x7fff_ffff_ffff_ffff === UInt(0)

(btime cannot distinguish these three approaches in performance, and returns 3.5 ns for all of them)

I'll define the initially-proposed solution
isfinite_1(x::IEEEFloat) = x-x===zero(x).
Another contender, related to the iszero(x-x) variant but using the other half of the fact that x-x can only take the values +0.0 or NaN, is
isfinite_2(x::IEEEFloat) = !isnan(x-x).
Of course, now that we're limited to IEEEFloat we can instead reach for bit-twiddling
isfinite_3(x::IEEEFloat) = (reinterpret(Unsigned,x) & Base.exponent_mask(typeof(x))) != Base.exponent_mask(typeof(x))

julia> code_native(isfinite_1,(Float64,);debuginfo=:none) vsubsd %xmm0, %xmm0, %xmm0 vmovq %xmm0, %rax testq %rax, %rax sete %al retq julia> code_native(isfinite_2,(Float64,);debuginfo=:none) vsubsd %xmm0, %xmm0, %xmm0 vucomisd %xmm0, %xmm0 setnp %al retq julia> code_native(isfinite_3,(Float64,);debuginfo=:none) vmovq %xmm0, %rax movabsq $9218868437227405312, %rcx # imm = 0x7FF0000000000000 andnq %rcx, %rax, %rax setne %al retq

Compared to isfinite_1, isfinite_2 is one instruction shorter in isolation (on my x86). isfinite_3 is the same number of instructions as isfinite_1 but one of those is a hoistable movabsq. Another advantage is that it requires no floating point operations. Note that, after inlining, this may not be what these functions look like in the wild.

Keep in mind that mytest! is not a very canonical use of isfinite. Applying a @code_native shows this fact. That said, I see the _1 and _2 variants benchmarking identically and _3 about 15% faster.

EDIT:
Actually, I'm increasingly a fan of the !isnan(x-x) variant. While it's not quite as fast as bit twiddling in this nanobenchmark, I think that it has the benefit of clarity. Further, I think that it /would/ be a suitable ::AbstractFloat definition.

nlw0 · 2022-07-27T08:28:05Z

How about this?

isfinite_4(x::IEEEFloat) = x + one(x) > x

oscardssmith · 2022-07-27T12:48:20Z

I would think that wouldn't be better since > on floating point isn't completely trivial. It's also wrong since 2.0^53 +1.0 == 2.0^53

nlw0 · 2022-07-27T20:07:56Z

Well observed... The point was just that it this handles the -0.0 case, and naturally fails for NaN or Inf.... Maybe there could be a simple way to generate a valid successor, but that would still fail for whatever is the highest valid number? oh well

faster isfinite

5b71313

oscardssmith added performance Must go faster maths Mathematical functions labels Jul 25, 2022

restrict optimized method to IEEFloats

047cf3a

giordano reviewed Jul 25, 2022

View reviewed changes

oscardssmith closed this Sep 23, 2022

giordano deleted the oscardssmith-faster-isfinite branch September 23, 2022 13:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

faster `isfinite` #46166

faster `isfinite` #46166

Uh oh!

oscardssmith commented Jul 25, 2022

Uh oh!

mikmoore commented Jul 25, 2022

Uh oh!

giordano Jul 25, 2022

Uh oh!

oscardssmith Jul 25, 2022

Uh oh!

oscardssmith Jul 25, 2022

Uh oh!

giordano Jul 25, 2022

Uh oh!

vtjnash Jul 25, 2022

Uh oh!

mikmoore Jul 25, 2022 •

edited

Loading

Uh oh!

nlw0 commented Jul 27, 2022 •

edited

Loading

Uh oh!

oscardssmith commented Jul 27, 2022 •

edited

Loading

Uh oh!

nlw0 commented Jul 27, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

faster isfinite #46166

faster isfinite #46166

Uh oh!

Conversation

oscardssmith commented Jul 25, 2022

Uh oh!

mikmoore commented Jul 25, 2022

Uh oh!

giordano Jul 25, 2022

Choose a reason for hiding this comment

Uh oh!

oscardssmith Jul 25, 2022

Choose a reason for hiding this comment

Uh oh!

oscardssmith Jul 25, 2022

Choose a reason for hiding this comment

Uh oh!

giordano Jul 25, 2022

Choose a reason for hiding this comment

Uh oh!

vtjnash Jul 25, 2022

Choose a reason for hiding this comment

Uh oh!

mikmoore Jul 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nlw0 commented Jul 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oscardssmith commented Jul 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nlw0 commented Jul 27, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

faster `isfinite` #46166

faster `isfinite` #46166

mikmoore Jul 25, 2022 •

edited

Loading

nlw0 commented Jul 27, 2022 •

edited

Loading

oscardssmith commented Jul 27, 2022 •

edited

Loading