- 
                Notifications
    You must be signed in to change notification settings 
- Fork 13.9k
Add SIMD funnel shift and round-to-even intrinsics #142078
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| r? @wesleywiser rustbot has assigned @wesleywiser. Use  | 
| Some changes occurred to the intrinsics. Make sure the CTFE / Miri interpreter cc @rust-lang/miri, @RalfJung, @oli-obk, @lcnr Some changes occurred to the platform-builtins intrinsics. Make sure the cc @antoyo, @GuillaumeGomez, @bjorn3, @calebzulawski, @programmerjake | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I believe you'd also need to implement this operation in MIRI.
| 
 Please add Miri to the list. :) (Doesn't have to be in the first PR.) | 
| 
 Both  | 
| 
 You should be able to have more or less the same implementation for cg_gcc by putting the code in this file. The complicated part is to find the equivalent intrinsic names for GCC. | 
| 
 If you want to block this PR on submitting a T-libs-api ACP, you can do that, but I don't see why you would. | 
| 
 I only found the intrinsic name for x86 as  
 I don't think this should blocked on adding these to int types, just thought this might be a nice place to discuss it. I will go ahead and submit an independent ACP for that | 
| 
 It seems GCC doesn't have an arch-independent builtin for this, so I guess it's better for this PR to not implement the intrinsic for cg_gcc. | 
| ok thanks | 
| Some changes occurred in compiler/rustc_codegen_cranelift cc @bjorn3 Some changes occurred in compiler/rustc_codegen_gcc | 
| I have implemented  | 
| The Miri subtree was changed cc @rust-lang/miri | 
        
          
                library/core/src/intrinsics/simd.rs
              
                Outdated
          
        
      | pub unsafe fn simd_fshl<T>(a: T, b: T, shift: T) -> T; | ||
|  | ||
| /// Funnel Shifts vector right elementwise, with UB on overflow. | ||
| /// | ||
| /// Concatenates `a` and `b` elementwise (with `a` in the most significant half), | ||
| /// creating a vector of the same length, but with each element being twice as | ||
| /// wide. Then shift this vector right elementwise by `shift`, shifting in zeros, | ||
| /// and extract the least significant half of each of the elements. If `a` and `b` | ||
| /// are the same, this is equivalent to an elementwise rotate right operation. | ||
| /// | ||
| /// `T` must be a vector of integers. | ||
| /// | ||
| /// # Safety | ||
| /// | ||
| /// Each element of `shift` must be less than `<int>::BITS`. | ||
| #[rustc_intrinsic] | ||
| #[rustc_nounwind] | ||
| pub unsafe fn simd_fshr<T>(a: T, b: T, shift: T) -> T; | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So, I have been thinking and I can't get over the bit where the current mnemonic sounds like it has something to do with floats in my head. I think we might want to call out these names as simd_funnel_shl and simd_funnel_shr? I looked around and we get names like __funnelshift_lc and __funnelshift_rc for similar intrinsics (e.g. CUDA), and I know that some languages actually do support shifts on floats, with... sometimes interesting semantics.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, simd_funnel_sh{l,r} sounds nice enough (with the scalar functions as funnel_sh{l,r}). I will change the name
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have changed the names. Btw @workingjubilee as you seem to be interested, should I r? you?
simd_fsh{l,r} and simd_round_ties_even| ☔ The latest upstream changes (presumably #141700) made this pull request unmergeable. Please resolve the merge conflicts. | 
| ☔ The latest upstream changes (presumably #142259) made this pull request unmergeable. Please resolve the merge conflicts. | 
| @workingjubilee does this need any more changes, or is it ready to be merged now? | 
| I think we're good. @bors r+ rollup | 
| Wait. @bors r- | 
| // Unary integer intrinsics | ||
| if matches!( | ||
| name, | ||
| sym::simd_bswap | sym::simd_bitreverse | sym::simd_ctlz | sym::simd_ctpop | sym::simd_cttz | ||
| sym::simd_bswap | ||
| | sym::simd_bitreverse | ||
| | sym::simd_ctlz | ||
| | sym::simd_ctpop | ||
| | sym::simd_cttz | ||
| | sym::simd_funnel_shl | ||
| | sym::simd_funnel_shr | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
funnel shift isn't exactly unary, now is it? probably should update the comment and admit this is our junk drawer of "everything else" right now.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
though on line 2388 there is an actual binary operation on SIMD integers
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah I just thought it would be nice to bunch them with other integer intrinsics (like an integer equivalent of simd_simple_float_intrinsic). If you want I can also add simd_saturating_{add,sub} to this list, but the annoying thing is that they use InvalidMonomorphization::ExpectedVectorElementType (which is badly named imo) for some reason, where all others use InvalidMonomorphization::UnsupportedOperation, so it would change some ui test messages if I do it. Should I do it, or just change the comment for now?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Many of these intrinsic error types are poorly named -- feel free to adjust the names (and messages) if it makes sense. :) Just please don't spend too much complexity on this: stable code cannot even "see" these checks, so it's not worth having a lot of code to make them extra nice.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I really don't want to do that in this PR. I will send another PR that will refactor intrinsic.rs. For now, @workingjubilee I have changed the comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Erm, @workingjubilee a smalll ping
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Whoops. Hi!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hm, I don't see the comment change but I don't want to delay this further because yes it should land before a refactor. ^^;
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh it seems like I just forgot to push it 🤦♂️
| @bors r+ | 
Rollup of 4 pull requests Successful merges: - #142078 (Add SIMD funnel shift and round-to-even intrinsics) - #142214 (`tests/ui`: A New Order [9/N]) - #142417 (`tests/ui`: A New Order [12/N]) - #143030 (Fix suggestion spans inside macros for the `unused_must_use` lint) r? `@ghost` `@rustbot` modify labels: rollup
Rollup merge of #142078 - sayantn:more-intrinsics, r=workingjubilee Add SIMD funnel shift and round-to-even intrinsics This PR adds 3 new SIMD intrinsics - `simd_funnel_shl` - funnel shift left - `simd_funnel_shr` - funnel shift right - `simd_round_ties_even` (vector version of `round_ties_even_fN`) TODO (future PR): implement `simd_fsh{l,r}` in miri, cg_gcc and cg_clif (it is surprisingly hard to implement without branches, the common tricks that rotate uses doesn't work because we have 2 elements now. e.g, the `-n&31` trick used by cg_gcc to implement rotate doesn't work with this because then `fshl(a, b, 0)` will be `a | b`) [#t-compiler > More SIMD intrinsics](https://rust-lang.zulipchat.com/#narrow/channel/131828-t-compiler/topic/More.20SIMD.20intrinsics/with/522130286) `@rustbot` label T-compiler T-libs A-intrinsics F-core_intrinsics r? `@workingjubilee`
Rollup of 4 pull requests Successful merges: - rust-lang/rust#142078 (Add SIMD funnel shift and round-to-even intrinsics) - rust-lang/rust#142214 (`tests/ui`: A New Order [9/N]) - rust-lang/rust#142417 (`tests/ui`: A New Order [12/N]) - rust-lang/rust#143030 (Fix suggestion spans inside macros for the `unused_must_use` lint) r? `@ghost` `@rustbot` modify labels: rollup
…jubilee
Add SIMD funnel shift and round-to-even intrinsics
This PR adds 3 new SIMD intrinsics
 - `simd_funnel_shl` - funnel shift left
 - `simd_funnel_shr` - funnel shift right
 - `simd_round_ties_even` (vector version of `round_ties_even_fN`)
TODO (future PR): implement `simd_fsh{l,r}` in miri, cg_gcc and cg_clif (it is surprisingly hard to implement without branches, the common tricks that rotate uses doesn't work because we have 2 elements now. e.g, the `-n&31` trick used by cg_gcc to implement rotate doesn't work with this because then `fshl(a, b, 0)` will be `a | b`)
[#t-compiler > More SIMD intrinsics](https://rust-lang.zulipchat.com/#narrow/channel/131828-t-compiler/topic/More.20SIMD.20intrinsics/with/522130286)
`@rustbot` label T-compiler T-libs A-intrinsics F-core_intrinsics
r? `@workingjubilee`
    …jubilee
Add SIMD funnel shift and round-to-even intrinsics
This PR adds 3 new SIMD intrinsics
 - `simd_funnel_shl` - funnel shift left
 - `simd_funnel_shr` - funnel shift right
 - `simd_round_ties_even` (vector version of `round_ties_even_fN`)
TODO (future PR): implement `simd_fsh{l,r}` in miri, cg_gcc and cg_clif (it is surprisingly hard to implement without branches, the common tricks that rotate uses doesn't work because we have 2 elements now. e.g, the `-n&31` trick used by cg_gcc to implement rotate doesn't work with this because then `fshl(a, b, 0)` will be `a | b`)
[#t-compiler > More SIMD intrinsics](https://rust-lang.zulipchat.com/#narrow/channel/131828-t-compiler/topic/More.20SIMD.20intrinsics/with/522130286)
`@rustbot` label T-compiler T-libs A-intrinsics F-core_intrinsics
r? `@workingjubilee`
    
This PR adds 3 new SIMD intrinsics
simd_funnel_shl- funnel shift leftsimd_funnel_shr- funnel shift rightsimd_round_ties_even(vector version ofround_ties_even_fN)TODO (future PR): implement
simd_fsh{l,r}in miri, cg_gcc and cg_clif (it is surprisingly hard to implement without branches, the common tricks that rotate uses doesn't work because we have 2 elements now. e.g, the-n&31trick used by cg_gcc to implement rotate doesn't work with this because thenfshl(a, b, 0)will bea | b)#t-compiler > More SIMD intrinsics
@rustbot label T-compiler T-libs A-intrinsics F-core_intrinsics
r? @workingjubilee