optimizations: better modeling and codegen for apply and svec calls #59548

vtjnash · 2025-09-12T20:45:19Z

Use svec instead of tuple for arguments (better match for ABI which will require boxes)
Directly forward single svec argument, both runtime and codegen, without copying.
Optimize all consistant builtin functions of constant arguments, not just ones with special tfuncs. Reducing code duplication and divergence.
Codegen for svec() directly, so optimizer can see each store (and doesn't have to build the whole thing on the stack first).

Written by Claude

Compiler/src/ssair/passes.jl

aviatesk · 2025-09-14T16:16:19Z

Compiler/src/ssair/passes.jl

+            elseif is_known_call(stmt, Core._apply_iterate, compact)
+                length(stmt.args) >= 4 || continue
+                lift_apply_args!(compact, idx, stmt, 𝕃ₒ)


Just commenting for reference, and we don't have to do this in this PR, but I startd to think it'd be better to make this kind of optimization independent of sroa_pass!.

We might want to instead just rename the pass GVN or MemSSAOpt, since doing everything in one pass is probably a lot more efficient, and either alternative name would reflect that this does general memory-value-replacement optimizations

- Use svec instead of tuple for arguments (better match for ABI which will require boxes) - Directly forward single svec argument, both runtime and codegen, without copying. - Optimize all consistant builtin functions of constant arguments, not just ones with special tfuncs. Reducing code duplication and divergence. - Codegen for `svec()` directly, so optimizer can see each store (and doesn't have to build the whole thing on the stack first).

Without a release store, it seems LLVM considers it a data race to have read the initial state on another thread. Marking this as a release store seems sufficient to prevent that optimization. It is also more consistent with how we initialize and write to most other structs, particularly since #55767. Fixes #59547 (more)

…59548) - Use svec instead of tuple for arguments (better match for ABI which will require boxes) - Directly forward single svec argument, both runtime and codegen, without copying. - Optimize all consistant builtin functions of constant arguments, not just ones with special tfuncs. Reducing code duplication and divergence. - Codegen for `svec()` directly, so optimizer can see each store (and doesn't have to build the whole thing on the stack first). Written with help by Claude

aviatesk · 2025-09-19T09:13:42Z

Backported to 1.12.

Further improves the implementation from #59548. Specifically, uses `widenconst` to enable conversion of `tuple` calls that have become `PartialStruct`, and removes incorrect comments and unused arguments. Also adds some Julia-IR level tests.

#59601) Further improves the implementation from #59548. Specifically, uses `widenconst` to enable conversion of `tuple` calls that have become `PartialStruct`, and removes incorrect comments and unused arguments. Also adds some Julia-IR level tests.

#59601) Further improves the implementation from #59548. Specifically, uses `widenconst` to enable conversion of `tuple` calls that have become `PartialStruct`, and removes incorrect comments and unused arguments. Also adds some Julia-IR level tests. (cherry picked from commit a4e02ca)

Adds a dedicated `_svec_len_nothrow` function that does more precise `:nothrow` modeling of `Core._svec_len` implemented in #59548.

…Lang#59548 (JuliaLang#59601) Further improves the implementation from JuliaLang#59548. Specifically, uses `widenconst` to enable conversion of `tuple` calls that have become `PartialStruct`, and removes incorrect comments and unused arguments. Also adds some Julia-IR level tests.

vtjnash requested review from aviatesk and topolarity September 12, 2025 20:45

vtjnash changed the title ~~optimizations: better modeling and codegen for apply calls~~ optimizations: better modeling and codegen for apply and svec calls Sep 12, 2025

aviatesk reviewed Sep 14, 2025

View reviewed changes

vtjnash force-pushed the jn/svec-opt-apply branch from fde4599 to 66687c1 Compare September 16, 2025 00:33

vtjnash force-pushed the jn/svec-opt-apply branch from 66687c1 to 19ad3be Compare September 16, 2025 13:40

vtjnash added the don't squash Don't squash merge label Sep 16, 2025

KristofferC added the backport 1.12 Change should be backported to release-1.12 label Sep 17, 2025

vtjnash merged commit f818842 into master Sep 17, 2025
9 checks passed

vtjnash deleted the jn/svec-opt-apply branch September 17, 2025 17:06

aviatesk removed the backport 1.12 Change should be backported to release-1.12 label Sep 19, 2025

aviatesk mentioned this pull request Sep 19, 2025

Backports for 1.12.0-rc3 (or 1.12.0) #59577

Merged

25 tasks

aviatesk mentioned this pull request Sep 19, 2025

optimizations: improve Core._apply_iterate call conversion in #59548 #59601

Merged

KristofferC mentioned this pull request Sep 30, 2025

Backports for 1.12.1 #59705

Merged

47 tasks

aviatesk added a commit that referenced this pull request Sep 30, 2025

effects: improve nothrow modeling for Core._svec_len

980bde6

Adds a dedicated `_svec_len_nothrow` function that does more precise `:nothrow` modeling of `Core._svec_len` implemented in #59548.

aviatesk mentioned this pull request Sep 30, 2025

effects: improve nothrow modeling for Core._svec_len #59706

Merged

aviatesk added a commit that referenced this pull request Sep 30, 2025

effects: improve nothrow modeling for Core._svec_len

c0d41cc

Adds a dedicated `_svec_len_nothrow` function that does more precise `:nothrow` modeling of `Core._svec_len` implemented in #59548.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

optimizations: better modeling and codegen for apply and svec calls #59548

optimizations: better modeling and codegen for apply and svec calls #59548

Uh oh!

vtjnash commented Sep 12, 2025

Uh oh!

Uh oh!

aviatesk Sep 14, 2025

Uh oh!

vtjnash Sep 16, 2025

Uh oh!

Uh oh!

aviatesk commented Sep 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

optimizations: better modeling and codegen for apply and svec calls #59548

optimizations: better modeling and codegen for apply and svec calls #59548

Uh oh!

Conversation

vtjnash commented Sep 12, 2025

Uh oh!

Uh oh!

aviatesk Sep 14, 2025

Choose a reason for hiding this comment

Uh oh!

vtjnash Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aviatesk commented Sep 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants