add basic benchmarks for Julia-level compilation pipeline #288

aviatesk · 2021-12-01T05:16:38Z

This commit setups a basic infrastructure for benchmarking
Julia-level compilation pipeline.
InferenceBenchmarks is based on InferenceBenchmarker <: AbstractInterpreter,
which maintains its own global inference cache, and so it allows us to
run the compilation pipeline multiple times while avoiding caches generated
by previous compilation to be reused.

I set up a top-level benchmark group named "inference": InferenceBenchmarks,
which is composed of the following subgroups:

"inference": just benchmarks overall Julia-level compilation pipeline
"abstract interpretation": benchmarks only abstract interpretation,
i.e. without optimization
"optimization": benchmarks only optimization

Here is an example of benchmark result obtained by comparing these two
commits of JuliaLang/julia 5c357e9 and d515f05:

# built on 5c357e9
using BenchmarkTools, BaseBenchmarks
BaseBenchmarks.load!("inference")
results = run(BaseBenchmarks.SUITE; verbose = true)
BenchmarkTools.save("5c357e9.json", results)

# built on d515f05
using BenchmarkTools, BaseBenchmarks
BaseBenchmarks.load!("inference")
results = run(BaseBenchmarks.SUITE; verbose = true)
BenchmarkTools.save("d515f05.json", results)

# compare
using BenchmarkTools, BaseBenchmarks
base = BenchmarkTools.load("5c357e9.json")[1]
target = BenchmarkTools.load("d515f05.json")[1]

julia> leaves(regressions(judge(minimum(target), minimum(base))))
Any[]

julia> leaves(improvements(judge(minimum(target), minimum(base))))
6-element Vector{Any}:
 (Any["inference", "inference", "rand(Float64)"], TrialJudgement(-2.85% => invariant))
 (Any["inference", "inference", "sin(42)"], TrialJudgement(-2.44% => invariant))
 (Any["inference", "inference", "abstract_call_gf_by_type"], TrialJudgement(-1.97% => invariant))
 (Any["inference", "inference", "println(::QuoteNode)"], TrialJudgement(-0.96% => invariant))
 (Any["inference", "optimization", "sin(42)"], TrialJudgement(+1.26% => invariant))
 (Any["inference", "optimization", "println(::QuoteNode)"], TrialJudgement(-6.97% => improvement))

This result is very satisfying because the refactor added in d515f05
certainly improved Julia-level compilation performance by avoiding
domtree construction in the SROA pass in many cases.

aviatesk · 2021-12-01T11:14:20Z

The failure in Julia nightly is because this added benchmark suite isn't tuned yet, and thus it's tuned like evals=2: https://github.com/JuliaCI/BaseBenchmarks.jl/runs/4379154336?check_suite_focus=true#step:5:7094
This disables setup settings and this causes the failure.

I confirmed this benchmark suite works just correctly on my machine.

vtjnash · 2021-12-01T19:36:30Z

I think you need to specify evals=1 to @benchmarkable

aviatesk · 2021-12-02T00:40:13Z

Even though I set it manually here?

BaseBenchmarks.jl/src/inference/InferenceBenchmarks.jl

Line 173 in d386165

tune_benchmarks!(g)

vtjnash · 2021-12-02T01:21:28Z

That will work, assuming no other code later calls tune

aviatesk · 2021-12-02T06:01:43Z

Ah, evals = 2 is specified for our test case:

BaseBenchmarks.jl/test/runtests.jl

Lines 10 to 14 in 0254882

    
           @test begin 
        
               run(BaseBenchmarks.SUITE, verbose = true, samples = 1, 
        
                   evals = 2, gctrial = false, gcsample = false); 
        
               true 
        
           end

This commit setups a basic infrastructure for benchmarking Julia-level compilation pipeline. `InferenceBenchmarks` is based on `InferenceBenchmarker <: AbstractInterpreter`, which maintains its own global inference cache, and so it allows us to run the compilation pipeline multiple times while avoiding caches generated by previous compilation to be reused. I set up a top-level benchmark group named `"inference": InferenceBenchmarks`, which is composed of the following subgroups: - `"inference"`: just benchmarks overall Julia-level compilation pipeline - `"abstract interpretation"`: benchmarks only abstract interpretation, i.e. without optimization - `"optimization"`: benchmarks only optimization Here is an example of benchmark result obtained by comparing these two commits of `JuliaLang/julia` [`5c357e9`](JuliaLang/julia@5c357e9) and [`d515f05`](JuliaLang/julia@d515f05): ```julia \# built on 5c357e9 using BenchmarkTools, BaseBenchmarks BaseBenchmarks.load!("inference") results = run(BaseBenchmarks.SUITE; verbose = true) BenchmarkTools.save("5c357e9.json", results) \# built on d515f05 using BenchmarkTools, BaseBenchmarks BaseBenchmarks.load!("inference") results = run(BaseBenchmarks.SUITE; verbose = true) BenchmarkTools.save("d515f05.json", results) \# compare using BenchmarkTools, BaseBenchmarks base = BenchmarkTools.load("5c357e9.json")[1] target = BenchmarkTools.load("d515f05.json")[1] ``` ``` julia> leaves(regressions(judge(minimum(target), minimum(base)))) Any[] julia> leaves(improvements(judge(minimum(target), minimum(base)))) 6-element Vector{Any}: (Any["inference", "inference", "rand(Float64)"], TrialJudgement(-2.85% => invariant)) (Any["inference", "inference", "sin(42)"], TrialJudgement(-2.44% => invariant)) (Any["inference", "inference", "abstract_call_gf_by_type"], TrialJudgement(-1.97% => invariant)) (Any["inference", "inference", "println(::QuoteNode)"], TrialJudgement(-0.96% => invariant)) (Any["inference", "optimization", "sin(42)"], TrialJudgement(+1.26% => invariant)) (Any["inference", "optimization", "println(::QuoteNode)"], TrialJudgement(-6.97% => improvement)) ``` This result is very satisfying because the refactor added in `d515f05` certainly improved Julia-level compilation performance by avoiding domtree construction in the SROA pass in many cases.

aviatesk force-pushed the inf branch 4 times, most recently from a8f0a4a to d386165 Compare December 1, 2021 11:12

aviatesk added 4 commits December 6, 2021 02:29

update LTS version

3da3d7b

update BenchmarkTools.jl version

c8b4464

set evals = 1

a1eed72

aviatesk force-pushed the inf branch from b96a4fb to a1eed72 Compare December 5, 2021 17:30

aviatesk mentioned this pull request Dec 7, 2021

optimizer: fully support inlining of union-split, partially constant-prop' callsite JuliaLang/julia#43347

Merged

vchuravy merged commit 8f68550 into JuliaCI:master Jan 5, 2022

aviatesk deleted the inf branch January 26, 2022 05:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add basic benchmarks for Julia-level compilation pipeline #288

add basic benchmarks for Julia-level compilation pipeline #288

Uh oh!

aviatesk commented Dec 1, 2021

Uh oh!

aviatesk commented Dec 1, 2021

Uh oh!

vtjnash commented Dec 1, 2021

Uh oh!

aviatesk commented Dec 2, 2021

Uh oh!

vtjnash commented Dec 2, 2021

Uh oh!

aviatesk commented Dec 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add basic benchmarks for Julia-level compilation pipeline #288

add basic benchmarks for Julia-level compilation pipeline #288

Uh oh!

Conversation

aviatesk commented Dec 1, 2021

Uh oh!

aviatesk commented Dec 1, 2021

Uh oh!

vtjnash commented Dec 1, 2021

Uh oh!

aviatesk commented Dec 2, 2021

Uh oh!

vtjnash commented Dec 2, 2021

Uh oh!

aviatesk commented Dec 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants