Skip to content

Exclude lib*jl from coverage also for CUSTATEVEC, CUTENSOR, and CUTENSORNET #2669

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Feb 20, 2025

Conversation

kshyatt
Copy link
Member

@kshyatt kshyatt commented Feb 20, 2025

No description provided.

@kshyatt
Copy link
Member Author

kshyatt commented Feb 20, 2025

Enzyme error doesn't look like it could be related?

@kshyatt kshyatt requested a review from maleadt February 20, 2025 18:02
@maleadt maleadt merged commit 8b93480 into master Feb 20, 2025
1 of 2 checks passed
@maleadt maleadt deleted the ksh/lib_excl branch February 20, 2025 19:47
Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

CUDA.jl Benchmarks

Benchmark suite Current: d50ecb4 Previous: 1786fd1 Ratio
latency/precompile 46228266955.5 ns 46250691468.5 ns 1.00
latency/ttfp 7050680444 ns 6940238537 ns 1.02
latency/import 3671506646 ns 3620061997 ns 1.01
integration/volumerhs 9611309.5 ns 9624100 ns 1.00
integration/byval/slices=1 146704 ns 146960 ns 1.00
integration/byval/slices=3 425172 ns 425196 ns 1.00
integration/byval/reference 144845 ns 144970 ns 1.00
integration/byval/slices=2 285923 ns 285990 ns 1.00
integration/cudadevrt 103196 ns 103427 ns 1.00
kernel/indexing 14011 ns 14103 ns 0.99
kernel/indexing_checked 14533 ns 14753 ns 0.99
kernel/occupancy 636.8930817610063 ns 647.6424242424242 ns 0.98
kernel/launch 2034 ns 2043.6 ns 1.00
kernel/rand 15442 ns 14449 ns 1.07
array/reverse/1d 19681 ns 19824 ns 0.99
array/reverse/2d 25046 ns 25406 ns 0.99
array/reverse/1d_inplace 10406 ns 11185 ns 0.93
array/reverse/2d_inplace 12091 ns 12761 ns 0.95
array/copy 20713 ns 21193 ns 0.98
array/iteration/findall/int 157550 ns 160244 ns 0.98
array/iteration/findall/bool 138495.5 ns 140488 ns 0.99
array/iteration/findfirst/int 153423.5 ns 154305 ns 0.99
array/iteration/findfirst/bool 154710 ns 155713.5 ns 0.99
array/iteration/scalar 71127 ns 72903 ns 0.98
array/iteration/logical 213720 ns 217936 ns 0.98
array/iteration/findmin/1d 41240 ns 41697 ns 0.99
array/iteration/findmin/2d 93741 ns 94800 ns 0.99
array/reductions/reduce/1d 41006 ns 44033 ns 0.93
array/reductions/reduce/2d 51481 ns 45300 ns 1.14
array/reductions/mapreduce/1d 35866.5 ns 42732 ns 0.84
array/reductions/mapreduce/2d 51039 ns 47788 ns 1.07
array/broadcast 20822 ns 21076 ns 0.99
array/copyto!/gpu_to_gpu 11591 ns 11783.5 ns 0.98
array/copyto!/cpu_to_gpu 207953 ns 211543 ns 0.98
array/copyto!/gpu_to_cpu 243288.5 ns 244994 ns 0.99
array/accumulate/1d 108442 ns 109266 ns 0.99
array/accumulate/2d 79828 ns 80430 ns 0.99
array/construct 1286.5 ns 1296 ns 0.99
array/random/randn/Float32 49454 ns 45708 ns 1.08
array/random/randn!/Float32 26460 ns 26943 ns 0.98
array/random/rand!/Int64 27095 ns 27059 ns 1.00
array/random/rand!/Float32 8511.666666666666 ns 8715.666666666666 ns 0.98
array/random/rand/Int64 35446 ns 38201 ns 0.93
array/random/rand/Float32 13018 ns 13271 ns 0.98
array/permutedims/4d 61077 ns 61202 ns 1.00
array/permutedims/2d 55146 ns 55721.5 ns 0.99
array/permutedims/3d 55637 ns 56694 ns 0.98
array/sorting/1d 2777083.5 ns 2776604 ns 1.00
array/sorting/by 3368139.5 ns 3367703.5 ns 1.00
array/sorting/2d 1084395.5 ns 1085001.5 ns 1.00
cuda/synchronization/stream/auto 1044.5 ns 1013.8461538461538 ns 1.03
cuda/synchronization/stream/nonblocking 6406.2 ns 6380.1 ns 1.00
cuda/synchronization/stream/blocking 809.0760869565217 ns 829.824074074074 ns 0.97
cuda/synchronization/context/auto 1171 ns 1194.4 ns 0.98
cuda/synchronization/context/nonblocking 6675 ns 6668 ns 1.00
cuda/synchronization/context/blocking 944.9795918367347 ns 935.85 ns 1.01

This comment was automatically generated by workflow using github-action-benchmark.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants