-
Notifications
You must be signed in to change notification settings - Fork 13.4k
Closed
Labels
Description
Name and Version
36c1532 (master)
Operating systems
Linux
Which llama.cpp modules do you know to be affected?
libllama (core library)
Command line
./bin/test-backend-ops -o IM2COL -b CUDA0 test
./bin/test-backend-ops -o IM2COL -b Vulkan0 test
Problem description & steps to reproduce
Reproduce:
Add a test case like test_cases.emplace_back(new test_im2col(GGML_TYPE_F32, GGML_TYPE_F16, GGML_TYPE_F16, {5, 5, 1, 32}, {3, 4, 1, 32}, 1, 1, 0, 0, 1, 1, true));
to test-backend-ops or
https://github.com/etasnadi/llama.cppxx/tree/im2col_issue
Problem:
The new test case (KW!=KH) fails with CUDA or Vulkan on my device.
First Bad Commit
No response
Relevant log output
[IM2COL] NMSE = 0.705221575 > 0.000000100 IM2COL(type_input=f32,type_kernel=f16,dst_type=f16,ne_input=[5,5,1,32],ne_kernel=[3,4,1,32],s0=1,s1=1,p0=0,p1=0,d0=1,d1=1,is_2D=1): FAIL
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture