-
Notifications
You must be signed in to change notification settings - Fork 470
Description
Great work on LlamaSharp. The new version was easy to integrate with and works great with the llama-2-7b-chat.Q4_K_M.gguf model.
I am writing a WPF app and my dev machine has CUDA installed. I used the CUDA backed and it works great, uses the GPU fine.
I added the CPU backend because I am not sure, if the other client computers would have CUDA installed or even have a NVidia GPU. When doing so, the speed dropped and it was not using the GPU.
Is it possible to do CUDA detection and chose the CUDA backend over the CPU backend?
Something like this, with CUDA checking?
#65
An Auto detection Universal Backend package that contains all the DLLs and picks the correct one will work great for clients apps where the developer cannot control the machine specs.
Thanks,
Ash