Auto loaded correct LlamaSharp backend for WPF app

Great work on LlamaSharp. The new version was easy to integrate with and works great with the llama-2-7b-chat.Q4_K_M.gguf model.

I am writing a WPF app and my dev machine has CUDA installed. I used the CUDA backed and it works great, uses the GPU fine. 

I added the CPU backend because I am not sure, if the other client computers would have CUDA installed or even have a NVidia GPU. When doing so, the speed dropped and it was not using the GPU. 

Is it possible to do CUDA detection and chose the CUDA backend over the CPU backend?
Something like this, with CUDA checking? 
https://github.com/SciSharp/LLamaSharp/pull/65

An Auto detection Universal Backend package that contains all the DLLs and picks the correct one will work great for clients apps where the developer cannot control the machine specs.

Thanks,
Ash



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Auto loaded correct LlamaSharp backend for WPF app #154

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Auto loaded correct LlamaSharp backend for WPF app #154

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions