Add `n_gpu_layers` to llama.cpp #4679

garrettsutula · 2023-05-14T18:36:43Z

Add `n-gpu-layers` param to Llama.cpp model & embedding

Adds a parameter n_gpu_layers to Llama.cpp model and embedding implementation to make it possible to load & run w/ GPU. Refer to this Llama.cpp PR for more info: ggml-org/llama.cpp#1412

Who can review?

Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested:

@hwchase17 @agola11

dev2049 · 2023-05-15T22:54:22Z

thanks @garrettsutula! seems to be a duplicate of #4739 so will close if that's ok

garrettsutula added 2 commits May 14, 2023 14:30

Add n_gpu_layers to llama.cpp model

7885121

Add n_gpu_layers to llama.cpp embedding

81bac20

dev2049 closed this May 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `n_gpu_layers` to llama.cpp #4679

Add `n_gpu_layers` to llama.cpp #4679

Uh oh!

garrettsutula commented May 14, 2023

Uh oh!

dev2049 commented May 15, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add n_gpu_layers to llama.cpp #4679

Add n_gpu_layers to llama.cpp #4679

Uh oh!

Conversation

garrettsutula commented May 14, 2023

Add n-gpu-layers param to Llama.cpp model & embedding

Who can review?

Uh oh!

dev2049 commented May 15, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add `n_gpu_layers` to llama.cpp #4679

Add `n_gpu_layers` to llama.cpp #4679

Add `n-gpu-layers` param to Llama.cpp model & embedding