Add support for llama-cpp GPU acceleration #4635

moejay · 2023-05-13T20:19:41Z

Add `n_gpu_layers` param to llama-cpp that allows some processing on the GPU

This is in preparation for added GPU support in llama-cpp
abetlen/llama-cpp-python#203 for when is approved and merged

This is the llama-cpp commit that this PR adds support for

Before submitting

No New integration

Who can review?

Thanks all for the wonderful project!

m0sh1x2 · 2023-05-14T14:05:27Z

Is there an easy way to test all of the functionality in one go or there is quite a lot of chained merges below?

moejay · 2023-05-14T17:37:24Z

Is there an easy way to test all of the functionality in one go or there is quite a lot of chained merges below?

not that I can think of (This is dependent on one PR though) so maybe it's not too bad to test

you need the llama-cpp-python binding to be updated, and for that
you could pip install from https://github.com/moejay/llama-cpp-python (That's my branch with this updated)

I can add some more detailed testing instructions in the PR description when I come back later this evening if that helps.

I did notice that a bunch of the examples were broken (There are probably issues for those that exist at the moment, maybe I'll be able to contribute some fixes there later on)

moejay · 2023-05-15T03:04:16Z

Looks like the same change was merged a bit later here
So, this should now work with the latest llama-cpp-bindings

dev2049 · 2023-05-17T00:00:44Z

duplicate of #4739

Add support for llama-cpp GPU acceleration

36eb132

dev2049 closed this May 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for llama-cpp GPU acceleration #4635

Add support for llama-cpp GPU acceleration #4635

Uh oh!

moejay commented May 13, 2023 •

edited

Loading

Uh oh!

m0sh1x2 commented May 14, 2023

Uh oh!

moejay commented May 14, 2023

Uh oh!

moejay commented May 15, 2023

Uh oh!

dev2049 commented May 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add support for llama-cpp GPU acceleration #4635

Add support for llama-cpp GPU acceleration #4635

Uh oh!

Conversation

moejay commented May 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add n_gpu_layers param to llama-cpp that allows some processing on the GPU

Before submitting

Who can review?

Uh oh!

m0sh1x2 commented May 14, 2023

Uh oh!

moejay commented May 14, 2023

Uh oh!

moejay commented May 15, 2023

Uh oh!

dev2049 commented May 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

moejay commented May 13, 2023 •

edited

Loading

Add `n_gpu_layers` param to llama-cpp that allows some processing on the GPU