Custom RoPE Scaling #389

LLukas22 · 2023-07-26T14:58:12Z

Closes #378.

Adds custom context scaling to llama, falcon, gpt-j, gpt-neox.

Adds an Option<ggml::CustomRoPEArguments> parameter to the ModelParameters.

Adds the optional --rope-base and --rope-scaling cli parameters.

philpax

Code looks good. What's the easiest way to test it?

LLukas22 · 2023-07-27T10:10:14Z

Sample command for 8k context of llama 2:
cargo run --release --features cublas -- infer -a llama -m "C:\Users\lkreu\Downloads\llama-2-13b-chat.ggmlv3.q5_K_M.bin" -p "A llama riding a crab" --use-gpu --rope-scaling 0.5 --num-ctx-tokens 8192 --ignore-eos --stats
Sit back and get some coffee☕ (8192 tokens are a lot of tokens to be generated)

16k context is also possible by setting rope-scaling to 0.25 but then i don't have enough VRAM to infer on my GPU.

LLukas22 · 2023-07-27T10:20:06Z

The generated text gets repetitive after some time but i guess that's a smapler/setting issue.
lama_story.txt

philpax · 2023-07-28T20:12:05Z

Great work! I just tested it with LLongMa-2; it's a bit finicky, but that shouldn't be a problem from us. I've revised the names a little to match llama.cpp / refer to frequency, but the rest is the same. Will merge once CI passes 🚀

Update llama.cpp + Custom RoPE Scaling

13e3d2a

philpax approved these changes Jul 27, 2023

View reviewed changes

LLukas22 and others added 2 commits July 27, 2023 12:22

Pull latest metal fixes from llama.cpp

3d5df47

feat(ggml): revise RoPE names

db1e049

philpax merged commit 9fe9f19 into rustformers:main Jul 28, 2023

hhamud mentioned this pull request Aug 7, 2023

Write a 0.2 changelog #244

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Custom RoPE Scaling #389

Custom RoPE Scaling #389

Uh oh!

LLukas22 commented Jul 26, 2023

Uh oh!

philpax left a comment

Uh oh!

LLukas22 commented Jul 27, 2023

Uh oh!

LLukas22 commented Jul 27, 2023

Uh oh!

philpax commented Jul 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Custom RoPE Scaling #389

Custom RoPE Scaling #389

Uh oh!

Conversation

LLukas22 commented Jul 26, 2023

Uh oh!

philpax left a comment

Choose a reason for hiding this comment

Uh oh!

LLukas22 commented Jul 27, 2023

Uh oh!

LLukas22 commented Jul 27, 2023

Uh oh!

philpax commented Jul 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants