-
Notifications
You must be signed in to change notification settings - Fork 471
Closed as not planned
Labels
staleStale issue will be autoclosed soonStale issue will be autoclosed soon
Description
Description
Loading a Microsoft Phi-4-mini-instruct (4bit quantization) model fails with:
unknown pre-tokenizer type: 'gpt-4o'
This issue was already addressed in llama.cpp b4792.
Reproduction Steps
LLamaWeights.LoadFromFile(new ModelParams("Phi-4-mini-instruct-Q4_K_M.gguf"));
Environment & Configuration
- Operating system: Windows 11
- .NET runtime version: 9
- LLamaSharp version: 0.21.0
- CUDA version (if you are using cuda backend): 12
Known Workarounds
- Download newer llama.cpp release b4792
- Use
NativeLibraryConfig.LLama.WithLibrary
to use the downloadedllama.dll
Metadata
Metadata
Assignees
Labels
staleStale issue will be autoclosed soonStale issue will be autoclosed soon