Skip to content

[BUG]: Unknown pre-tokenizer type: 'gpt-4o' #1128

@koenigst

Description

@koenigst

Description

Loading a Microsoft Phi-4-mini-instruct (4bit quantization) model fails with:
unknown pre-tokenizer type: 'gpt-4o'
This issue was already addressed in llama.cpp b4792.

Reproduction Steps

LLamaWeights.LoadFromFile(new ModelParams("Phi-4-mini-instruct-Q4_K_M.gguf"));

Environment & Configuration

  • Operating system: Windows 11
  • .NET runtime version: 9
  • LLamaSharp version: 0.21.0
  • CUDA version (if you are using cuda backend): 12

Known Workarounds

Metadata

Metadata

Assignees

No one assigned

    Labels

    staleStale issue will be autoclosed soon

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions