[Bugfix] Mistral tool parser streaming update #19425

avigny · 2025-06-10T14:39:17Z

Purpose

Fixes #13622
Fixes #17585
Fixes #20028

This PR is similar to #16096 (hermes tool parser)

In summary

Repairs tool call in streaming mode for (older) models with tokenizer version <v11

The model output is incrementaly parsed with ijson emitting events used to know what is being streamed (what part of the tool call). for more details see _extract_tool_calls_streaming_pre_v11_tokenizer
Quick unit tests added in tests/tool_use/test_mistral_tool_parser.py see test_extract_tool_calls_streaming_pre_v11_tokenizer

Adds support for tool calls in streaming mode for recent models (tokenizer version >=v11)

See _extract_tool_calls_streaming for implementation details
Test added for mistralai/Mistral-Small-3.2-24B-Instruct-2506 in tests/tool_use/test_mistral_tool_parser.py
Quick unit tests added in tests/tool_use/test_mistral_tool_parser.py see test_extract_tool_calls_streaming

Test Plan

I've added a test file tests/tool_use/test_mistral_tool_parser.py for easy and fast testing. This file works similarly as the existing tests/tool_use/test_jamba_tool_parser.py.

This tests the parsing functions with a mocked model output. It allows toeasily test edge cases.

Use pytest tests/tool_use/test_mistral_tool_parser.py to run this test file.

Test added for mistralai/Mistral-Small-3.2-24B-Instruct-2506 in tests/tool_use/test_mistral_tool_parser.py

(Optional) Documentation Update

I believe no documentation update is needed

github-actions · 2025-06-10T14:39:27Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Summary of Changes

Hello @avigny, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request addresses an issue related to streaming tool calls for Mistral models by replacing the previous partial_json_parser-based implementation with a more robust, custom stateful parser. This new approach aims to accurately extract tool call information, including names and arguments, as tokens are streamed, improving the reliability of tool use functionality in streaming mode. The changes include a significant rewrite of the parsing logic and the addition of comprehensive test coverage.

Highlights

Refactor Streaming Parser: The core logic for parsing Mistral tool calls during streaming has been completely rewritten.
Removed Dependency: The dependency on the partial_json_parser library for streaming tool call extraction has been removed.
New Parsing Mechanism: Introduced a custom stateful parsing mechanism using regex and json.JSONDecoder.raw_decode to incrementally extract tool call names and arguments from the raw token stream.
Comprehensive Tests: Added a new, extensive test file (tests/tool_use/test_mistral_tool_parser.py) with various test cases covering both streaming and non-streaming scenarios for Mistral tool calls, including single and multiple tool calls with different argument structures.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configureGemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request refactors the streaming tool call parsing logic for Mistral models and adds a comprehensive test suite. The core change involves replacing partial_json_parser with a custom regex and json.raw_decode-based approach for more fine-grained control over the streaming process. The new tests cover a variety of scenarios. The review includes stylistic suggestions for the tests and points for consideration regarding complexity and state management in the new parsing logic.

vllm/entrypoints/openai/tool_parsers/mistral_tool_parser.py

Tests are similar as the ones added for Jamba models in vllm-project#9154 Signed-off-by: avigny <[email protected]>

Signed-off-by: avigny <[email protected]>

avigny · 2025-06-11T10:12:51Z

@hibukipanim I did run the test you provided in your issue description #17585 (comment) and got the following output:

ChoiceDeltaToolCall(index=0, id='j6OY9szTS', function=ChoiceDeltaToolCallFunction(arguments=None, name='mcp_confluence'), type='function')
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments='{"', name=None), type=None)
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments='query', name=None), type=None)
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments='":', name=None), type=None)
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments=' "', name=None), type=None)
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments='co', name=None), type=None)
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments='ffee', name=None), type=None)
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments='",', name=None), type=None)
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments=' "', name=None), type=None)
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments='limit', name=None), type=None)
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments='":', name=None), type=None)
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments=' ', name=None), type=None)
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments='1', name=None), type=None)
ChoiceDeltaToolCall(index=0, id=None, function=ChoiceDeltaToolCallFunction(arguments='}', name=None), type=None)

It seems to fix your issue.
Please let me know If I missed something.

PedroMiolaSilva · 2025-07-02T21:06:32Z

@avigny hey!

I've being trying to test your solution but with no success. This is what I'm doing:

source ../.env
export MODEL_ID=unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8
export MODEL_ID_PORT=8000
export MODEL_ID_GPU=0

docker run \
--runtime nvidia \
-e VLLM_USE_V1=1 \
--gpus all \
--ipc=host \
-p "${MODEL_ID_PORT}:8000" \
--env "HUGGING_FACE_HUB_TOKEN=${HUGGING_FACE_HUB_TOKEN}" \
--env "HF_HUB_OFFLINE=0" \
-v "${HF_HOME}:/root/.cache/huggingface" \
-v "./mistral_tool_parser.py:/usr/local/lib/python3.12/dist-packages/vllm/entrypoints/openai/tool_parsers/mistral_tool_parser.py" \
vllm/vllm-openai:latest \
-v "$(pwd):/app" \
--model ${MODEL_ID} \
--tool-call-parser mistral \
--chat-template /app/template.jinja
--enable-auto-tool-choice \
--limit-mm-per-prompt 'image=1' \
--tokenizer_mode mistral \
--config_format mistral \
--load_format mistral \
--max-model-len 64000 \
--gpu-memory-utilization 0.8

Where template.jinja is this one and mistral_tool_parser.py is the one that you've created.

I'm using this test request:

curl -X POST \
   http://localhost:8000/v1/chat/completions \
   -H "Content-Type: application/json" \
   -d '{
   "model": "unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8",
   "messages": [
    {"role":"system","content":"You have access to the weather tool. You should call this tool when you think it makes sense"},
     {"role": "user", "content": "What'\''s the weather in New York?"}
   ],
   "tools": [
     {
       "type": "function",
       "function": {
         "name": "get_weather",
         "description": "Get the current weather in a given location",
         "parameters": {
           "type": "object",
           "properties": {
             "location": {
               "type": "string",
               "description": "The city and state, e.g. San Francisco, CA"
             }
           },
           "required": ["location"]
         }
       }
     }
   ]
 }'

When I set stream to false, I'm getting this response:

{"id":"chatcmpl-0dc2b75406114cbcb4f95735ccfdb094","object":"chat.completion","created":1751490167,"model":"unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8","choices":[{"index":0,"message":{"role":"assistant","reasoning_content":null,"content":"[TOOL_CALLS]get_weather{\"location\": \"New York, NY\"}","tool_calls":[]},"logprobs":null,"finish_reason":"stop","stop_reason":null}],"usage":{"prompt_tokens":112,"total_tokens":127,"completion_tokens":15,"prompt_tokens_details":null},"prompt_logprobs":null,"kv_transfer_params":null}

And this error:

ERROR 07-02 14:00:22 [mistral_tool_parser.py:160] Error in extracting tool call from response.
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160] Traceback (most recent call last):
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]   File "/usr/local/lib/python3.12/dist-packages/vllm/entrypoints/openai/tool_parsers/mistral_tool_parser.py", line 131, in extract_tool_calls
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]     function_call_arr = json.loads(tool_content)
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]                         ^^^^^^^^^^^^^^^^^^^^^^^^
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]   File "/usr/lib/python3.12/json/__init__.py", line 346, in loads
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]     return _default_decoder.decode(s)
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]            ^^^^^^^^^^^^^^^^^^^^^^^^^^
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]   File "/usr/lib/python3.12/json/decoder.py", line 338, in decode
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]     obj, end = self.raw_decode(s, idx=_w(s, 0).end())
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]   File "/usr/lib/python3.12/json/decoder.py", line 356, in raw_decode
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]     raise JSONDecodeError("Expecting value", s, err.value) from None
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160] json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160] 
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160] During handling of the above exception, another exception occurred:
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160] 
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160] Traceback (most recent call last):
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]   File "/usr/local/lib/python3.12/dist-packages/vllm/entrypoints/openai/tool_parsers/mistral_tool_parser.py", line 137, in extract_tool_calls
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]     raw_tool_call = self.tool_call_regex.findall(tool_content)[0]
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160]                     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^
ERROR 07-02 14:00:22 [mistral_tool_parser.py:160] IndexError: list index out of range

When I set stream=true, I dont receive any errors, but the response does not have tool calls:

data: {"id":"chatcmpl-028934e8ee754938943457f631313546","object":"chat.completion.chunk","created":1751490269,"model":"unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8","choices":[{"index":0,"delta":{"role":"assistant","content":""},"logprobs":null,"finish_reason":null}]}

Am I doing something wrong here?

rdlh · 2025-07-03T09:11:28Z

Looks liks this PR unfortunately don't fix issues on Mistral Small 3.2.

API Call :

{
    "stream": false,
    "temperature": 0.15,
    "top_p": 1.0,
    "tool_choice": "auto",
    "model": "mistralai/Mistral-Small-3.2-24B-Instruct-2506",
    "messages": [
        {
            "role": "user",
            "content": "Hi ! What's the result of 95478415 / 4571 ?"
        }
    ],
    "tools": [
        {
            "type":"function",
            "function": {
            "name":"calculator",
            "description":"Perform a basic calculation using ruby syntax for arithmetic operations.",
            "parameters": {
                "type":"object",
                "properties": {
                "calculation": {
                    "type":"string",
                    "description":"A basic arithmetic calculation in python language (e.g., \"2+2\", \"10*3\", \"45/9\").",
                    "required":["calculation"]
                }
                },
                "required":["calculation"]
            }
            }
        }
    ]
}

Still have this error :

ERROR 07-03 01:55:20 [mistral_tool_parser.py:166] Error in extracting tool call from response.
ERROR 07-03 01:55:20 [mistral_tool_parser.py:166] Traceback (most recent call last):
ERROR 07-03 01:55:20 [mistral_tool_parser.py:166]     function_call_arr = json.loads(tool_content)

Here are some logs :

=== model_output ===
[TOOL_CALLS]calculator{"calculation": "95478415 / 4571"}
=== tool_content ===
calculator{"calculation": "95478415 / 4571"}

Please note that this issue is NOT happening when using "tool_choice": "required".

avigny · 2025-07-03T10:58:40Z

Yes you're both right!
I believe I did branch out and started working on my fix before the changes introduced by #19193 which introduced the use of fn_name_regex from the model tokenizer.
I'll try to port this to the extract_tool_calls_streaming method.

Thanks for finding this!

gaby · 2025-07-04T04:10:17Z

Any update on getting this merge?

DarkLight1337 · 2025-07-04T13:55:51Z

cc @aarnphm

sjuxax · 2025-07-04T19:25:53Z

So I did more complete testing and found this wasn't working that well after all -- I was getting the same errors reported above. Not sure what happened on initial testing. But, I've since taken it and have a working implementation, for streaming at least, at https://github.com/sjuxax/vllm/tree/Mistral3.2-tool-call-fix. I'm going to cherry-pick it onto #20471 in a sec. Then using that branch should work with quantized HF models and tool calling.

PedroMiolaSilva

I think replacing lines 127:139 with this below will fix it for non-streaming:

            #First, use the tool call token to split, and we discard the first item, because it is empty
            raw_tool_calls = model_output.split(self.bot_token)[1:] 
            function_call_arr = []
            for raw_tool_call in raw_tool_calls:
                tool_name = raw_tool_call.split("{")[0]
                tool_arguments_begin = raw_tool_call.find("{")
                tool_arguments = raw_tool_call[tool_arguments_begin:]
                function_call_arr.append({
                                        "name": tool_name,
                                        "arguments": json.loads(tool_arguments)
                })

…pdate

Signed-off-by: avigny <[email protected]>

… tokens Signed-off-by: avigny <[email protected]>

mergify · 2025-10-16T13:40:45Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @avigny.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

…-streaming-update # Conflicts: # vllm/entrypoints/openai/tool_parsers/mistral_tool_parser.py

same as vllm-project#26633 Signed-off-by: avigny <[email protected]>

Signed-off-by: avigny <[email protected]>

avigny · 2025-10-16T16:54:15Z

About the errors during the streaming parsing I've added a try catch to be like other parsers.
In case of an error, the streaming parser will return None as other tool parsers do. I don't think there is an easy way to return the whole model output without duplicated tokens or missing tokens as in some cases tokens are buffered before being returned in a delta tool call...

vllm/vllm/entrypoints/openai/tool_parsers/llama4_pythonic_tool_parser.py

Lines 211 to 216 in 5afd327

    
           except Exception: 
        
               logger.exception("Error trying to handle streaming tool call.") 
        
               logger.debug( 
        
                   "Skipping chunk as a result of tool streaming extraction error" 
        
               ) 
        
               return None

vllm/vllm/entrypoints/openai/tool_parsers/deepseekv31_tool_parser.py

Lines 390 to 392 in 5afd327

    
           except Exception: 
        
               logger.exception("Error trying to handle streaming tool call.") 
        
               return None  # do not stream a delta. skip this token ID.

bbrowning · 2025-10-16T19:30:32Z

@avigny I didn't review all of the commits from scratch again, but the latest commits checking just for the bot_token_id (vs text) and error handling look good to me. Thanks for getting this in good shape!!

.buildkite/test-pipeline.yaml

tests/tool_use/utils.py

Signed-off-by: avigny <[email protected]>

…ct-2506`" This reverts commit 58512a6. Signed-off-by: avigny <[email protected]>

Signed-off-by: avigny <[email protected]>

…om chat template Signed-off-by: avigny <[email protected]>

Signed-off-by: avigny <[email protected]>

alew3 · 2025-10-26T23:56:41Z

@avigny I can see you are putting in a lot of work getting mistral tool calling to work! Thanks for all your effort! Can you update what the current status is?

avigny · 2025-10-27T11:39:31Z

@avigny I can see you are putting in a lot of work getting mistral tool calling to work! Thanks for all your effort! Can you update what the current status is?

Hi @alew3, I'm essentially waiting for a code owner green light to get this PR merged :)

DarkLight1337 · 2025-10-27T11:41:46Z

cc @chaunceyjiang can you take a look?

patrickvonplaten

Very cool!

Large model test is no longer being added

alew3 · 2025-10-31T17:12:03Z

Looking forwarding for this merge!

Blake-Martin-code · 2025-10-31T17:31:15Z

If this gets merged and incorporated in a new release of the docker vLLM image by early next week, y'all are for real goated. That would be perfect timing for me

Happy Halloween!

gemini-code-assist bot reviewed Jun 10, 2025

View reviewed changes

mergify bot added frontend tool-calling labels Jun 10, 2025

github-project-automation bot added this to Tool Calling Jun 10, 2025

gemini-code-assist bot reviewed Jun 10, 2025

View reviewed changes

vllm/entrypoints/openai/tool_parsers/mistral_tool_parser.py Outdated Show resolved Hide resolved

vllm/entrypoints/openai/tool_parsers/mistral_tool_parser.py Outdated Show resolved Hide resolved

avigny added 4 commits June 11, 2025 10:12

Testing mistral tool parser

554ff47

Tests are similar as the ones added for Jamba models in vllm-project#9154 Signed-off-by: avigny <[email protected]>

Update streaming tool parser for mistral

f1e1e38

Signed-off-by: avigny <[email protected]>

Removing unneeded check

92601d9

Signed-off-by: avigny <[email protected]>

repair ruff pre-commit

d6d17c1

Signed-off-by: avigny <[email protected]>

avigny force-pushed the mistral-tool-parser-streaming-update branch from c468495 to d6d17c1 Compare June 11, 2025 08:13

avigny marked this pull request as ready for review June 11, 2025 09:25

avigny requested a review from aarnphm as a code owner June 11, 2025 09:25

avigny changed the title ~~Mistral tool parser streaming update~~ [Bugfix] Mistral tool parser streaming update Jun 11, 2025

sjuxax mentioned this pull request Jul 2, 2025

[Bug]: Streaming tool call is not working for Mistral Small 3.2 #20028

Open

1 task

sjuxax mentioned this pull request Jul 4, 2025

Map Mistral-HF models back onto Mistral format on-the-fly #20471

Draft

8 tasks

gaby mentioned this pull request Jul 4, 2025

[Bug]: Mistrall Small 3.2 doesn't work with images #20025

Open

1 task

PedroMiolaSilva reviewed Jul 4, 2025

View reviewed changes

sjuxax pushed a commit to sjuxax/vllm that referenced this pull request Jul 4, 2025

Bring in tests from vllm-project#19425

c78d1fb

sjuxax mentioned this pull request Jul 4, 2025

feat: Add streaming support for Mistral v11 tool format #20503

Open

avigny added 3 commits July 6, 2025 21:14

Merge branch 'vllm-project:main' into mistral-tool-parser-streaming-u…

52e9d6f

…pdate

Adding SPDX header

dd788b6

Signed-off-by: avigny <[email protected]>

Update non streaming tests with v11 tokenizer and tool call format

170a04e

Signed-off-by: avigny <[email protected]>

Merge branch 'main' into mistral-tool-parser-streaming-update

248630a

mergify bot removed the needs-rebase label Oct 7, 2025

avigny added 2 commits October 11, 2025 00:32

Repairing imports in test_mistral_tool_parser.py

a9fe7d0

Signed-off-by: avigny <[email protected]>

Added check that [TOOL_CALLS] is a single bot token and not textual…

1dde66b

… tokens Signed-off-by: avigny <[email protected]>

mergify bot added the needs-rebase label Oct 16, 2025

avigny added 2 commits October 16, 2025 15:57

Merge remote-tracking branch 'upstream/main' into mistral-tool-parser…

81781e3

…-streaming-update # Conflicts: # vllm/entrypoints/openai/tool_parsers/mistral_tool_parser.py

Removing Optional and Union typing

682c61c

same as vllm-project#26633 Signed-off-by: avigny <[email protected]>

mergify bot removed the needs-rebase label Oct 16, 2025

avigny added 2 commits October 16, 2025 17:00

Typos

0c362ca

Signed-off-by: avigny <[email protected]>

Try-Catch the streaming parsing

bfbba0e

Signed-off-by: avigny <[email protected]>

hmellor previously requested changes Oct 17, 2025

View reviewed changes

.buildkite/test-pipeline.yaml Outdated Show resolved Hide resolved

tests/tool_use/utils.py Show resolved Hide resolved

avigny added 2 commits October 17, 2025 14:46

Removing tool test on mistralai/Mistral-Small-3.2-24B-Instruct-2506

58512a6

Signed-off-by: avigny <[email protected]>

Merge branch 'main' into mistral-tool-parser-streaming-update

c2291fe

avigny requested a review from hmellor October 24, 2025 12:38

avigny added 4 commits October 24, 2025 14:48

Revert "Removing tool test on `mistralai/Mistral-Small-3.2-24B-Instru…

39dd8fd

…ct-2506`" This reverts commit 58512a6. Signed-off-by: avigny <[email protected]>

Making mistral-small-3.2 test not run by CI

41f0d69

Signed-off-by: avigny <[email protected]>

Repair mistral-small-3.2 test as mistral tokenizer cannot take cust…

280c34e

…om chat template Signed-off-by: avigny <[email protected]>

Set supports_parallel to True for both mistral models

47ce613

Signed-off-by: avigny <[email protected]>

avigny force-pushed the mistral-tool-parser-streaming-update branch from ed1a188 to 47ce613 Compare October 24, 2025 12:48

bbrowning mentioned this pull request Oct 28, 2025

[CI/Build] Add common tool call parser test suite #27599

Open

patrickvonplaten approved these changes Oct 31, 2025

View reviewed changes

Uh oh!

[Bugfix] Mistral tool parser streaming update #19425

Are you sure you want to change the base?

[Bugfix] Mistral tool parser streaming update #19425

Conversation

avigny commented Jun 10, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

In summary

Repairs tool call in streaming mode for (older) models with tokenizer version <v11

Adds support for tool calls in streaming mode for recent models (tokenizer version >=v11)

Test Plan

(Optional) Documentation Update

Uh oh!

github-actions bot commented Jun 10, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

avigny commented Jun 11, 2025

Uh oh!

PedroMiolaSilva commented Jul 2, 2025

Uh oh!

rdlh commented Jul 3, 2025

Uh oh!

avigny commented Jul 3, 2025

Uh oh!

gaby commented Jul 4, 2025

Uh oh!

DarkLight1337 commented Jul 4, 2025

Uh oh!

sjuxax commented Jul 4, 2025

Uh oh!

PedroMiolaSilva left a comment

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Oct 16, 2025

Uh oh!

avigny commented Oct 16, 2025

Uh oh!

bbrowning commented Oct 16, 2025

Uh oh!

Uh oh!

Uh oh!

alew3 commented Oct 26, 2025

Uh oh!

avigny commented Oct 27, 2025

Uh oh!

DarkLight1337 commented Oct 27, 2025

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

alew3 commented Oct 31, 2025

Uh oh!

Blake-Martin-code commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

17 participants

avigny commented Jun 10, 2025 •

edited by github-actions bot

Loading