Ignore leading whitespace when streaming text, fixing run_stream + Ollama + Qwen3 #2294

DouweM · 2025-07-24T00:31:08Z

Ollama + Qwen3 will emit <think>\n</think>\n\n ahead of tool calls, which we don't want to end up treating as a final result when using run_stream (as that would stop the run before even handling the tool calls).

(See example code below for context on this output)

Before, the TextPart(content='\n\n') is treated as the final result:

PartStartEvent(index=0, part=ThinkingPart(content='')),
PartDeltaEvent(index=0, delta=ThinkingPartDelta(content_delta='\n')),
PartStartEvent(index=1, part=TextPart(content='\n\n')),
FinalResultEvent(tool_name=None, tool_call_id=None),
PartStartEvent(index=2, part=ToolCallPart(tool_name='roll_dice', args='{}', tool_call_id='call_q2vb1z22')),    
...

After, only the actual text after the tool call is treated as the final result: (If it was a final_result call, that call would be treated as the final result)

PartStartEvent(index=0, part=ThinkingPart(content=''))
PartDeltaEvent(index=0, delta=ThinkingPartDelta(content_delta='\n'))
PartStartEvent(index=1, part=ToolCallPart(tool_name='roll_dice', args='{}', tool_call_id='call_frxbo7tq'))
...
PartStartEvent(index=1, part=TextPart(content='You'))
FinalResultEvent(tool_name=None, tool_call_id=None)
PartDeltaEvent(index=1, delta=TextPartDelta(content_delta=' rolled'))
PartDeltaEvent(index=1, delta=TextPartDelta(content_delta=' a'))
PartDeltaEvent(index=1, delta=TextPartDelta(content_delta=' '))
PartDeltaEvent(index=1, delta=TextPartDelta(content_delta='5'))
PartDeltaEvent(index=1, delta=TextPartDelta(content_delta='.'))

Code

import asyncio
import random

from pydantic_ai import Agent
from pydantic_ai.models.openai import OpenAIModel
from pydantic_ai.providers.openai import OpenAIProvider


def roll_dice() -> str:
    """Roll a six-sided die and return the result."""
    return str(random.randint(1, 6))


# --- Agent and Model Configuration ---


ollama_model = OpenAIModel(
    model_name='qwen3:1.7b',
    provider=OpenAIProvider(base_url='http://localhost:11434/v1'),
    settings={'temperature': 0.2},
)

default_system_prompt = """
You are a helpful assistant.
Do not ask any follow-up questions.
"""

model_settings = {
    'temperature': 0.2,
}

tools = [roll_dice]

agent = Agent(model=ollama_model, tools=tools, system_prompt=default_system_prompt, output_type=str)


async def main():
    prompt = 'roll the dice'

    async with agent.iter(prompt) as agent_run:
        async for node in agent_run:
            if Agent.is_model_request_node(node) or Agent.is_call_tools_node(node):
                async with node.stream(agent_run.ctx) as request_stream:
                    async for event in request_stream:
                        print(event, flush=True)


# ---

if __name__ == '__main__':
    asyncio.run(main())

github-actions · 2025-07-24T00:35:28Z

Docs Preview

commit:	`d145396`
Preview URL:	https://91e4334f-pydantic-ai-previews.pydantic.workers.dev

…lama + Qwen3

# Conflicts: # pydantic_ai_slim/pydantic_ai/models/huggingface.py

…lama + Qwen3 (pydantic#2294)

DouweM self-assigned this Jul 24, 2025

DouweM marked this pull request as ready for review July 24, 2025 19:20

Base automatically changed from streaming-think-tags to main July 24, 2025 19:23

DouweM closed this Jul 24, 2025

DouweM force-pushed the streaming-ignore-leading-newlines branch from 55d6c93 to 7eb4491 Compare July 24, 2025 19:24

DouweM added 2 commits July 24, 2025 19:25

Ignore leading whitespace when streaming text, fixing run_stream + Ol…

9d56f8f

…lama + Qwen3

Add test and fix tests

a96a9cf

# Conflicts: # pydantic_ai_slim/pydantic_ai/models/huggingface.py

DouweM reopened this Jul 24, 2025

DouweM added 5 commits July 24, 2025 19:30

Fix HuggingFace streaming test

394c056

Fix flaky a2a test

6f5170c

Merge branch 'main' into streaming-ignore-leading-newlines

17a7a28

Add pragma no branch where appropriate

86e879b

Merge branch 'main' into streaming-ignore-leading-newlines

d145396

DouweM enabled auto-merge (squash) July 24, 2025 20:58

Remove unnecessary 'pragma: no cover'

080f1e3

DouweM merged commit 41dd069 into main Jul 24, 2025
16 checks passed

DouweM deleted the streaming-ignore-leading-newlines branch July 24, 2025 21:16

KRRT7 pushed a commit to aseembits93/pydantic-ai that referenced this pull request Jul 24, 2025

Ignore leading whitespace when streaming text, fixing run_stream + Ol…

d8086c1

…lama + Qwen3 (pydantic#2294)

DouweM mentioned this pull request Aug 6, 2025

Missing Tokens from Beginning of any Streaming text content after 0.4.* #2428

Closed

2 tasks

DouweM mentioned this pull request Aug 14, 2025

Ignore leading whitespace when streaming from Qwen or DeepSeek #2554

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ignore leading whitespace when streaming text, fixing run_stream + Ollama + Qwen3 #2294

Ignore leading whitespace when streaming text, fixing run_stream + Ollama + Qwen3 #2294

Uh oh!

DouweM commented Jul 24, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Ignore leading whitespace when streaming text, fixing run_stream + Ollama + Qwen3 #2294

Ignore leading whitespace when streaming text, fixing run_stream + Ollama + Qwen3 #2294

Uh oh!

Conversation

DouweM commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code

Uh oh!

github-actions bot commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Docs Preview

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DouweM commented Jul 24, 2025 •

edited

Loading

github-actions bot commented Jul 24, 2025 •

edited

Loading