fix(core): remove orphaned ToolMessages in trim_message #33265

yashv6655 · 2025-10-04T01:13:41Z

Description

Fixes a bug where trim_messages with strategy="last" could create invalid message histories by orphaning ToolMessages when their corresponding AIMessage with tool_calls was trimmed away.

Issue

When trimming message history, if a ToolMessage was included in the trimmed result but its corresponding AIMessage (containing the tool call that the ToolMessage responds to) was removed, this created an orphaned ToolMessage with a tool_call_id that references a non-existent tool call. This invalid message history would be rejected by most LLM APIs.

Fix

Added a _remove_orphaned_tool_messages() helper function that:

Scans the trimmed messages for all valid tool_call_ids from AIMessages
Filters out any ToolMessages whose tool_call_id doesn't match a valid tool call
Returns a cleaned message list with orphaned ToolMessages removed

This function is called in _first_max_tokens() before returning, which fixes both strategy="first" and strategy="last" (since "last" internally uses "first" with reversed messages).

Example

Before (broken):

trimmed_messages = trim_messages(messages, strategy="last", token_counter=len, max_tokens=5)
# Returns: [ToolMessage(tool_call_id="abc123"), HumanMessage(...), ...]
# Invalid! ToolMessage references a tool call that's not in the trimmed history

After (fixed):

trimmed_messages = trim_messages(messages, strategy="last", token_counter=len, max_tokens=5)
# Returns: [HumanMessage(...), AIMessage(...), ...]
# Valid! Orphaned ToolMessage was automatically removed

Issue

Resolves #33245

Dependencies

None - this is a pure bug fix with no new dependencies.

Testing

Added 5 comprehensive unit tests covering various orphaning scenarios

codspeed-hq · 2025-10-04T01:20:46Z

CodSpeed WallTime Performance Report

Merging #33265 will not alter performance

_{Comparing yashv6655:fix/core/trim-messages-tool-call-orphaning (891903a) with master (7f5be6b)¹}

⚠️

Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

Summary

✅ 13 untouched

No successful run was found on master (46b87e4) during the generation of this report, so 7f5be6b was used instead as the comparison base. There might be some changes unrelated to this pull request in this report. ↩

eyurtsev · 2025-10-04T01:23:12Z

Hi @yashv6655! Thank you for the PR. I haven't reviewed in detail yet, but noticed that the issue is missing a parameter

If you look at the how-to docs:

https://python.langchain.com/docs/how_to/trim_messages/#trimming-based-on-message-count

We recommend adding: end_on explicitly so only valid chat histories are produced.

Could you confirm whether this resolves the issue for you?

I'm basically wondering whetherthis is a bug vs. a devx issue (i.e., the API isn't intuitive)

yashv6655 · 2025-10-04T01:41:55Z

@eyurtsev Thanks for the feedback! You're right that end_on=("human", "tool") prevents the specific issue in the bug report.

However, I believe the fix is still needed:

1. The parameter is optional

The bug report used:

trim_messages(messages, strategy="last", token_counter=len, max_tokens=5)

The docs recommend end_on=("human", "tool"), but it's optional. Users don't realize it's necessary to prevent invalid histories.

2. `end_on` doesn't prevent all orphaning cases

Even with proper usage, orphaned ToolMessages can still occur:

messages = [
    HumanMessage("start"),
    AIMessage(tool_calls=[{"id": "tool1", ...}]),
    ToolMessage(tool_call_id="tool1"),
    AIMessage(tool_calls=[{"id": "tool2", ...}]),
    ToolMessage(tool_call_id="tool2"),
    HumanMessage("end"),
]

trim_messages(
    messages,
    max_tokens=4,
    token_counter=len,
    strategy="last",
    end_on=("human", "tool"),
)
# Result: [ToolMessage(tool1), AIMessage(tool2), ToolMessage(tool2), HumanMessage]
# ToolMessage(tool1) is orphaned

end_on controls the final message type, but doesn't prevent orphaning in the middle of the trimmed sequence.

fix(core): remove orphaned ToolMessages in trim_message

4233237

yashv6655 requested a review from eyurtsev as a code owner October 4, 2025 01:13

github-actions bot added core Related to the package `langchain-core` and removed core Related to the package `langchain-core` labels Oct 4, 2025

yashv6655 changed the title ~~Fix(core): remove orphaned ToolMessages in trim_message~~ fix(core): remove orphaned ToolMessages in trim_message Oct 4, 2025

github-actions bot added the core Related to the package `langchain-core` label Oct 4, 2025

Merge branch 'master' into fix/core/trim-messages-tool-call-orphaning

64261fc

eyurtsev self-assigned this Oct 4, 2025

Merge branch 'master' into fix/core/trim-messages-tool-call-orphaning

891903a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(core): remove orphaned ToolMessages in trim_message #33265

fix(core): remove orphaned ToolMessages in trim_message #33265

yashv6655 commented Oct 4, 2025

Uh oh!

codspeed-hq bot commented Oct 4, 2025 •

edited

Loading

Uh oh!

eyurtsev commented Oct 4, 2025

Uh oh!

yashv6655 commented Oct 4, 2025

Uh oh!

Uh oh!

fix(core): remove orphaned ToolMessages in trim_message #33265

Are you sure you want to change the base?

fix(core): remove orphaned ToolMessages in trim_message #33265

Conversation

yashv6655 commented Oct 4, 2025

Description

Issue

Fix

Example

Issue

Dependencies

Testing

Uh oh!

codspeed-hq bot commented Oct 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed WallTime Performance Report

Merging #33265 will not alter performance

Summary

Footnotes

Uh oh!

eyurtsev commented Oct 4, 2025

Uh oh!

yashv6655 commented Oct 4, 2025

1. The parameter is optional

2. end_on doesn't prevent all orphaning cases

Uh oh!

Uh oh!

codspeed-hq bot commented Oct 4, 2025 •

edited

Loading

2. `end_on` doesn't prevent all orphaning cases