[InferenceClient] Server-side auto-routing for conversational task #1810

Wauplin · 2025-10-17T14:59:52Z

Equivalent Python PR: huggingface/huggingface_hub#3448

Now that we have server-side routing on https://router.huggingface.co/v1/chat/completions, it's best to use it in the JS client (centralized logic between JS and Python clients + saves 1 HTTP call). We still keep client-side routing for all other tasks.

Server-side auto-routing for conversational task

9fbd133

Wauplin requested review from SBrandeis, hanouticelina and julien-c as code owners October 17, 2025 14:59

fix test

7df2074

Wauplin mentioned this pull request Oct 17, 2025

[InferenceClient] Server-side auto-routing for conversational task huggingface/huggingface_hub#3448

Open

Wauplin changed the title ~~Server-side auto-routing for conversational task~~ [InferenceClient] Server-side auto-routing for conversational task Oct 17, 2025

fix test

f067cd1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[InferenceClient] Server-side auto-routing for conversational task #1810

[InferenceClient] Server-side auto-routing for conversational task #1810

Uh oh!

Wauplin commented Oct 17, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[InferenceClient] Server-side auto-routing for conversational task #1810

Are you sure you want to change the base?

[InferenceClient] Server-side auto-routing for conversational task #1810

Uh oh!

Conversation

Wauplin commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Wauplin commented Oct 17, 2025 •

edited

Loading