Skip to content

Conversation

Wauplin
Copy link
Contributor

@Wauplin Wauplin commented Oct 17, 2025

Equivalent Python PR: huggingface/huggingface_hub#3448

Discussed in private DMs.

Now that we have server-side routing on https://router.huggingface.co/v1/chat/completions, it's best to use it in the JS client (centralized logic between JS and Python clients + saves 1 HTTP call). We still keep client-side routing for all other tasks.

@Wauplin Wauplin changed the title Server-side auto-routing for conversational task [InferenceClient] Server-side auto-routing for conversational task Oct 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant