Skip to content

Conversation

rymc
Copy link
Contributor

@rymc rymc commented Jul 10, 2025

Backend.validate() always issued a smoke-test call to the legacy /v1/completions endpoint, even when the caller specified GUIDELLM__PREFERRED_ROUTE=chat_completions. This breaks validation against deployments that expose only the chat-completions route.

The PR makes backend validation honor the GUIDELLM__PREFERRED_ROUTE setting. Instead of always using the /v1/completions endpoint, it now chooses between text_completions and chat_completions based on the configured preference.

📦 Build Artifacts Available
The build artifacts (.whl and .tar.gz) have been successfully generated and are available for download: https://github.com/neuralmagic/guidellm/actions/runs/16188636004/artifacts/3501922873.
They will be retained for up to 30 days.

Copy link
Collaborator

@markurtz markurtz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is great, thanks!

@markurtz markurtz merged commit 4d46368 into vllm-project:main Jul 10, 2025
15 checks passed
DaltheCow pushed a commit that referenced this pull request Jul 15, 2025
`Backend.validate()` always issued a smoke-test call to the legacy
/v1/completions endpoint, even when the caller specified
`GUIDELLM__PREFERRED_ROUTE=chat_completions`. This breaks validation
against deployments that expose only the chat-completions route.

The PR makes backend validation honor the `GUIDELLM__PREFERRED_ROUTE`
setting. Instead of always using the /v1/completions endpoint, it now
chooses between text_completions and chat_completions based on the
configured preference.

Signed-off-by: dalthecow <[email protected]>
DaltheCow pushed a commit that referenced this pull request Jul 15, 2025
`Backend.validate()` always issued a smoke-test call to the legacy
/v1/completions endpoint, even when the caller specified
`GUIDELLM__PREFERRED_ROUTE=chat_completions`. This breaks validation
against deployments that expose only the chat-completions route.

The PR makes backend validation honor the `GUIDELLM__PREFERRED_ROUTE`
setting. Instead of always using the /v1/completions endpoint, it now
chooses between text_completions and chat_completions based on the
configured preference.

Signed-off-by: dalthecow <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants