[RFC]: How to use OpenAI's "Reasoning Levels" (low/medium/high) in vLLM?

### Motivation.

Hi,

According to OpenAI's documentation, there is a feature called Reasoning Levels that allows controlling the depth of reasoning in a model response. The levels are:
```
Reasoning levels
You can adjust the reasoning level that suits your task across three levels:

Low: Fast responses for general dialogue.  
Medium: Balanced speed and detail.  
High: Deep and detailed analysis.  
The reasoning level can be set in the system prompts, e.g., "Reasoning: high".
```

I would like to know:

1. How can I use this in vLLM? Should I just include "Reasoning: high" in the system prompt, or is there another preferred way to activate it?
2. What is the default reasoning level if I don’t explicitly set it?



### Proposed Change.

None

### Feedback Period.

_No response_

### CC List.

_No response_

### Any Other Things.

_No response_

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[RFC]: How to use OpenAI's "Reasoning Levels" (low/medium/high) in vLLM? #22359

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[RFC]: How to use OpenAI's "Reasoning Levels" (low/medium/high) in vLLM? #22359

Description

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions