[Feature]: Support OpenAI speech-to-text interface `v1/audio/[transcriptions,translations]`

### 🚀 The feature, motivation and pitch

Now that we have support for Whisper (https://github.com/vllm-project/vllm/pull/11280), we should consider implementing OpenAI's explicit speech-to-text API. Documentation is here https://platform.openai.com/docs/guides/speech-to-text


### Example of `v1/audio/transcriptions`

```python
from openai import OpenAI
client = OpenAI()

audio_file= open("/path/to/file/audio.mp3", "rb")
transcription = client.audio.transcriptions.create(
    model="whisper-1", 
    file=audio_file
)

print(transcription.text)
```


### Example of `v1/audio/translations`

```python
from openai import OpenAI
client = OpenAI()

audio_file = open("/path/to/file/german.mp3", "rb")
transcription = client.audio.translations.create(
    model="whisper-1", 
    file=audio_file,
)

print(transcription.text)
```

### Alternatives

_No response_

### Additional context

_No response_

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature]: Support OpenAI speech-to-text interface `v1/audio/[transcriptions,translations]` #12130

🚀 The feature, motivation and pitch

Example of `v1/audio/transcriptions`

Example of `v1/audio/translations`

Alternatives

Additional context

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Feature]: Support OpenAI speech-to-text interface v1/audio/[transcriptions,translations] #12130

Description

🚀 The feature, motivation and pitch

Example of v1/audio/transcriptions

Example of v1/audio/translations

Alternatives

Additional context

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[Feature]: Support OpenAI speech-to-text interface `v1/audio/[transcriptions,translations]` #12130

Example of `v1/audio/transcriptions`

Example of `v1/audio/translations`