[Bug]: Two BOS when using chat

### Your current environment

vllm 0.8.4

### 🐛 Describe the bug

```python
from vllm import LLM
llm = LLM("meta-llama/Llama-3.2-1B-Instruct", 
          gpu_memory_utilization=0.3)
prompt = [{"role": "user", "content": "Are you ok?"}]
out = llm.chat(prompt)
print(out[0].prompt_token_ids[:2], llm.get_tokenizer().bos_token_id)
```

```
([128000, 128000], 128000)
```

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug]: Two BOS when using chat #16853

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: Two BOS when using chat #16853

Description

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions