[V1][Structured Output] Enable Speculative Decoding with Structured Outputs #751

shen-shanshan · 2025-05-05T01:51:52Z

What this PR does / why we need it?

Enable speculative decoding with structured outputs, adapted from vllm-project/vllm#14702.

Does this PR introduce any user-facing change?

Find more details at vllm-project/vllm#14702.

How was this patch tested?

TODO:

Test tests/v1/entrypoints/llm/test_struct_output_generate.py after spec decode supported in vllm-ascend V1.

Signed-off-by: Shanshan Shen <[email protected]>

Signed-off-by: shen-shanshan <[email protected]>

shen-shanshan marked this pull request as draft May 5, 2025 01:51

shen-shanshan changed the title ~~[V1][Structured Output][Spec Decode] Enable Speculative Decoding with Structured Outputs~~ [V1][Core] Enable Speculative Decoding with Structured Outputs May 5, 2025

Enable Speculative Decoding with Structured Outputs

78a0479

Signed-off-by: Shanshan Shen <[email protected]>

shen-shanshan force-pushed the v1-gd branch from dcb597e to 78a0479 Compare May 8, 2025 07:38

shen-shanshan marked this pull request as ready for review May 8, 2025 07:38

format

32ae411

Signed-off-by: shen-shanshan <[email protected]>

shen-shanshan changed the title ~~[V1][Core] Enable Speculative Decoding with Structured Outputs~~ [V1][Structured Output] Enable Speculative Decoding with Structured Outputs May 8, 2025

shen-shanshan mentioned this pull request May 8, 2025

[Feature]: Add Support for Guided Decoding (Structured Output) #177

Closed

20 tasks

shen-shanshan marked this pull request as draft May 9, 2025 01:23

wangxiyuan mentioned this pull request Jun 4, 2025

[release] 0.9.0rc1 release checklist #904

Closed

76 tasks

shen-shanshan closed this Jul 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[V1][Structured Output] Enable Speculative Decoding with Structured Outputs #751

[V1][Structured Output] Enable Speculative Decoding with Structured Outputs #751

Uh oh!

shen-shanshan commented May 5, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

[V1][Structured Output] Enable Speculative Decoding with Structured Outputs #751

[V1][Structured Output] Enable Speculative Decoding with Structured Outputs #751

Uh oh!

Conversation

shen-shanshan commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

shen-shanshan commented May 5, 2025 •

edited

Loading