vllm-project / llm-compressor Public

Notifications You must be signed in to change notification settings
Fork 246
Star 2k

Code
Issues 59
Pull requests 37
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: vllm-project/llm-compressor

Labels 11 Milestones 0

New pull request New

37 Open 839 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[WIP] Add in-memory caching for lm-eval base model results

#1898 opened Oct 3, 2025 by rahul-tuli • Draft

Created FAQ page first draft

#1896 opened Oct 2, 2025 by cajeonrh

Loading…

[Training] Fix tokenizer attribute of SessionMixin ready

When a PR is ready for review

#1895 opened Oct 1, 2025 by kylesayrs • Draft

add gpt oss nvfp4 example

#1885 opened Sep 30, 2025 by shanjiaz • Draft

Add awq activation fp8 support in loss compute

#1873 opened Sep 27, 2025 by Bluedyson

Loading…

Add block quantization e2e test

#1867 opened Sep 25, 2025 by shanjiaz • Draft

[Dependencies] update lm_eval version pin ready

When a PR is ready for review

#1862 opened Sep 24, 2025 by brian-dellabetta

Loading…

[Logging] clean up CompressionLogger verbosity ready

When a PR is ready for review

#1861 opened Sep 23, 2025 by brian-dellabetta

Loading…

[MoE Calibration] Simplify MoE calibration interface

#1851 opened Sep 22, 2025 by sairampillai

Loading…

MSE observer for NVFP4

#1840 opened Sep 17, 2025 by shubhra

Loading…

Updating base.py (parallel calibration and model #1809)

#1837 opened Sep 17, 2025 by aashvgit

Loading…

ready label check ready

When a PR is ready for review

#1832 opened Sep 17, 2025 by brian-dellabetta

Loading…

1 task done

Add file to linearize and quantize the gpt-oss models

#1831 opened Sep 17, 2025 by shubhra

Loading…

[Observers] Small observers cleanup, add e2e quantization tests

#1830 opened Sep 17, 2025 by kylesayrs

Loading…

[Quantization] Group Activation Quantization

#1811 opened Sep 12, 2025 by kylesayrs • Draft

add support for per-head attention quantization

#1791 opened Sep 2, 2025 by eldarkurtic

Loading…

[MXFP4] Add mxfp4 support

#1783 opened Aug 28, 2025 by dsikka • Draft

[Transform] Spinquant R3 ready

When a PR is ready for review

#1778 opened Aug 27, 2025 by kylesayrs

Loading…

[Tracing] Support Cohere Vision, Decouple vision tower from first layer ready

When a PR is ready for review

#1710 opened Aug 6, 2025 by kylesayrs

Loading…

[Example] [VLM] Gemma3n

#1696 opened Jul 31, 2025 by kylesayrs • Draft

[Autowrapper] Support Gemma3n, autowrapper improvements ready

When a PR is ready for review

#1693 opened Jul 30, 2025 by kylesayrs

Loading…

1686 Logic matching refactor

#1687 opened Jul 28, 2025 by ved1beta

Loading…

[AWQ] Allow for activation quantization ready

When a PR is ready for review

#1682 opened Jul 24, 2025 by brian-dellabetta

Loading…

add quantization_w4a4_fp4 qwen3 example

#1681 opened Jul 24, 2025 by wangwenmingaa

Loading…

[KV Cache] support kv cache int8 per channel quantization ready

When a PR is ready for review

#1663 opened Jul 19, 2025 by Eviannn

Loading…

Previous 1 2 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!