Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Created FAQ page first draft
#1896 opened Oct 2, 2025 by cajeonrh Loading…
[Training] Fix tokenizer attribute of SessionMixin ready When a PR is ready for review
#1895 opened Oct 1, 2025 by kylesayrs Draft
add gpt oss nvfp4 example
#1885 opened Sep 30, 2025 by shanjiaz Draft
Add awq activation fp8 support in loss compute
#1873 opened Sep 27, 2025 by Bluedyson Loading…
Add block quantization e2e test
#1867 opened Sep 25, 2025 by shanjiaz Draft
[Dependencies] update lm_eval version pin ready When a PR is ready for review
#1862 opened Sep 24, 2025 by brian-dellabetta Loading…
[Logging] clean up CompressionLogger verbosity ready When a PR is ready for review
#1861 opened Sep 23, 2025 by brian-dellabetta Loading…
MSE observer for NVFP4
#1840 opened Sep 17, 2025 by shubhra Loading…
ready label check ready When a PR is ready for review
#1832 opened Sep 17, 2025 by brian-dellabetta Loading…
1 task done
add support for per-head attention quantization
#1791 opened Sep 2, 2025 by eldarkurtic Loading…
[MXFP4] Add mxfp4 support
#1783 opened Aug 28, 2025 by dsikka Draft
[Transform] Spinquant R3 ready When a PR is ready for review
#1778 opened Aug 27, 2025 by kylesayrs Loading…
[Tracing] Support Cohere Vision, Decouple vision tower from first layer ready When a PR is ready for review
#1710 opened Aug 6, 2025 by kylesayrs Loading…
[Example] [VLM] Gemma3n
#1696 opened Jul 31, 2025 by kylesayrs Draft
[Autowrapper] Support Gemma3n, autowrapper improvements ready When a PR is ready for review
#1693 opened Jul 30, 2025 by kylesayrs Loading…
1686 Logic matching refactor
#1687 opened Jul 28, 2025 by ved1beta Loading…
[AWQ] Allow for activation quantization ready When a PR is ready for review
#1682 opened Jul 24, 2025 by brian-dellabetta Loading…
add quantization_w4a4_fp4 qwen3 example
#1681 opened Jul 24, 2025 by wangwenmingaa Loading…
[KV Cache] support kv cache int8 per channel quantization ready When a PR is ready for review
#1663 opened Jul 19, 2025 by Eviannn Loading…
ProTip! Follow long discussions with comments:>50.