-
Notifications
You must be signed in to change notification settings - Fork 246
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] Add in-memory caching for lm-eval base model results
#1898
opened Oct 3, 2025 by
rahul-tuli
•
Draft
[Dependencies] update When a PR is ready for review
lm_eval
version pin
ready
#1862
opened Sep 24, 2025 by
brian-dellabetta
Loading…
[Logging] clean up CompressionLogger verbosity
ready
When a PR is ready for review
#1861
opened Sep 23, 2025 by
brian-dellabetta
Loading…
[MoE Calibration] Simplify MoE calibration interface
#1851
opened Sep 22, 2025 by
sairampillai
Loading…
Updating base.py (parallel calibration and model #1809)
#1837
opened Sep 17, 2025 by
aashvgit
Loading…
ready label check
ready
When a PR is ready for review
#1832
opened Sep 17, 2025 by
brian-dellabetta
Loading…
1 task done
[Observers] Small observers cleanup, add e2e quantization tests
#1830
opened Sep 17, 2025 by
kylesayrs
Loading…
[Transform] Spinquant R3
ready
When a PR is ready for review
#1778
opened Aug 27, 2025 by
kylesayrs
Loading…
[Tracing] Support Cohere Vision, Decouple vision tower from first layer
ready
When a PR is ready for review
#1710
opened Aug 6, 2025 by
kylesayrs
Loading…
[Autowrapper] Support Gemma3n, autowrapper improvements
ready
When a PR is ready for review
#1693
opened Jul 30, 2025 by
kylesayrs
Loading…
[AWQ] Allow for activation quantization
ready
When a PR is ready for review
#1682
opened Jul 24, 2025 by
brian-dellabetta
Loading…
[KV Cache] support kv cache int8 per channel quantization
ready
When a PR is ready for review
#1663
opened Jul 19, 2025 by
Eviannn
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.