File tree
1,979 files changed
+185459
-157813
lines changed- .buildkite
- nightly-benchmarks/scripts
- scripts
- hardware_ci
- tpu
- .github
- ISSUE_TEMPLATE
- workflows
- benchmarks
- auto_tune
- cutlass_benchmarks
- disagg_benchmarks
- kernels
- deepgemm
- multi_turn
- cmake
- external_projects
- csrc
- attention
- mla
- cutlass_sm100_mla
- device
- kernel
- core
- cpu
- cutlass_extensions
- moe
- marlin_moe_wna16
- quantization
- cutlass_w4a8
- fp4
- fused_kernels
- gptq_marlin
- machete
- w8a8
- cutlass
- c3x
- moe
- fp8
- amd
- nvidia
- int8
- rocm
- docker
- docs
- api
- vllm
- assets
- deployment
- design/cuda_graphs
- community
- configuration
- contributing
- model
- deployment
- frameworks
- integrations
- design
- features
- quantization
- getting_started
- installation
- cpu
- gpu
- mkdocs/hooks
- models
- extensions
- serving
- training
- usage
- examples
- offline_inference
- basic
- kv_load_failure_recovery
- logits_processor
- pooling
- online_serving
- dashboards
- grafana
- perses
- disaggregated_serving_p2p_nccl_xpyd
- disaggregated_serving
- elastic_ep
- openai_embedding_long_text
- pooling
- structured_outputs
- others
- requirements
- tests
- async_engine
- basic_correctness
- benchmarks
- compile
- piecewise
- config
- core
- block
- e2e
- cuda
- detokenizer
- distributed
- engine
- entrypoints
- llm
- offline_mode
- openai
- correctness
- tool_parsers
- pooling
- correctness
- llm
- openai
- evals
- gpt_oss
- gsm8k
- configs
- fastsafetensors_loader
- kernels
- attention
- core
- mamba
- moe
- modular_kernel_tools
- quantization
- kv_transfer
- lora
- metrics
- mistral_tool_use
- model_executor
- model_loader
- fastsafetensors_loader
- runai_model_streamer
- tensorizer_loader
- models
- language
- generation_ppl_test
- generation
- pooling_mteb_test
- pooling
- multimodal
- generation
- vlm_utils
- pooling
- processing
- quantization
- mq_llm_engine
- multimodal
- plugins_tests
- plugins
- lora_resolvers
- prithvi_io_processor_plugin/prithvi_io_processor
- vllm_add_dummy_model
- vllm_add_dummy_model
- vllm_add_dummy_platform
- vllm_add_dummy_platform
- quantization
- reasoning
- runai_model_streamer_test
- samplers
- speculative_decoding/speculators
- standalone_tests
- tokenization
- tool_use
- mistral
- tools
- tpu
- lora
- tracing
- transformers_utils
- utils_
- v1
- attention
- core
- cudagraph
- distributed
- e2e
- engine
- entrypoints
- llm
- openai
- responses
- executor
- generation
- kv_connector
- nixl_integration
- unit
- kv_offload
- logits_processors
- metrics
- sample
- shutdown
- spec_decode
- structured_output
- tpu
- worker
- tracing
- worker
- vllm_test_utils
- vllm_test_utils
- weight_loading
- worker
- tools
- ep_kernels
- pre_commit
- profiler
- nsys_profile_tools
- vllm
- adapter_commons
- assets
- attention
- backends
- mla
- layers
- ops
- utils
- benchmarks
- lib
- compilation
- config
- core
- block
- device_allocator
- distributed
- device_communicators
- eplb
- kv_transfer
- kv_connector
- v1
- p2p
- kv_lookup_buffer
- kv_pipe
- engine
- multiprocessing
- output_processor
- entrypoints
- cli
- benchmark
- openai
- tool_parsers
- executor
- inputs
- logging_utils
- lora
- layers
- ops
- ipex_ops
- torch_ops
- triton_ops
- xla_ops
- punica_wrapper
- model_executor
- layers
- fla/ops
- fused_moe
- configs
- mamba
- ops
- quantization
- compressed_tensors
- schemes
- transform
- schemes
- kernels
- mixed_precision
- scaled_mm
- quark
- schemes
- utils
- rotary_embedding
- model_loader
- models
- warmup
- multimodal
- platforms
- plugins
- io_processors
- lora_resolvers
- profiler
- ray
- reasoning
- transformers_utils
- chat_templates
- configs
- speculators
- processors
- tokenizers
- triton_utils
- usage
- utils
- v1
- attention/backends
- mla
- core
- sched
- engine
- executor
- kv_offload
- backends
- worker
- metrics
- pool
- sample
- logits_processor
- ops
- tpu
- spec_decode
- structured_output
- worker
- worker
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
1,979 files changed
+185459
-157813
lines changedLines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
368 | 368 | | |
369 | 369 | | |
370 | 370 | | |
371 | | - | |
| 371 | + | |
372 | 372 | | |
373 | 373 | | |
374 | 374 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
181 | 181 | | |
182 | 182 | | |
183 | 183 | | |
184 | | - | |
185 | | - | |
| 184 | + | |
186 | 185 | | |
187 | | - | |
188 | 186 | | |
189 | 187 | | |
190 | 188 | | |
191 | 189 | | |
192 | | - | |
193 | | - | |
| 190 | + | |
194 | 191 | | |
195 | | - | |
196 | 192 | | |
197 | 193 | | |
198 | 194 | | |
| |||
Lines changed: 1 addition & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
365 | 365 | | |
366 | 366 | | |
367 | 367 | | |
368 | | - | |
369 | | - | |
| 368 | + | |
370 | 369 | | |
371 | 370 | | |
372 | 371 | | |
| |||
455 | 454 | | |
456 | 455 | | |
457 | 456 | | |
458 | | - | |
459 | | - | |
460 | | - | |
461 | | - | |
462 | | - | |
463 | 457 | | |
464 | 458 | | |
465 | 459 | | |
| |||
This file was deleted.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
76 | 76 | | |
77 | 77 | | |
78 | 78 | | |
79 | | - | |
| 79 | + | |
80 | 80 | | |
81 | 81 | | |
82 | 82 | | |
| |||
150 | 150 | | |
151 | 151 | | |
152 | 152 | | |
153 | | - | |
154 | | - | |
155 | | - | |
156 | | - | |
157 | | - | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
158 | 163 | | |
159 | 164 | | |
160 | 165 | | |
| |||
163 | 168 | | |
164 | 169 | | |
165 | 170 | | |
| 171 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
11 | | - | |
| 11 | + | |
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
17 | 36 | | |
18 | 37 | | |
19 | 38 | | |
20 | 39 | | |
21 | 40 | | |
22 | 41 | | |
23 | | - | |
| 42 | + | |
| 43 | + | |
24 | 44 | | |
| 45 | + | |
25 | 46 | | |
26 | 47 | | |
27 | 48 | | |
| |||
43 | 64 | | |
44 | 65 | | |
45 | 66 | | |
46 | | - | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
47 | 70 | | |
48 | 71 | | |
49 | 72 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
86 | 86 | | |
87 | 87 | | |
88 | 88 | | |
89 | | - | |
90 | | - | |
91 | | - | |
92 | | - | |
93 | 89 | | |
94 | 90 | | |
95 | 91 | | |
| |||
167 | 163 | | |
168 | 164 | | |
169 | 165 | | |
170 | | - | |
171 | | - | |
172 | | - | |
173 | | - | |
174 | | - | |
175 | | - | |
176 | 166 | | |
177 | 167 | | |
178 | 168 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
58 | 58 | | |
59 | 59 | | |
60 | 60 | | |
61 | | - | |
62 | | - | |
63 | | - | |
64 | | - | |
65 | | - | |
| 61 | + | |
| 62 | + | |
66 | 63 | | |
67 | 64 | | |
68 | 65 | | |
| |||
0 commit comments