Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
86 commits
Select commit Hold shift + click to select a range
21d7d67
Functionalized patterns in prep for utility
ProExpertProg Sep 6, 2025
f3b4cf1
TEMP Mostly working
ProExpertProg Sep 9, 2025
cdad3c0
TEMP: fixed rmsnorm issue (TODO assert dtypes in fused norm_quant ker…
ProExpertProg Sep 12, 2025
8e4a56f
rms works fully now, had to remove more conversions (and add them in …
ProExpertProg Sep 16, 2025
e151e6d
quant works except (torch,torch)
ProExpertProg Sep 16, 2025
14fdc8b
quant with fix for pure torch, broke others
ProExpertProg Sep 18, 2025
05a65f3
ALL WORKS
ProExpertProg Sep 18, 2025
e6b394e
Add TODO
ProExpertProg Sep 20, 2025
d96913a
Cleanup test_fusion.py, added extra layer of rms/quant
ProExpertProg Sep 25, 2025
b172747
Functionalize attn+quant patterns
ProExpertProg Sep 25, 2025
1ae80c6
Move global vllm_config to pass manager
ProExpertProg Sep 25, 2025
77835fd
Attention fusion works with custom ops
ProExpertProg Sep 25, 2025
1277999
Remove V0 attn fusion test
ProExpertProg Sep 25, 2025
d843a67
Add triton attn test to attn+quant fusion
ProExpertProg Sep 26, 2025
cdd1529
Flat product for better test names/visibility
ProExpertProg Sep 26, 2025
141a37e
Fix rmsnorm
ProExpertProg Sep 26, 2025
c6d6c3b
Refactor E2E attn fusion test
ProExpertProg Sep 26, 2025
490ac86
Add TP=2 test (untested)
ProExpertProg Sep 26, 2025
d0b1b56
improve tests by adding more cases
ProExpertProg Sep 26, 2025
47b4688
TEMP working on caplog
ProExpertProg Sep 27, 2025
ae7f56f
Temp MP workaround P2
ProExpertProg Sep 30, 2025
eb899a4
Temp MP workaround P3
ProExpertProg Sep 30, 2025
a2aa978
Test for caplog utils
ProExpertProg Oct 1, 2025
21a9f9f
Fixed tests, passing with 2.8, 2.9 tbd
ProExpertProg Oct 2, 2025
66a35a9
Update tests/compile/backend.py
ProExpertProg Oct 2, 2025
7eb1364
Update csrc/layernorm_kernels.cu
ProExpertProg Oct 2, 2025
5fef180
clean up fullgraph tests
ProExpertProg Oct 2, 2025
db479ae
TEMP allreduce fusion
ProExpertProg Oct 2, 2025
54189a9
allreduce fusion working (custom ops on)
ProExpertProg Oct 3, 2025
b7f52bf
allreduce fusion working with/without custom ops (except fp4)
ProExpertProg Oct 3, 2025
d09a278
allreduce fusion working with/without custom ops (with fp4)
ProExpertProg Oct 3, 2025
c8675ff
log depyf folder, fix context for TestBackend, fix pattern dump
ProExpertProg Oct 3, 2025
d3f95fe
fullgraph allreduce test update requirements
ProExpertProg Oct 3, 2025
4dbfcf7
Move e2e tests to new file, add to test pipeline
ProExpertProg Oct 3, 2025
31d0127
Add e2e fusions to fullgraph test (should work with Triton backend), …
ProExpertProg Oct 3, 2025
c653d24
Fix spelling, precommit
ProExpertProg Oct 4, 2025
1756f67
add back fp4
ProExpertProg Oct 4, 2025
5619bc3
clean up e2e tests
ProExpertProg Oct 10, 2025
32989d8
add pattern for final allreduce in model
ProExpertProg Oct 10, 2025
46ee626
add more comprehensive testing for quantfp8 (-rmsnorm+-quant still fa…
ProExpertProg Oct 10, 2025
a1c7fdb
add more comprehensive testing for allreduce-rmsnorm, fix fp4 (-rmsno…
ProExpertProg Oct 10, 2025
c3264d8
Fix partial match rmsnorm+quant, fix allreduce+rmsnorm match
ProExpertProg Oct 10, 2025
095277c
Simplify matcher utils by using RMSNorm.forward_static
ProExpertProg Oct 10, 2025
52f78ce
Add allreduce test to 2-gpu test
ProExpertProg Oct 11, 2025
1b1a63e
Fix e2e allreduce fusion test
ProExpertProg Oct 11, 2025
0d6e550
fix func test
ProExpertProg Oct 12, 2025
26892df
fix pass manager test
ProExpertProg Oct 12, 2025
3547b87
fix sequence parallelism test
ProExpertProg Oct 12, 2025
af1ffa7
PR review
ProExpertProg Oct 15, 2025
97b3ff2
Merge remote-tracking branch 'upstream/main' into luka/custom-op-matc…
ProExpertProg Oct 15, 2025
b5f89e5
Cleanup test_full_graph.py
ProExpertProg Oct 15, 2025
f6429e4
Cleanup test_fusion_attn.py
ProExpertProg Oct 15, 2025
8a363d3
Slight improvement for E2E fusion
ProExpertProg Oct 15, 2025
12a7c6d
Tests & docs for flat_product
ProExpertProg Oct 15, 2025
db16ee1
Merge branch 'main' into luka/custom-op-matching-2
ProExpertProg Oct 15, 2025
8ffb474
Remove/fix TODOs
ProExpertProg Oct 15, 2025
2a6299c
Fix e2e test patterns
ProExpertProg Oct 15, 2025
465ce58
Update tests/compile/test_fusion.py
ProExpertProg Oct 15, 2025
bb0254a
Merge branch 'main' into luka/custom-op-matching-2
ProExpertProg Oct 15, 2025
bcd95b5
Fix func test
ProExpertProg Oct 15, 2025
db2b1c7
Smaller model for e2e fusion test
ProExpertProg Oct 15, 2025
a3ebf0a
fix fp8 quant tests
ProExpertProg Oct 15, 2025
3943257
Restore original torch.Parameter behavior in RMSNorm
ProExpertProg Oct 15, 2025
532cbcf
Add comment to test_logger
ProExpertProg Oct 15, 2025
7e6f5b3
add flat_product example
ProExpertProg Oct 15, 2025
24f1298
PR comments: cleanup fusion passes, & matching
ProExpertProg Oct 15, 2025
de7405b
PR comments: add _custom_op suffix
ProExpertProg Oct 15, 2025
6253d5b
Add e2e to L40 distributed, move tests to start of B200 distributed
ProExpertProg Oct 15, 2025
876ef22
Fix tests, PR feedback
ProExpertProg Oct 15, 2025
e99a759
Break up B200 tests, move allreduce to H200
ProExpertProg Oct 15, 2025
a226864
Merge branch 'main' into luka/custom-op-matching-2
ProExpertProg Oct 16, 2025
ae581e1
Fix attention fusion test numerics
ProExpertProg Oct 16, 2025
c03b29b
Remove inductor graph partition from unit test (included in e2e tests)
ProExpertProg Oct 16, 2025
d2e0489
Relax tolerance for L40 fusion test
ProExpertProg Oct 16, 2025
65ef5fd
Merge branch 'main' into luka/custom-op-matching-2
ProExpertProg Oct 16, 2025
d4fe977
Fix NamedTuple
ProExpertProg Oct 16, 2025
6319e39
Update test durations
ProExpertProg Oct 16, 2025
e34d36d
More tweaking of precision
ProExpertProg Oct 16, 2025
f72ee43
Split original pr
ilmarkov Sep 4, 2025
c4c0215
Update bench
ilmarkov Sep 5, 2025
309d79e
Update threshold configuration
ilmarkov Sep 8, 2025
afcfd73
Move all_reduce from custom op in fused_moe
ilmarkov Sep 8, 2025
0248dcd
Linter fixes
ilmarkov Oct 16, 2025
18e4771
Upd
ilmarkov Oct 16, 2025
1debd8e
Merge branch 'main' into imarkov/fused_allreduce_torch_native
ilmarkov Oct 21, 2025
9516d2b
Upd after review
ilmarkov Oct 21, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading