You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* gemlite integration in torchao
Summary:
This PR adds support for gemlite kernels in torchao using a subclass
integration with the gemlite_uintx_weight_only constructor. This works
for int4 grouped and ungrouped assymmetric oeight only quantization and
int8 symmetric ungrouped quantization for fp16 models. TP support
through DTensor is included in thsi PR
in the process of integrating gemlite into AQT i also made some fixes to
a few quant primitives that are being used which previously were not.
Test Plan:
test_integration.py -k "test_gemlite_layout"
test_affine_quantized_tensor_parallel.py -k "test_tp_gemlite"
see benchmarks.sh for gemlite benchmarks as well.
Reviewers:
Subscribers:
Tasks:
Tags:
new gemlite integration using pip install
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
tests ran
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
fixing gemlite to do int4 matmul instead of fp16 fp16
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
running tests
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
more testing
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
AQT integration wip
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
Wip
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
testing on gemlite a100_int8_tuning branch
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
gemlite subclass testing bitpacking 8 bits
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
bug fixing stuff
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
hicham fixes
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
new benchmarks
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
testing gemlite 8 bit
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
WIP
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
tp support
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
wip
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
final
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
* fixing regressions
Summary:
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
0 commit comments