Fix 4bit quantization with blocksize = 4096 #1160

matthewdouglas · 2024-03-29T15:50:14Z

Fixes #1157. Not a particularly practical use case, but I would consider it a bug nonetheless.

matthewdouglas · 2024-03-29T15:52:24Z

csrc/ops.cu


  if(blocksize == 4096)
-    kQuantizeBlockwise<T, 4096, 4, STOCHASTIC, 0><<<num_blocks, 1024>>>(code, A, absmax, out, rand, rand_offset, n);
+    kQuantizeBlockwise<T, 4096, 4, STOCHASTIC, DATA_TYPE><<<num_blocks, 1024>>>(code, A, absmax, out, rand, rand_offset, n);


Somewhat subtle, but here's where the issue was.

matthewdouglas · 2024-03-29T15:53:41Z

tests/test_functional.py

-def test_fp4_quant(dtype):
+@pytest.mark.parametrize("quant_type", ["fp4", "nf4"])
+@pytest.mark.parametrize("blocksize", [64, 128, 256, 512, 1024, 2048, 4096])
+def test_4bit_quant(dtype, quant_type, blocksize):


Previously only fp4 and blocksize=64 was tested here. Expanded these tests, but note the degradation with the larger block size.

github-actions · 2024-03-29T15:54:04Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

abhilash1910

This is good find, lgtm.

Titus-von-Koeller · 2024-04-02T10:28:59Z

Cool! I'm checking with Tim for a review. Will get back to you.

TimDettmers · 2024-04-02T13:31:08Z

This looks good, thanks or catching it!

matthewdouglas added 2 commits March 29, 2024 11:34

Fix 4bit quantization with blocksize=4096

c17fb8e

fix formatting for install_cuda.py

a471456

matthewdouglas commented Mar 29, 2024

View reviewed changes

abhilash1910 approved these changes Mar 30, 2024

View reviewed changes

Titus-von-Koeller merged commit 76885a4 into bitsandbytes-foundation:main Apr 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix 4bit quantization with blocksize = 4096 #1160

Fix 4bit quantization with blocksize = 4096 #1160

Uh oh!

matthewdouglas commented Mar 29, 2024

Uh oh!

matthewdouglas Mar 29, 2024

Uh oh!

matthewdouglas Mar 29, 2024

Uh oh!

github-actions bot commented Mar 29, 2024

Uh oh!

abhilash1910 left a comment

Uh oh!

Titus-von-Koeller commented Apr 2, 2024

Uh oh!

TimDettmers commented Apr 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Fix 4bit quantization with blocksize = 4096 #1160

Fix 4bit quantization with blocksize = 4096 #1160

Uh oh!

Conversation

matthewdouglas commented Mar 29, 2024

Uh oh!

matthewdouglas Mar 29, 2024

Choose a reason for hiding this comment

Uh oh!

matthewdouglas Mar 29, 2024

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 29, 2024

Uh oh!

abhilash1910 left a comment

Choose a reason for hiding this comment

Uh oh!

Titus-von-Koeller commented Apr 2, 2024

Uh oh!

TimDettmers commented Apr 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants