[QNN] Fix per-channel broadcast with invalid axes #10936
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Per-channel quantization for broadcast ops was supported in #10718, but apparently an invalid
BroadcastAttrscan be created, where the channel axis can be set to 1 even though the input shape is of rank 1. I've hit this error when working on the quantized BERT model with the latestmain. The error can be reproduced by running the test (disabled on CI since it is slow)tvm/tests/python/unittest/test_meta_schedule_integration.py
Line 220 in c2488ac
This PR adds a workaround for cases like that, but I have a feeling that something needs to be fixed on the FQ2I side, where those invalid axes are created.
@sfvaroglu @AndrewZhaoLuo @mbrookhart @anwang2009