Commit 52bcb65
committed
Add INT16 support to rescale operation
Pull Request resolved: #13802
Add INT16 support for RequantizeNode rescale operations in ExecutorTorch ARM backend.
This follows the pattern established for linear, mul, sigmoid, tanh, slice, view/transpose, cat, and FCNode operations, extending int16 support to RequantizeNode rescale operations.
Changes:
- Add INT16 dtype validation support in op_rescale.py
- Enable rescale operations for 16A8W quantization configuration
The 16A8W configuration uses 16-bit activations with 8-bit weights, enabling higher precision for activations while maintaining weight efficiency. RequantizeNode rescale operations are essential for proper quantization scaling in the 16A8W pipeline.
ghstack-source-id: 308860606
@exported-using-ghexport
Differential Revision: [D80513725](https://our.internmc.facebook.com/intern/diff/D80513725/)1 parent c3d7aa6 commit 52bcb65
1 file changed
+11
-4
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
49 | | - | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
50 | 57 | | |
51 | | - | |
| 58 | + | |
52 | 59 | | |
53 | | - | |
| 60 | + | |
54 | 61 | | |
55 | | - | |
| 62 | + | |
56 | 63 | | |
57 | 64 | | |
58 | 65 | | |
| |||
0 commit comments