You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Similar to the torch reference ops we have for attention and RoPE, create pure Python implementation for each quantized op to make it easier to debug our pipeline.