You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This PR refactors BF16Legalize to enable more f32 computations.
We also split the BF16Legalize into two steps.
- BF16ComputeLegalize changes all computation to f32 while keeping
the external BF16 storages.
- BF16StorageLegalize changes all storage to u16.
Now BF16 kernels accept tvm.nd.array that are created as bfloat16 type.
0 commit comments