Releases: ROCm/TransformerEngine
Releases · ROCm/TransformerEngine
v2.1 ROCm
What's Changed
- Enable Multi-latent attention
- Gfx950 support
- Add release wheels building support
- Remove rocBlas support
- Add layernorm Triton kernels
Upstream changes:
https://github.com/NVIDIA/TransformerEngine/releases/tag/v2.1
https://github.com/NVIDIA/TransformerEngine/releases/tag/v2.0
Full Changelog: v1.14_rocm...v2.1_rocm
v1.14 ROCm
[CI] deprecate praxis installation and tests - Removed praxis installation and related test setup from `ci/jax.sh` - Installed `flax>=0.7.1`, with typing_extensions>=4.12.2
v1.13 ROCm
[CI] deprecate praxis installation and tests - Removed praxis installation and related test setup from `ci/jax.sh` - Installed `flax>=0.7.1`, with typing_extensions>=4.12.2
v1.9 ROCm
[ROCm] backport rmsnorm triton kernels into rocm v1.9 (#169) * [ROCm] backport rmsnorm triton kernels into rocm v1.9 * [ROCm] use single worker for CI
v1.12 ROCm
v1.12_rocm IFU release v1.12
v1.11 ROCm
[PyTorch] Drop FA as an installation requirement (#1226) (#125) Upstream cherry-pick 161b1d9 + partially e762592 Co-authored-by: Charlene Yang <[email protected]>