You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: 2.9.0/final.md
-11Lines changed: 0 additions & 11 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -204,7 +204,6 @@ We move enabling `pin_memory` back inside `BaseDataLoaderIter`. This is required
204
204
- Support GQA for flash attention ([#157893](https://github.com/pytorch/pytorch/pull/157893))
205
205
206
206
## CUDA
207
-
- MXFP8 grouped GEMM support for `torch._scaled_grouped_mm` + submodule bump ([#162209](https://github.com/pytorch/pytorch/pull/162209))
208
207
- Add getter for CUDA graph exec to allow mutation of captured kernel params ([#161294](https://github.com/pytorch/pytorch/pull/161294))
209
208
- Implement support for `cudnn_batch_norm_out` kernel to replace the autogen approach ([#123020](https://github.com/pytorch/pytorch/pull/123020))
210
209
@@ -214,7 +213,6 @@ We move enabling `pin_memory` back inside `BaseDataLoaderIter`. This is required
214
213
215
214
## Dynamo
216
215
- Experimental API for ahead-of-time compiling models in fullgraph mode ([#161383](https://github.com/pytorch/pytorch/pull/161383))
217
-
- Toggle erroring/resume on graph break with `torch._dynamo.error_on_graph_break` ([#161739](https://github.com/pytorch/pytorch/pull/161739), [#161747](https://github.com/pytorch/pytorch/pull/161747))
218
216
- Add a hook for recompilations ([#157961](https://github.com/pytorch/pytorch/pull/157961))
219
217
220
218
## Export
@@ -264,18 +262,14 @@ We move enabling `pin_memory` back inside `BaseDataLoaderIter`. This is required
264
262
265
263
## ROCm
266
264
- OCP Micro-scaling Format (mx-fp8/mx-fp4) Support ([#151360](https://github.com/pytorch/pytorch/pull/151360))
267
-
- Support experimental CU carveout `torch._C._set_sm_carveout_experimental()` ([#149466](https://github.com/pytorch/pytorch/pull/149466))
268
-
- Add FP8 rowwise support to `_scaled_grouped_mm` ([#159075](https://github.com/pytorch/pytorch/pull/159075))
269
265
270
266
## XPU
271
267
- Enable `FlexAttention` on Intel GPU ([#143553](https://github.com/pytorch/pytorch/pull/143553))
272
-
- Enable `_int_mm` on Intel GPU ([#157769](https://github.com/pytorch/pytorch/pull/157769))
273
268
274
269
# Improvements
275
270
## AOTDispatcher
276
271
- Skip logging in fp8 activation quantization if there are no nodes to be quantized ([#158129](https://github.com/pytorch/pytorch/pull/158129))
277
272
- Add `aot_export_joint_with_descriptors` and `aot_compile_joint_with_descriptors` ([#158715](https://github.com/pytorch/pytorch/pull/158715))
278
-
- Allow keeping input mutations in the graph for `_aot_export_function` ([#157730](https://github.com/pytorch/pytorch/pull/157730))
279
273
- Extract out `prepare_aot_module_simplified` for use in next PR ([#158319](https://github.com/pytorch/pytorch/pull/158319))
280
274
- Rename modules in AOTAutograd ([#158449](https://github.com/pytorch/pytorch/pull/158449))
281
275
- Track descriptors for all inputs/outputs of AOTAutograd traced graph ([#158624](https://github.com/pytorch/pytorch/pull/158624))
@@ -291,14 +285,11 @@ We move enabling `pin_memory` back inside `BaseDataLoaderIter`. This is required
291
285
- Build `libtorch` without NVSHMEM ([#160910](https://github.com/pytorch/pytorch/pull/160910))
292
286
293
287
## Composability
294
-
- Set `enable_gqa` for `aten._scaled_dot_product_attention_math decomp`([#158604](https://github.com/pytorch/pytorch/pull/158604))
295
-
- Meta implementation for `aten._scaled_dot_product_attention_math_for_mps` ([#159695](https://github.com/pytorch/pytorch/pull/159695))
296
288
- Meta implementation for `aten.add.Scalar` ([#161332](https://github.com/pytorch/pytorch/pull/161332))
0 commit comments