AttributeError: 'FullyShardedDataParallelPlugin' object has no attribute 'activation_checkpointing'

### System Info


```
Traceback (most recent call last):
  File "/workspace/run/run_llm.py", line 717, in <module>
    main()
  File "/workspace/run/run_llm.py", line 644, in main
    trainer = Trainer(
  File "/opt/conda/lib/python3.10/site-packages/transformers/trainer.py", line 342, in __init__
    self.create_accelerator_and_postprocess()
  File "/opt/conda/lib/python3.10/site-packages/transformers/trainer.py", line 3900, in create_accelerator_and_postprocess
    "activation_checkpointing", fsdp_plugin.activation_checkpointing
AttributeError: 'FullyShardedDataParallelPlugin' object has no attribute 'activation_checkpointing'
```

https://github.com/huggingface/transformers/blob/aea761499f4b1193f2706f471442da6f9df65d65/src/transformers/trainer.py#L3893-L3907

The 'FullyShardedDataParallelPlugin' class in [accelerate](https://github.com/huggingface/accelerate) version **v0.22.0** does not have 'activation_checkpointing'. but the **main** branch does.

**v0.22.0**
https://github.com/huggingface/accelerate/blob/6b3e559926afc4b9a127eb7762fc523ea0ea656a/src/accelerate/utils/dataclasses.py#L778

**main**
https://github.com/huggingface/accelerate/blob/739b135f8367becb67ffaada12fe76e3aa60fefd/src/accelerate/utils/dataclasses.py#L783



### Who can help?

_No response_

### Information

- [ ] The official example scripts
- [X] My own modified scripts

### Tasks

- [ ] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...)
- [X] My own task or dataset (give details below)

### Reproduction

-

### Expected behavior

-

	# post accelerator creation setup
	if self.is_fsdp_enabled:
	fsdp_plugin = self.accelerator.state.fsdp_plugin
	fsdp_plugin.limit_all_gathers = self.args.fsdp_config.get(
	"limit_all_gathers", fsdp_plugin.limit_all_gathers
	)
	fsdp_plugin.activation_checkpointing = self.args.fsdp_config.get(
	"activation_checkpointing", fsdp_plugin.activation_checkpointing
	)
	if fsdp_plugin.activation_checkpointing and self.args.gradient_checkpointing:
	raise ValueError(
	"The activation_checkpointing in FSDP config and the gradient_checkpointing in training arg "
	"can't be set to True simultaneously. Please use FSDP's activation_checkpointing logic "
	"when using FSDP."
	)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

AttributeError: 'FullyShardedDataParallelPlugin' object has no attribute 'activation_checkpointing' #25988

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

AttributeError: 'FullyShardedDataParallelPlugin' object has no attribute 'activation_checkpointing' #25988

Description

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions