You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Updates SkillGen documentation for data gen command and success rates (#3702)
# SkillGen documentation: data gen command cleanup and success-rate
guidance
## Description
This PR updates the SkillGen documentation in
`docs/source/overview/imitation-learning/skillgen.rst`:
- Removes a redundant `--headless` flag from a data generation command
example.
- Adds a note with success-rate guidelines and training recommendations
for cube stacking and bin cube stacking.
- Clarifies minor text details, including correcting the planning phase
order to “Retreat → Contact → Approach” for consistency.
Motivation: Improve clarity, set realistic expectations on data
generation and downstream policy performance, and align docs with actual
planner behavior.
Dependencies: None
## Type of change
- Documentation update
## Checklist
- [x] I have read and understood the [contribution
guidelines](https://isaac-sim.github.io/IsaacLab/main/source/refs/contributing.html)
- [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with
`./isaaclab.sh --format`
- [x] I have made corresponding changes to the documentation
- [x] My changes generate no new warnings
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have updated the changelog and the corresponding version in the
extension's `config/extension.toml` file
- [x] I have added my name to the `CONTRIBUTORS.md` or my name already
exists there
**Expected Success Rates and Recommendations for Cube Stacking and Bin Cube Stacking Tasks**
401
+
402
+
* SkillGen data generation and downstream policy success are sensitive to the task and the quality of dataset annotation, and can show high variance.
403
+
* For cube stacking and bin cube stacking, data generation success is typically 40% to 70% when the dataset is properly annotated per the instructions.
404
+
* Behavior Cloning (BC) policy success from 1000 generated demonstrations trained for 2000 epochs (default) is typically 40% to 85% for these tasks, depending on data quality.
405
+
* Recommendation: Train for the default 2000 epochs with about 1000 generated demonstrations, and evaluate multiple checkpoints saved after the 1000th epoch to select the best-performing policy.
0 commit comments