docs/source/overview/reinforcement-learning · e0a8df23ccd382572ac5b5b3faac0511ef2fc929 · kevin / KincoActuatorIsaacLab

Adds a configuration example for Student-Teacher Distillation (#3100) · 6c06a58b

Clemens Schwarke authored Sep 05, 2025

# Description

This PR adds a configuration class to distill a walking policy for
ANYmal D as an example. The training is run almost the same way as a
normal PPO training. The only difference is that a policy checkpoint
needs to be passed via the `--load_run` CLI argument, to serve as the
teacher.

Additionally, the `RslRlDistillationRunnerCfg` got moved to the correct
file.

## Type of change

- New feature (non-breaking change which adds functionality)

## Checklist

- [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with
`./isaaclab.sh --format`
- [ ] I have made corresponding changes to the documentation
- [x] My changes generate no new warnings
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have updated the changelog and the corresponding version in the
extension's `config/extension.toml` file
- [ ] I have added my name to the `CONTRIBUTORS.md` or my name already
exists there

---------
Co-authored-by: Kelly Guo <kellyg@nvidia.com>

6c06a58b

Name	Last commit	Last update
..
index.rst		Loading commit data...
performance_benchmarks.rst		Loading commit data...
rl_existing_scripts.rst		Loading commit data...
rl_frameworks.rst		Loading commit data...
training_guide.rst		Loading commit data...

index.rst

Download source code

Download this directory