• James Smith's avatar
    Fixes imitation learning workflow for lift environment (#451) · 83d62e21
    James Smith authored
    # Description
    
    This PR fixes the imitation learning workflow in that
    `collect_demonstrations`, `train` and `play` scripts all don't throw
    exceptions. I haven't validated that the training actually generates a
    successful policy, only that the loss decreased within the first few
    iterations.
    
    A follow up task might be to make sure that the chosen observation terms
    can still result in a good policy, but that's outside of the scope of
    updating this workflow to API changes in Orbit and Robomimic.
    
    Fixes #387 
    
    ## Type of change
    
    - Bug fix (non-breaking change which fixes an issue)
    
    ## Screenshot
    
    ## Checklist
    
    - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with
    `./orbit.sh --format`
    - [ ] I have made corresponding changes to the documentation
    - [x] My changes generate no new warnings
    - [ ] I have added tests that prove my fix is effective or that my
    feature works
    - [x] I have run all the tests with `./orbit.sh --test` and they pass
    - [x] I have updated the changelog and the corresponding version in the
    extension's `config/extension.toml` file
    - [x] I have added my name to the `CONTRIBUTORS.md` or my name already
    exists there
    
    ---------
    Signed-off-by: 's avatarJames Smith <142246516+jsmith-bdai@users.noreply.github.com>
    Co-authored-by: 's avatarMayank Mittal <12863862+Mayankm96@users.noreply.github.com>
    83d62e21
Name
Last commit
Last update
..
rl_games Loading commit data...
robomimic Loading commit data...
rsl_rl Loading commit data...
sb3 Loading commit data...
skrl Loading commit data...