Improves policy behavior of Franka Cabinet and Allegro (#111)

# Description Improves policy behavior for training Franka Cabinet direct environment and Repose Cube Allegro direct environments. ## Type of change - Bug fix (non-breaking change which fixes an issue) ## Checklist - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [ ] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [ ] I have added my name to the `CONTRIBUTORS.md` or my name already exists there

Improves policy behavior of Franka Cabinet and Allegro (#111)
# Description Improves policy behavior for training Franka Cabinet direct environment and Repose Cube Allegro direct environments. ## Type of change - Bug fix (non-breaking change which fixes an issue) ## Checklist - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [ ] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [ ] I have added my name to the `CONTRIBUTORS.md` or my name already exists there
293a6c2b · Kelly Guo · David Hoeller · a655ad95 · 293a6c2b · 293a6c2b
Commit 293a6c2b authored Aug 30, 2024 by Kelly Guo Committed by David Hoeller Sep 20, 2024
4 changed files
--- a/source/extensions/omni.isaac.lab/omni/isaac/lab/sensors/camera/tiled_camera.py
+++ b/source/extensions/omni.isaac.lab/omni/isaac/lab/sensors/camera/tiled_camera.py
@@ -24,7 +24,7 @@ from ..sensor_base import SensorBase
 from .camera import Camera
 if TYPE_CHECKING:
-    from .camera_cfg import TiledCameraCfg
+    from .tiled_camera_cfg import TiledCameraCfg
 class TiledCamera(Camera):

--- a/source/extensions/omni.isaac.lab_tasks/omni/isaac/lab_tasks/direct/allegro_hand/agents/rl_games_ppo_cfg.yaml
+++ b/source/extensions/omni.isaac.lab_tasks/omni/isaac/lab_tasks/direct/allegro_hand/agents/rl_games_ppo_cfg.yaml
@@ -30,7 +30,7 @@ params:
          val: 0
        fixed_sigma: True
    mlp:
-      units: [512, 512, 256, 128]
+      units: [1024, 512, 256, 128]
      activation: elu
      d2rl: False

--- a/source/extensions/omni.isaac.lab_tasks/omni/isaac/lab_tasks/direct/allegro_hand/agents/rsl_rl_ppo_cfg.py
+++ b/source/extensions/omni.isaac.lab_tasks/omni/isaac/lab_tasks/direct/allegro_hand/agents/rsl_rl_ppo_cfg.py
@@ -21,8 +21,8 @@ class AllegroHandPPORunnerCfg(RslRlOnPolicyRunnerCfg):
    empirical_normalization = True
    policy = RslRlPpoActorCriticCfg(
        init_noise_std=1.0,
-        actor_hidden_dims=[512, 512, 256, 128],
+        actor_hidden_dims=[1024, 512, 256, 128],
-        critic_hidden_dims=[512, 512, 256, 128],
+        critic_hidden_dims=[1024, 512, 256, 128],
        activation="elu",
    )
    algorithm = RslRlPpoAlgorithmCfg(

--- a/source/extensions/omni.isaac.lab_tasks/omni/isaac/lab_tasks/direct/allegro_hand/allegro_hand_env_cfg.py
+++ b/source/extensions/omni.isaac.lab_tasks/omni/isaac/lab_tasks/direct/allegro_hand/allegro_hand_env_cfg.py
@@ -112,7 +112,7 @@ class AllegroHandEnvCfg(DirectRLEnvCfg):
    fall_penalty = 0
    fall_dist = 0.24
    vel_obs_scale = 0.2
-    success_tolerance = 0.1
+    success_tolerance = 0.2
    max_consecutive_success = 0
    av_factor = 0.1
    act_moving_average = 1.0