Flexiv Rizon4s Reach Policy Support by shauryadNv · Pull Request #4526 · isaac-sim/IsaacLab

shauryadNv · 2026-02-03T22:36:27Z

Description

Added support to train a reach policy for the Flexiv Rizon 4s robot. Defined new envs/tasks for this.
Added required robot config for the Flexiv Rizon 4s.
Added documentation for a tutorial on training a reach policy (with both, UR and Rizon).

Type of change

New feature (non-breaking change which adds functionality)
Documentation update

Checklist

I have read and understood the contribution guidelines
I have run the pre-commit checks with ./isaaclab.sh --format
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I have updated the changelog and the corresponding version in the extension's config/extension.toml file
I have added my name to the CONTRIBUTORS.md or my name already exists there

greptile-apps · 2026-02-03T22:40:36Z

Greptile Overview

Greptile Summary

This PR adds support for training reach policies on the Flexiv Rizon 4s robot, following the same pattern as the existing UR10e implementation.

Key additions:

New robot configuration for Flexiv Rizon 4s (7-DOF arm) with actuator specifications
Complete environment configs for training, play mode, and ROS inference deployment
PPO training configuration with 1500 iterations and 256-128-64 network architecture
Comprehensive documentation tutorial covering both UR10e and Rizon 4s reach policy training
Workspace bounds configured for Rizon 4s: x[0,0.8]m, y[-0.4,0.4]m, z[0.05,0.75]m

Issues found:

Critical: Hardcoded absolute path in flexiv_rizon.py will prevent other users from running this code
Duplicate policy_action_space assignment in ROS inference config
Minor comment inconsistency about visualization setting

Confidence Score: 2/5

This PR cannot be merged as-is due to hardcoded local path that will break for other users
The hardcoded absolute path /home/shauryad/workspaces/... in the robot config is a blocker that must be fixed. The duplicate variable assignment should also be corrected. Otherwise the implementation follows established patterns correctly.
Pay special attention to flexiv_rizon.py which contains a hardcoded path that must be changed to a nucleus directory path before merging

Important Files Changed

Filename	Overview
source/isaaclab_assets/isaaclab_assets/robots/flexiv_rizon.py	adds Flexiv Rizon 4s robot config with hardcoded local path that will break for other users
source/isaaclab_tasks/isaaclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/joint_pos_env_cfg.py	reach environment config for Rizon 4s with workspace bounds and action scale, minor comment inconsistency
source/isaaclab_tasks/isaaclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/ros_inference_env_cfg.py	ROS inference config with duplicate policy_action_space assignment

Sequence Diagram

sequenceDiagram
    participant User
    participant GymEnv as Gym Environment
    participant ReachEnv as Rizon4sReachEnvCfg
    participant Robot as Flexiv Rizon 4s
    participant Policy as PPO Policy
    participant FrameTransformer

    User->>GymEnv: Register Isaac-Deploy-Reach-Rizon4s-v0
    GymEnv->>ReachEnv: Initialize environment config
    ReachEnv->>Robot: Load FLEXIV_RIZON4S_CFG
    Note over Robot: 7-DOF arm with actuator limits
    ReachEnv->>FrameTransformer: Setup ee_frame_wrt_base_frame
    Note over FrameTransformer: Track flange pose relative to base_link
    
    loop Training Episode
        ReachEnv->>ReachEnv: Sample target pose (pos + quat)
        Note over ReachEnv: Workspace: x[0,0.8], y[-0.4,0.4], z[0.05,0.75]
        ReachEnv->>Policy: Observe (joint_pos[7], joint_vel[7], target_pose[7])
        Note over Policy: Input dim: 21
        Policy->>Policy: Forward pass (256→128→64 network)
        Policy->>ReachEnv: Output incremental joint positions[7]
        Note over Policy: Scale: 0.0625
        ReachEnv->>Robot: Apply RelativeJointPositionAction
        Robot->>Robot: Update joint positions
        FrameTransformer->>ReachEnv: Compute end-effector pose
        ReachEnv->>ReachEnv: Calculate reward (pose tracking error)
        ReachEnv->>Policy: Return (obs, reward, done)
    end
    
    User->>GymEnv: Load trained policy for deployment
    GymEnv->>ReachEnv: Initialize ROS inference config
    Note over ReachEnv: Fixed target or ROS commanded
    loop Deployment
        ReachEnv->>Policy: Observe robot state
        Policy->>Robot: Command joint increments
        Robot->>Robot: Execute motion
    end

greptile-apps

_{3 files reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

source/isaaclab_assets/isaaclab_assets/robots/flexiv_rizon.py

...aclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/ros_inference_env_cfg.py

greptile-apps · 2026-02-03T22:40:41Z

.../isaaclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/joint_pos_env_cfg.py

+        # Disable visualization for the goal pose because the commands are generated wrt to the base frame
+        self.commands.ee_pose.debug_vis = True


comment says "Disable visualization" but code sets debug_vis = True

consider clarifying whether visualization should be enabled or disabled for Rizon 4s and update comment accordingly

...aclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/agents/rsl_rl_ppo_cfg.py

...lab_tasks/isaaclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/__init__.py

AntoineRichard · 2026-02-04T08:26:48Z

Hello, it's very unlikely we will merge this to main. Could you target develop instead? Thanks!

kellyguo11

LGTM, we can merge this once we resolve the asset upload and TODO in the doc

isaaclab-review-bot

🤖 Isaac Lab Review Bot

Summary

This PR adds support for the Flexiv Rizon 4s robot with a reach policy training pipeline. It introduces:

A new robot asset configuration (FLEXIV_RIZON4S_CFG)
Gym environment registrations (train, play, ROS inference)
PPO training configuration
Comprehensive documentation tutorial

Design Assessment

The overall structure follows the established pattern set by the existing UR10e reach deployment config. The new Rizon 4s config properly extends ReachEnvCfg, overrides robot-specific parameters, and registers the expected gym environments. Good adoption of existing abstractions.

Architecture Impact

Additive only — no changes to shared base classes or APIs
Blast radius is limited to the new robot config and documentation
No risk of breaking existing UR10e environments

Implementation Verdict

🟡 Minor issues found — 2 bugs (1 docs build-breaker, 1 missing feature parity), plus minor style observations.

Test Coverage

❌ No tests added (checkbox unchecked in PR template). This is acceptable for a new config if manually validated, but integration tests for environment instantiation would strengthen confidence.

CI Status

✅ pre-commit passed
⏳ license-check, Docker builds, and docs builds are still pending

Findings

See inline comments for details.

docs/source/policy_deployment/index.rst

...aclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/ros_inference_env_cfg.py

isaaclab-review-bot · 2026-03-30T22:06:59Z

...aclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/ros_inference_env_cfg.py

+            "joint4",
+            "joint5",
+            "joint6",
+            "joint7",


⚠️ Suggestion: Use DRY pattern for action_scale_joint_space

The UR10e config uses a cleaner pattern:

self.action_scale_joint_space = [self.joint_action_scale] * self.action_space

Instead of manually listing the scale 7 times. This avoids a subtle bug if action_space is changed but the list length isn't updated.

Suggested change

"joint7",

self.action_scale_joint_space = [self.joint_action_scale] * self.action_space

isaaclab-review-bot · 2026-03-30T22:06:59Z

...aclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/ros_inference_env_cfg.py

+        self.obs_order = ["arm_dof_pos", "arm_dof_vel", "target_pos", "target_quat"]
+        self.policy_action_space = "joint"
+        self.arm_joint_names = [
+            "joint1",


⚠️ Minor: Duplicate policy_action_space assignment

self.policy_action_space = "joint" is set on line 22 and again on line 30. The second assignment is redundant.

Suggested change

"joint1",

self.arm_joint_names = [

source/isaaclab_tasks/docs/CHANGELOG.rst

.../isaaclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/joint_pos_env_cfg.py

source/isaaclab_assets/isaaclab_assets/robots/flexiv.py

isaaclab-review-bot

🤖 Isaac Lab Review Bot — Follow-up

1 new issue found in the latest push.

isaaclab-review-bot · 2026-03-31T20:02:08Z

docs/source/policy_deployment/04_reach/reach_policy.rst

+
+.. figure:: ../../_static/policy_deployment/03_reach/reach_train.jpg
+    :align: center
+    :figwidth: 100%


🐛 Bug: Image extension mismatch — broken figure

The added image file is reach_train.png but this directive references reach_train.jpg. Sphinx will not find the file and the figure will be missing from the rendered docs.

Suggested change

:figwidth: 100%

.. figure:: ../../_static/policy_deployment/03_reach/reach_train.png

isaaclab-review-bot

🤖 Isaac Lab Review Bot — Follow-up #2

Previous image extension bug (.jpg → .png) is fixed ✅

1 new minor docs issue found.

isaaclab-review-bot · 2026-03-31T21:53:56Z

docs/source/policy_deployment/04_reach/reach_policy.rst

+- ``--num_envs 4096``: Runs 4096 parallel environments for efficient data collection
+- ``--video``: Records videos to monitor training progress
+- ``--video_length 720``: Each video captures exactly one full episode (``episode_length_s / (sim.dt * decimation)`` = ``12.0 / (1/120 * 2)`` = 720 steps)
+- ``--video_interval 72000``: Records a video every 72000 environment steps (~every 100 training iterations).


📝 Minor: Incorrect approximation in video interval comment

The comment says "~every 100 training iterations" but with num_steps_per_env = 512:

72000 / 512 ≈ 140 iterations, not 100

The number 72000 comes from 720 steps/episode × 100 = 72000, so this should read:

Suggested change

- ``--video_interval 72000``: Records a video every 72000 environment steps (~every 100 training iterations).

- ``--video_interval 72000``: Records a video every 72000 environment steps (~every 100 episodes).

Made-with: Cursor

docs/source/policy_deployment/04_reach/reach_policy.rst

...aclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/ros_inference_env_cfg.py

Co-authored-by: Ashwin Varghese Kuruttukulam <123109010+ashwinvkNV@users.noreply.github.com> Signed-off-by: shauryadNv <shauryad@nvidia.com>

Co-authored-by: isaaclab-review-bot[bot] <270793704+isaaclab-review-bot[bot]@users.noreply.github.com> Signed-off-by: shauryadNv <shauryad@nvidia.com>

isaaclab-review-bot

🤖 Isaac Lab Review Bot — Follow-up #3

Previous initial_joint_pos concern is now addressed ✅

1 new bug found: syntax error introduced in the latest push.

isaaclab-review-bot · 2026-04-04T00:32:38Z

...aclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/ros_inference_env_cfg.py

+            self.joint_action_scale,
+            self.joint_action_scale,
+        ]
+


🐛 Bug: Duplicate action_scale_joint_space — first assignment missing closing ]

The latest push appears to have appended the new block without removing the original. The first action_scale_joint_space assignment (starting around line 42) is missing its closing bracket ], followed immediately by a second full assignment. This will raise a SyntaxError on import.

Expected: a single assignment. Remove the first (broken) one and keep only:

Suggested change

self.action_scale_joint_space = [

self.joint_action_scale,

self.joint_action_scale,

self.joint_action_scale,

self.joint_action_scale,

self.joint_action_scale,

self.joint_action_scale,

self.joint_action_scale,

]

Or even better (DRY):

self.action_scale_joint_space = [self.joint_action_scale] * self.action_space

huidongc · 2026-04-04T10:53:37Z

docs/source/policy_deployment/04_reach/reach_policy.rst

+
+- `Isaac ROS Manipulation Documentation <https://nvidia-isaac-ros.github.io/reference_workflows/isaac_for_manipulation/index.html>`_
+- Gear Assembly Sim-to-Real Tutorial: :ref:`walkthrough_sim_to_real`
+- Sim-to-Real Policy Transfer for Whole Body Controllers: :ref:`sim2real`


the sim2real ref is not defined and causing a doc build error:

docs/source/policy_deployment/04_reach/reach_policy.rst:670: WARNING: undefined label: 'sim2real' [ref.ref]

Is the ref pointing to docs/source/policy_deployment/00_hover/hover_policy.rst?

Good catch! You're right — the :ref:sim2real`` label is undefined on develop and will cause a doc build error.

I traced it: the .. _sim2real: label is defined in docs/source/experimental-features/newton-physics-integration/sim-to-real.rst, but that file only exists on main, not on develop (the target branch). So the cross-reference is dangling.

It's not pointing to hover_policy.rst — that file only has an external URL reference (.. _README of Sim2Real deployment: ...), not an RST label.

Suggested fix: either (1) remove the sim2real bullet from the Further Resources section since the target page doesn't exist on develop yet, or (2) replace it with a direct external link to the relevant sim-to-real resource (e.g., the HOVER sim2real README or similar). Option 1 is probably cleanest for now — the line can be re-added once sim-to-real.rst lands on develop.

shauryadNv requested a review from ashwinvkNV February 3, 2026 22:36

shauryadNv requested review from Mayankm96, jtigue-bdai, kellyguo11 and pascal-roth as code owners February 3, 2026 22:36

github-actions bot added documentation Improvements or additions to documentation asset New asset feature or request labels Feb 3, 2026

greptile-apps bot reviewed Feb 3, 2026

View reviewed changes

shauryadNv force-pushed the shauryad/flexiv_reach branch from dce1e98 to 638ee9d Compare February 3, 2026 22:47

ashwinvkNV reviewed Feb 3, 2026

View reviewed changes

...aclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/agents/rsl_rl_ppo_cfg.py Outdated Show resolved Hide resolved

ashwinvkNV reviewed Feb 3, 2026

View reviewed changes

...lab_tasks/isaaclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/__init__.py Outdated Show resolved Hide resolved

shauryadNv changed the base branch from main to develop February 5, 2026 18:32

kellyguo11 reviewed Feb 7, 2026

View reviewed changes

AntoineRichard mentioned this pull request Feb 9, 2026

Fixes gear assembly environment for Isaac Sim 6.0 and XYZW quaternions #4559

Merged

shauryadNv force-pushed the shauryad/flexiv_reach branch from 75bfc29 to b9b145d Compare February 17, 2026 19:01

shauryadNv force-pushed the shauryad/flexiv_reach branch from 15acee2 to 27a4644 Compare March 30, 2026 22:02

isaaclab-review-bot bot reviewed Mar 30, 2026

View reviewed changes

isaaclab-review-bot bot reviewed Mar 31, 2026

View reviewed changes

shauryadNv force-pushed the shauryad/flexiv_reach branch from d2e67e2 to f8f6b6e Compare March 31, 2026 20:31

isaaclab-review-bot bot reviewed Mar 31, 2026

View reviewed changes

shauryadNv force-pushed the shauryad/flexiv_reach branch from 3045456 to cd8b319 Compare April 1, 2026 16:14

shauryadNv added 7 commits April 3, 2026 14:23

Adding support for flexiv rizon 4s robot with reach policy training.

bd101a3

Fixed rizon4s movement ranges.

34938c6

Fixed docs for reach policy.

5ef93a0

Updated update logs.

98c3b94

Fixed license dates in new files.

a0278cc

Minor fix.

78450a7

Minor fix.

6b223a9

shauryadNv added 10 commits April 3, 2026 14:23

Fixed Flexiv USD asset source and quaternion convention.

17c3652

Formatting fix.

f550ffe

Better params for Flexiv Rizon 4s setup.

be862a5

Added Flexiv robot configs to lazy_export .pyi stubs after rebase.

95a83be

Made-with: Cursor

Minor fix to flexiv reach env and added fig to reach doc.

db14674

Minor fix in fig ref.

f7bded2

Renamed Flexiv asset file.

81c45bc

Enabled resampling during replay for flexiv reach.

0ccfb6a

Doc cleanup and updates.

ed2d9cf

Updated docs structure.

c61e102

shauryadNv force-pushed the shauryad/flexiv_reach branch 2 times, most recently from 70abd2b to c61e102 Compare April 3, 2026 21:43

ashwinvkNV approved these changes Apr 3, 2026

View reviewed changes

docs/source/policy_deployment/04_reach/reach_policy.rst Outdated Show resolved Hide resolved

...aclab_tasks/manager_based/manipulation/deploy/reach/config/rizon_4s/ros_inference_env_cfg.py Show resolved Hide resolved

shauryadNv and others added 2 commits April 3, 2026 17:24

Update reach doc.

e26a01f

Co-authored-by: Ashwin Varghese Kuruttukulam <123109010+ashwinvkNV@users.noreply.github.com> Signed-off-by: shauryadNv <shauryad@nvidia.com>

Extract inital joint pos for ROS inference with Flexiv.

4bc2190

Co-authored-by: isaaclab-review-bot[bot] <270793704+isaaclab-review-bot[bot]@users.noreply.github.com> Signed-off-by: shauryadNv <shauryad@nvidia.com>

isaaclab-review-bot bot reviewed Apr 4, 2026

View reviewed changes

shauryadNv added 2 commits April 3, 2026 17:34

Minor fix, removed repeated code.

2d8ee4c

Formatting fix.

7e34e76

shauryadNv merged commit ecc0077 into isaac-sim:develop Apr 4, 2026
7 of 8 checks passed

huidongc reviewed Apr 4, 2026

View reviewed changes

		# Disable visualization for the goal pose because the commands are generated wrt to the base frame
		self.commands.ee_pose.debug_vis = True

	"joint7",
	self.action_scale_joint_space = [self.joint_action_scale] * self.action_space

	:figwidth: 100%
	.. figure:: ../../_static/policy_deployment/03_reach/reach_train.png

	- ``--video_interval 72000``: Records a video every 72000 environment steps (~every 100 training iterations).
	- ``--video_interval 72000``: Records a video every 72000 environment steps (~every 100 episodes).

Conversation

shauryadNv commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Checklist

Uh oh!

greptile-apps bot commented Feb 3, 2026

Greptile Overview

Greptile Summary

Confidence Score: 2/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

AntoineRichard commented Feb 4, 2026

Uh oh!

kellyguo11 left a comment

Choose a reason for hiding this comment

Uh oh!

isaaclab-review-bot bot left a comment

Choose a reason for hiding this comment

🤖 Isaac Lab Review Bot

Summary

Design Assessment

Architecture Impact

Implementation Verdict

Test Coverage

CI Status

Findings

Uh oh!

Uh oh!

Uh oh!

isaaclab-review-bot bot Mar 30, 2026

Choose a reason for hiding this comment

⚠️ Suggestion: Use DRY pattern for action_scale_joint_space

Uh oh!

isaaclab-review-bot bot Mar 30, 2026

Choose a reason for hiding this comment

⚠️ Minor: Duplicate policy_action_space assignment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

isaaclab-review-bot bot left a comment

Choose a reason for hiding this comment

🤖 Isaac Lab Review Bot — Follow-up

Uh oh!

isaaclab-review-bot bot Mar 31, 2026

Choose a reason for hiding this comment

🐛 Bug: Image extension mismatch — broken figure

Uh oh!

isaaclab-review-bot bot left a comment

Choose a reason for hiding this comment

🤖 Isaac Lab Review Bot — Follow-up #2

Uh oh!

isaaclab-review-bot bot Mar 31, 2026

Choose a reason for hiding this comment

📝 Minor: Incorrect approximation in video interval comment

Uh oh!

Uh oh!

Uh oh!

isaaclab-review-bot bot left a comment

Choose a reason for hiding this comment

🤖 Isaac Lab Review Bot — Follow-up #3

Uh oh!

isaaclab-review-bot bot Apr 4, 2026

Choose a reason for hiding this comment

🐛 Bug: Duplicate action_scale_joint_space — first assignment missing closing ]

Uh oh!

Uh oh!

shauryadNv commented Feb 3, 2026 •

edited

Loading

⚠️ Suggestion: Use DRY pattern for `action_scale_joint_space`

⚠️ Minor: Duplicate `policy_action_space` assignment

🐛 Bug: Duplicate `action_scale_joint_space` — first assignment missing closing `]`