Dense startup time profiling by AntoineRichard · Pull Request #5072 · isaac-sim/IsaacLab

AntoineRichard · 2026-03-20T12:29:36Z

Description

Allows for dense startup time profiling.

Type of change

New feature (non-breaking change which adds functionality)
Documentation update

Checklist

I have read and understood the contribution guidelines
I have run the pre-commit checks with ./isaaclab.sh --format
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I have updated the changelog and the corresponding version in the extension's config/extension.toml file
I have added my name to the CONTRIBUTORS.md or my name already exists there

- Add error handling for YAML whitelist loading (OSError, YAMLError, type validation) - Use None instead of 0.0 for missing Timer sub-timings with info log - Warn when IsaacLab source directory is missing (empty prefix list) - Warn on unmatched whitelist patterns to catch typos - Use try/finally for cProfile disable to handle exceptions cleanly

- Fix orphaned docstring (convert to comment) - Improve main() and module docstrings for accuracy - Remove phase numbering from section comments (avoid confusion) - Fix Timer comment wording (backends -> environment types) - Fix top_n docstring (remove "per phase") - Remove dead print_startup_summary function

The humanoid observations pattern needs a leading wildcard to match the full module path (isaaclab_tasks.manager_based.classic.humanoid...).

greptile-apps · 2026-03-20T12:33:09Z

Greptile Summary

This PR introduces a dense startup-time profiling benchmark (benchmark_startup.py) that wraps each phase of an IsaacLab environment's startup sequence (app launch, Python imports, task config resolution, env creation, and first step) in its own cProfile session, emitting per-function timings as SingleMeasurement entries through the standard benchmark backend. It also extends SummaryMetrics to dynamically render any benchmark phases it doesn't already handle by name, provides a default whitelist YAML config for dashboard-stable function tracking, and ships corresponding documentation and a minor version bump.

Key changes:

New: scripts/benchmarks/benchmark_startup.py — five-phase cProfile benchmark script with optional whitelist-mode function filtering
New: scripts/benchmarks/startup_whitelist.yaml — default per-phase fnmatch pattern whitelist
Modified: scripts/benchmarks/utils.py — adds parse_cprofile_stats() helper using the internal pstats.Stats.stats dict
Modified: source/isaaclab/isaaclab/test/benchmark/backends.py — SummaryMetrics now renders unknown phases dynamically instead of silently dropping them
Docs / metadata: benchmarks.rst, CHANGELOG.rst, extension.toml updated accordingly

Three logic-level issues were found in benchmark_startup.py: (1) env_cfg.seed is unconditionally overridden with None when --seed is not supplied, silently clearing any config-level seed; (2) env_creation_profile is not disabled if gym.make() raises a BaseException (e.g. KeyboardInterrupt), because the handler uses except Exception instead of except BaseException or a finally; and (3) the first-step action sampling hard-codes a 1D continuous Box space assumption via .single_action_space.shape[0], which will raise or produce incorrect results for discrete or multi-dimensional action spaces.

Confidence Score: 2/5

The new benchmark script has three logic-level bugs that could cause silent correctness issues or unexpected crashes during profiling runs.
The core infrastructure changes (backends.py dynamic rendering, utils.py parse_cprofile_stats) are solid and well-guarded. However, benchmark_startup.py carries three reproducible logic bugs: unconditional seed override strips config-level seeds; bare except Exception leaves the profile enabled on keyboard interrupts or SystemExit; and the action-space shape assumption will crash on non-1D Box environments. These do not affect the existing library but will affect correctness and usability of the new benchmark tool.
scripts/benchmarks/benchmark_startup.py requires attention for the seed override, BaseException handling, and action space shape assumption.

Important Files Changed

Filename	Overview
scripts/benchmarks/benchmark_startup.py	New startup profiling script with three logic issues: unconditional seed override with None, env_creation_profile not disabled on BaseException from gym.make(), and hard-coded 1D Box action space assumption for first_step sampling.
scripts/benchmarks/utils.py	Adds parse_cprofile_stats() helper. Relies on the internal pstats.Stats.stats dict (acknowledged in a code comment). Whitelist pattern-matching and placeholder emission logic looks correct. Minor: print_startup_summary defined in a prior thread is noted as dead code.
source/isaaclab/isaaclab/test/benchmark/backends.py	Adds a dynamic fallback renderer for unknown phases in SummaryMetrics._print_summary(). Implementation is correct and consistent with existing frametime rendering. known_phases set is defined inline; if a future developer adds another hard-coded phase after this loop, it could render twice — minor future-maintenance concern.
scripts/benchmarks/startup_whitelist.yaml	Default whitelist config covering app_launch, env_creation, and first_step phases. Pattern syntax is consistent with fnmatch; no issues found.
docs/source/testing/benchmarks.rst	Adds comprehensive documentation for the startup profiling benchmark and updates the SummaryMetrics description. Content is accurate relative to the implementation.

Sequence Diagram

sequenceDiagram
    participant CLI as __main__
    participant LS as launch_simulation()
    participant M as main()
    participant GYM as gym.make()
    participant ENV as env
    participant BP as BaseIsaacLabBenchmark
    participant BE as Backend (SummaryMetrics etc.)

    CLI->>LS: app_launch_profile.enable() → with launch_simulation(env_cfg, args_cli)
    LS-->>CLI: context entered (app_launch_profile.disable())
    CLI->>M: main(env_cfg, app_launch_profile, app_launch_wall_ms)
    M->>GYM: env_creation_profile.enable() → gym.make(task, cfg)
    GYM-->>M: env
    M->>ENV: env.reset()  [inner try/finally disables env_creation_profile]
    ENV-->>M: obs
    M->>ENV: env.step(actions)  [first_step_profile try/finally]
    ENV-->>M: obs, reward, ...
    M->>BP: benchmark.add_measurement(phase, SingleMeasurement) ×N phases
    M->>BP: benchmark.update_manual_recorders()
    M->>BP: benchmark._finalize_impl()
    BP->>BE: backend.finalize(output_path, output_filename)
    BE-->>CLI: JSON + console summary printed
    M->>ENV: env.close()  [outer finally]

_{Last reviewed commit: "Document reliance on..."}

scripts/benchmarks/utils.py

scripts/benchmarks/benchmark_startup.py

- Make --task required to fail early on missing argument - Validate whitelist YAML values are list[str] per phase - Warn on unknown phase names in whitelist config - Wrap post-env-creation code in try/finally for env.close() - Remove [UNMATCHED] prefix from placeholder labels to keep dashboard keys stable (matches docstring/RST contract) - Move fnmatch import to function top, replace warnings.warn with print("[WARNING]") for consistency - Remove stale "Launch Isaac Sim Simulator first" comment - Use _ for unused lineno in pstats tuple unpacking

AntoineRichard · 2026-03-20T13:23:54Z

@greptile

scripts/benchmarks/benchmark_startup.py

scripts/benchmarks/utils.py

Move the try/finally guarding env.close() to wrap everything after gym.make() succeeds, so env.reset() failures also trigger cleanup.

AntoineRichard · 2026-03-20T13:58:48Z

@greptile

scripts/benchmarks/benchmark_startup.py

greptile-apps · 2026-03-20T14:03:17Z

scripts/benchmarks/benchmark_startup.py

+            {"name": "seed", "data": args_cli.seed},
+            {"name": "num_envs", "data": args_cli.num_envs},
+            {"name": "top_n", "data": args_cli.top_n},
+            {"name": "presets", "data": get_preset_string(hydra_args)},


env_creation_profile left enabled on BaseException

except Exception does not catch KeyboardInterrupt, SystemExit, or any other BaseException subclass. If gym.make() is interrupted by a signal or a hard abort, the profile object is left in the enabled state for the remainder of the process lifetime, which silently continues accumulating CPU-time into the stale profile.

The standard pattern is to use a finally clause (or a bare except) so the profile is always stopped:

env_creation_profile.enable() try: env = gym.make(args_cli.task, cfg=env_cfg) except BaseException: env_creation_profile.disable() raise

Or, more idiomatically, fold the disable into a try/finally around the whole block, the same way first_step_profile is handled a few lines below.

scripts/benchmarks/benchmark_startup.py

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> Signed-off-by: Antoine RICHARD <antoiner@nvidia.com>

except Exception misses KeyboardInterrupt and SystemExit, leaving the profiler enabled for the rest of the process lifetime.

isaaclab-review-bot

Startup Profiling Benchmark — Code Review

Overall this is a solid, well-structured addition. The five-phase profiling architecture is clean, the whitelist/fnmatch system is genuinely useful for dashboard stability, and the error handling has clearly gotten multiple passes of improvement. A few issues below — one actual bug, a couple of design nits.

Verdict: Approve with nits. The seed bug (comment 1) should be fixed before merge; the rest are minor.

isaaclab-review-bot · 2026-03-25T20:01:11Z

scripts/benchmarks/benchmark_startup.py

+    # Override config with CLI args
+    env_cfg.scene.num_envs = args_cli.num_envs if args_cli.num_envs is not None else env_cfg.scene.num_envs
+    env_cfg.sim.device = args_cli.device if args_cli.device is not None else env_cfg.sim.device
+    env_cfg.seed = args_cli.seed


Bug: unconditional seed override clobbers config default.

env_cfg.seed = args_cli.seed

When --seed is not passed, args_cli.seed is None. This unconditionally overwrites whatever env_cfg.seed was — either the task config's default or the conditional assignment at module-level line 166.

Line 166 already does the right thing:

env_cfg.seed = args_cli.seed if args_cli.seed is not None else env_cfg.seed

This line should match that pattern, or just be removed since the module-level assignment already ran:

# Either: env_cfg.seed = args_cli.seed if args_cli.seed is not None else env_cfg.seed # Or just delete this line — line 166 already handled it.

isaaclab-review-bot · 2026-03-25T20:01:11Z

scripts/benchmarks/benchmark_startup.py

+
+# -- Create the benchmark instance ------------------------------------------
+
+env_cfg.seed = args_cli.seed if args_cli.seed is not None else env_cfg.seed


Nit: module-level seed assignment is redundant with main(). This line correctly does the conditional assignment, but then main() line 206 unconditionally overwrites it (see other comment). If you fix line 206 to be conditional too, consider removing one of the two assignments to avoid confusion about which one "wins".

Since env_cfg is passed into main() and main() also sets num_envs and device there, it'd be cleaner to do all config overrides in one place — inside main() — and remove this line.

isaaclab-review-bot · 2026-03-25T20:01:11Z

scripts/benchmarks/benchmark_startup.py

+
+        # Finalize benchmark output
+        benchmark.update_manual_recorders()
+        benchmark._finalize_impl()


Minor: calling private _finalize_impl() directly.

I know other benchmark scripts (benchmark_non_rl.py) do this too, so this is a pre-existing pattern rather than a problem you introduced. Still worth noting: if BaseIsaacLabBenchmark ever gains a public finalize() wrapper (which it should), this should switch to it.

isaaclab-review-bot · 2026-03-25T20:01:11Z

scripts/benchmarks/benchmark_startup.py

+
+imports_profile.disable()
+
+if torch.cuda.is_available() and torch.cuda.is_initialized():


Design note: cuda.is_initialized() check may miss lazy CUDA init.

At this point in the script, CUDA may not yet be initialized (torch was just imported but no tensor ops have run). If CUDA isn't initialized, is_initialized() returns False and the synchronize is skipped — which is correct, since there's nothing to sync. But it means the wall-clock imports_time_end doesn't account for any deferred GPU init that gets triggered later.

This is fine for a profiling script (you're measuring import time, not GPU init time), but worth a comment explaining the intent — especially since the same pattern at lines 225 and 245 does catch active GPU work.

isaaclab-review-bot · 2026-03-25T20:01:11Z

scripts/benchmarks/utils.py

+    for func_key, (_, _, tottime, cumtime, callers) in stats.stats.items():
+        filename, _, funcname = func_key
+        if _is_isaaclab(filename):
+            label = _make_label(filename, funcname)


Observation (not a bug): external functions can appear with abbreviated labels. When an external function (e.g., torch.nn.modules.linear:forward) is directly called by an IsaacLab function, _make_label truncates the path to site-packages-relative or last-3-components. This is fine for readability but means the label isn't a valid Python import path — it's a display-only string. Worth a brief docstring note on _make_label that labels are for display, not programmatic use.

isaaclab-review-bot · 2026-03-25T20:01:12Z

scripts/benchmarks/benchmark_startup.py

+
+        # Sample random actions
+        actions = (
+            torch.rand(env.unwrapped.num_envs, env.unwrapped.single_action_space.shape[0], device=env.unwrapped.device)


Nit: single_action_space may not exist for all env types.

For DirectMARLEnv, the action space structure is different — single_action_space exists but might be a Dict space, not a Box, so .shape[0] would fail. The type signature of main() accepts DirectMARLEnvCfg, but this line assumes a flat action space.

Since this is a benchmarking script (not a general-purpose tool), this is acceptable if the docs/help text clarify it's for single-agent RL envs. But if MARL envs are intended to be supported, this needs a check.

isaaclab-review-bot · 2026-03-25T20:01:12Z

source/isaaclab/isaaclab/test/benchmark/backends.py

+                if isinstance(measurement, StatisticalMeasurement):
+                    unit_str = f" {measurement.unit.strip()}" if (measurement.unit and measurement.unit.strip()) else ""
+                    value = f"{self._format_scalar(measurement.mean)}{unit_str}"
+                elif isinstance(measurement, SingleMeasurement):


Good addition. The catch-all rendering for unknown phases is clean and doesn't break existing behavior. One minor style nit: the unit_str construction is duplicated between the frametime_phase block above and this new block. Could extract a helper like _format_measurement_value(measurement) to DRY it up, but that's cosmetic.

ooctipus · 2026-03-29T22:10:21Z

@AntoineRichard nice feature

def parse_cprofile_stats and cprofile related logic is added to startup time, it could also be beneficial to runtime dense breakdown do you plan to intrument these new feature to benchmark rsl_rl, or benchmark non_rl?

WIP

7535af1

AntoineRichard requested review from Mayankm96, jtigue-bdai, kellyguo11 and ooctipus as code owners March 20, 2026 12:29

github-actions bot added documentation Improvements or additions to documentation isaac-lab Related to Isaac Lab team labels Mar 20, 2026

AntoineRichard added 2 commits March 20, 2026 13:31

Fix whitelist YAML: add license header, fix humanoid pattern

8f9239e

The humanoid observations pattern needs a leading wildcard to match the full module path (isaaclab_tasks.manager_based.classic.humanoid...).

greptile-apps bot reviewed Mar 20, 2026

View reviewed changes

AntoineRichard added 4 commits March 20, 2026 14:14

Fix pstats comment to use pcalls/ncalls per Python docs

9809268

Profile resolve_task_config as its own task_config phase

118289d

Remove env-specific humanoid pattern from default whitelist

93a68f6

greptile-apps bot reviewed Mar 20, 2026

View reviewed changes

scripts/benchmarks/benchmark_startup.py Show resolved Hide resolved

scripts/benchmarks/utils.py Show resolved Hide resolved

AntoineRichard added 2 commits March 20, 2026 14:57

Fix env resource leak when env.reset() raises after gym.make()

ffc90a7

Move the try/finally guarding env.close() to wrap everything after gym.make() succeeds, so env.reset() failures also trigger cleanup.

Document reliance on internal pstats.Stats.stats dict

bf0d4ec

greptile-apps bot reviewed Mar 20, 2026

View reviewed changes

AntoineRichard and others added 2 commits March 20, 2026 15:27

Update scripts/benchmarks/benchmark_startup.py

0667af8

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> Signed-off-by: Antoine RICHARD <antoiner@nvidia.com>

Catch BaseException when disabling profiler on gym.make() failure

8abd1d1

except Exception misses KeyboardInterrupt and SystemExit, leaving the profiler enabled for the rest of the process lifetime.

isaaclab-review-bot bot reviewed Mar 25, 2026

View reviewed changes

kellyguo11 approved these changes Mar 29, 2026

View reviewed changes


		# -- Create the benchmark instance ------------------------------------------

		env_cfg.seed = args_cli.seed if args_cli.seed is not None else env_cfg.seed


		imports_profile.disable()

		if torch.cuda.is_available() and torch.cuda.is_initialized():

Conversation

AntoineRichard commented Mar 20, 2026

Description

Type of change

Checklist

Uh oh!

greptile-apps bot commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 2/5

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AntoineRichard commented Mar 20, 2026

Uh oh!

Uh oh!

Uh oh!

AntoineRichard commented Mar 20, 2026

Uh oh!

Uh oh!

greptile-apps bot Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

isaaclab-review-bot bot left a comment

Choose a reason for hiding this comment

Startup Profiling Benchmark — Code Review

Uh oh!

isaaclab-review-bot bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

isaaclab-review-bot bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

isaaclab-review-bot bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

isaaclab-review-bot bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

isaaclab-review-bot bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

isaaclab-review-bot bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

isaaclab-review-bot bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

ooctipus commented Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

greptile-apps bot commented Mar 20, 2026 •

edited

Loading