ESM2 changes to work with vLLM by gagank1 · Pull Request #1473 · NVIDIA/bionemo-framework

gagank1 · 2026-02-18T23:44:15Z

Description

This PR makes the ESM2 model compatible with vLLM. Primary issues were a naming incompatibility (vLLM expects model. prefix and ESM2 uses esm.) and NVEsmModel defaults to add_pooling_layer=True when loading the checkpoint even though it's exported without pooler weights.

Usage

python test_esm2_golden_values.py from inside the container, instructions to build and run it are provided.

Type of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Refactor
Documentation update
Other (please describe):

Triggering Code Rabbit AI Review

To trigger a code review from code rabbit, comment on a pull request with one of these commands:

@coderabbitai review - Triggers a standard review
@coderabbitai full review - Triggers a comprehensive review

See https://docs.coderabbit.ai/reference/review-commands for a full list of commands.

Pre-submit Checklist

I have tested these changes locally
I have updated the documentation accordingly
I have added/updated tests as needed
All existing tests pass successfully

Summary by CodeRabbit

New Features
- Added vLLM inference recipe for ESM2 models with containerized deployment support.
- Added configurable pooling layer option to ESM2 model initialization.
Bug Fixes
- Improved state dictionary filtering and checkpoint export to prevent shape mismatches in downstream applications.
Documentation
- Added vLLM inference README with installation and usage examples.
- Added Apache License 2.0 for vLLM recipe components.
Chores
- Updated pre-commit configuration with additional file exclusions.
- Enhanced FP8 statistics logging verification in tests.

copy-pr-bot · 2026-02-18T23:44:18Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

coderabbitai · 2026-02-18T23:44:25Z

Important

Review skipped

Auto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: abbb3b6d-c96f-4be4-b633-c76b9fb253a2

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

This PR refactors the ESM-2 TE model hierarchy from esm to model namespace, adds a configurable pooling layer option to NVEsmConfig, introduces comprehensive state transformation utilities for checkpoint conversion, and adds a complete vLLM inference recipe with bidirectional HF-to-TE conversion support.

Changes

Cohort / File(s)	Summary
Model Refactoring `bionemo-recipes/models/esm2/modeling_esm_te.py`, `bionemo-recipes/recipes/esm2_native_te/modeling_esm_te.py`, `bionemo-recipes/recipes/esm2_accelerate_te/example_8m_checkpoint/esm_nv.py`, `bionemo-recipes/recipes/esm2_peft_te/example_8m_checkpoint/esm_nv.py`	Updated `base_model_prefix` from "esm" to "model", added `add_pooling_layer` config option (default False), changed internal `self.esm` references to `self.model`, updated weight-tying keys to "model.embeddings.word_embeddings.weight", extended state_dict filtering to exclude `.inv_freq` keys.
Convert & Export Utilities `bionemo-recipes/models/esm2/convert.py`, `bionemo-recipes/models/esm2/export.py`, `bionemo-recipes/recipes/vllm_inference/esm2/convert.py`, `bionemo-recipes/recipes/vllm_inference/esm2/export.py`	Updated mapping keys from esm.\* to model.\* for encoder layers and embeddings; adjusted QKV packing/unpacking and embedding padding transforms to reference new paths; added `padded_vocab_size=None` parameter; created new vLLM export utilities with HF checkpoint export and model metadata handling.
State Transformation Module `bionemo-recipes/recipes/vllm_inference/esm2/state.py`	Introduced comprehensive state-dict transformation utilities including TransformCTX context holder, StateDictTransform class, decorator-based state_transform factory, and TransformFns with QKV/FC/padding operations to enable flexible bidirectional model conversions.
Test Updates `bionemo-recipes/models/esm2/tests/test_.py`, `bionemo-recipes/recipes/esm2_native_te/tests/test_.py`	Updated references from `model.esm.encoder` to `model.model.encoder` across nondistributed and distributed test paths; updated state_dict key comparisons to align with new namespace; expanded FP8 statistics logging verification.
vLLM Recipe Infrastructure `bionemo-recipes/recipes/vllm_inference/esm2/Dockerfile`, `bionemo-recipes/recipes/vllm_inference/esm2/.ci_build.sh`, `bionemo-recipes/recipes/vllm_inference/esm2/.ci_test_env.sh`, `bionemo-recipes/recipes/vllm_inference/esm2/requirements.txt`, `bionemo-recipes/recipes/vllm_inference/esm2/install_vllm.sh`	Added Docker build configuration with conditional VLLM installation, CI build/test environment scripts, and build requirements including transformer\_engine, accelerate, datasets, and PEFT.
vLLM Recipe Configs & Tokenizer `bionemo-recipes/recipes/vllm_inference/esm2/esm_fast_tokenizer/...`, `bionemo-recipes/recipes/vllm_inference/esm2/tokenizer_config.json`, `bionemo-recipes/models/esm2/esm_fast_tokenizer/...`	Added complete ESM-2 fast tokenizer vocabulary mapping, special tokens configuration (\<cls\>, \<pad\>, \<eos\>, \<unk\>, \<mask\>), and tokenizer_config with model_max_length and input_names.
vLLM Recipe Documentation & Models `bionemo-recipes/recipes/vllm_inference/esm2/README.md`, `bionemo-recipes/recipes/vllm_inference/esm2/model_readme.template`, `bionemo-recipes/recipes/vllm_inference/esm2/modeling_esm_te.py`	Added comprehensive README documenting vLLM inference setup, template for model card with deployment info and hardware requirements, and TE-optimized ESM implementation with config validation, rotary embeddings, FP8/FP4 autocast support, and THD/BSHD input handling.
vLLM Recipe Tests & License `bionemo-recipes/recipes/vllm_inference/esm2/tests/test_vllm.py`, `bionemo-recipes/recipes/vllm_inference/esm2/LICENSE`, `bionemo-recipes/recipes/vllm_inference/esm2/tests/conftest.py`	Added vLLM golden tests comparing embeddings across vLLM, HF exported, and HF reference checkpoints; included Apache 2.0 license; added test configuration.
Training Script Updates `bionemo-recipes/recipes/esm2_native_te/train_ddp.py`, `bionemo-recipes/recipes/esm2_native_te/train_ddp_cp.py`, `bionemo-recipes/recipes/esm2_native_te/train_fsdp2.py`, `bionemo-recipes/recipes/esm2_native_te/train_fsdp2_cp.py`, `bionemo-recipes/recipes/esm2_native_te/tests/test_stop_and_go.py`	Introduced backward-compatible base model attribute detection using conditional logic (`model.model if hasattr(model, "model") else model.esm`) to handle both wrapped and direct model structures.
Configuration & Metadata `.pre-commit-config.yaml`, `.secrets.baseline`, `bionemo-recipes/models/esm2/README.md`, `bionemo-recipes/recipes/esm2_native_te/fp4_debugging_stats.yaml`, `ci/scripts/check_copied_files.py`	Updated pre-commit exclude patterns for generated/copied files; updated secrets baseline line numbers; added vLLM recipe documentation link; updated YAML layer selection patterns from model.esm to model.model; added file synchronization mappings for vLLM recipe components.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant Export as export.py
    participant Convert as convert.py
    participant HFModel as HF ESM2 Model
    participant TEModel as TE ESM2 Model
    participant StateMgmt as State Management

    User->>Export: export_hf_checkpoint(tag)
    Export->>HFModel: Load HF checkpoint
    HFModel-->>Export: model + config
    Export->>Convert: convert_esm_hf_to_te(model_hf)
    Convert->>TEModel: Initialize TE model (empty)
    Convert->>StateMgmt: extract state dicts
    StateMgmt->>StateMgmt: Apply QKV packing transforms
    StateMgmt->>StateMgmt: Apply embedding padding transforms
    StateMgmt->>StateMgmt: Apply FC/bias transforms
    StateMgmt->>TEModel: Load transformed state
    TEModel-->>Convert: Converted model
    Convert-->>Export: TE model ready
    Export->>Export: Save checkpoint + config
    Export->>Export: Smoke test load in bf16
    Export-->>User: Exported checkpoint path

sequenceDiagram
    participant Client as vLLM Client
    participant VLLM as vLLM Engine
    participant TEModel as TE ESM2 Model
    participant Embeddings as Embeddings Layer
    participant Encoder as TE Encoder

    Client->>VLLM: Create LLM instance (pooling mode)
    VLLM->>TEModel: Load checkpoint
    VLLM->>TEModel: Initialize model
    Client->>VLLM: embed(sequences)
    VLLM->>TEModel: forward(input_ids, attention_mask)
    TEModel->>Embeddings: Get token embeddings
    Embeddings-->>TEModel: Embedded tokens
    TEModel->>Encoder: Apply TE encoder layers
    Encoder-->>TEModel: Encoded hidden states
    TEModel->>TEModel: Extract last token + L2 normalize
    TEModel-->>VLLM: Pooled embeddings
    VLLM-->>Client: Normalized embeddings

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~75 minutes

Possibly related PRs

ESM2 NVFP4 and MXFP8 support and documentation update. #1484: Modifies the same ESM-TE model codebase including NVEsmConfig, NVEsmModel, NVEsmEncoder, and weight mapping paths with related namespace and configuration changes.

Suggested reviewers

jstjohn
yzhang123
dorotat-nv
trvachov
broland-hat
polinabinder1

Poem

🐰 From esm to model the namespace springs,
New pools of config and vLLM's wings,
State transforms dance with QKV delight,
Inference runs swift through transformers' height,
Checkpoints convert from HF to TE's light! 🚀

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'ESM2 changes to work with vLLM' is clear and specific, directly summarizing the main objective of making ESM2 compatible with vLLM.
Description check	✅ Passed	The PR description includes a detailed explanation of the changes, addresses the two primary issues (naming incompatibility and pooling layer defaults), provides usage instructions, and specifies the change type as a new feature.
Docstring Coverage	✅ Passed	Docstring coverage is 92.81% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch gkaushik/esm2-vllm

📝 Coding Plan

Generate coding plan for human review comments

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Tip

CodeRabbit can scan for known vulnerabilities in your dependencies using OSV Scanner.

OSV Scanner will automatically detect and report security vulnerabilities in your project's dependencies. No additional configuration is required.

broland-hat · 2026-02-20T22:19:41Z

@gagank1 : Are you working on a readme?

gagank1 · 2026-02-23T17:06:00Z

/ok to test

copy-pr-bot · 2026-02-23T17:06:03Z

/ok to test

@gagank1, there was an error processing your request: E1

See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/1/

gagank1 · 2026-02-23T17:06:15Z

/ok to test

copy-pr-bot · 2026-02-23T17:06:18Z

/ok to test

@gagank1, there was an error processing your request: E1

See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/1/

gagank1 · 2026-02-23T17:07:18Z

/ok to test c34c09b

gagank1 · 2026-02-23T17:12:02Z

@coderabbitai review

coderabbitai · 2026-02-23T17:12:16Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 5

♻️ Duplicate comments (3)

bionemo-recipes/recipes/esm2_peft_te/example_8m_checkpoint/esm_nv.py (1)

405-407: Same _tied_weights_keys dict type concern.

See comment on modeling_esm_te.py.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/recipes/esm2_peft_te/example_8m_checkpoint/esm_nv.py` around
lines 405 - 407, The class variable _tied_weights_keys uses the modern built-in
generic dict[...] typing which may be incompatible with the rest of the
codebase; change its annotation to use typing.Dict[str, str] (and ensure Dict is
imported) or use typing.Mapping if immutability is desired, mirroring the fix
applied in modeling_esm_te.py so the declaration becomes ClassVar[Dict[str,
str]] with the same key/value entries retained.

bionemo-recipes/recipes/esm2_native_te/example_8m_checkpoint/esm_nv.py (1)

405-407: Same _tied_weights_keys dict type concern as in modeling_esm_te.py.

See comment on the canonical file — HF expects list[str], not dict[str, str].

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/recipes/esm2_native_te/example_8m_checkpoint/esm_nv.py`
around lines 405 - 407, The _tied_weights_keys typed as ClassVar[dict[str, str]]
should be changed to ClassVar[list[str]] to match HF expectations (same fix as
in modeling_esm_te.py); replace the dict literal with a list of the relevant
parameter names (e.g. ["lm_head.decoder.weight",
"model.embeddings.word_embeddings.weight"]) and update any usages that assume
dict semantics to use the list order or explicit pairing where needed.

bionemo-recipes/recipes/esm2_accelerate_te/example_8m_checkpoint/esm_nv.py (1)

405-407: Same _tied_weights_keys dict type concern.

See comment on modeling_esm_te.py.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/recipes/esm2_accelerate_te/example_8m_checkpoint/esm_nv.py`
around lines 405 - 407, _tied_weights_keys is annotated as a mutable dict
ClassVar which repeats the same typing concern as in modeling_esm_te.py; change
the annotation to an immutable mapping type (e.g., ClassVar[Mapping[str, str]]
from typing) and, to avoid accidental mutation, assign a read-only view (e.g.,
types.MappingProxyType({"lm_head.decoder.weight":
"model.embeddings.word_embeddings.weight"})); update the import list to include
typing.Mapping and types if not present and mirror the same pattern used/fixed
in modeling_esm_te.py.

🧹 Nitpick comments (4)

bionemo-recipes/vllm/launch.sh (1)
50-50: exec $DOCKER_CMD is unquoted — word splitting will break paths with spaces.

If PROJECT_ROOT contains spaces (e.g., /home/user/my projects/...), the -v argument will be incorrectly split into multiple tokens. Use a Bash array to avoid this:
🔧 Proposed fix (array-based approach)

Replace the string-based DOCKER_CMD with an array throughout the script:
-DOCKER_CMD="docker run -itd ..."
+DOCKER_CMD=("docker" "run" "-itd" "--gpus" "all" "--network" "host" "--ipc=host" "-e" "HF_TOKEN" "--rm" "--name" "${CONTAINER}_dev")
 
 if [ "$MOUNT_DIR" = true ]; then
     PROJECT_ROOT="$(cd "$(dirname "${BASH_SOURCE[0]}")/../.." && pwd)"
-    DOCKER_CMD="$DOCKER_CMD -v ${PROJECT_ROOT}:/workspace/bionemo-framework"
+    DOCKER_CMD+=("-v" "${PROJECT_ROOT}:/workspace/bionemo-framework")
 fi
 
-DOCKER_CMD="$DOCKER_CMD $CONTAINER /bin/bash"
+DOCKER_CMD+=("$CONTAINER" "/bin/bash")
 
-exec $DOCKER_CMD
+exec "${DOCKER_CMD[@]}"
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/vllm/launch.sh` at line 50, The exec $DOCKER_CMD call uses an
unquoted string which allows word-splitting and breaks mount paths with spaces
(e.g., PROJECT_ROOT used in -v). Convert the string DOCKER_CMD into a Bash array
(e.g., DOCKER_CMD=(docker run ...)) and update all places that build/append to
DOCKER_CMD so they push elements into the array, then replace exec $DOCKER_CMD
with exec "${DOCKER_CMD[@]}" so each argument (including the -v PROJECT_ROOT
value) is preserved; update any helper code that concatenates DOCKER_CMD to use
array operations instead.
bionemo-recipes/vllm/Dockerfile (2)
2-3: Base image is hosted on an internal NVIDIA GitLab registry — not pullable outside NVIDIA.

gitlab-master.nvidia.com:5005/dl/dgx/vllm:main-py3.43005406-devel requires internal network/credentials access. The commented-out nvcr.io/nvidia/vllm:26.01-py3 alternative on line 1 is the publicly accessible equivalent. Once an NGC release with vLLM ≥ 0.14 becomes available, switching to the public image will make this recipe usable by external contributors without additional setup.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/vllm/Dockerfile` around lines 2 - 3, The Dockerfile's FROM
line currently uses an internal image tag
"gitlab-master.nvidia.com:5005/dl/dgx/vllm:main-py3.43005406-devel" which is not
pullable externally; replace that base image with the public equivalent
"nvcr.io/nvidia/vllm:26.01-py3" (or parametrize the base via a build ARG) in the
FROM instruction so external contributors can build without internal
credentials, and retain a short comment noting the vLLM>=0.14 requirement and to
switch back when an official public image with the needed version is available.
30-30: Pin the transformer_engine version for reproducible builds.

pip install --no-build-isolation transformer_engine[pytorch] with no version specifier will install whichever version is latest at build time. TE releases frequently and has had breaking API changes between major versions (e.g., 1.x → 2.x). A silent version bump can break the integration without any change to this file.
🔧 Proposed fix
-RUN pip install --no-build-isolation transformer_engine[pytorch]
+RUN pip install --no-build-isolation "transformer_engine[pytorch]==<verified_version>"
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/vllm/Dockerfile` at line 30, The Dockerfile currently
installs transformer_engine without a version pin (the RUN pip install
--no-build-isolation transformer_engine[pytorch] line); change that to install a
specific, tested TE release by updating that RUN to include an exact version
specifier (for example: RUN pip install --no-build-isolation
transformer_engine==<MAJOR.MINOR.PATCH>[pytorch]) so builds are reproducible and
won’t break on upstream major/minor bumps; optionally add a short comment noting
the chosen compatible version.
bionemo-recipes/vllm/test_esm2_golden_values.py (1)
46-63: sys.path.insert + os.chdir is fragile for test infrastructure.

sys.path.insert(0, ...) at module level (line 46) and os.chdir inside fresh_export make this script sensitive to working directory and import order. This is acceptable for a standalone validation script run manually inside a container, but consider adding a note that this is not designed to run as part of a pytest suite.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/vllm/test_esm2_golden_values.py` around lines 46 - 63, The
test module mutates import paths and working directory (sys.path.insert(0, ...),
os.chdir(...) inside fresh_export) which is fragile for pytest; update the file
to document this by adding a clear module-level comment or docstring near
sys.path.insert and a brief note on fresh_export explaining it intentionally
changes cwd for export_hf_checkpoint and that the script is not intended to be
run under pytest/parallel test runners (referencing sys.path.insert,
ESM2_MODEL_DIR, and fresh_export by name), or alternatively guard execution with
a main-check so pytest won't import/run it implicitly.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@bionemo-recipes/vllm/Dockerfile`:
- Around line 1-36: Add a non-root user and switch to it in the Dockerfile so
the container does not run as root; create a user/group (e.g., bionemo), chown
the application directory (/workspace/bionemo) and any cache/venv paths to that
user, and add a final USER bionemo line. Locate the Dockerfile sections around
WORKDIR /workspace/bionemo and COPY . . and insert user creation
(useradd/addgroup or groupadd) and chown before switching context, then add USER
bionemo at the end so subsequent runtime processes run unprivileged.

In `@bionemo-recipes/vllm/launch.sh`:
- Around line 36-40: The script hardcodes "--name vllm_dev" in DOCKER_CMD which
prevents multiple instances and misrepresents the positional $CONTAINER
argument; update the usage comment to clarify that the positional argument is
the image name (or image_name) and change the DOCKER_CMD assignments in the
HEADLESS branch to use a container name derived from $CONTAINER (for example
--name "$CONTAINER" or --name "${CONTAINER}_dev") instead of "vllm_dev" so the
image argument is also used as the container name and avoids name collisions.

In `@bionemo-recipes/vllm/README.md`:
- Around line 21-23: Change the fenced code block that currently uses the wrong
language tag; locate the block containing the shell command "python
test_esm2_golden_values.py" in the README and replace the opening fence language
identifier from ```python to ```bash so the command is treated as a shell
snippet and highlighted correctly.
- Line 17: The README's "or use `launch.sh`" is misleading because `launch.sh
--mount_dir` mounts the repo to `/workspace/bionemo-framework` while the manual
`docker run` mounts to `/workspace/bionemo`, causing `python
test_esm2_golden_values.py` to fail when run from WORKDIR; fix by either (A)
update `launch.sh` to mount the project root into `/workspace/bionemo` instead
of `/workspace/bionemo-framework` (adjust mount target and any downstream path
references in launch.sh), or (B) update README.md to explicitly document that
`launch.sh --mount_dir` mounts at `/workspace/bionemo-framework` and instruct
users to cd into the correct subdirectory (where `test_esm2_golden_values.py`
lives) before running the test; mention the affected files `launch.sh`,
`README.md`, and `test_esm2_golden_values.py` so reviewers can locate the
changes.

In `@bionemo-recipes/vllm/test_esm2_golden_values.py`:
- Around line 147-210: The script currently only prints comparisons and never
fails; add assertions that enforce the golden-value tolerances using RTOL and
ATOL: for each pair in pairs (refer to the pairs list and variables a, b),
assert np.allclose(a, b, rtol=RTOL, atol=ATOL) (or equivalently assert
(np.abs(a-b) <= ATOL + RTOL * np.abs(b)).all()) and fail the test if not, and
also assert cosine_sim(a, b) exceeds a sensible threshold or that exact is True
when ATOL/RTOL are zero; add per-sequence assertions inside the per-sequence
loop to ensure each sequence max-diff <= ATOL + RTOL * max(|b_i|) so the test
fails on unacceptable drift.

---

Duplicate comments:
In `@bionemo-recipes/recipes/esm2_accelerate_te/example_8m_checkpoint/esm_nv.py`:
- Around line 405-407: _tied_weights_keys is annotated as a mutable dict
ClassVar which repeats the same typing concern as in modeling_esm_te.py; change
the annotation to an immutable mapping type (e.g., ClassVar[Mapping[str, str]]
from typing) and, to avoid accidental mutation, assign a read-only view (e.g.,
types.MappingProxyType({"lm_head.decoder.weight":
"model.embeddings.word_embeddings.weight"})); update the import list to include
typing.Mapping and types if not present and mirror the same pattern used/fixed
in modeling_esm_te.py.

In `@bionemo-recipes/recipes/esm2_native_te/example_8m_checkpoint/esm_nv.py`:
- Around line 405-407: The _tied_weights_keys typed as ClassVar[dict[str, str]]
should be changed to ClassVar[list[str]] to match HF expectations (same fix as
in modeling_esm_te.py); replace the dict literal with a list of the relevant
parameter names (e.g. ["lm_head.decoder.weight",
"model.embeddings.word_embeddings.weight"]) and update any usages that assume
dict semantics to use the list order or explicit pairing where needed.

In `@bionemo-recipes/recipes/esm2_peft_te/example_8m_checkpoint/esm_nv.py`:
- Around line 405-407: The class variable _tied_weights_keys uses the modern
built-in generic dict[...] typing which may be incompatible with the rest of the
codebase; change its annotation to use typing.Dict[str, str] (and ensure Dict is
imported) or use typing.Mapping if immutability is desired, mirroring the fix
applied in modeling_esm_te.py so the declaration becomes ClassVar[Dict[str,
str]] with the same key/value entries retained.

---

Nitpick comments:
In `@bionemo-recipes/vllm/Dockerfile`:
- Around line 2-3: The Dockerfile's FROM line currently uses an internal image
tag "gitlab-master.nvidia.com:5005/dl/dgx/vllm:main-py3.43005406-devel" which is
not pullable externally; replace that base image with the public equivalent
"nvcr.io/nvidia/vllm:26.01-py3" (or parametrize the base via a build ARG) in the
FROM instruction so external contributors can build without internal
credentials, and retain a short comment noting the vLLM>=0.14 requirement and to
switch back when an official public image with the needed version is available.
- Line 30: The Dockerfile currently installs transformer_engine without a
version pin (the RUN pip install --no-build-isolation
transformer_engine[pytorch] line); change that to install a specific, tested TE
release by updating that RUN to include an exact version specifier (for example:
RUN pip install --no-build-isolation
transformer_engine==<MAJOR.MINOR.PATCH>[pytorch]) so builds are reproducible and
won’t break on upstream major/minor bumps; optionally add a short comment noting
the chosen compatible version.

In `@bionemo-recipes/vllm/launch.sh`:
- Line 50: The exec $DOCKER_CMD call uses an unquoted string which allows
word-splitting and breaks mount paths with spaces (e.g., PROJECT_ROOT used in
-v). Convert the string DOCKER_CMD into a Bash array (e.g., DOCKER_CMD=(docker
run ...)) and update all places that build/append to DOCKER_CMD so they push
elements into the array, then replace exec $DOCKER_CMD with exec
"${DOCKER_CMD[@]}" so each argument (including the -v PROJECT_ROOT value) is
preserved; update any helper code that concatenates DOCKER_CMD to use array
operations instead.

In `@bionemo-recipes/vllm/test_esm2_golden_values.py`:
- Around line 46-63: The test module mutates import paths and working directory
(sys.path.insert(0, ...), os.chdir(...) inside fresh_export) which is fragile
for pytest; update the file to document this by adding a clear module-level
comment or docstring near sys.path.insert and a brief note on fresh_export
explaining it intentionally changes cwd for export_hf_checkpoint and that the
script is not intended to be run under pytest/parallel test runners (referencing
sys.path.insert, ESM2_MODEL_DIR, and fresh_export by name), or alternatively
guard execution with a main-check so pytest won't import/run it implicitly.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 73cd13d and c34c09b.

📒 Files selected for processing (20)

bionemo-recipes/models/esm2/convert.py
bionemo-recipes/models/esm2/export.py
bionemo-recipes/models/esm2/modeling_esm_te.py
bionemo-recipes/models/esm2/tests/test_cp_bshd.py
bionemo-recipes/models/esm2/tests/test_cp_thd.py
bionemo-recipes/models/esm2/tests/test_distributed_fp8.py
bionemo-recipes/models/esm2/tests/test_distributed_strategies.py
bionemo-recipes/models/esm2/tests/test_modeling_esm_te.py
bionemo-recipes/recipes/esm2_accelerate_te/example_8m_checkpoint/esm_nv.py
bionemo-recipes/recipes/esm2_native_te/example_8m_checkpoint/esm_nv.py
bionemo-recipes/recipes/esm2_native_te/tests/test_stop_and_go.py
bionemo-recipes/recipes/esm2_native_te/train_ddp.py
bionemo-recipes/recipes/esm2_native_te/train_ddp_cp.py
bionemo-recipes/recipes/esm2_native_te/train_fsdp2.py
bionemo-recipes/recipes/esm2_native_te/train_fsdp2_cp.py
bionemo-recipes/recipes/esm2_peft_te/example_8m_checkpoint/esm_nv.py
bionemo-recipes/vllm/Dockerfile
bionemo-recipes/vllm/README.md
bionemo-recipes/vllm/launch.sh
bionemo-recipes/vllm/test_esm2_golden_values.py

coderabbitai · 2026-02-23T17:19:41Z

bionemo-recipes/vllm/Dockerfile

+# FROM nvcr.io/nvidia/vllm:26.01-py3
+FROM gitlab-master.nvidia.com:5005/dl/dgx/vllm:main-py3.43005406-devel
+# using this because we need vllm >= 0.14 to work with Transformers v5. no released nvidia version with this yet.
+
+# The vLLM image has CUDA 13.1 runtime and nvcc, but missing dev headers (cusparse.h, nvtx, etc.)
+# Install cuda-keyring to add NVIDIA's apt repo, then install the dev headers for transformer_engine
+RUN apt-get update && apt-get install -y --no-install-recommends wget && \
+    wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/x86_64/cuda-keyring_1.1-1_all.deb && \
+    dpkg -i cuda-keyring_1.1-1_all.deb && \
+    rm cuda-keyring_1.1-1_all.deb && \
+    apt-get update && apt-get install -y --no-install-recommends \
+    cuda-nvtx-13-1 \
+    cuda-cupti-dev-13-1 \
+    cuda-nvml-dev-13-1 \
+    libcusparse-dev-13-1 \
+    libcusolver-dev-13-1 \
+    libcufft-dev-13-1 \
+    libnvjitlink-dev-13-1 \
+    libnvjpeg-dev-13-1 \
+    libcublasmp0-dev-cuda-13 \
+    libcudnn9-cuda-13 \
+    && rm -rf /var/lib/apt/lists/*
+
+# Install remaining dependencies
+RUN --mount=type=cache,target=/root/.cache/pip \
+    --mount=type=bind,source=requirements.txt,target=/requirements.txt \
+    pip install -r /requirements.txt
+
+# Install transformer_engine from source (force build for CUDA 13.1, not pre-built cu12 wheel)
+RUN pip install --no-build-isolation transformer_engine[pytorch]
+
+RUN pip install transformers[torch]==5.0.0
+
+
+WORKDIR /workspace/bionemo
+COPY . .


⚠️ Potential issue | 🟠 Major

Container runs as root — add a USER directive.

No USER command is present, so all processes run as root inside the container. This is flagged by Trivy (DS-0002) and violates the principle of least privilege.

🔒 Proposed fix

WORKDIR /workspace/bionemo COPY . . + +RUN useradd -m -u 1000 appuser && chown -R appuser /workspace/bionemo +USER appuser

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

# FROM nvcr.io/nvidia/vllm:26.01-py3

FROM gitlab-master.nvidia.com:5005/dl/dgx/vllm:main-py3.43005406-devel

# using this because we need vllm >= 0.14 to work with Transformers v5. no released nvidia version with this yet.

# The vLLM image has CUDA 13.1 runtime and nvcc, but missing dev headers (cusparse.h, nvtx, etc.)

# Install cuda-keyring to add NVIDIA's apt repo, then install the dev headers for transformer_engine

RUN apt-get update && apt-get install -y --no-install-recommends wget && \

wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/x86_64/cuda-keyring_1.1-1_all.deb && \

dpkg -i cuda-keyring_1.1-1_all.deb && \

rm cuda-keyring_1.1-1_all.deb && \

apt-get update && apt-get install -y --no-install-recommends \

cuda-nvtx-13-1 \

cuda-cupti-dev-13-1 \

cuda-nvml-dev-13-1 \

libcusparse-dev-13-1 \

libcusolver-dev-13-1 \

libcufft-dev-13-1 \

libnvjitlink-dev-13-1 \

libnvjpeg-dev-13-1 \

libcublasmp0-dev-cuda-13 \

libcudnn9-cuda-13 \

&& rm -rf /var/lib/apt/lists/*

# Install remaining dependencies

RUN --mount=type=cache,target=/root/.cache/pip \

--mount=type=bind,source=requirements.txt,target=/requirements.txt \

pip install -r /requirements.txt

# Install transformer_engine from source (force build for CUDA 13.1, not pre-built cu12 wheel)

RUN pip install --no-build-isolation transformer_engine[pytorch]

RUN pip install transformers[torch]==5.0.0

WORKDIR /workspace/bionemo

COPY . .

# FROM nvcr.io/nvidia/vllm:26.01-py3

FROM gitlab-master.nvidia.com:5005/dl/dgx/vllm:main-py3.43005406-devel

# using this because we need vllm >= 0.14 to work with Transformers v5. no released nvidia version with this yet.

# The vLLM image has CUDA 13.1 runtime and nvcc, but missing dev headers (cusparse.h, nvtx, etc.)

# Install cuda-keyring to add NVIDIA's apt repo, then install the dev headers for transformer_engine

RUN apt-get update && apt-get install -y --no-install-recommends wget && \

wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/x86_64/cuda-keyring_1.1-1_all.deb && \

dpkg -i cuda-keyring_1.1-1_all.deb && \

rm cuda-keyring_1.1-1_all.deb && \

apt-get update && apt-get install -y --no-install-recommends \

cuda-nvtx-13-1 \

cuda-cupti-dev-13-1 \

cuda-nvml-dev-13-1 \

libcusparse-dev-13-1 \

libcusolver-dev-13-1 \

libcufft-dev-13-1 \

libnvjitlink-dev-13-1 \

libnvjpeg-dev-13-1 \

libcublasmp0-dev-cuda-13 \

libcudnn9-cuda-13 \

&& rm -rf /var/lib/apt/lists/*

# Install remaining dependencies

RUN --mount=type=cache,target=/root/.cache/pip \

--mount=type=bind,source=requirements.txt,target=/requirements.txt \

pip install -r /requirements.txt

# Install transformer_engine from source (force build for CUDA 13.1, not pre-built cu12 wheel)

RUN pip install --no-build-isolation transformer_engine[pytorch]

RUN pip install transformers[torch]==5.0.0

WORKDIR /workspace/bionemo

COPY . .

RUN useradd -m -u 1000 appuser && chown -R appuser /workspace/bionemo

USER appuser

🧰 Tools

🪛 Trivy (0.69.1)

[error] 1-1: Image user should not be 'root'

Specify at least 1 USER command in Dockerfile with non-root user as argument

Rule: DS-0002

Learn more

(IaC/Dockerfile)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@bionemo-recipes/vllm/Dockerfile` around lines 1 - 36, Add a non-root user and switch to it in the Dockerfile so the container does not run as root; create a user/group (e.g., bionemo), chown the application directory (/workspace/bionemo) and any cache/venv paths to that user, and add a final USER bionemo line. Locate the Dockerfile sections around WORKDIR /workspace/bionemo and COPY . . and insert user creation (useradd/addgroup or groupadd) and chown before switching context, then add USER bionemo at the end so subsequent runtime processes run unprivileged.

coderabbitai · 2026-02-23T17:19:41Z

bionemo-recipes/vllm/launch.sh

+if [ "$HEADLESS" = true ]; then
+    DOCKER_CMD="docker run -itd --gpus all --network host --ipc=host -e HF_TOKEN --rm --name vllm_dev"
+else
+    DOCKER_CMD="docker run -it --gpus all --network host --ipc=host -e HF_TOKEN --rm --name vllm_dev"
+fi


⚠️ Potential issue | 🟡 Minor

Hardcoded --name vllm_dev prevents running multiple containers and shadows the $CONTAINER argument's meaning.

The script's positional argument ($CONTAINER) is the Docker image name, but the running container name is always vllm_dev. This has two problems:

Starting a second instance (e.g., a second headless container) will fail with docker: Error response from daemon: Conflict. The container name "/vllm_dev" is already in use.

The usage comment says <container_name> but it's really <image_name>, which is confusing.

Consider using $CONTAINER as both the image name and the container name (or derive the container name from it):

🔧 Proposed fix

-DOCKER_CMD="docker run -itd --gpus all --network host --ipc=host -e HF_TOKEN --rm --name vllm_dev" +DOCKER_CMD="docker run -itd --gpus all --network host --ipc=host -e HF_TOKEN --rm --name ${CONTAINER}_dev"

-DOCKER_CMD="docker run -it --gpus all --network host --ipc=host -e HF_TOKEN --rm --name vllm_dev" +DOCKER_CMD="docker run -it --gpus all --network host --ipc=host -e HF_TOKEN --rm --name ${CONTAINER}_dev"

Also update the usage comment on line 4:

-# Usage: ./launch.sh <container_name> [--mount_dir] [--headless] +# Usage: ./launch.sh <image_name> [--mount_dir] [--headless]

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if [ "$HEADLESS" = true ]; then

DOCKER_CMD="docker run -itd --gpus all --network host --ipc=host -e HF_TOKEN --rm --name vllm_dev"

else

DOCKER_CMD="docker run -it --gpus all --network host --ipc=host -e HF_TOKEN --rm --name vllm_dev"

fi

if [ "$HEADLESS" = true ]; then

DOCKER_CMD="docker run -itd --gpus all --network host --ipc=host -e HF_TOKEN --rm --name ${CONTAINER}_dev"

else

DOCKER_CMD="docker run -it --gpus all --network host --ipc=host -e HF_TOKEN --rm --name ${CONTAINER}_dev"

fi

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@bionemo-recipes/vllm/launch.sh` around lines 36 - 40, The script hardcodes "--name vllm_dev" in DOCKER_CMD which prevents multiple instances and misrepresents the positional $CONTAINER argument; update the usage comment to clarify that the positional argument is the image name (or image_name) and change the DOCKER_CMD assignments in the HEADLESS branch to use a container name derived from $CONTAINER (for example --name "$CONTAINER" or --name "${CONTAINER}_dev") instead of "vllm_dev" so the image argument is also used as the container name and avoids name collisions.

coderabbitai · 2026-02-23T17:19:41Z

bionemo-recipes/vllm/README.md

+docker run -it --gpus all --network host --ipc=host -e HF_TOKEN --rm -v ${PWD}:/workspace/bionemo vllm /bin/bash
+```
+
+or use `launch.sh`.


⚠️ Potential issue | 🟡 Minor

launch.sh mounts to a different container path than the manual docker run command — the README's test instruction will break after using launch.sh --mount_dir.

The manual command (line 14) mounts ${PWD} → /workspace/bionemo, which is the container's WORKDIR, so python test_esm2_golden_values.py works directly. However, launch.sh --mount_dir mounts the project root (two levels up) to /workspace/bionemo-framework, leaving /workspace/bionemo populated only by the image's COPY . . layer. Running python test_esm2_golden_values.py from WORKDIR after using --mount_dir will fail unless the user navigates to the correct subdirectory. The "or use launch.sh" phrasing implies equivalence — either clarify the path difference or update launch.sh to mount to /workspace/bionemo instead of /workspace/bionemo-framework.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@bionemo-recipes/vllm/README.md` at line 17, The README's "or use `launch.sh`" is misleading because `launch.sh --mount_dir` mounts the repo to `/workspace/bionemo-framework` while the manual `docker run` mounts to `/workspace/bionemo`, causing `python test_esm2_golden_values.py` to fail when run from WORKDIR; fix by either (A) update `launch.sh` to mount the project root into `/workspace/bionemo` instead of `/workspace/bionemo-framework` (adjust mount target and any downstream path references in launch.sh), or (B) update README.md to explicitly document that `launch.sh --mount_dir` mounts at `/workspace/bionemo-framework` and instruct users to cd into the correct subdirectory (where `test_esm2_golden_values.py` lives) before running the test; mention the affected files `launch.sh`, `README.md`, and `test_esm2_golden_values.py` so reviewers can locate the changes.

coderabbitai · 2026-02-23T17:19:41Z

bionemo-recipes/vllm/README.md

+```python
+python test_esm2_golden_values.py
+```


⚠️ Potential issue | 🟡 Minor

Wrong code fence language identifier — should be bash, not python.

The python test_esm2_golden_values.py command is a shell invocation, not Python source code. Using ```python causes syntax highlighters to misparse it.

📝 Proposed fix

-```python +```bash python test_esm2_golden_values.py

</details> <details> <summary>🤖 Prompt for AI Agents</summary>

Verify each finding against the current code and only fix it if needed.

In @bionemo-recipes/vllm/README.md around lines 21 - 23, Change the fenced code
block that currently uses the wrong language tag; locate the block containing
the shell command "python test_esm2_golden_values.py" in the README and replace
the opening fence language identifier from python to bash so the command
is treated as a shell snippet and highlighted correctly.

</details>  

coderabbitai · 2026-02-23T17:19:41Z

bionemo-recipes/vllm/test_esm2_golden_values.py

+if __name__ == "__main__":
+    print(f"GPUs: {torch.cuda.device_count()}")
+
+    # Step 0: fresh export (facebook HF -> our TE format)
+    print("\n[0/3] Exporting checkpoint ...")
+    MODEL_ID = fresh_export()
+
+    print(f"MODEL_ID:           {MODEL_ID}")
+    print(f"REFERENCE_MODEL_ID: {REFERENCE_MODEL_ID}")
+    print(f"Sequences:          {len(SEQUENCES)}")
+
+    # 1) vLLM on exported model
+    print("\n[1/3] vLLM inference (exported model) ...")
+    emb_vllm = vllm_embed(MODEL_ID, SEQUENCES)
+
+    # 2) HuggingFace on exported model
+    print("\n[2/3] HuggingFace inference (exported model) ...")
+    emb_hf_exported = hf_embed(MODEL_ID, SEQUENCES)
+
+    # 3) HuggingFace on reference Hub model
+    print("\n[3/3] HuggingFace inference (reference model) ...")
+    emb_hf_reference = hf_embed(REFERENCE_MODEL_ID, SEQUENCES)
+
+    # ---- Pairwise comparisons ----
+    pairs = [
+        ("vLLM (exported)", "HF (exported)", emb_vllm, emb_hf_exported),
+        ("vLLM (exported)", "HF (reference)", emb_vllm, emb_hf_reference),
+        ("HF (exported)", "HF (reference)", emb_hf_exported, emb_hf_reference),
+    ]
+
+    # ---- Summary table ----
+    header = f"{'Pair':<35} {'max |diff|':>14} {'mean |diff|':>14} {'cos sim':>12} {'exact':>7}"
+    sep = "-" * len(header)
+    print(f"\n{sep}")
+    print(header)
+    print(sep)
+
+    for name_a, name_b, a, b in pairs:
+        diffs = np.abs(a.astype(np.float64) - b.astype(np.float64))
+        label = f"{name_a}  vs  {name_b}"
+        exact = np.array_equal(a, b)
+        print(
+            f"{label:<35} {diffs.max():>14.8e} {diffs.mean():>14.8e} "
+            f"{cosine_sim(a, b):>12.10f} {'YES' if exact else 'NO':>7}"
+        )
+
+    print(sep)
+    print(f"Tolerance: rtol={RTOL}, atol={ATOL} (0 = exact match required)")
+
+    # Per-sequence breakdown
+    short = {"vLLM (exported)": "vllm", "HF (exported)": "hf_exp", "HF (reference)": "hf_ref"}
+    print("\nPer-sequence max |diff|:")
+    for i in range(len(SEQUENCES)):
+        row = f"  seq {i}:"
+        for name_a, name_b, a, b in pairs:
+            d = float(np.abs(a[i].astype(np.float64) - b[i].astype(np.float64)).max())
+            row += f"  {short[name_a]}_vs_{short[name_b]}={d:.8e}"
+        print(row)
+
+    print(sep)
+
+    # Cleanup
+    if torch.distributed.is_initialized():
+        torch.distributed.destroy_process_group()


⚠️ Potential issue | 🟠 Major

Test script has no assertions — it will never fail.

This "test" prints a comparison table but never asserts that results are within tolerance. RTOL and ATOL (line 74) are defined but unused. A silent pass regardless of output drift defeats the purpose of a golden-value test.

Add assertions after the comparison loop, for example:

Suggested assertion block

+ all_passed = True for name_a, name_b, a, b in pairs: diffs = np.abs(a.astype(np.float64) - b.astype(np.float64)) label = f"{name_a} vs {name_b}" exact = np.array_equal(a, b) print( f"{label:<35} {diffs.max():>14.8e} {diffs.mean():>14.8e} " f"{cosine_sim(a, b):>12.10f} {'YES' if exact else 'NO':>7}" ) + if not np.allclose(a, b, rtol=RTOL, atol=ATOL): + all_passed = False print(sep) - print(f"Tolerance: rtol={RTOL}, atol={ATOL} (0 = exact match required)") + print(f"Tolerance: rtol={RTOL}, atol={ATOL}") + + assert all_passed, "Golden value comparison failed — see table above for details."

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@bionemo-recipes/vllm/test_esm2_golden_values.py` around lines 147 - 210, The script currently only prints comparisons and never fails; add assertions that enforce the golden-value tolerances using RTOL and ATOL: for each pair in pairs (refer to the pairs list and variables a, b), assert np.allclose(a, b, rtol=RTOL, atol=ATOL) (or equivalently assert (np.abs(a-b) <= ATOL + RTOL * np.abs(b)).all()) and fail the test if not, and also assert cosine_sim(a, b) exceeds a sensible threshold or that exact is True when ATOL/RTOL are zero; add per-sequence assertions inside the per-sequence loop to ensure each sequence max-diff <= ATOL + RTOL * max(|b_i|) so the test fails on unacceptable drift.

bionemo-recipes/models/esm2/export.py

bionemo-recipes/models/esm2/modeling_esm_te.py

bionemo-recipes/models/esm2/tests/test_cp_bshd.py

bionemo-recipes/models/esm2/modeling_esm_te.py

bionemo-recipes/vllm/Dockerfile

coderabbitai · 2026-03-09T15:06:22Z

Caution

Failed to replace (edit) comment. This is likely due to insufficient permissions or the comment being deleted.

Error details

{"name":"HttpError","status":500,"request":{"method":"PATCH","url":"https://api.github.com/repos/NVIDIA/bionemo-framework/issues/comments/3923853930","headers":{"accept":"application/vnd.github.v3+json","user-agent":"octokit.js/0.0.0-development octokit-core.js/7.0.6 Node.js/24","authorization":"token [REDACTED]","content-type":"application/json; charset=utf-8"},"body":{"body":"<!-- This is an auto-generated comment: summarize by coderabbit.ai -->\n<!-- This is an auto-generated comment: skip review by coderabbit.ai -->\n\n> [!IMPORTANT]\n> ## Review skipped\n> \n> Auto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the `.coderabbit.yaml` file in this repository. To trigger a single review, invoke the `@coderabbitai review` command.\n> \n> <details>\n> <summary>⚙️ Run configuration</summary>\n> \n> **Configuration used**: Path: .coderabbit.yaml\n> \n> **Review profile**: CHILL\n> \n> **Plan**: Pro\n> \n> **Run ID**: `178149c8-22ee-4af1-9fec-700dfc0e80ae`\n> \n> </details>\n> \n> You can disable this status message by setting the `reviews.review_status` to `false` in the CodeRabbit configuration file.\n> \n> Use the checkbox below for a quick retry:\n> - [ ] <!-- {\"checkboxId\": \"e9bb8d72-00e8-4f67-9cb2-caf3b22574fe\"} --> 🔍 Trigger review\n\n<!-- end of auto-generated comment: skip review by coderabbit.ai -->\n\n<!-- walkthrough_start -->\n\n<details>\n<summary>📝 Walkthrough</summary>\n\n## Walkthrough\n\nESM2 model architecture refactored to rename the base model namespace from \"esm\" to \"model\" across core components, model classes, and related tests. A new `add_pooling_layer` configuration option was introduced to control pooler presence. Additionally, vLLM integration infrastructure added with Docker setup and validation tooling.\n\n## Changes\n\n|Cohort / File(s)|Summary|\n|---|---|\n|**Core Model Architecture Refactoring** <br> `bionemo-recipes/models/esm2/convert.py`, `export.py`, `modeling_esm_te.py`|Namespace renaming from \"esm\" to \"model\" in weight mappings, base model references, and state dict handling. Added `add_pooling_layer` configuration option (defaults to True) wired through NVEsmConfig and NVEsmModel. Updated base_model_prefix and tied weights references. Removed quantized_model_init wrapper in favor of fp8_model_init context. Changed state_dict filtering to exclude both `_extra_state` and `.inv_freq` keys for vLLM compatibility.|\n|**ESM2 Test Suite Updates** <br> `bionemo-recipes/models/esm2/tests/test_cp_bshd.py`, `test_cp_thd.py`, `test_distributed_fp8.py`, `test_distributed_strategies.py`, `test_modeling_esm_te.py`|Updated model layer access paths from `model.esm.encoder.layers` to `model.model.encoder.layers` across distributed testing, context-parallel gradient validation, and state_dict verification. No functional logic changes, only attribute path corrections.|\n|**Recipe Implementation Propagation** <br> `bionemo-recipes/recipes/esm2_accelerate_te/...`, `esm2_native_te/...`, `esm2_peft_te/.../esm_nv.py`|Applied namespace refactoring, pooling layer configuration, and weight initialization updates from core model to recipe implementations. Updated NVEsmPreTrainedModel, NVEsmForMaskedLM, NVEsmForTokenClassification, and NVEsmLMHead across all recipe variants. Replaced direct \"esm\" references with \"model\" and removed legacy TE quantization contexts.|\n|**Training Script Updates** <br> `bionemo-recipes/recipes/esm2_native_te/tests/test_stop_and_go.py`, `train_ddp.py`, `train_ddp_cp.py`, `train_fsdp2.py`, `train_fsdp2_cp.py`|Updated contact_head deletion and encoder layer access paths from `model.esm.*` to `model.model.*` to align with new model structure. No logic changes, only path corrections for distributed training configurations (DDP, FSDP2, context parallelism).|\n|**vLLM Integration** <br> `bionemo-recipes/vllm/Dockerfile`, `README.md`, `launch.sh`, `test_esm2_golden_values.py`|New vLLM inference infrastructure added: Dockerfile extends vLLM base with CUDA 13.1 tooling and transformer_engine from source; launch.sh provides Docker run wrapper with optional mounts and headless mode; test_esm2_golden_values.py validates ESM2 embedding consistency across vLLM, HuggingFace, and hub models with numerical comparison metrics.|\n\n## Estimated code review effort\n\n🎯 3 (Moderate) | ⏱️ ~22 minutes\n\n## Poem\n\n> 🐰 *Hop, hop! The model's now refined,*\n> *From \"esm\" paths, we've left behind,*\n> *A \"model\" namespace, clean and bright,*\n> *With pooling layers—flexible might!*\n> *And vLLM joins the warren's flight,* 🚀\n\n</details>\n\n<!-- walkthrough_end -->\n\n<!-- pre_merge_checks_walkthrough_start -->\n\n<details>\n<summary>🚥 Pre-merge checks | ✅ 2 | ❌ 1</summary>\n\n### ❌ Failed checks (1 inconclusive)\n\n|     Check name    | Status         | Explanation                                                                                                                                                                                                  | Resolution                                                                                                                                                                                                        |\n| :---------------: | :------------- | :----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | :---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |\n| Description check | ❓ Inconclusive | The description covers the basic what (making ESM2 compatible with vLLM) and usage instructions, but is missing detailed explanation of the changes, their purpose, and how they achieve vLLM compatibility. | Add more comprehensive details about the specific changes made, the rationale behind the namespace refactoring from 'esm' to 'model', the pooling layer changes, and how these changes enable vLLM compatibility. |\n\n<details>\n<summary>✅ Passed checks (2 passed)</summary>\n\n|     Check name     | Status   | Explanation                                                                                                                   |\n| :----------------: | :------- | :---------------------------------------------------------------------------------------------------------------------------- |\n|     Title check    | ✅ Passed | The title 'ESM2 changes to work with vLLM' clearly describes the main objective of the PR - making ESM2 compatible with vLLM. |\n| Docstring Coverage | ✅ Passed | Docstring coverage is 96.30% which is sufficient. The required threshold is 80.00%.                                           |\n\n</details>\n\n<sub>✏️ Tip: You can configure your own custom pre-merge checks in the settings.</sub>\n\n</details>\n\n<!-- pre_merge_checks_walkthrough_end -->\n\n<!-- finishing_touch_checkbox_start -->\n\n<details>\n<summary>✨ Finishing Touches</summary>\n\n<details>\n<summary>🧪 Generate unit tests (beta)</summary>\n\n- [ ] <!-- {\"checkboxId\": \"f47ac10b-58cc-4372-a567-0e02b2c3d479\", \"radioGroupId\": \"utg-output-choice-group-unknown_comment_id\"} -->   Create PR with unit tests\n- [ ] <!-- {\"checkboxId\": \"07f1e7d6-8a8e-4e23-9900-8731c2c87f58\", \"radioGroupId\": \"utg-output-choice-group-unknown_comment_id\"} -->   Post copyable unit tests in a comment\n- [ ] <!-- {\"checkboxId\": \"6ba7b810-9dad-11d1-80b4-00c04fd430c8\", \"radioGroupId\": \"utg-output-choice-group-unknown_comment_id\"} -->   Commit unit tests in branch `gkaushik/esm2-vllm`\n\n</details>\n\n</details>\n\n<!-- finishing_touch_checkbox_end -->\n\n<!-- tips_start -->\n\n---\n\nThanks for using [CodeRabbit](https://coderabbit.ai?utm_source=oss&utm_medium=github&utm_campaign=NVIDIA/bionemo-framework&utm_content=1473)! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.\n\n<details>\n<summary>❤️ Share</summary>\n\n- [X](https://twitter.com/intent/tweet?text=I%20just%20used%20%40coderabbitai%20for%20my%20code%20review%2C%20and%20it%27s%20fantastic%21%20It%27s%20free%20for%20OSS%20and%20offers%20a%20free%20trial%20for%20the%20proprietary%20code.%20Check%20it%20out%3A&url=https%3A//coderabbit.ai)\n- [Mastodon](https://mastodon.social/share?text=I%20just%20used%20%40coderabbitai%20for%20my%20code%20review%2C%20and%20it%27s%20fantastic%21%20It%27s%20free%20for%20OSS%20and%20offers%20a%20free%20trial%20for%20the%20proprietary%20code.%20Check%20it%20out%3A%20https%3A%2F%2Fcoderabbit.ai)\n- [Reddit](https://www.reddit.com/submit?title=Great%20tool%20for%20code%20review%20-%20CodeRabbit&text=I%20just%20used%20CodeRabbit%20for%20my%20code%20review%2C%20and%20it%27s%20fantastic%21%20It%27s%20free%20for%20OSS%20and%20offers%20a%20free%20trial%20for%20proprietary%20code.%20Check%20it%20out%3A%20https%3A//coderabbit.ai)\n- [LinkedIn](https://www.linkedin.com/sharing/share-offsite/?url=https%3A%2F%2Fcoderabbit.ai&mini=true&title=Great%20tool%20for%20code%20review%20-%20CodeRabbit&summary=I%20just%20used%20CodeRabbit%20for%20my%20code%20review%2C%20and%20it%27s%20fantastic%21%20It%27s%20free%20for%20OSS%20and%20offers%20a%20free%20trial%20for%20proprietary%20code)\n\n</details>\n\n<sub>Comment `@coderabbitai help` to get the list of available commands and usage tips.</sub>\n\n<!-- tips_end -->\n\n<!-- internal state start -->\n\n\n<!-- DwQgtGAEAqAWCWBnSTIEMB26CuAXA9mAOYCmGJATmriQCaQDG+Ats2bgFyQAOFk+AIwBWJBrngA3EsgEBPRvlqU0AgfFwA6NPEgQAfACgjoCEYDEZyAAUASpETZWaCrKPR1AGxJcAogGUAWQAmRlhMUmQCSAB3fAoAaxj1WEgJABk0gMhIAwA5RwFKLgBGABYAdgBmbIMAVRs0rlhcXG5EDgB6DqJk7AENJmYO3IA1AEkAETGAQQ61fHJmQgAzKjZYhI7ubA8PDrKqmtrEIsgiNHOMeOKav3xsCgYSSAEqDAZYLiJ4tGxEBHiHWkzCCYAku2YkEASYQwZykXAvN4fLjMbRYHJ+XDUP5cfDcMg1GwkCTwEjRSjtSBCRC4IT4WAYAA0PBpdIZzKE0XgHkKFGZDEQ3DQSkQzNocXwWNwYAwEmZuAoEjQH3wcqpLHUHxIu1lzMQaBJuDCYDIRBqAGEKCRqHQuEEAAxBABsYEdYGKAA5oEFKhxSqUOMUAKwALSME2kDAo8G44gWHAMUGmtFoyA+4WkkCiqPizyNz38wUgSyUHgUzCF4gEXiSRtSGQCGkTkDG7w82BFkD+F2e8AwNIo2DE8AWlMHWAABtxZEaFlnpLgAPrAoKLoj4DxKDCLpXt6QaacTlD9+BKLOwZ5MDBYvuUADcCmvaMoL2w3NoHXHx4HQ7j/fQVo8BQqqnnQx7ns8thNlA5oeGgiCIPAyykvQ8HoJA5DRJAyzWrgDzPAAFBgCxgK81rxH2RAAJTQdYVpgA4AjMOooSiPEHhIAi6gkMwyDOM8vDSOwr4Itg7wXgwub0ARHj4AwaBljQNKipA4oMI47DUCOWDYNwtA2ipSm4HxKZ0B0un6TQtDMgpikLsgQoIZRNHmJY5osGw17IA4TguEYUCvBumC0GAYQIvBUkxBe+Z8GgkA2D40wTAEPgoDIJCUTEcSRfGkAAEQAALnJc1yQFw0yAbI9xZQkmVznFVrCmwAD8eXNsVmClUgDhgcwOziNwNZ5R0+CJFERl5eWqIYKmkAEUaVrPHJ6kUFa7zSMyCzPFaOFrQwmVxYKohIfADBTSxYT/IwlSlAw9oAJz9BoGguVAHVXDcVoAI7YAuYENcSpJYasLAQS8kpcIVTBKFQqjqNokBWiSZJ5bR0PKHDN5zQIkpUYj0jcAsSioXgLBafJuzyH2tCnVpGBmvm+PI1h0RoQq8BEKQVqoTNGGSplRrUJA7lKDYKhqAi+IUMscS8ceUY8ZpZZI0DfG8+KmbEQi9Eq2S6AeI1tDyLr5L0IMzHGU2BgWMLHksWwCE9t5jion5zbuaw6jebA+BYbw+BEFaTlziDkKYMe6jwApqlUMsCIy3whYhAA1PWmRzXltCx5LfBQCuNUUfTtYpOCHjMHlVHMjLuy+2BciQD7fwkLmJDcJlNKt2OPEgUXcVXlIFBWaxkmE328fAZCPQIgqyqF0Q8oLpR/JeJgunMlaSwkkXYlWnBVkdAAXiwajPMhXgqcF8txITVDiEXZ+ZqH6BaxefCvJgHxW/oxjgHnvP4MsHABBiBkGUEPc27AuC8H4MIUQ4gpAyHkOjWGEstA6G/iYKAcBUCoHDr8YBpByC3zAhA68XAqBYR8q7eQ9dkHi3UGg3QYBDA/1MAYeYixCBWn2viRAHQSzaj4SuDo/dKCaGnAmPKUjraWGmGMEBRCbT0Coc4eQADQgZkQP5eKZA0AO2fvQSsKQN5oG4G3emlIAiYHkKiMxmV2DRkzIgLkuAPjYQnmDOIHM+zR38MWRQOwBLUBSARYEGgyDIOei9LM+AFCrQJkTTKAiyxGLmsk8J7xFCUCiTRGACBkB9gYO2Ts1AaDXm0syMelA2A0xtJtPA2xcDMjSGgWQlBciyxsrzHihQUyZWIhQVEHED5aTnLmWQiBaLQB8GAYBAAJAAYgYyAizZmEGmZAcZxZTHmKIMgCySjyq7CzHCEgS4tmX0QPcR4JBFxbJ2pQCJmY+xgxpDaRcM9+wJ0hEoJgt84j7L0komJ+sOZYBcSkRmmF/GlgwnogmypCLpIAFR4z7B3YU/BAGM0EiSe4yAwnIrhQ7IUTxaI+GYL0mmRdyQc2aDwYU1KiCfh4jsg6tAhB/FwJ5YyXBFxCloMuSldAmVq3oIuMSAqhVUsooCyyYEoh4jAUtLA6SekitlRoDYgr1V9IsVqjKRB6XopoJi9RYTdWiq1XEHVwq9V7INXS3AD5ELMTghQDwsh5QXngHwJgYkaAUCFIPZACcwa/KyXwWlRrjL41RC8sS6Z6ZgRebJHoZ0BB4EfHfX6ILbHnmAtgI1YNoUYHhYKRFtF3Ing7tefGZbmK9yjPgBCkAvrxAkFmN4iBvmUn5bPRc7aJCLmjc0Zk/bJKDo7YuNQ8Fx2SoHUOkdhqx3LIlRgUl8Qp3DtncgYiWEHl7WeGqzJMMNBwTaRQSZRKjEFKwPmX19hrlPA6FiCg8Jtl2P1c2AIfY4iPkQrWhgSCwjJvAgK8yG7MVhojTDZZZdFwXmFHyqVo7jIdHXahldsaFKG3kApMFxdICWoOurUQkbNkkAmQ+YUnKaQ8uQDtLwYgS163SUBEgyEAAe4Fd7EkwAiWxuziPXkcZM5sHSNFgaiMsRNf5o6FDCHit+VGiYlvLaS7aujG1mkvgRogGAeUgqhWxrJZYG0IqeA+NNp041okiBeew8KRL412k8uVwLsxmY0Miq2rlIDTA8IG0Z/5xqOd+e6kLyBzVcZvkPf92xqy2fYJHaQ2jJNJoiCCiLzgotYuI7FuI+9EscTOil8Q+5/NWIwEhBckAFncmeNMMtnqD6UCMGkW8aZQOkFoFwZOnoOjug9EYHwNJ4ConAVkpmQNiPLATpwSAAQ6DwEcAYKRbVExsI4d3MA3CYzSH4WZoRiAQRAkK4PA8shJHSJtnIhRyrlEu1UflzLaXmytgK3FxDyxFxahHvgMe/JbJpgWAPJcwIfsfPwB854EKGWmUFRIOSKhFyITawAXg6eQb8pr6DqKUMsX4QWBaxLioTJy1Z5ACuJvYeAbXmRkAcNGIuJGLEdFgy+fT4Lkj42jsj+SAg0f0+eHp5Hp525hHxGAZip3qBuJeekTIAByZAIwUcCCsLl3Y2oKUyvprRZMs0SCxdKyxUhuAOdyQ0k+P82F/04sZXQHcGvhdtZeCQJTI4KC0U+4zfUbBsJyYqfYM5oOMDISIAAbTyoy/l+ANyUUXBeygeUAC6kAMf1YUicbLSAVA1kJhuF8CsQvgUZibuLJCJLxFHteZkuKUtF1l4hemYAi9eD4JQYCV7wJK6yNEBANY5lFp6PTBZiKYmd4MsWbqmVxnIHhwOU6uAMcT48CcWiv6BnZpN3haOFuCWm7ROeIWHeXy4IECcOttBc1RFGGN5gCy4hWMQFJdOimDTe66fQQf8gjQ4J7qiDSD6guB+YyIBZBZgLaSRCxKMw5a3wwH5aV5FZgQJZ9ClYibiAVZaISaxJvawF04GbYhBzIEXbFYYHJblI4FzSybvB/h8JFLwQnCIB4z8QwInCKh0AaB5KXg9aZjsETaDSKy27aSqRnLaAb7FwvIoGDxQ7/a16A51qXxXiR4NzBQcT0zMhaw8CUFnTTBWBjBSYRBWydbkDdYZh9aQADaVBDbFCjbjaTYkLTYmxzYLZcDLY0xrYbb+TbbaS7b7a8JHalgnZnbJJJ6Q40BXY3ZtR3byKEKPb2DPYuCvb8G4FQCtgKgBJPCoQpjx6J70zJ6tIvh4h27KjAStqqEczLLJIRziBl63piicbE53wMyxLQCDgkA/5JDcwFr3DFpRBKCBqNpw7RSvzoQd6ZQp58CoBNz0AvKMwCDwTHrebNhEiDSIr0CHruYgrAjYzLEwrah4zw61EvIlanQiJwQIRgBeBSBlhVLSyIpioxDYZZiyBJJsoWKVJtgdiUQJjZABTLGLjJL8o7TwA8Zvb0BPx5TAiTRRB5TJJtQAkwAoQvFOrIBCaZQHJDxPwWp2pWrarSoar6poZ5reZs4OqEkUmTJoYGDIkBD9QxiF56HoAtDRiZpKTLJ9iRwEYjJ263p8wHqcaPLrSh4eDLAaCnH9h475YnDinhKnbNi1BApDwP6nYBBmYAYKi/hiFRAOBmJFb8CxjaTRxx6TGFHTEMprBzTY7PD1yE4tFsG8xRCGzuKgxVFmiD4Ei2kN4UicQCyObQrmlEA3HFEqZe5xC0TKnyq/6vHcn1HDJl6XJSjPCga0CaFtFdi55LEnDAmakmrWj46AJ7FelYCMpByt5mjUnLKknxlRyJl/j/G6B1FCqxiyDLromClZlOLagSk5m3Jqr4mapUlDkWJ0nNlvI0CLg0wsZnyBpYkqkKqxIm5FIdjPDLhcYzxo4pnLIaB9jDqrAkBfTNjJh0Y4lxCswUCGLBJ3qpDOAjh/CMBXEsEgqFo0B9FFopC1GhIaBEAaDMhqlP4v4RR0CZAAUjCP7P4UDQCjRkCwTMEnTyR/horSmFnIGna0REgbCRQdGYA9qyyUA+D0y3hQCXwLJWAeh7bajAp1m8lJm8zyTjZFwUwb5NlQBEibxgQ/QCYi6CognxkxBUBmIX5YCAWZBzJoW0UNnaTjnsWtxwQ5FAIo40gdDLDcAei76bmL687kWUVHT7TIRlY/SSAKTsDIAERqUeh5mliLjxlVzqWLj4LKW4AoX4y3ECYwqBJaJQA+CxbBRgQ06USzKok2ZnSXwPwMRMD4hmweRmWvmcbMYIjQr9mHFlhpkZnLIbJVKBwhbfGrlMrGGZhRA+xYSohv5gRwT0zYA9ipVTEBxewASXgsCNL/TOnjhIXcFKmLn0CAVWBWi4W3i0AamwoukJVwIYR6z9nWXaigmcbgnLK0ZcpgQ8RtlgCkmyTCiZSpJRBNwvBAnpJdUxmQCAVQWv7v5ZCXwnVxAwW5gYDwVOSGVl6jVuailykSlSkYpFlikSlhL+ZGA/pnI+zKJgokF8GaJsXHUQWnbVqR4aCLi2U1ZLiLhnD2boB5HBlFGXpcA4wbiZ4wCdGyWQ2P7DXahw0I3qDw0ERvX8gLCR5cCAUw0cw2To0J4ZmY2nAADyxpCwCkUeONHgGeWetpVEhNV1FAZ1oFWQuKD5iAnq3ZyivZCpzAD4+63Z3kitySotUNQF0FsFd1z5iFDRKs+Kctcx31StKtvsat5tmtUAVgLJCBeW2JtohNU1IJgk3G4EvV/VVAg1JNZYkJbpkIMJp2cJsSCJZmSJzZHyKEHZMaiAdyVGHmQ8z1IpTw+Ug5+ulJNqRJ9qNJ2Gk0LyYtEttAmQzYmINo/A4ZX+/6BBqQSA8A1YvYWA5x+hhhENk5tyM5uABEeMaky+vcvMc5lAdV6aX2/l9AUQK5xSzwe5soi4h5X04EjKkcc4UQG5W5ndhNCcl5tAvdDKRoydS5iM9w75C0/RX5mpl8ztk9sSG8qol4YiJ+0yYAjFrRLwskkkWiyJ8ZrZuA7ZaGiA+9qtTcMg+1mp2xopYaFJaJMadRzYAA4rJEseZhpoilsCyQYUYSYuigBpxMJERLEtCrIbGoUFVM6T7LnhxB3LQJXC5ozIIJwUqE3boUlm3UYXXewcRBgLMtGCSKabzP8PcJuB7jNmSGBGGkIcqAiHOOKNENKY1JCN2KQMsgoXXgiIMJWI3dyOoLIFbDbNVrVjSPVo1gFi1hMkgB1l1oVVYcnKUMUENjdA4eIE4TFWeK4Zxu4SsnSuttIltmAEYDtksFRTwodskqEUEK+vZFEzSH9twDOv8LQNEb47EbIvEaAsQk9r5GooAgQdousQpWBJA08IQWqhhREpGuemGdpXWOkiepEtMbecUHjAHsIezb3mGtwzTMvhyUU2JMyEEHjIzLJHiHUfWfTplFeDQJue3trl4GWIHPcG0Pbl3sqJCt2t8i+NMd0ZUC03om040ys8LFYGcFQDTMJJo/eVchgDwdgnxPNnAovggArteJQC1kkUxAEjWLggwCUycFCbXXELcqUilvVLzNMQMswNuuBDjHWNw2AN0+zL0zzPQOaFYDE5LDeTwZJpKOMaFVXdmsBGWMsLJFhJw4BIJNfgiGQ2ptiQGU1fEixqUuyXgEEkaOAXdlAYgaOMZuFqIJFgwWQVXvjnwK3VgalukcdfgWkbseQWZGK+VvIIhMQXhKQUMXAp1QYGYZmJCf1kGENkGM4xNsCujGI1hJ40Vh4Stt4X4xAAE+wv4cE4EWE8dkCKdpE0ZHwkZHEx8rAEkxIikxAfdgkZk0kdk6kZovk/JZsfrMrMKUeoQYzIWKlQfSkE/GU8wBkg09U1HlEhnl5qWJKeSaetko07m89BngsbAItJAJ/VFlwARGgC09qHAmIeoq0zWIc504kj06y/QDlec15MyARAIGisFnbg/V3iW3wIc3qWcppZi1QDrgs4WtwN0QRAwM24lW24Ah288F2/+mi6c5tRcxuNuwsLc/kvrIGg5qLmyY3ay+gL8yAam6pHfmTrW/ALmHLTtNI3EB0NEEJeXo5rUQgMoI8LALIA+IzEHDI9io5viwRAO6SHWkwDrsOAsMyEeycHhGu6o81VcwsHjDg/+ImvwUk1Y+YTY3qwa0a64woO44DHrBa4PFa14cwIG/44E461wsdEEeE262dp6xi9OZxCy1ZAvepckz4XEQ9qGyoikeonkwYAAOrJBVvPALJ+ATBWAhDvziTepNVniq3cTELRYDypVFuFsVNnqHMFlmqAIZtZuVOHOlnoBmKeqZSyaUxo5hBXm7FrO1thmXvPDM6FozSTMg4greeeq+fOAAt8CMw6fFjSApCXybz0spuDGAzp0kddjiSWHYuxK4sl4LBZHEuksEvd7/rpWTNpEcuyJct5ZhbPCO2CsxbCsEsKvUGkiSu2lVaYBGMIgNY1jNYKSyBtYUCUc6vkf9YOPuh0cmsuFMfmvzaWtLbWscc+FccOtbROt8cushGCcevRPeuIvifO4Dg2g9D7gBsydpNyfAoKc5OFWSvb7/q/tiD/oLF1ZGJcBud/CZcbJGKGcMfPCq3Kh/NFWOY2dbPVP11xR1PFvZuXp3qfX5ZOew8+6NNFevcgpTNEvYSkuM6rQ1caFLwEsIBGqUA3HEjai1v1UMCmHWO6vWHzf2EGBjYuNLeMfMxuHreeGrZbe2t+F7e8ehN8ICfCLCfevhGFGREkDSe3YPchtPfJEvfKdQDRnAobK1HFOPweIWrTvLJ4lZ2lPI8Ubpfkmjl7LgSd2icsaYkWLwZhlPt/MAo2QcQGaZTw59zkesOlbBH0/MsPvvk/hiD4RpLeb1MufVNOn0A33EY28dDpxIb0BbICkESZ3EnZ1Xm50EnYZDN31jUsalp6KUSG4cpLX0CV5m4Iid0Isr7DzxCEGXOAQXD2ZJV6xbIWYVrp1Z8W9nrPQ1HW9m85LLKJ9/JByEwRdFxTn2/d2J0TJu82qUSepb5/p8C7wo4TuAKWmpJWjsz8aKSxK7VI/WfTtVOo+45oXqKY+X+NM2QPPDhFwlVBfX/sFkAaShu34s4Mz2TgEoA/XLViz1m7WEnQQ2J0Itymy89ZsLHRbILxtabY7W3HMXiEwOx8JnWETRys+2nxTkaA52fZl4EXAegoWajJQpbkhyygleqTALOk0URDxnuEbZNJK0yLARb8ilM0qzSTyWlSiYhcoi2mQAM1aa1RS+P7CFDnAh4yjPMLEilI8lpKCwB8A6X6h2ZcGHRX6MyHhxqQbcUoMQon0h6r96Y6/ZsPbTYZ7Vc8eveNk8nrTworCbtMzLNU9qB1oSsJEFBHVLCF1RK2tPqiQAGrkAhqqxKACp3IigIti2oKOKKXqj2B8QBlWzIBx2S9Qr697Xpi8CVYuIPgPvbDB0AEoCkog9g0sA+ET6/0Vq/9OOs0ETanJqW4DazjbzR4ylzU5TWod0UWo0M3KS3VgGZVogL8m+n+ZTIiHwDChQhXAYen/y7LT01yMgXFpAA3pUBtyldS+HPQPLfRXwDzK9IzjLRJZmKBHKsDo3/pEY98bzaOOtQGEwxxMQA7Wv7TGZ0U7cyrMtKqzhygR/BIKSHq3HCgs0CiRAdpugGQBc15MHgPmqzTTxNEicjJOfvgREF/kuBHwr4W51tIPhz8fAaQfjH0qxoACoaOCH+TWLRtFKNMbhNrCsHrRCCJZXnLUXPqflT6xYRksISfLMEiqxfEluNXiHCUKAayGXBA2aJfc+AP5P8uBUgrAVyqpdAILyNOxQUbqcFA2o9T/DdFd47QmfmZVehE0RR/I86l2VAboRaiIfFIQRHs5fVgQeMSRrGgeLvNkk0HVEoA0/TCZE+qdI9BnVH458aSOdako6hjSowJM2tcSmhRvrkJu4UgegNxXKRtY+KDggSlMz3w0ZEc2EByvxURoLsAI9wXmC0kvQdJBk2rZwNKJ9H/RSYb9LMjVReQ714utEQCnrntHLIZRjAimKPVsxWjYk+pOLDADmQTBlkBAW6tOWAiEws0LQ7lHFXhxT8NWyiCsUXHxYA9Ky9YxsZn0QCPBk8poQ+vLSOZWh8Q1ABGoGhXhSAFR7gMCOaKd5mgYM5GAqon1xKNCx+I5I8a8QLak0nRx4+0c6ONRYA+MSoVDs+UzAERi6IFQUcsjFpij9aCFSUdpAVHvc/UsVLyMsn9i55OxDGJ9hUQxJUiawL9Msc4QrBbQgJz1RKmDBUAP1XMf7XvJfD4Gjh9GjXcdkgRa7iEmC3Lf8B11QIisA+VBbAr120T3VKQwgiPBzEJrLZZwwNFVpHy9HTDyaSNAiFEmZqCoMa0xbGqzTxrqCuikAZFMiniCXk9keMGEGR0sIAQiAOgqunBg5ITU/YuWNgIGjRqCTuBFpV3k7mkjJkgWqEdWvKShFs1piCo+ifTW8E+1nwAQ0sITRGD3kC8JAVSqSE3DY1qhM1D2vNWcEeIQ65caELaI8FVpny9kx/D4L8F0B/aLEwGooC4DFCKwpQ80SRxUgaScYRsLsvrwsl9k/J9xVCg53Np7ECIYrXod7hsYI9WMWEfIfT3162Sopio5gLFN9r+CEp9JJKVYW6EsZbhoNVQf2GZAaTtBPKeilCW5Dzki4VU/9InwIj912YzFcjg+BGEM8x6qtcYZ2BmFoA5h75S+PuQXrfRmpzBaKeqTMyE0/AINe4YVT5Q8TKa1NR8HTVamM154ek/ItZLDJcBfhJpf4fzUFpStyAeMCqcbT+By1+aRGZQUFnxq/QXIyJAAEKe4a6FALgFZJ4Gu8oZqI8EUxMhHvDPpl6KKN6S2hNhkSpgzApqNZZQJWaJeRqHbiUD4gZo0WMsnjPRmXpIpp01qadVfFl1SZLJJgq2gpk0A+UFWQVIAyX4YlPiZoLiQAG88oCGNPhoE5w+40MeUSGNnzzrWo8+F4gugAF85orMZACFPCS1DNZtqE8U6grgkzmy0wZIZTOPCBpjR7Il6rl20wpp70V7UMZuSDrEZTsZJWFFTQ1qakhaFwszHxOehUQTpCEM6cwA9HChCaZM2zFJXpwhZvRnFP0dVQDHO5ox5uMrmGOtq4jxqqY2KLzGMmRirK2c8KOF3oCJj2kssIuXNDFb8y70gaInOnUUnJpaGVsqAFBV3opc2JVEjNEjL6Htzesdva6fhBGlZojRppLMfBFEj6gVG99X0ezKjmtTixedeOSyVmmxQq+rQ5CeNWgANikiBpQeE2L1qti8QFIxAAOKICW0B5RBO4ZHzro0teY08ssKFRXkMTtaoovWvRMNp/hN5ZgpuaySRast+Eks4WbHTFkL4uw3VeKs7OeDuDzxJsy8RrJVn6y0IRsi8Y6JQUF04ZzZTIocLLCWCEFNgmpFwDepWd6ewc4mqHJyTuwWpL4gUZkA0D5irygCzAjpKBoPzBpI8ugA+BYp7orab5PMNWwvo218yJUr6pQt+owRGFIcwtmwtoAcLbMXCxQDwpul8LaAD4N+UTytrVjuyEi2FAKS9Lt8POtMJulbEMY4RjG6rQZD4jtxho0R2ERrNNwsIdz+s9jRxqUE44oDdunCdAUESwHHdFwdwyQLcgIEm4iBtyUgX9hrzqNBOoSiQDQKDb0DEiTApTmkW0RsDsirVSetW0LL/RNJ70oSa7w9JGkyizaVtIzBfq/AiAPKMCMm00aITjIwIloqThhkSTL4XIXoixCiAE9caOKIONYPUQ4oDJZoS0hpxTb8NWplwy+HBN/xCUpYZwnRBsUUobh6A/ZIPikjBI8YkRLg0Om4MRJHMxWTI5ZU/xwgv8zQDUjwI4PmptckCl8EhhuNPHvEi4+UzCmSGygvKnUVwhQWWUEY7kNqBVXIaf1AlnAMmvJOuEVNcxp1m66PAnL6nGp7FBZouZ9ghDyrFIFy8qQgqlLbJlCcMvMBZX8uTl24P5h1YFCQ2eKkl/6HxL9Lb2tHWD1ZBJHBebLgY6j0KmbbBVrNwVOobIVSoQd/OVGS1Sx1FIeGcopCfK1lYEfSohXxp4VNmFAIimPjhxxlEa4zPkmIWrByQW+c0f0eIEDHTVbl8ZDoE5WzGezjIxxXnK6kGgnQzIllMADLHUj/MSVmqsFlCQcpmq55C7FSEvMmZP1FcQYBiDELlW1cDcJ5XeeeQoC9zQVsKm0SZlKpJDQF75AOfKWSQoUEVu7RWnsReR9QSc1IksF5RtIKL6e0g4UTrRLpgVOZ11X+RKNpjIVMVvxOfo5iXnRwYsnyEgPihgAzJZVhlJStmO1Vf04xYkauVkDT7lhGkUWKMnAt0WH4TepPPgA7AXl0jXMKEhNeYIHJX1eYwZD3BGT5CqN3Udqmvsn1rLqrrhYhIxKzAmQb8d8YEuKnqRTIO944U0kemCIKz5U7SUwnaXtNFy8xFhR0o8pRmX5hpLmOwjiHsPhz94AspMYIeiTSAnDsk/mQLARJ5ZESHlPLciYPDQKisWS5WWiQwo5mMTI8hNeGr/WRoDTNF3VVGRGLRmGSsa4MXGlnnEkEbV53tXwR1PikXTkSNyu5RCTAEhTJozCcKdqDahyLCNJajwCRvunkbx5gELiYTm4lkbU1ywGmjjOjmvSBJH01mZzW5otYARG4AGcLRY1fy+R4tbmQEEJr0S7y0YDyZAudzQKk6sCo6k/FlnyzCyis3cdkhVmQxTeV41BQXwtl6yogrmqFgrKVnXjcAqs4TR4GNlHjWVfm3WYTWOA9k01kizNWVMVJibWNQq3WrdT/k/iFgSWl8pQo+r1Cs18pYEMZujntSnJXUico+u7q7rkZqka3A0o/BCEvAbW6Li+r4AUj4WwKjyUBvMprRbBLWgUEtLNAWrlka0/FlPS4wfrphe+WYZ3V3KHTF6ItLLSZqVFmbmFFm5EkotfYgNitgc2FByvbbZrMtwseRY/ljnKLkSpGxGvDT5gM9k0UaJZW7LbQZyDVWc4MYjUtpYQ1RBcljPXKTluqMA29C8vF1fZqjLKjlWecYym1nbAEXqmkFVrXm1DRasSMVhRqflpFJ5+IhBYQWAhvp5A1ZPTC0FBZYASW+iuBVEG5xEZoUtRcPjdMIb3ycdgEAghtqu3ibTNn4/LfWpkr7bId/nVJAYt2olbNSrO7HbJrBrJoRa/1AxoNxsW4A3F1Hawl4rABONtufioJuLwwEsoJeISsJVIFhwYsvWC4bcniEcozQ1w+AFJbJ1V6MD1ezAiIFG2lX0BSwZyXUpUO9lOcpm0jRDGhTPExb0k/usQIHsxQLFYgPAQphfC941YaUvORmNI2qrmY/o9AQtYXmCS48SuM7RngSwGUVcraBBQAZARQ2hY4CfLEiXlkw3xYcNZgvDe9lZ2c7VdrPZOJEzABBBoBzhWAcxzW6scNu7HXxaLwCXBLglK4UJVpBN0ECZ4fYacrQG4D26VekKp3eG0yWRtmwcAIzuDytqe7OSjMMPUuHHVPwTMrQjUbbPfKpJz+pNQ/RHvmJSKMe5JDCrfrT6e8wUPvXnH7yUlM7tSEfK0DwURl7qhpNTBAGWS7SyBzsTwWMAFgv0kAfAC6j+jqs0G85iI/AGKOtNswl7/qQbJroKzQ38tcs7XYsnK0ondcaJ72IAcuRIMYM2GJoagvIHQ3/hyQgEEsEeuA6oAaY82PCUtiV11YRuTWcxpN1b1gCBsdhbvW4y0x894BbHIXsPvta67Alh2cfe60n3wIIlnk2fduBTDxMGAi+u7srzoGPdV91CF3RQZbDuymqrzL2aYssPTNpQwaWyOEN9moAPS+EaSLobdzPA9AkAZpqD1NYmdx2mYSdpZ0z2K8Y+tneHkjpCNfNFeRvFHpKryQuGZWjuOsGbVGqBcK8xvA9gBJxkPBJmucr2Q4eXanMlmkyXgsiLg5gwpYoZAma4dImMBbIwB15PCkbUFV/cZyP7AUaXBFH5ma4VdspJ0HPE00Y+TEdgc5bl7E2VegVkgVr3Yb75jevrltGEOWE5uYhzno4R56SG4B/ehAZtzkOoDR9B3TAUcaN1T71Dr6DqQvUQAL6ggS+ww47plXO719LAt3bHrd4vtUkTixzJ8nwqDIXwbySSL7qf1crjeweqhTFqx7Mh40T4bks2tC7OYicuwJYoCanoP9XeSEd/hSBAUXc0o6AK/OwEvZJHNEDcK2gfumx1t99jmDZK53U4WH8uMMTzs3k1LM78Ipe5DY8noCaEgjGAOWmgGf4Ig3+PxxVXMMBMcGR6vou+bnowMMASePedQjNAzKbQcj5XPRWSxlbqsxAmrcY9AVQ2V7WuBB0idFmIOdd0CDenrmYdYncL2dzwLRXgQHnnZOujBjEooCPXLGPF1hWwgt3WPc8YBWxvvV40QHC9kBI+gIiceUMghVD4S03ZoauM3G4mdx4NivseNr7cmWSgwFvrB5dk5SjzMGEKYIp8AATiQdNtH0H4iV0t9/SJC2BzhCQuCt7Kut4neaImeQs8XYmiYJkYnDmqK3E2JANCSEPJ+Oqo9GBMPbVKhfEcQq3C5ywHU2hJ9xVlkFNUnUolpJbdI11JUAB4L5RmMkgbySgUstkfDPHo/11g4onBl6oJiTU4mWT/+yAmWClP4tnA8Y5RH53v3yDSV/AwRn0BOA/RhI/sP5plEyn5dISDXMvTqYr3hoDTNe40xRK664bzTix8gG6d6z9ZPTHPLnsa19Nmt+eA+wM/sf8WhnDd4Z1cPiDjim6olFYYgbEvIFjxEl1A/Q7QMTMMDkzJh5467vdgQjXVZeA4TNFtB6T/oLMujacAhlMbOiIKIjdUQe0U1kapk3omhEoW0bPh0xWiAnIzQHFaiAUvjSsdak1a/ampHjWpac3AoDloU+EoiS3yakrwzO/9Naf0tWRo5lw8S0jS7IvDYwJkfSdCOEmQAfpPNP6YCLxq2lRVVyDwAghQDjx3SbFjE7aVoiwa4GtK95QSLblUbq1O2lUZdRy18661SFMQluKOYiza2oW9zeFtJKq11GhBJBRCZQXxa0FBdG/qVKwXlWeVbK5oK6KCFqqXzYOk3jb3lPpk/zHIggCjJyD3bf65og6Y9pKEANsMQioUqQpuWxaSxUR3zRrMs2ATY0BipjONRuXIiNweAMQuSu7nC7rydYT7r1cpAnAB4B+R8RNbjXWCJdp2h/edoq2nYIaTC86qwr2vW1rr2oMOS9C1q87a134gXRewO1qj3rHgT61zqsDjLMTfAMNUQCpnF5ERy6wlmezrj4Y+L8l9E14NoUFCeA1M2KC0DWZoEeTVMZHaja+GoBxJ5KZPqn0kpnr/l1Hb66dlu3PbZIr262vqt4pGqeJglBIRQEKEvkNiLyEHTTdfMXtCaulYNcdD7Wg6y8R2x+FGN+05yrD8cf9DXIoDJiY5t4ZwMRMjTdEDtApXarDpR0IgbD7nW1YLi8CKXqkc+KQFaXhS6Tob4EXWzeS4D03mAlw8y7/t6tZndIcWIQVtCOZyWvhl8GGOEtDQeJyl/6TGV3PMMOzo4+U6y8fV2rrWuzDafIzWjKS4BeTAqrm8yJWX/jiUlEDoJedl1ZYXkc6oO3JAHoMr6RKExPutaRFiCIb282iAgzOap640qoNtYAg2UvhAAOATAhAAuASXXCR4EU1rGvp3w4gymDQwkkUeJko/qOBiY7y31PV6iDX2KC6acwILHtElp9RcZK4ATgFN9lh6b2WZBLo0MS4ygCuO4uCXOl6w5BDgPgQpz7ACoPGnlFIDtg8oNkFoNuD7CNIF6ssagFwE6zBZ/heUK/H60/v5QjQtAdPK/fAcwOq4TcbdP2jWCUgb74kqE6dGAgzp5csATw99N0280x4hmraFCbQBcY0cR5KcfTCNAEO/hUeYh75dIcI5iYLuQXPg48uEP/hjDrPE6FKBf3cA24Mqlun/r4gKFL9rPHlECr0xIHAd9y+g86LMgpJMkuEKwSPCl3nykACcKJaIDqPJwChsfScYn1EWlwkSrjNEpIFkD4lFA6i8ksPBfwYDya0XIjn3u6Xdlr9xEno5pGtptHDk9jbVrMxeOJwBjsM0Y5UMmOSL5jsizEqsdsQElVAux7IAnAOObZTjni1YQnByOvpDGjwEeBpBmT+Ak4HR0eEuggpCgA8hx1dI4kc6wBB9zjIpse2LhlNqm56To802lL6NnlvTf9KYdAyvHwC3x5je1BBOQn+FsJxGYidmOLHFF6x1RYSdXZkndE58m5MXWSz47GTmOvZvGviygnT1yWkeH3EeIsrUQCcLVbi31WEtFso8Cc9KszW0FFVgLS6JGc8dFDxxw3cY84ymPPJpFtpjM7ic2P5n9jpZ8wRWdTR2A5kBG1xInCbPRZ2zhfEeDrriCO+Z9WJHIwUbWglGCNsNFTcxQ0q3lZoF5IM+22VqAgiz5sBzQaRZoYGLwhCACnWf734Qi4U+n/epITgOgE4HDky8pcQ5ahCLsrn2HfZxqQUE4NzcKA83IIjwGkxmCOsvv4uswZo14qklKeB195LGCcMys1TPQyXu16NVDrUXsTH5pBTxq2zuLyBE+My3RXHZOfA29H6WjlxduYB6OIaRLiteZpeu6uryE4Qmi65/l5b0rIWd17vS9cfYhbbVggvvbEoBAJKwoI8PMrFXwS5RQEmW3zB4Zs3Ax7F/kjeSIy6VYx00HsGsOMKZcEhuWSIYAjFZS27cTdowB0iFkeXCb98pufwh6kaL8IzxVupnd/MsNUVfEQCNQyHhoSpAPBQhRQHea8ATK75C8B4GWXLJFpow8ljvq4g/FacYkc+Le33O6SxWTp7g9Yr4OmMxurWdrCAKo5t6NdTjb02hZ71+nVuXjOZD4210hn9uhu0uEMAmA6rKAD8GIqkqMOMWXszF97Dko4ECFIAr7ySO+9MaCwEQnF2aHFH7ykRli8xVECoz1Le3DSuFL5PmeQASAgwyyc0LUAmDTAfDlQDQDcAIAfCeCAAaSoyvcEwWvF8odBDV9rRgkwGYMB4QYAANNOFkBSrGsVGWfNEO3lkC7NqrX1Q6FiBmhQ6+MBxLEJiIyLSkouuH/D+IU7Rp8sTcbveGBCUCdoOI78MTHNHUh+ArA0wGwH4B8D8hsAOnaAOBWgBsfwKAQNIGZ4WQLJLPjAbAPDLSDTA/AZniYLkFyCM5XE0SFK8x4I+boaqMC1E4NszTvgDo5wE1MLDw8EfigRH4oJX1mhtxuAcHl1fTLIBbhgMuOWyGXifhxQ62cbYyhvE6G4AvZMyvuPjcMQxgC41O6IL7jk+7BIgGzfM8uGIo44n4VyB4OnV95xf8P7oJL2AD4HMR03kXkRiO1kCOQRxgkUiG+GhnqRigIQL0oIgVFWBcGeZv473iiDrmxCQYDQPaEO9EZerbiWsUVloh+Aw8BcVSEis5H/5YkAHbKH308m678ObcaHgJGAgiAWMoYoCWPD1MoBEPivMY/hJAuTHl70xjDZBaw2kGYL5BuCyQAQvcXk4lQCAa6HEOZmPGOxmQ0gN8LyGXnwS59x0AShJQUokpPrIGwd1JmsmTF1MxvqNxjnoUpP5KD4Ap/iEbwGVQU7Egm8otEYYkMGFBr7CkKcOukI5vDJHC5ASAGpVKipEKRYqi4IH3MG/AW8fgvw5sYKCpEWSLhoAHNcjz4FyAiYSQwEQzMJBUmngP4XSssly2N37tfg4kewFGBjBNIFqOYlRj85rBOLF4RcJOMeAQWl7gB2rWcyj59BDZ7QmP01tj68bwbogOF0Z/ruJ9wQyOGgf4J++p8MXafv7+ny8YMCM/0I0KeGfBBSDXzow0BoWPH6CKJ/7fn8K6BB6AggQSkj4KQDVmsESrEud9AXzB4bDAe33yp58D7nKMl/nfVpOj++kGOtnBtOEx2QGmeJp9V3hxdMaV6A/+7bwfACzN8LBjIQr0SVEiCSwuADGeUQ76lqr7HNqRlf/PrABr95jw4EGVgWoMfLizMhKGXfXANhQp5jArA5oewH5wp56YsA2v3X319DfMgGN8FgIzCVAbNSxRbBAECACWAA0UThmJnYcxToBKkDRmapWWOs39gfvbWATxp4aPVuJBEWBW9l/cJ3zL9eYOAP+9p4R72woXvOYBedVgeFGwpvwUCDBgV/IhCHcYA0KELJ5/VAH9gSQYmFQCL/W8geIVzKQARYJCLUAz0skB8Fz0uQE4G+JgsYcGtsBEbs3+YQuR31L9woYSkZlWAsrn79iUX9WUQ30HDHMFi/EAjEIplNgMQ0FdMH0NMl7YiSh8yJGHzr15jWC20R+DdJ33sk/cSBT9YAZ5zQEifCEA6BvAmvz8DIAQABQCYpQVYuMUQDwBBtQvyugh/aAzr93bQcDEBniE3FiDOSOKCV8XwdXw8hulXnCn9o4CgMJV6AOfxfYq3CAh3djGDwP3cJuQ9yD81dZOCDB7QMPwj9luKQxx9B9WQ3vcCfAIJONifb1gn11wTcDIAdwBSF+hJkWiy/cHjTP0U5s/FiwA8hwZf2aovAd8my9WRbLzOBu7Hhl3A78OrCSClbPgF3ALfd+jGwiwKDVA1tGcDT0ZB/UgMlh33WWDHNDyK6AnwngHGniBWRDZBIZ8scOF98tzCCEv8COI11qEIJQQQLQSAO0lnhsvUNAdxo9T81+hCRBMGbAoNfrx3VxwIhEKdN/IOFgA5aZ5SkD/ZK4mlBmxMgGaRQQSFgIwXVaki50R8TmEoh3gwiEWQ8YNehh4SDVKmbBFkbEIBhSFWAD6AOQgwEo8kEZqhaV2gZsAWRcQte1Pk0vcIRxwZlEhnkJZnevBO8VqOIBexzQFTkbFnETUFANMyTc01IgdXqy9RVGWmR3Zn4KUIRBDQuID0ZOQpZBgYGvLMkyhpgUmDmVeYZ0IIBPxEXC5FtSWJnvopyU1l0Qm6WhkZxNyKgCuUguGkDWRbqJPjN5lkNIBCBKQ6hiepZBXB049AATAIHIFtHsNgIX83DUoAKDXtDKuQHiLh+8DQG790QiG0xDKAGmnQ536asifhmXPABWVagcQFuCg8egiQIQNdAPfpUQHjBUAArR9hPM4VGmlbxQuCbG5B7yf+m6JqjREOsE2+JAFwkf0E/EyD1IG4SPIkQ9Ojr9xwOswWgoQv3zhVU2QyAvAW6FnBMCmBLEBYYAECGl7Caia0DLIr8Ta3fIhwhNh4AXwIUF9RCaJgFHC6cN1EnDqcN8O0ApuZEiiVHeVML38iAEwRp5Zw9aF9IWRd8NWc+wh8PbAnwpCEJ18YOLEohyXLhyfI7wsXxgwxOUPgCocwkAn/cl3HVghEHgMvHdsBMOEMXVNSSYBUhoIkpm6ICAafBgiY9R8grc3zM2BXgMAMX3JUbA4CzsD8DFexmNnAuYzIMJWdLD9sNTKyFx523UnXZCHiVuSCM+ASyFE80g+4UXxHkGFCPVuiVAxNx/SVnHZCnTF4hqdCuRIwKQmfPWDeRxPZm1C4HglxXNtkfWxiCBigNoIx9z3ejkj8VuTC0Wxb3I1GH0WETBBEwRPUmG/cYqDoTIREYNAEoRndWhEjR6ETQARgMEX+HOglwU8AToTYS7mMCmEYKLSiGAG6DuhHoSoEqB7oO6A9BygIIDQB7QcoFqjSgAQFoBSgNAFKBKgAQHKAPQEgCCAPQD0F+ZKo/SDyjWEKAHNgKaTKMXBsonVF5hUoiAA4xgSSgFIA4lNiATo3kU+W/gDAaWXHI8oJAFsB4ZT+ikgPYHlHBsaGKLURN5AzaKEYdgWgF2idVWwBOic8Lok2ikADmgHhowUyAwA7ojfAejsgTOFPAbAMSBA9MQP/kQBzQGvCi1tSL6PygaYWgD+iMAdwAzsSAEGLYgwYxR02ioYmGIjAjg7SERjJIZGI0FNozQikgxgBCCmDAYqLSkRGQfGLnlsY+ICJAHAILEQAotKPHHJsgDaOyA2Y/KAUJcgeFDJi4YmsAUJP7ZmLZi8oScj+BcYiGPZiYSY/DCUFgHmO+NPAZ4GVxffOuiiAmAyDQbBlcHCOcA5aEUCd9CgOs2hMYEbAPCV8sRmFsBmyHMEyglY7YUboawNWJYUBY9mO+iBEMmMvIE9IgAdjHYvKG4QKIq0A+izox2PygvEMfAUhqYrmLYAyY7Ai8Ao6bIB1kKY9mNZjPYzmO5jIYDGIeCxCfmLjjPYkWIZjhhe6MziJY2vkwAQsWWP1NMYucCYB1zMGBzI4hMKDSQ0AOeEgArYisC0gWGO2ICB4+N33hUPbGAgHNUAFvEyghiSQmWopYsvFGVHMAgkM5H0bYCDQW0K33KDSTC8HwwMhOnk48J1FuN2E9GD2Ilig4R8O0gyYo3H8QOdZqitBjwxCGttB47kD4gcYLNH9wGPTAxlZUQJQFB5SJUyl3VqYdTBJRJ8Q6y8R74DxGVxgQDWKiBlcZJGVxQeHdUtIJ45ZHnMhIPHkDCawK4Oti1AW4I0At4oWOdjIYV2MohUE76O9jcjX2K4BwY/OKFig4nxA8BQ45OMhjIwTQL3jBY2OMFiE4iWKTjw4lOIrsJtW2FOtSAbBPyhs4sWKITvowuOlj3olhPG053SdlzFkAe6CdANASoHtAAAUiihToFIFQAHAebFOgUOTQHKNvoN8F6JdwoRhEZUAD0CO97QORJQS+E4TRdjnAN2K4S8oEhJaxyE5hMhjWEoGOjjIAHWXHIgRSmJpBbAVOOoSZYyGBajygNAGDABAYoAEAngZYDcj7oD0HcinQBgCdBigWgCCBlgNqOKBygBgFKAHQD0FKjaAGRPaj+o2gHtAgwFQH0g0ASoGtBlgcoCdAnQRJMLIPYuWTnlbAXmJIAyY9JN6jUk0oHtAOk5bz4cUwcoAEBkkjpOWB0k0oFoAliNACCAckuJLiSkktAHKjigEgFCT7oIIGWT4k50HtAPQfpPtAckzpNKA6k2d0oh3IDhJIBh3FrAroaAKLQYT8oCv0O5BEY7gdMLvacAuTBY76IIAsQDwAWRg8UcCi1igMxLyg6CDDn7A1OI0BA9K7HOJ8NaEohLAdCfE4yl53WERDBwxEK7CeSA4vKFeSFID5I7Cvku0F+T/khgiBTYAEFIm0wUrvXZi6EiWOuTJeV1ml5TuC3XO5CIwVCu4aAG7mmDZAZFM9i0U95M+T+wb5JxSuUxAHxTCU5xK4Bw/UlMhSKU7ZQiYzdETjl5PhBXiRSuAS5JeTJQdFL5SotSoF5TMUwFOSBBU2VDVSIUwWKhSBgsZ3ecVDF4TwFzjD3xidFogHDmdTsJJXlTIARVKgdlUzlM1SwU5bw1SAU/lO1SnE3VJKASUtmLJShY8VIItIzafW+conX51icbU68FscHUp1NRSXUjFK9TvkoIE9S8Un1JES/UnwwDSY4sVOhTjU/XQn1bfaMxpTYmfJ3iZgoW3XjTnk51LeTk0hgh5Ta0v5L5SBU31IsQotEVMDT80o1P11Q0ktJn1LjbQxrSUUjlIbSYCJtJRTcUmAjbSs0jtJKB9U8lILS+08Z1XAB0jQyHSF9eM0eSFU5tLHTVUkoHTSZ0zNNBTvkxdODTl0oJVXSw084xjNlga424BbjHdMdS90pNIPSfDI9NHBZ009IXTRUg1JDTr09dIuM0QWM0fTt01lN3TR0t9LdTJ0z2OnSv0k9KJSz0v9KXTe0yvyCDhglQ1GCtwCYL3AWUtlIlj90mDK4AnQT9K1TgU9tL2QotKARQyL0tDKUNr0yZwjTpnaNMUJbUqFhosIMl9Kgz6099I9Tm0+DPIyCUyjPdTc01xPHI3EtxIMAQowSDmj30W5AUIE6bYOmjSKUmH7QkHe3hWiEQNaNc0Gk34H+YbZIkGZg6AA6PUB3IANFVl7QKTJCinKdTNzJ7eZTOYQgAA=== -->\n\n<!-- internal state end -->"},"request":{"retryCount":3,"retries":3,"retryAfter":16}},"response":{"url":"https://api.github.com/repos/NVIDIA/bionemo-framework/issues/comments/3923853930","status":500,"headers":{"access-control-allow-origin":"*","access-control-expose-headers":"ETag, Link, Location, Retry-After, X-GitHub-OTP, X-RateLimit-Limit, X-RateLimit-Remaining, X-RateLimit-Used, X-RateLimit-Resource, X-RateLimit-Reset, X-OAuth-Scopes, X-Accepted-OAuth-Scopes, X-Poll-Interval, X-GitHub-Media-Type, X-GitHub-SSO, X-GitHub-Request-Id, Deprecation, Sunset","content-length":"0","content-security-policy":"default-src 'none'","content-type":"application/json; charset=utf-8","date":"Mon, 09 Mar 2026 15:05:55 GMT","referrer-policy":"origin-when-cross-origin, strict-origin-when-cross-origin","server":"github.com","strict-transport-security":"max-age=31536000; includeSubdomains; preload","vary":"Accept-Encoding, Accept, X-Requested-With","x-accepted-github-permissions":"issues=write; pull_requests=write","x-content-type-options":"nosniff","x-frame-options":"deny","x-github-api-version-selected":"2022-11-28","x-github-media-type":"github.v3; format=json","x-github-request-id":"581C:6FA3A:18528D0:68B3E61:69AEE1D0","x-ratelimit-limit":"15000","x-ratelimit-remaining":"14549","x-ratelimit-reset":"1773070068","x-ratelimit-resource":"core","x-ratelimit-used":"451","x-xss-protection":"0"},"data":""}}

gagank1 · 2026-03-09T16:37:47Z

/ok to test c31c3e8

gagank1 · 2026-03-09T18:18:24Z

/ok to test ef612a3

pstjohn · 2026-03-09T18:25:23Z

bionemo-recipes/recipes/vllm_inference/esm2/install_vllm.sh

wait, what? why doesn't this work by just adding vllm to requirements.txt?

To link it to an existing pytorch install (ie the one in our base image) you have to build it: https://docs.vllm.ai/en/latest/getting_started/installation/gpu/#use-an-existing-pytorch-installation

Btw that script is optional, the way i thought about it was someone might want to quickly build the image for training without waiting for vllm to install and then maybe decide to install it in which case they can use that script inside the container. or they can use the build args with the Dockerfile to install it on first build

gagank1 · 2026-03-09T20:44:27Z

/ok to test 60043cd

gagank1 · 2026-03-09T20:48:26Z

/ok to test c2d2351

codecov · 2026-03-09T22:27:43Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 76.45%. Comparing base (470e10d) to head (b6a3bcf).
⚠️ Report is 5 commits behind head on main.
✅ All tests successful. No failed tests found.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1473   +/-   ##
=======================================
  Coverage   76.45%   76.45%           
=======================================
  Files         102      102           
  Lines        7952     7952           
=======================================
  Hits         6080     6080           
  Misses       1872     1872

pstjohn · 2026-03-10T14:01:17Z

bionemo-recipes/recipes/vllm_inference/.ci_test_env.sh

can we flatten this directory? e.g. no esm2/ folder. we can put modeling_esm_te.py and modeling_llama_te.py right next to each other.

we could but i have a number of files for llama3 as well so it might get pretty messy: https://github.com/NVIDIA/bionemo-framework/pull/1501/changes. do we still want to go that way?

you're going to get the one CI job for the vllm_inference folder. So cding into esm2 here isn't going to scale well when you have multiple folders. You can always set up deeper packges here if you need to, but pip install and pytest are getting run from the root folder

so what i did to solve this in the llama3 repo was have it dive into the subfolders of vllm_inference to discover other recipe jobs: https://github.com/NVIDIA/bionemo-framework/pull/1501/changes#diff-f91d3e08cdd503116607af5517ca8357db1810406cb3c74c824a8f732061d0cb. should i do this here as well?

gagank1 · 2026-03-13T00:42:54Z

/ok to test b6a3bcf

Add a vLLM-based inference recipe for ESM2 models using TransformerEngine, including model conversion utilities, export scripts, and tests. Also updates NVEsmModel and related classes to support configurable add_pooling_layer via config and use self.model as the base_model_prefix for HuggingFace compatibility. Signed-off-by: Gagan Kaushik <gkaushik@nvidia.com>

gagank1 · 2026-03-14T01:20:44Z

/ok to test f2bfbce

Signed-off-by: Gagan Kaushik <gkaushik@nvidia.com>

gagank1 · 2026-03-14T22:54:42Z

/ok to test a9a6f75

pstjohn · 2026-03-15T02:15:27Z

.pre-commit-config.yaml

Wait, what? Why

those are only the copies from check_copied_files (originals are all checked by ruff anyway). ruff was giving a different ordering of imports for copied files and the original for the vllm recipe, so the two precommit checks were clashing. we can remove the the non-vllm copied files from the ignore but i just thought it would be a good practice to avoid future issues like that

fixed with a .ruff.toml file

pstjohn · 2026-03-15T02:22:53Z

@coderabbitai full review

coderabbitai · 2026-03-15T02:23:04Z

✅ Actions performed

Full review triggered.

coderabbitai

Actionable comments posted: 12

🧹 Nitpick comments (8)

.pre-commit-config.yaml (1)

32-58: Consider DRY: use YAML anchors to avoid duplicating the exclude pattern.

The identical exclude pattern is duplicated between ruff and ruff-format hooks. YAML anchors can reduce this duplication and ensure both hooks stay in sync.

♻️ Proposed refactor using YAML anchors

       - id: ruff
         # 1. Attempt to automatically fix any lint issues.
         args: ["--fix"]
         # Exclude check_copied_files destinations; they are verbatim copies
         # of source files and must not be reformatted independently.
-        exclude: |
+        exclude: &ruff_exclude |
           (?x)^(
             bionemo-recipes/recipes/esm2_native_te/example_8m_checkpoint/esm_nv\.py|
             ...
           )$
       - id: ruff-format
-        exclude: |
-          (?x)^(
-            bionemo-recipes/recipes/esm2_native_te/example_8m_checkpoint/esm_nv\.py|
-            ...
-          )$
+        exclude: *ruff_exclude

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.pre-commit-config.yaml around lines 32 - 58, The exclude regex is
duplicated for the ruff and ruff-format hooks; create a single YAML anchor
(e.g., &ruff_exclude) for that multiline exclude pattern and replace both
exclude blocks with an alias (e.g., *ruff_exclude) so the pattern is defined
once and referenced by the ruff and ruff-format hook entries in
.pre-commit-config.yaml.

bionemo-recipes/recipes/esm2_native_te/tests/test_train.py (1)

296-309: Consider deduplicating FP8 log assertions into a shared helper.

This block is effectively duplicated from the DDP FP8 stats test; extracting a helper would reduce maintenance cost for future log-path changes.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/recipes/esm2_native_te/tests/test_train.py` around lines 296
- 309, Extract the repeated FP8 log assertions into a shared helper (e.g.,
assert_fp8_logs(fp8_log_dir: Path, rank: int = 0)) and replace the duplicated
block in test_train.py with a call to that helper; the helper should verify
fp8_log_dir exists, the "rank_<rank>" directory, the "nvdlfw_inspect_logs" and
"nvdlfw_inspect_statistics_logs" subdirectories, and that the two files named
like "nvdlfw_inspect_globalrank-<rank>.log" (metadata_log and stats_log) both
exist and are non-empty. Ensure the helper uses the same Path names
(fp8_log_dir, metadata_log, stats_log) so callers are straightforward to update
and put the helper in a shared test utilities module (or conftest) so other
tests (the DDP FP8 stats test) can import and call it.

bionemo-recipes/recipes/vllm_inference/.ci_build.sh (1)

2-2: Add error handling for cd command.

If the esm2 directory doesn't exist, the script will continue in the wrong directory and fail unexpectedly. This was also flagged by static analysis (SC2164).

Proposed fix

 #!/bin/bash -x
-cd esm2
+cd esm2 || exit 1
 PIP_CONSTRAINT= pip install -r requirements.txt
 ./install_vllm.sh

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/recipes/vllm_inference/.ci_build.sh` at line 2, The script
.ci_build.sh currently runs the command `cd esm2` without checking its result;
update the script to handle failure of the `cd esm2` step (e.g., test that the
directory exists or check the exit status) and abort with a clear error message
and non-zero exit code if the chdir fails so the build doesn't continue in the
wrong directory.

bionemo-recipes/recipes/vllm_inference/esm2/Dockerfile (1)

1-24: Consider adding a non-root user for security.

Trivy flagged that the image runs as root (DS-0002). While this may be acceptable for development containers, consider adding a non-root user for production deployments to follow security best practices.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/recipes/vllm_inference/esm2/Dockerfile` around lines 1 - 24,
Add a non-root user and switch to it before the final WORKDIR to avoid running
the image as root: create a dedicated user/group in the Dockerfile (e.g., via
RUN useradd/groupadd or adduser), chown the application directory
(/workspace/bionemo) and any cache dirs used during build to that user, and set
USER to that account before the final WORKDIR /workspace/bionemo. Ensure the
conditional INSTALL_VLLM block still runs as root (keep it before
creating/switching user) and only switch to the non-root user afterward so
runtime processes no longer run as root.

bionemo-recipes/recipes/vllm_inference/esm2/install_vllm.sh (1)

10-14: Add error handling for cd commands.

If /workspace doesn't exist or cd vllm fails after cloning, the script will continue executing in the wrong directory, potentially causing unexpected behavior.

Proposed fix

-cd /workspace
+cd /workspace || { echo "ERROR: /workspace does not exist"; exit 1; }
 if [ ! -d vllm ]; then
     git clone --branch v0.15.1 --depth 1 https://github.com/vllm-project/vllm.git
 fi
-cd vllm
+cd vllm || { echo "ERROR: Failed to enter vllm directory"; exit 1; }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/recipes/vllm_inference/esm2/install_vllm.sh` around lines 10
- 14, The script currently runs `cd /workspace` and `cd vllm` without checking
for errors; update the install_vllm.sh flow to fail fast on directory errors by
verifying `cd /workspace` succeeds (create /workspace with `mkdir -p` or exit
with an error message if it cannot be accessed) and after cloning verify `cd
vllm` returned successfully (if not, print a clear error and exit non‑zero).
Apply these checks around the existing `cd /workspace` and `cd vllm` commands
(or enable `set -euo pipefail` at the top) so the script doesn’t continue in the
wrong directory.

bionemo-recipes/recipes/vllm_inference/esm2/modeling_esm_te.py (3)

174-184: Mutating self.config may cause unintended side effects.

Directly assigning to self.config.layer_precision modifies the config object, which could affect other components sharing the same config instance. Consider storing this as an instance attribute instead.

Suggested fix

         self.config = config
         self._fp8_recipe: transformer_engine.common.recipe.Recipe | None = fp8_recipe
         self._fp4_recipe: transformer_engine.common.recipe.Recipe | None = fp4_recipe
+        self._layer_precision = config.layer_precision
 
-        if self.config.layer_precision is None:
+        if self._layer_precision is None:
             if fp8_recipe is not None and fp4_recipe is not None:
                 raise RuntimeError("Both FP8 and FP4 recipes provided, but no layer precision provided.")
             if fp8_recipe is not None:
                 warnings.warn("No layer precision provided, using FP8 recipe for all layers.", UserWarning)
-                self.config.layer_precision = ["fp8"] * self.config.num_hidden_layers
+                self._layer_precision = ["fp8"] * self.config.num_hidden_layers
             elif fp4_recipe is not None:
                 raise RuntimeError(
                     "FP4 recipe provided but no layer_precision configured. "
                     "Set layer_precision explicitly when using FP4."
                 )
 
-        if self.config.layer_precision is not None and "fp4" in self.config.layer_precision and fp4_recipe is None:
+        if self._layer_precision is not None and "fp4" in self._layer_precision and fp4_recipe is None:
             raise RuntimeError("layer_precision contains 'fp4' entries but no fp4_recipe was provided.")

Then update get_autocast_context to use self._layer_precision instead of self.config.layer_precision.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/recipes/vllm_inference/esm2/modeling_esm_te.py` around lines
174 - 184, Don't mutate the shared config by assigning to
self.config.layer_precision; instead set an instance attribute (e.g.,
self._layer_precision) based on the existing logic (use ["fp8"] *
self.config.num_hidden_layers when fp8_recipe is provided, raise for both
recipes or for fp4 without explicit layer_precision), and then update
get_autocast_context to read from self._layer_precision instead of
self.config.layer_precision so other components sharing the config are not
affected.

696-696: Extract duplicated magic number to a class constant.

mask_ratio_train = 0.15 * 0.8 is defined identically in both _apply_token_dropout_bshd and _apply_token_dropout_thd. Consider defining this as a class constant to follow DRY principles.

Suggested fix

 class NVEsmEmbeddings(nn.Module):
     """Modified version of EsmEmbeddings to support THD inputs."""
 
+    # Mask ratio used during ESM model training (15% masking * 80% replacement)
+    _MASK_RATIO_TRAIN = 0.15 * 0.8
+
     def __init__(self, config):

Then replace both occurrences:

-        mask_ratio_train = 0.15 * 0.8  # Hardcoded as the ratio used in all ESM model training runs
+        mask_ratio_train = self._MASK_RATIO_TRAIN

Also applies to: 717-717

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/recipes/vllm_inference/esm2/modeling_esm_te.py` at line 696,
The duplicated magic number (0.15 * 0.8) used for mask_ratio_train in
_apply_token_dropout_bshd and _apply_token_dropout_thd should be extracted to a
class-level constant (e.g., MASK_RATIO_TRAIN) on the model class (the class that
defines _apply_token_dropout_bshd/_apply_token_dropout_thd) and both functions
should reference that constant instead of recalculating the value inline; add
the constant near other class constants, replace the inline expressions in
_apply_token_dropout_bshd and _apply_token_dropout_thd with the new class
constant, and ensure any imports/uses within the class access it as
self.MASK_RATIO_TRAIN or ClassName.MASK_RATIO_TRAIN as appropriate.

80-80: Confusing default value for padded_vocab_size.

The default of 64 contradicts the docstring which says "If not provided, defaults to vocab_size." This will cause unexpected behavior for models with larger vocabularies. Consider using None as the default to match the documented behavior.

Suggested fix

-        padded_vocab_size: Optional[int] = 64,
+        padded_vocab_size: Optional[int] = None,

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bionemo-recipes/recipes/vllm_inference/esm2/modeling_esm_te.py` at line 80,
The parameter padded_vocab_size currently defaults to 64; change its signature
to padded_vocab_size: Optional[int] = None (and update any type hints) and
ensure the constructor/initialization logic in the class/function (where
padded_vocab_size is used) treats None by setting padded_vocab_size = vocab_size
so behavior matches the docstring; also update any downstream uses that assume
an int to handle the None-to-vocab_size assignment.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.pre-commit-config.yaml:
- Around line 44-45: The regex used to exclude files under
bionemo-recipes/models/(llama3|mixtral)/tests/common/ only matches the directory
itself, not files inside it; update the exclusion patterns in
.pre-commit-config.yaml for the ruff and ruff-format hooks (the entries
containing the string "bionemo-recipes/models/(llama3|mixtral)/tests/common/")
to append ".*" so they match files within that directory (e.g., change the
pattern to end with "/tests/common/.*").

In `@bionemo-recipes/models/esm2/modeling_esm_te.py`:
- Around line 414-416: The current fallback for add_pooling_layer in the model
init uses getattr(config, "add_pooling_layer", True) which conflicts with
NVEsmConfig's declared default False and the docstring; change the getattr
default to False so the fallback is getattr(config, "add_pooling_layer", False)
(keep the outer check if add_pooling_layer is None), and update the
function/class docstring where it states the default (currently "defaults to
True") to reflect the False default; reference symbols: add_pooling_layer,
config, NVEsmConfig, and the initializer in modeling_esm_te.py.

In `@bionemo-recipes/recipes/vllm_inference/.ci_test_env.sh`:
- Around line 1-2: The script lacks a shebang and blindly runs the cd command;
add a proper shebang (e.g., #!/usr/bin/env bash) at the top and guard the cd by
checking that the target directory exists and is a directory before changing
into it (for example test -d "esm2" || exit 1, or use if [ -d "esm2" ]; then cd
"esm2"; else echo "esm2 not found" >&2; exit 1; fi) so the script fails fast
instead of continuing in the wrong directory; update the file around the
existing cd esm2 invocation accordingly.

In `@bionemo-recipes/recipes/vllm_inference/esm2/convert.py`:
- Around line 110-129: The call to output_model.post_init() is redundant because
state.apply_transforms(...) already performs weight tying (see
state.apply_transforms) and related init work; remove the
output_model.post_init() call after apply_transforms, or if you only want to
ensure tied weights explicitly, replace it with a direct call to
output_model.tie_weights() instead of post_init; update the code around the
EsmForMaskedLM construction and the output_model variable accordingly.

In `@bionemo-recipes/recipes/vllm_inference/esm2/Dockerfile`:
- Around line 16-21: The Dockerfile runs the vLLM build steps (git clone vllm,
python use_existing_torch.py, uv pip install -r requirements/build.txt --system,
uv pip install --no-build-isolation -e . --system, pip install --upgrade
"transformers[torch]") but does not pass TORCH_CUDA_ARCH_LIST and MAX_JOBS like
install_vllm.sh; fix by exporting or prefixing TORCH_CUDA_ARCH_LIST and MAX_JOBS
with appropriate values (matching install_vllm.sh) for the uv pip install -r
requirements/build.txt --system and uv pip install --no-build-isolation -e .
--system commands (and the final pip install if needed) so the build targets the
correct CUDA arch and uses the desired parallelism.

In `@bionemo-recipes/recipes/vllm_inference/esm2/export.py`:
- Around line 70-121: The export_hf_checkpoint function loads local assets using
bare relative paths which break when the caller's CWD is different; change all
asset references in export_hf_checkpoint
(AutoTokenizer.from_pretrained("esm_fast_tokenizer"),
open("model_readme.template"), shutil.copy("modeling_esm_te.py", ...),
shutil.copy("LICENSE", ...)) to resolve them relative to the recipe file by
computing root = Path(__file__).resolve().parent and then using root /
"esm_fast_tokenizer", root / "model_readme.template", root /
"modeling_esm_te.py", and root / "LICENSE" so the tokenizer, template, and
copied files are always loaded from the recipe package rather than the current
working directory or the Hub.

In `@bionemo-recipes/recipes/vllm_inference/esm2/model_readme.template`:
- Around line 91-94: The "Runtime Engine(s):" section currently lists only
"Hugging Face Transformers" but this is a vllm_inference template and should
explicitly list vLLM (and include Transformers only if both are supported);
update the "Runtime Engine(s):" block in the vllm_inference
model_readme.template to include "vLLM" and, if the recipe supports both, list
"vLLM, Hugging Face Transformers" (apply the same change at the second
occurrence referenced in the comment).
- Around line 18-20: The description for "ESM-2" in model_readme.template
overstates its capability as directly performing protein structure prediction;
update the wording to say ESM-2 is a protein language/embedding model trained
with a masked LM objective that produces sequence embeddings useful for
downstream tasks (including structure-related analyses) rather than claiming it
predicts 3D structures directly; change the paragraph referencing "ESM-2" and
the similar sentence at the other occurrence (line ~44) to this softened
phrasing so the README correctly presents the model as an embedding/language
model for downstream use.

In `@bionemo-recipes/recipes/vllm_inference/esm2/modeling_esm_te.py`:
- Around line 140-143: The assertion checking padded_vocab_size vs vocab_size
should be converted to an explicit runtime check: replace the assert in the
block that references self.padded_vocab_size and self.vocab_size with an
if-condition that raises a ValueError when padded_vocab_size is less than
vocab_size, using the same descriptive message (e.g., "padded_vocab_size (...)
must be greater than or equal to vocab_size (...)"); update the check in the
method or initializer where these attributes are set (the code block containing
self.padded_vocab_size and self.vocab_size) so the validation cannot be bypassed
by Python optimizations.

In `@bionemo-recipes/recipes/vllm_inference/esm2/tests/conftest.py`:
- Around line 1-20: Add a top-level Google-style module docstring to conftest.py
describing the module's purpose (e.g., pytest configuration and test path setup)
to satisfy pydocstyle; place a triple-quoted docstring at the very top of the
file that briefly summarizes what conftest.py does (adjusts sys.path and
provides test fixtures/config) and include any relevant notes or authorship per
project conventions.
- Line 20: Replace the sys.path.append call in conftest.py with
sys.path.insert(0, Path(__file__).parent.parent.as_posix()) so the local esm2
package is prioritized during tests; this ensures the import in test_vllm.py
(from export import export_hf_checkpoint) resolves to the local source rather
than an installed package and you can locate the call by searching for
sys.path.append and Path(__file__).parent.parent.as_posix() in conftest.py.

In `@bionemo-recipes/recipes/vllm_inference/esm2/tests/test_vllm.py`:
- Around line 62-131: Add a module-level skip marker so all tests are skipped
when CUDA isn't available: detect CUDA availability (torch.cuda.is_available())
and set pytestmark = pytest.mark.skipif(not torch.cuda.is_available(),
reason="requires CUDA") at the top of the module; this prevents session-scoped
fixtures hf_exported_embeddings and hf_reference_embeddings from calling
_hf_embed (which .to("cuda")s models/tensors) on CPU-only runners and avoids
failures during fixture setup.

---

Nitpick comments:
In @.pre-commit-config.yaml:
- Around line 32-58: The exclude regex is duplicated for the ruff and
ruff-format hooks; create a single YAML anchor (e.g., &ruff_exclude) for that
multiline exclude pattern and replace both exclude blocks with an alias (e.g.,
*ruff_exclude) so the pattern is defined once and referenced by the ruff and
ruff-format hook entries in .pre-commit-config.yaml.

In `@bionemo-recipes/recipes/esm2_native_te/tests/test_train.py`:
- Around line 296-309: Extract the repeated FP8 log assertions into a shared
helper (e.g., assert_fp8_logs(fp8_log_dir: Path, rank: int = 0)) and replace the
duplicated block in test_train.py with a call to that helper; the helper should
verify fp8_log_dir exists, the "rank_<rank>" directory, the
"nvdlfw_inspect_logs" and "nvdlfw_inspect_statistics_logs" subdirectories, and
that the two files named like "nvdlfw_inspect_globalrank-<rank>.log"
(metadata_log and stats_log) both exist and are non-empty. Ensure the helper
uses the same Path names (fp8_log_dir, metadata_log, stats_log) so callers are
straightforward to update and put the helper in a shared test utilities module
(or conftest) so other tests (the DDP FP8 stats test) can import and call it.

In `@bionemo-recipes/recipes/vllm_inference/.ci_build.sh`:
- Line 2: The script .ci_build.sh currently runs the command `cd esm2` without
checking its result; update the script to handle failure of the `cd esm2` step
(e.g., test that the directory exists or check the exit status) and abort with a
clear error message and non-zero exit code if the chdir fails so the build
doesn't continue in the wrong directory.

In `@bionemo-recipes/recipes/vllm_inference/esm2/Dockerfile`:
- Around line 1-24: Add a non-root user and switch to it before the final
WORKDIR to avoid running the image as root: create a dedicated user/group in the
Dockerfile (e.g., via RUN useradd/groupadd or adduser), chown the application
directory (/workspace/bionemo) and any cache dirs used during build to that
user, and set USER to that account before the final WORKDIR /workspace/bionemo.
Ensure the conditional INSTALL_VLLM block still runs as root (keep it before
creating/switching user) and only switch to the non-root user afterward so
runtime processes no longer run as root.

In `@bionemo-recipes/recipes/vllm_inference/esm2/install_vllm.sh`:
- Around line 10-14: The script currently runs `cd /workspace` and `cd vllm`
without checking for errors; update the install_vllm.sh flow to fail fast on
directory errors by verifying `cd /workspace` succeeds (create /workspace with
`mkdir -p` or exit with an error message if it cannot be accessed) and after
cloning verify `cd vllm` returned successfully (if not, print a clear error and
exit non‑zero). Apply these checks around the existing `cd /workspace` and `cd
vllm` commands (or enable `set -euo pipefail` at the top) so the script doesn’t
continue in the wrong directory.

In `@bionemo-recipes/recipes/vllm_inference/esm2/modeling_esm_te.py`:
- Around line 174-184: Don't mutate the shared config by assigning to
self.config.layer_precision; instead set an instance attribute (e.g.,
self._layer_precision) based on the existing logic (use ["fp8"] *
self.config.num_hidden_layers when fp8_recipe is provided, raise for both
recipes or for fp4 without explicit layer_precision), and then update
get_autocast_context to read from self._layer_precision instead of
self.config.layer_precision so other components sharing the config are not
affected.
- Line 696: The duplicated magic number (0.15 * 0.8) used for mask_ratio_train
in _apply_token_dropout_bshd and _apply_token_dropout_thd should be extracted to
a class-level constant (e.g., MASK_RATIO_TRAIN) on the model class (the class
that defines _apply_token_dropout_bshd/_apply_token_dropout_thd) and both
functions should reference that constant instead of recalculating the value
inline; add the constant near other class constants, replace the inline
expressions in _apply_token_dropout_bshd and _apply_token_dropout_thd with the
new class constant, and ensure any imports/uses within the class access it as
self.MASK_RATIO_TRAIN or ClassName.MASK_RATIO_TRAIN as appropriate.
- Line 80: The parameter padded_vocab_size currently defaults to 64; change its
signature to padded_vocab_size: Optional[int] = None (and update any type hints)
and ensure the constructor/initialization logic in the class/function (where
padded_vocab_size is used) treats None by setting padded_vocab_size = vocab_size
so behavior matches the docstring; also update any downstream uses that assume
an int to handle the None-to-vocab_size assignment.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: e2413d6e-00ae-495d-aa4b-c6737be37029

📥 Commits

Reviewing files that changed from the base of the PR and between 0d58b73 and a9a6f75.

📒 Files selected for processing (41)

.pre-commit-config.yaml
.secrets.baseline
bionemo-recipes/models/esm2/README.md
bionemo-recipes/models/esm2/convert.py
bionemo-recipes/models/esm2/export.py
bionemo-recipes/models/esm2/modeling_esm_te.py
bionemo-recipes/models/esm2/tests/test_cp_bshd.py
bionemo-recipes/models/esm2/tests/test_cp_thd.py
bionemo-recipes/models/esm2/tests/test_distributed_fp8.py
bionemo-recipes/models/esm2/tests/test_distributed_strategies.py
bionemo-recipes/models/esm2/tests/test_modeling_esm_te.py
bionemo-recipes/recipes/esm2_accelerate_te/example_8m_checkpoint/esm_nv.py
bionemo-recipes/recipes/esm2_native_te/fp4_debugging_stats.yaml
bionemo-recipes/recipes/esm2_native_te/modeling_esm_te.py
bionemo-recipes/recipes/esm2_native_te/quantization.py
bionemo-recipes/recipes/esm2_native_te/tests/test_quantization.py
bionemo-recipes/recipes/esm2_native_te/tests/test_stop_and_go.py
bionemo-recipes/recipes/esm2_native_te/tests/test_train.py
bionemo-recipes/recipes/esm2_native_te/train_ddp.py
bionemo-recipes/recipes/esm2_native_te/train_ddp_cp.py
bionemo-recipes/recipes/esm2_native_te/train_fsdp2.py
bionemo-recipes/recipes/esm2_native_te/train_fsdp2_cp.py
bionemo-recipes/recipes/esm2_peft_te/example_8m_checkpoint/esm_nv.py
bionemo-recipes/recipes/vllm_inference/.ci_build.sh
bionemo-recipes/recipes/vllm_inference/.ci_test_env.sh
bionemo-recipes/recipes/vllm_inference/esm2/Dockerfile
bionemo-recipes/recipes/vllm_inference/esm2/LICENSE
bionemo-recipes/recipes/vllm_inference/esm2/README.md
bionemo-recipes/recipes/vllm_inference/esm2/convert.py
bionemo-recipes/recipes/vllm_inference/esm2/esm_fast_tokenizer/special_tokens_map.json
bionemo-recipes/recipes/vllm_inference/esm2/esm_fast_tokenizer/tokenizer.json
bionemo-recipes/recipes/vllm_inference/esm2/esm_fast_tokenizer/tokenizer_config.json
bionemo-recipes/recipes/vllm_inference/esm2/export.py
bionemo-recipes/recipes/vllm_inference/esm2/install_vllm.sh
bionemo-recipes/recipes/vllm_inference/esm2/model_readme.template
bionemo-recipes/recipes/vllm_inference/esm2/modeling_esm_te.py
bionemo-recipes/recipes/vllm_inference/esm2/requirements.txt
bionemo-recipes/recipes/vllm_inference/esm2/state.py
bionemo-recipes/recipes/vllm_inference/esm2/tests/conftest.py
bionemo-recipes/recipes/vllm_inference/esm2/tests/test_vllm.py
ci/scripts/check_copied_files.py

.pre-commit-config.yaml

bionemo-recipes/models/esm2/modeling_esm_te.py

bionemo-recipes/recipes/vllm_inference/.ci_test_env.sh

coderabbitai · 2026-03-15T02:37:42Z

bionemo-recipes/recipes/vllm_inference/esm2/convert.py

+    with init_empty_weights():
+        model_hf = EsmForMaskedLM(hf_config)
+
+        # Remove contact_head since it's not present in TE models
+        if hasattr(model_hf.esm, "contact_head"):
+            delattr(model_hf.esm, "contact_head")
+
+    output_model = state.apply_transforms(
+        model_te,
+        model_hf,
+        reverse_mapping,
+        [_unpack_qkv_weight, _unpack_qkv_bias, _unpad_embeddings, _unpad_decoder_weights, _unpad_bias],
+        state_dict_ignored_entries=[
+            "lm_head.decoder.weight",
+            "esm.contact_head.regression.weight",
+            "esm.contact_head.regression.bias",
+        ],
+    )
+
+    output_model.post_init()


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🌐 Web query:

In the current Hugging Face Transformers version, what does PreTrainedModel.post_init() do? If it is called after parameters have been manually loaded into a model, does it reinitialize weights or only perform setup/tie_weights logic?

💡 Result:

In current Transformers (v5.x), PreTrainedModel.post_init() is the “end of __init__” hook. Its main job is to call the model’s weight-init/tie/pruning routine and to finalize some “static” model properties for composite models. Concretely, it calls self.init_weights() (and uses that moment to ensure things like tied-weights metadata are correct). [1]

init_weights():

prunes heads if config.pruned_heads is set, and

runs initialization by recursively applying _initialize_weights only to modules that are not already marked as initialized (via a per-module _is_hf_initialized flag), and then

calls self.tie_weights() to enforce configured weight tying. [2][3]

So, if you call post_init() after you manually loaded parameters, it typically will not “blindly reinitialize everything”—because modules that have already been initialized (and thus have _is_hf_initialized=True) are skipped by _initialize_weights. [3] However:

it will (re)run tie_weights(), which can overwrite some parameters by design (e.g., tying output embeddings to input embeddings when tie_word_embeddings=True). [2][3]

it can initialize newly-added / newly-resized modules that don’t have _is_hf_initialized set yet (this is one reason people see “unexpected reinit” after resizing embeddings or adding heads). [3]

Rule of thumb: post_init() is not a “safe no-op setup call”; it’s “init + tie (and prune)”. After manual loading, it won’t usually reinit already-initialized modules, but it can initialize uninitialized/new modules and it will enforce tying. [1][2][3]

Sources

HF docs (post_init description, v5.3.0 docs): https://huggingface.co/docs/transformers/main_classes/model

HF docs (init_weights / tie_weights behavior, v5.3.0 docs): https://huggingface.co/docs/transformers/main_classes/model

modeling_utils.py (shows post_init -> init_weights, and _initialize_weights skipping modules with _is_hf_initialized): https://huggingface.co/Yuvarraj/MASR/blob/main/transformers/src/transformers/modeling_utils.py

🏁 Script executed:

# Find the file and understand the context cd bionemo-recipes || true find . -type f -name "convert.py" | grep -E "esm2|vllm" | head -5

Repository: NVIDIA/bionemo-framework

Length of output: 133

🏁 Script executed:

# Search for ESM2 config setup and weight tying settings rg -t py "tie_word_embeddings|tie_weights|EsmConfig" --max-count=20

Repository: NVIDIA/bionemo-framework

Length of output: 19722

🏁 Script executed:

# Check what apply_transforms does and whether it handles tying rg -t py "apply_transforms" -B 5 -A 10 --max-count=30

Repository: NVIDIA/bionemo-framework

Length of output: 37790

🏁 Script executed:

# Check the actual convert.py file at the target lines cd bionemo-recipes && head -150 recipes/vllm_inference/esm2/convert.py | tail -50

Repository: NVIDIA/bionemo-framework

Length of output: 2046

🏁 Script executed:

# Compare with models/esm2/convert.py to see if they differ diff -u models/esm2/convert.py recipes/vllm_inference/esm2/convert.py | head -100

Repository: NVIDIA/bionemo-framework

Length of output: 195

🏁 Script executed:

# Check how other models handle post_init() after apply_transforms rg -t py "post_init" --max-count=20

Repository: NVIDIA/bionemo-framework

Length of output: 3627

🏁 Script executed:

# Check if ESM2 has weight tying configured rg -t py "tie_word_embeddings" --max-count=30 | grep -i esm

Repository: NVIDIA/bionemo-framework

Length of output: 50

🏁 Script executed:

# Look at what EsmForMaskedLM's post_init() would actually do rg -A 20 "class EsmForMaskedLM" recipes/vllm_inference/esm2/modeling_esm_te.py | head -40

Repository: NVIDIA/bionemo-framework

Length of output: 154

🏁 Script executed:

# Check the state.py apply_transforms function to see if it handles tying cd bionemo-recipes && sed -n '1,200p' recipes/vllm_inference/esm2/state.py | grep -A 50 "def apply_transforms"

Repository: NVIDIA/bionemo-framework

Length of output: 2814

🏁 Script executed:

# Check if state.py calls tie_weights() after apply_transforms cd bionemo-recipes && cat recipes/vllm_inference/esm2/state.py | grep -A 5 "def apply_transforms" | tail -100

Repository: NVIDIA/bionemo-framework

Length of output: 322

🏁 Script executed:

# Check the end of apply_transforms to see if it calls tie_weights rg -A 100 "def apply_transforms" recipes/vllm_inference/esm2/state.py | tail -80

Repository: NVIDIA/bionemo-framework

Length of output: 144

🏁 Script executed:

# Check the actual convert.py more completely around the post_init call cd bionemo-recipes && cat -n recipes/vllm_inference/esm2/convert.py | sed -n '95,135p'

Repository: NVIDIA/bionemo-framework

Length of output: 1972

🏁 Script executed:

# Look for any comments explaining why post_init is needed rg -B 5 -A 5 "output_model.post_init()" recipes/vllm_inference/esm2/convert.py

Repository: NVIDIA/bionemo-framework

Length of output: 146

🏁 Script executed:

# Find the actual path to the convert.py file find . -name "convert.py" -path "*vllm_inference/esm2*"

Repository: NVIDIA/bionemo-framework

Length of output: 124

🏁 Script executed:

# Check the full path and list the directory git ls-files | grep -E "vllm_inference.*esm2.*convert"

Repository: NVIDIA/bionemo-framework

Length of output: 122

🏁 Script executed:

# Look at the state.py end of apply_transforms function to see if it calls tie_weights git ls-files | grep -E "vllm_inference.*esm2.*state"

Repository: NVIDIA/bionemo-framework

Length of output: 120

🏁 Script executed:

# Check models/esm2/convert.py for comparison git ls-files | grep -E "models/esm2.*convert"

Repository: NVIDIA/bionemo-framework

Length of output: 106

🏁 Script executed:

# Now check the actual file with correct path cat bionemo-recipes/recipes/vllm_inference/esm2/convert.py | grep -A 150 "def convert_esm_te_to_hf" | head -70

Repository: NVIDIA/bionemo-framework

Length of output: 3196

🏁 Script executed:

# Check the state.py apply_transforms ending to see if tie_weights is called tail -100 bionemo-recipes/recipes/vllm_inference/esm2/state.py | grep -B 20 "return target"

Repository: NVIDIA/bionemo-framework

Length of output: 50

🏁 Script executed:

# Check models/esm2/convert.py for the same function and see if it also calls post_init grep -A 20 "output_model.post_init()" bionemo-recipes/models/esm2/convert.py

Repository: NVIDIA/bionemo-framework

Length of output: 895

🏁 Script executed:

# Check if there's a difference in how state.py handles tie_weights grep -n "tie_weights" bionemo-recipes/recipes/vllm_inference/esm2/state.py

Repository: NVIDIA/bionemo-framework

Length of output: 143

Remove redundant post_init() call after state.apply_transforms().

state.apply_transforms() already calls tie_weights() at line 220 of state.py, so calling output_model.post_init() immediately after (line 129) redundantly re-runs tie_weights() and other initialization logic. Since all weights have already been loaded and tied by apply_transforms(), either remove the post_init() call or replace it with an explicit output_model.tie_weights() if only tying is needed.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@bionemo-recipes/recipes/vllm_inference/esm2/convert.py` around lines 110 - 129, The call to output_model.post_init() is redundant because state.apply_transforms(...) already performs weight tying (see state.apply_transforms) and related init work; remove the output_model.post_init() call after apply_transforms, or if you only want to ensure tied weights explicitly, replace it with a direct call to output_model.tie_weights() instead of post_init; update the code around the EsmForMaskedLM construction and the output_model variable accordingly.

out of scope of PR

@gagank1, understood — I'll note this for a follow-up. Would you like me to open a GitHub issue to track the potential removal of the redundant output_model.post_init() call in convert_esm_te_to_hf?

bionemo-recipes/recipes/vllm_inference/esm2/Dockerfile

coderabbitai · 2026-03-15T02:37:43Z

bionemo-recipes/recipes/vllm_inference/esm2/model_readme.template

+**Runtime Engine(s):**
+
+- Hugging Face Transformers
+


⚠️ Potential issue | 🟡 Minor

Runtime engine section is inconsistent with this recipe context.

In a vllm_inference template, listing only Hugging Face Transformers can mislead users. Consider listing vLLM explicitly (and Transformers if both are supported).

Also applies to: 214-217

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@bionemo-recipes/recipes/vllm_inference/esm2/model_readme.template` around lines 91 - 94, The "Runtime Engine(s):" section currently lists only "Hugging Face Transformers" but this is a vllm_inference template and should explicitly list vLLM (and include Transformers only if both are supported); update the "Runtime Engine(s):" block in the vllm_inference model_readme.template to include "vLLM" and, if the recipe supports both, list "vLLM, Hugging Face Transformers" (apply the same change at the second occurrence referenced in the comment).

out of scope

@gagank1, understood! I'll set this aside for now. If you'd like to track it for a future update, I can open a follow-up issue — just let me know.

coderabbitai · 2026-03-15T02:37:43Z

bionemo-recipes/recipes/vllm_inference/esm2/modeling_esm_te.py

+        if self.padded_vocab_size is not None and self.vocab_size is not None:
+            assert self.padded_vocab_size >= self.vocab_size, (
+                f"padded_vocab_size ({self.padded_vocab_size}) must be greater than or equal to vocab_size ({self.vocab_size})"
+            )


⚠️ Potential issue | 🟡 Minor

Replace assert with explicit exception for production code.

Using assert for validation can be disabled with Python's -O flag, bypassing this critical check. Use raise ValueError for robust validation.

Suggested fix

# Ensure padded_vocab_size is at least as large as vocab_size if self.padded_vocab_size is not None and self.vocab_size is not None: - assert self.padded_vocab_size >= self.vocab_size, ( - f"padded_vocab_size ({self.padded_vocab_size}) must be greater than or equal to vocab_size ({self.vocab_size})" - ) + if self.padded_vocab_size < self.vocab_size: + raise ValueError( + f"padded_vocab_size ({self.padded_vocab_size}) must be >= vocab_size ({self.vocab_size})" + )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@bionemo-recipes/recipes/vllm_inference/esm2/modeling_esm_te.py` around lines 140 - 143, The assertion checking padded_vocab_size vs vocab_size should be converted to an explicit runtime check: replace the assert in the block that references self.padded_vocab_size and self.vocab_size with an if-condition that raises a ValueError when padded_vocab_size is less than vocab_size, using the same descriptive message (e.g., "padded_vocab_size (...) must be greater than or equal to vocab_size (...)"); update the check in the method or initializer where these attributes are set (the code block containing self.padded_vocab_size and self.vocab_size) so the validation cannot be bypassed by Python optimizations.

out of scope

@gagank1 Understood, I'll defer this to a future PR. No further action needed here.

bionemo-recipes/recipes/vllm_inference/esm2/tests/conftest.py

coderabbitai · 2026-03-15T02:37:43Z

bionemo-recipes/recipes/vllm_inference/esm2/tests/conftest.py

+from pathlib import Path
+
+
+sys.path.append(Path(__file__).parent.parent.as_posix())


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

# First, let's check if the file exists and view its contents fd "conftest.py" bionemo-recipes/recipes/vllm_inference/esm2/tests/

Repository: NVIDIA/bionemo-framework

Length of output: 129

🏁 Script executed:

# Now let's examine the conftest.py file cat -n bionemo-recipes/recipes/vllm_inference/esm2/tests/conftest.py

Repository: NVIDIA/bionemo-framework

Length of output: 994

🏁 Script executed:

# Check the directory structure to understand the path resolution tree -L 4 bionemo-recipes/recipes/vllm_inference/esm2/

Repository: NVIDIA/bionemo-framework

Length of output: 469

🏁 Script executed:

# Look for pyproject.toml locations in the repository fd "pyproject.toml" | head -20

Repository: NVIDIA/bionemo-framework

Length of output: 1061

🏁 Script executed:

# Let's check what imports are in test_vllm.py that might depend on sys.path cat -n bionemo-recipes/recipes/vllm_inference/esm2/tests/test_vllm.py | head -50

Repository: NVIDIA/bionemo-framework

Length of output: 2119

🏁 Script executed:

# Check what files/modules are in the esm2 directory that might be imported ls -la bionemo-recipes/recipes/vllm_inference/esm2/*.py

Repository: NVIDIA/bionemo-framework

Length of output: 492

🏁 Script executed:

# Look at the full directory structure to understand what could be imported find bionemo-recipes/recipes/vllm_inference/esm2/ -maxdepth 2 -type f -name "*.py" -o -type d

Repository: NVIDIA/bionemo-framework

Length of output: 575

🏁 Script executed:

# Search for other conftest.py files to understand the patterns used in this repo find . -name "conftest.py" -type f | head -20

Repository: NVIDIA/bionemo-framework

Length of output: 1228

🏁 Script executed:

# Let's also check if there's any __init__.py in the esm2 directory or its parents find bionemo-recipes/recipes/vllm_inference/ -name "__init__.py" -type f

Repository: NVIDIA/bionemo-framework

Length of output: 50

🏁 Script executed:

# Let's trace what the proposed fix would actually do # Starting from conftest location, where would it find pyproject.toml? find bionemo-recipes/recipes/vllm_inference -name "pyproject.toml" -type f

Repository: NVIDIA/bionemo-framework

Length of output: 50

🏁 Script executed:

# Check if there are any imports from parent packages in test files grep -r "import\|from" bionemo-recipes/recipes/vllm_inference/esm2/tests/ --include="*.py"

Repository: NVIDIA/bionemo-framework

Length of output: 1570

🏁 Script executed:

# Let's check a few other conftest files to see how they handle sys.path head -30 bionemo-recipes/models/esm2/tests/conftest.py

Repository: NVIDIA/bionemo-framework

Length of output: 1090

🏁 Script executed:

# Check the parent directory structure more carefully ls -la bionemo-recipes/recipes/vllm_inference/esm2/

Repository: NVIDIA/bionemo-framework

Length of output: 940

🏁 Script executed:

# Let's verify what the proposed fix would actually resolve to # Starting from conftest, walk up to find pyproject.toml python3 << 'EOF' from pathlib import Path conftest_path = Path("bionemo-recipes/recipes/vllm_inference/esm2/tests/conftest.py") start = conftest_path.parent.resolve() print(f"Starting from: {start}") print(f"Parent: {start.parent}") print(f"Parent.parent: {start.parent.parent}") # Walk up looking for pyproject.toml for parent in [start, *start.parents]: if (parent / "pyproject.toml").exists(): print(f"Found pyproject.toml at: {parent}") break # What current code does current_append = start.parent.parent print(f"\nCurrent sys.path.append would add: {current_append}") print(f"Files in that directory: {list(current_append.glob('*.py'))}") EOF

Repository: NVIDIA/bionemo-framework

Length of output: 461

Use sys.path.insert(0, ...) instead of append(); installed packages should not take priority over local sources.

Using append() places the path at the end of sys.path, allowing installed packages to be found first. Since the import from export import export_hf_checkpoint in test_vllm.py expects the esm2 directory in the path, change this to insert(0) to prioritize local sources during tests.

🔧 Proposed fix

-sys.path.append(Path(__file__).parent.parent.as_posix()) +sys.path.insert(0, Path(__file__).parent.parent.as_posix())

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

sys.path.append(Path(__file__).parent.parent.as_posix())

sys.path.insert(0, Path(__file__).parent.parent.as_posix())

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@bionemo-recipes/recipes/vllm_inference/esm2/tests/conftest.py` at line 20, Replace the sys.path.append call in conftest.py with sys.path.insert(0, Path(__file__).parent.parent.as_posix()) so the local esm2 package is prioritized during tests; this ensures the import in test_vllm.py (from export import export_hf_checkpoint) resolves to the local source rather than an installed package and you can locate the call by searching for sys.path.append and Path(__file__).parent.parent.as_posix() in conftest.py.

out of scope, follows other conftests

@gagank1, understood — I'll defer to the existing project conventions for conftest files. I've noted this for future reviews.

(ᵔᴥᵔ)

✏️ Learnings added

Learnt from: gagank1 Repo: NVIDIA/bionemo-framework PR: 1473 File: bionemo-recipes/recipes/vllm_inference/esm2/tests/conftest.py:20-20 Timestamp: 2026-03-16T15:51:29.934Z Learning: In bionemo-recipes conftest.py files (e.g., bionemo-recipes/recipes/vllm_inference/esm2/tests/conftest.py), using `sys.path.append(Path(__file__).parent.parent.as_posix())` is the accepted project pattern for adding parent directories to the Python path. Do not flag or suggest replacing it with `sys.path.insert(0, ...)`, as this follows the existing style of other conftest files in the repository.

Note: Learnings are effective only in the context of similar code segments. To apply general coding guidelines, please configure review instructions. You can manage existing learnings in the app.

🧠 Learnings used

Learnt from: gagank1 Repo: NVIDIA/bionemo-framework PR: 1473 File: bionemo-recipes/recipes/vllm_inference/esm2/tests/conftest.py:1-20 Timestamp: 2026-03-16T15:50:46.694Z Learning: Do not require or flag a top-level module docstring in conftest.py files across the repository (e.g., bionemo-recipes/recipes/.../tests/conftest.py). This follows the project's existing style for conftest files. If CI enforces docstrings elsewhere, exceptions for conftest should remain isolated to this pattern.

bionemo-recipes/recipes/vllm_inference/esm2/tests/test_vllm.py

Signed-off-by: Gagan Kaushik <gkaushik@nvidia.com>

gagank1 · 2026-03-16T15:17:59Z

/ok to test dcaa6c7

Signed-off-by: Gagan Kaushik <gkaushik@nvidia.com>

gagank1 · 2026-03-16T15:58:26Z

/ok to test 13046cb

pstjohn · 2026-03-16T16:10:34Z

bionemo-recipes/recipes/esm2_native_te/tests/test_train.py

-    assert fp8_log_dir.exists()
-    assert (fp8_log_dir / "rank_0" / "nvdlfw_inspect_logs" / "nvdlfw_inspect_globalrank-0.log").exists()
-    assert (fp8_log_dir / "rank_0" / "nvdlfw_inspect_statistics_logs" / "nvdlfw_inspect_globalrank-0.log").exists()
+    assert fp8_log_dir.exists(), "FP8 log directory was not created"
+    assert (fp8_log_dir / "rank_0").exists(), "rank_0 directory was not created"
+    assert (fp8_log_dir / "rank_0" / "nvdlfw_inspect_logs").exists(), "nvdlfw_inspect_logs directory was not created"
+    assert (fp8_log_dir / "rank_0" / "nvdlfw_inspect_statistics_logs").exists(), (
+        "nvdlfw_inspect_statistics_logs directory was not created"
+    )
+
+    metadata_log = fp8_log_dir / "rank_0" / "nvdlfw_inspect_logs" / "nvdlfw_inspect_globalrank-0.log"
+    stats_log = fp8_log_dir / "rank_0" / "nvdlfw_inspect_statistics_logs" / "nvdlfw_inspect_globalrank-0.log"
+
+    assert metadata_log.exists(), "Metadata log file was not created"
+    assert stats_log.exists(), "Statistics log file was not created"
+    assert metadata_log.stat().st_size > 0, "Metadata log file is empty"
+    assert stats_log.stat().st_size > 0, "Statistics log file is empty"


Any reason you're changing these?

strengthening the test - cursor suggested because it was only checking the existence of the files instead of checking that they're non-empty like above in test_sanity_ddp_fp8_stats_logging

pstjohn · 2026-03-16T16:11:12Z

bionemo-recipes/recipes/vllm_inference/.ci_test_env.sh

pstjohn · 2026-03-16T16:12:50Z

bionemo-recipes/recipes/vllm_inference/esm2/LICENSE

Why do we need all these files? You should just need the modeling_esm file right?

Is LICENSE not necessary? I saw it in some of the other recipes. The other files are needed since the tests for this recipe export/convert the original weights.

gagank1 self-assigned this Feb 18, 2026

gagank1 marked this pull request as ready for review February 23, 2026 17:06

gagank1 requested review from cspades, dorotat-nv, jomitchellnv, jstjohn, jwilber, pstjohn, savitha-eng and trvachov as code owners February 23, 2026 17:06

gagank1 requested a review from broland-hat February 23, 2026 17:17

coderabbitai bot reviewed Feb 23, 2026

View reviewed changes