fix(run-engine): sweeper LREMs the worker queue list when acking marked runs by d-cs · Pull Request #3527 · triggerdotdev/trigger.dev

d-cs · 2026-05-05T14:28:17Z

The concurrency sweeper's processMarkedRun calls acknowledgeMessage with removeFromWorkerQueue: false. The Lua script always DELs the message body but only LREMs the worker queue list when the flag is set. Result: any run that was pushed onto the worker queue list (fast-path enqueue or processQueueForWorkerQueue promotion) but not yet BLPOP'd when the sweeper finds it leaves a stale messageKey value sitting on the list. The next worker BLPOP returns the tombstone, GET messageKey returns nil, and the dequeue path logs Failed to dequeue message from worker queue.

The original assumption was that sweeper-acked runs had already been BLPOP'd off the list (entry already gone, nothing to LREM). That holds when the worker BLPOPs and then dies, but not when fast-path enqueue or processQueueForWorkerQueue adds the entry to the list and the run then ages past the sweeper's 10-minute completed-at threshold without ever being popped.

Switch to removeFromWorkerQueue: true. Cost is one extra LREM per swept run, O(N) over the worker queue list length. Sweeper acks are bounded by the cron schedule (every 5 minutes, max 100 marked runs per fire), so the added Redis work is negligible relative to baseline ack workload.

Test

internal-packages/run-engine/src/run-queue/tests/concurrencySweeper.test.ts adds a regression test that fast-path enqueues a message, runs the sweeper, then dequeues. Pre-fix the dequeue path logs the "Failed to dequeue message from worker queue" error; post-fix the worker queue list is left clean and the dequeue returns undefined silently.

changeset-bot · 2026-05-05T14:28:25Z

⚠️ No Changeset found

Latest commit: 0a1199a

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

coderabbitai · 2026-05-05T14:28:48Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 5149dd01-6fda-4117-bfc5-726274b0117d

📥 Commits

Reviewing files that changed from the base of the PR and between eacd023 and 0a1199a.

📒 Files selected for processing (1)

internal-packages/run-engine/src/run-queue/tests/concurrencySweeper.test.ts

📜 Recent review details

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (28)

GitHub Check: units / webapp / 🧪 Unit Tests: Webapp (6, 8)
GitHub Check: units / webapp / 🧪 Unit Tests: Webapp (2, 8)
GitHub Check: units / webapp / 🧪 Unit Tests: Webapp (1, 8)
GitHub Check: units / webapp / 🧪 Unit Tests: Webapp (4, 8)
GitHub Check: units / webapp / 🧪 Unit Tests: Webapp (8, 8)
GitHub Check: units / webapp / 🧪 Unit Tests: Webapp (3, 8)
GitHub Check: units / internal / 🧪 Unit Tests: Internal (6, 8)
GitHub Check: units / webapp / 🧪 Unit Tests: Webapp (7, 8)
GitHub Check: units / internal / 🧪 Unit Tests: Internal (5, 8)
GitHub Check: units / webapp / 🧪 Unit Tests: Webapp (5, 8)
GitHub Check: units / internal / 🧪 Unit Tests: Internal (1, 8)
GitHub Check: units / internal / 🧪 Unit Tests: Internal (3, 8)
GitHub Check: units / internal / 🧪 Unit Tests: Internal (7, 8)
GitHub Check: units / internal / 🧪 Unit Tests: Internal (8, 8)
GitHub Check: units / internal / 🧪 Unit Tests: Internal (2, 8)
GitHub Check: units / internal / 🧪 Unit Tests: Internal (4, 8)
GitHub Check: units / packages / 🧪 Unit Tests: Packages (1, 1)
GitHub Check: units / e2e-webapp / 🧪 E2E Tests: Webapp
GitHub Check: typecheck / typecheck
GitHub Check: sdk-compat / Node.js 20.20 (ubuntu-latest)
GitHub Check: e2e / 🧪 CLI v3 tests (windows-latest - npm)
GitHub Check: sdk-compat / Node.js 22.12 (ubuntu-latest)
GitHub Check: e2e / 🧪 CLI v3 tests (windows-latest - pnpm)
GitHub Check: sdk-compat / Deno Runtime
GitHub Check: e2e / 🧪 CLI v3 tests (ubuntu-latest - npm)
GitHub Check: sdk-compat / Cloudflare Workers
GitHub Check: sdk-compat / Bun Runtime
GitHub Check: e2e / 🧪 CLI v3 tests (ubuntu-latest - pnpm)

🧰 Additional context used

📓 Path-based instructions (10)

**/*.{ts,tsx}

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

**/*.{ts,tsx}: Use types over interfaces for TypeScript
Avoid using enums; prefer string unions or const objects instead

Files:

internal-packages/run-engine/src/run-queue/tests/concurrencySweeper.test.ts

**/*.{ts,tsx,js,jsx}

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

Use function declarations instead of default exports

Files:

internal-packages/run-engine/src/run-queue/tests/concurrencySweeper.test.ts

**/*.{test,spec}.{ts,tsx}

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

Use vitest for all tests in the Trigger.dev repository

Files:

internal-packages/run-engine/src/run-queue/tests/concurrencySweeper.test.ts

**/*.ts

📄 CodeRabbit inference engine (.cursor/rules/otel-metrics.mdc)

**/*.ts: When creating or editing OTEL metrics (counters, histograms, gauges), ensure metric attributes have low cardinality by using only enums, booleans, bounded error codes, or bounded shard IDs
Do not use high-cardinality attributes in OTEL metrics such as UUIDs/IDs (envId, userId, runId, projectId, organizationId), unbounded integers (itemCount, batchSize, retryCount), timestamps (createdAt, startTime), or free-form strings (errorMessage, taskName, queueName)
When exporting OTEL metrics via OTLP to Prometheus, be aware that the exporter automatically adds unit suffixes to metric names (e.g., 'my_duration_ms' becomes 'my_duration_ms_milliseconds', 'my_counter' becomes 'my_counter_total'). Account for these transformations when writing Grafana dashboards or Prometheus queries

Files:

internal-packages/run-engine/src/run-queue/tests/concurrencySweeper.test.ts

{apps,internal-packages}/**/*.{ts,tsx,js}