Skip to content

[codex] Dampen Airflow heartbeat flakes#339

Merged
redreceipt merged 1 commit into
mainfrom
codex/dampen-airflow-heartbeat-flakes
May 20, 2026
Merged

[codex] Dampen Airflow heartbeat flakes#339
redreceipt merged 1 commit into
mainfrom
codex/dampen-airflow-heartbeat-flakes

Conversation

@redreceipt
Copy link
Copy Markdown
Member

Summary

  • dampen Better Stack heartbeat failures for transient Airflow unknown evaluations
  • keep real degraded fleet-health payloads reporting /fail immediately
  • add heartbeat unit coverage and document the new 3-strike unknown behavior

Root Cause

The worker converts Airflow API evaluation exceptions into {"status":"unknown"} with HTTP 503. A single Astronomer /api/v2/dags RemoteDisconnected then triggered the Better Stack heartbeat /fail, which opened and resolved one-minute incidents on the next healthy worker run.

Proof

  • source .venv-ci/bin/activate && ruff check .
  • source .venv-ci/bin/activate && mypy .
  • source .venv-ci/bin/activate && python -m unittest discover -s tests -p 'test_*.py'

@redreceipt redreceipt temporarily deployed to bug-board-codex-dampen--5vg7t5 May 20, 2026 18:27 Inactive
@redreceipt redreceipt marked this pull request as ready for review May 20, 2026 18:28
@redreceipt redreceipt merged commit bed0add into main May 20, 2026
5 checks passed
@redreceipt redreceipt deleted the codex/dampen-airflow-heartbeat-flakes branch May 20, 2026 18:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant