Skip to content

Add s3-graphs-zip workflow for zipping CE graphs#999

Draft
rsaksida wants to merge 2 commits intomasterfrom
feature/s3-graphs-zip
Draft

Add s3-graphs-zip workflow for zipping CE graphs#999
rsaksida wants to merge 2 commits intomasterfrom
feature/s3-graphs-zip

Conversation

@rsaksida
Copy link
Member

@rsaksida rsaksida commented Mar 6, 2026

  • Add s3-graphs-zip workflow for archiving CE graphs
  • Hook up legacy CER API to Argo workflows (WIP)

rsaksida added 2 commits March 6, 2026 16:58
- Add script to package CE graph JSON files into zip files in S3
  - Streams zip files directly to S3 with multipart upload support
  - Calls preconfigured webhook for notifications
- Add Dockerfile for Argo / container orchestration
- Add docker-compose.yml with LocalStack for integration test setup
- Add workflow template
@rohit-joy
Copy link
Contributor

Go ahead and deploy it in Sandbox. Then let @JWaltuch or @mparsons-ce know to test it. Then go to production. :)

resources:
requests:
cpu: "1000m"
memory: "2Gi"
Copy link
Contributor

@rohit-joy rohit-joy Mar 9, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

With the streaming ZIP operation, why does it need 2GB minimum memory minimum? I would think, at most the memory consumption would be the size of the ZIP metadata header. That is about 50MB for 500K files. So it should not be 2GB minimum. Reason this is concerning is because we are allocating 2GB up front, that means Kubernetes is going to assume that it needs to auto-scale more VMs to server other apps. I recommend reducing this number to 200MB or lower to see where it breaks. Then increase it by 100MB increments until it doesn't break.

Also, add a # comment here indicating the reason for the low memory allocation.

Same question for the CPU. 1000m is probably too high for this workflow.

Note that these workflows are very light weight. So we should keep to the minimum required memory allocations to reduce costs as we start to run unrelated workflows in parallel.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants