Skip to content

Use SLURM singleton dependency for GPU jobs to prevent conflicts#530

Merged
JRPan merged 1 commit intoaccel-sim:devfrom
purdue-aalp:dev-ci
Jan 28, 2026
Merged

Use SLURM singleton dependency for GPU jobs to prevent conflicts#530
JRPan merged 1 commit intoaccel-sim:devfrom
purdue-aalp:dev-ci

Conversation

@JRPan
Copy link
Collaborator

@JRPan JRPan commented Jan 25, 2026

Summary

  • Add srun --job-name=gpu-lock --dependency=singleton --partition=tgrogers-dgx -- wrapper to GPU tracer jobs in main.yml and weekly.yml
  • This ensures only one GPU job runs at a time across all CI runners, preventing GPU resource conflicts
  • Change Tracer-Tool and Tracer-Weekly jobs to run on tgrogers-raid since GPU work is now submitted via SLURM

Test plan

  • Verify CI workflow runs successfully
  • Confirm GPU jobs are properly serialized via SLURM singleton dependency

🤖 Generated with Claude Code

@JRPan JRPan merged commit bf35ab0 into accel-sim:dev Jan 28, 2026
9 of 14 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant