A fault-tolerant distributed job scheduler that delivers priority-based execution, tenant-aware fairness, and resilient checkpointed workloads with leader-elected high availability.
-
Updated
Feb 14, 2026 - Python
A fault-tolerant distributed job scheduler that delivers priority-based execution, tenant-aware fairness, and resilient checkpointed workloads with leader-elected high availability.
Add a description, image, and links to the auto-retries topic page so that developers can more easily learn about it.
To associate your repository with the auto-retries topic, visit your repo's landing page and select "manage topics."