Dynamic Node Manager

Overview

A Python service that watches Kubernetes namespaces, estimates YuniKorn queue demand (running + pending pods), and dynamically converts Slurm batch nodes into Kubernetes nodes when demand exceeds configured queue headroom or when bin-pack deadlocks are detected. When nodes go idle long enough, it reverts them back to Slurm and reduces YuniKorn queue capacity accordingly.

This tool is designed for environments where:

Slurm/batch nodes can be switched into a Kubernetes pool via an external node-convert script.
YuniKorn is used for queue management and its capacities are controlled through a queues.yaml stored in a ConfigMap.

What it does

1) Watches multiple namespaces

You configure a set of monitored_namespaces. For each namespace, you also configure a corresponding YuniKorn queue_path.

2) Computes demand per namespace + queue

For each namespace’s queue path, it:

Lists pods in that namespace.
Filters pods to those assigned to the queue path (by annotation/label).
Sums effective CPU/memory for:
- Running pods (counted as “used”)
- Pending pods (counted as “pending demand”)
Applies default requests for Pending pods that declare no resources (so “invisible” pending work still triggers scaling).

3) Triggers conversions when needed

Conversion is triggered when any of these is true:

Total demand (running + pending) exceeds queue headroom for CPU or memory.
Pending demand exceeds available capacity (after subtracting running usage).
A “bin-pack blocked” condition is detected (see below).

4) Converts nodes from Slurm → Kubernetes

When scaling out is required, the actuator:

Runs /usr/site/rcac/sbin/node-convert --set k8s ... --namespace <ns>
Waits for new node(s) to appear in the Kubernetes API
Records converted nodes in a local JSON file
Increases YuniKorn queue capacity by (nodes_added * node_cpu_capacity, nodes_added * node_memory_capacity) only after the nodes actually join the cluster

5) Reverts nodes from Kubernetes → Slurm when idle

Periodically checks converted nodes:

If a converted node has no active pods in its namespace, and it has been idle longer than reversion_idle_seconds, the tool:
- runs /usr/site/rcac/sbin/node-convert --set batch --node-name <node>
- waits for node to disappear from the K8s API
- decreases YuniKorn queue capacity accordingly
- removes it from the converted node registry

Key concepts

YuniKorn queue capacity management

This tool treats YuniKorn as the source of truth for “how much capacity a queue is allowed to consume”.

It:

Reads queues.yaml (ConfigMap: yunikorn-configs by default)
Locates the queue by queue_path (dot-separated path like root.platform.services.dev)
Adjusts:
- resources.max.{vcore,memory}
- resources.guaranteed.{vcore,memory}
Ensures guaranteed <= max after changes

Pending pods with missing requests

If a pod is Pending and has no declared requests/limits, this tool assigns defaults:

CPU: default_cpu_request_m (milliCPU)
Memory: default_mem_request (parsed via kubernetes.utils.quantity.parse_quantity)

This is critical to avoid false “no demand” conditions.

Bin-pack deadlock detection

A common failure mode: you have available queue headroom on paper, but the cluster cannot schedule the next pending pod because it needs a fresh node (fragmentation/binpack).

This tool estimates:

max_pend_cpu, max_pend_mem: largest pending pod “shape”
pods_per_node: how many of that shape fit on a node
running_pods_est: rough estimate of how many pods are already consuming the queue

If it detects a “perfectly filled” pattern with pending work waiting, it triggers conversion for at least one node.

Architecture

The service runs two daemon threads:

Evaluate loop (`evaluate_loop`)

Runs every check_interval_seconds:

trigger()
Computes needs per namespace and enqueues conversion tasks into a global queue.
check_converted_nodes()
Finds idle converted nodes and reverts them.

Actuator loop (`actuator_loop`)

Continuously:

Checks global conversion cap
Enforces cooldown
Pops tasks from the global queue
Performs node conversion (1 node per task)
Updates YuniKorn capacity after nodes join

Files and dependencies

Python dependencies

kubernetes Python client
pyyaml

Required files on disk

/etc/dynamic-node/dynamic_node_config.ini
Main configuration.
/etc/dynamic-node/namespace_queue_paths.json
Map of namespace → YuniKorn queue path.

External command required

/usr/site/rcac/sbin/node-convert
Used to convert nodes between Slurm batch and Kubernetes.

Configuration

`/etc/dynamic-node/dynamic_node_config.ini`

Expected keys (section [settings]):

kubeconfig_path (optional)
If set, loads kubeconfig from this path. Otherwise uses default kubeconfig loading behavior.
monitored_namespaces
Comma-separated namespaces to watch.
reversion_idle_seconds
How long a converted node must be idle (no active pods) before revert.
check_interval_seconds
Evaluate loop interval.
max_converted_nodes
Global cap on how many nodes may be converted at any time.
yunikorn_cm_namespace (default: yunikorn)
Namespace containing the YuniKorn ConfigMap.
yunikorn_cm_name (default: yunikorn-configs)
Name of the YuniKorn ConfigMap containing queues.yaml.
yunikorn_queue_annotation (default: yunikorn.apache.org/queue)
Key used to identify queue assignment on pods (annotation or label).
node_cpu_capacity (default: 128)
vcores to add/subtract per converted node.
node_memory_capacity (default: 256G)
memory to add/subtract per converted node.
global_cooldown_seconds (default: 30)
Cooldown between conversions (note: present but not currently wired to advance _next_conversion_allowed_at).
default_cpu_request_m (default: 1000)
Used only when pending pods have no resource requests/limits.
default_mem_request (default: 2Gi)
Used only when pending pods have no resource requests/limits.

`/etc/dynamic-node/namespace_queue_paths.json`

A JSON object mapping each monitored namespace to a YuniKorn queue path:

{
  "workloads-dev": "root.platform.services.dev",
  "workloads-prod": "root.platform.services.prod"
}

Usage

This service supports two modes:

monitor: run the continuous evaluate + actuator loops (normal operation)
test: create a synthetic load Deployment in a namespace/queue (validation)

Prerequisites

A valid kubeconfig accessible to the process (either:
- set kubeconfig_path in /etc/dynamic-node/dynamic_node_config.ini, or
- rely on default kubeconfig loading via KUBECONFIG / ~/.kube/config)
/etc/dynamic-node/dynamic_node_config.ini is present and readable
/etc/dynamic-node/namespace_queue_paths.json is present and contains queue mappings for all monitored namespaces
The process identity can:
- list nodes and pods
- read/patch the YuniKorn ConfigMap containing queues.yaml
- (test mode) create/patch/delete Deployments in the target namespace
The external converter exists and is executable:
- /usr/site/rcac/sbin/node-convert

Run in monitor mode

Monitor mode starts two background threads:

Evaluate loop: periodically computes demand and enqueues conversions; checks for idle nodes to revert
Actuator loop: drains the conversion queue and performs node-convert actions

python3 dynamic-node-manager.py --mode monitor

Run in test mode (generate queue load)

Test mode creates a Deployment that schedules pods annotated to a specific YuniKorn queue. Each pod runs a long sleep to hold resources.

python3 dynamic-node-manager.py \
  --mode test \
  --namespace workloads-dev \
  --queue root.platform.services.dev \
  --replicas 200

Common operational checks

Verify the manager is logging demand decisions and conversion actions via syslog.
Confirm YuniKorn queues.yaml is being updated (capacity changes show as MAX/GUAR bumps in logs).
Confirm nodes join/leave the Kubernetes API after node-convert runs.
Confirm /etc/dynamic-node/converted_nodes.json is being updated as nodes are converted and reverted.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
dynamic-node-manager.py		dynamic-node-manager.py
dynamic-node-manager.service		dynamic-node-manager.service
dynamic_node_config.ini		dynamic_node_config.ini
node-convert		node-convert
yunikorn-configs.yaml		yunikorn-configs.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Dynamic Node Manager

Table of contents

Overview

What it does

1) Watches multiple namespaces

2) Computes demand per namespace + queue

3) Triggers conversions when needed

4) Converts nodes from Slurm → Kubernetes

5) Reverts nodes from Kubernetes → Slurm when idle

Key concepts

YuniKorn queue capacity management

Pending pods with missing requests

Bin-pack deadlock detection

Architecture

Evaluate loop (`evaluate_loop`)

Actuator loop (`actuator_loop`)

Files and dependencies

Python dependencies

Required files on disk

External command required

Configuration

`/etc/dynamic-node/dynamic_node_config.ini`

`/etc/dynamic-node/namespace_queue_paths.json`

Usage

Prerequisites

Run in monitor mode

Run in test mode (generate queue load)

Common operational checks

About

Uh oh!

Releases

Packages

Languages

PurdueRCAC/dynamic-node-manager

Folders and files

Latest commit

History

Repository files navigation

Dynamic Node Manager

Table of contents

Overview

What it does

1) Watches multiple namespaces

2) Computes demand per namespace + queue

3) Triggers conversions when needed

4) Converts nodes from Slurm → Kubernetes

5) Reverts nodes from Kubernetes → Slurm when idle

Key concepts

YuniKorn queue capacity management

Pending pods with missing requests

Bin-pack deadlock detection

Architecture

Evaluate loop (evaluate_loop)

Actuator loop (actuator_loop)

Files and dependencies

Python dependencies

Required files on disk

External command required

Configuration

/etc/dynamic-node/dynamic_node_config.ini

/etc/dynamic-node/namespace_queue_paths.json

Usage

Prerequisites

Run in monitor mode

Run in test mode (generate queue load)

Common operational checks

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Evaluate loop (`evaluate_loop`)

Actuator loop (`actuator_loop`)

`/etc/dynamic-node/dynamic_node_config.ini`

`/etc/dynamic-node/namespace_queue_paths.json`

Packages