-
Notifications
You must be signed in to change notification settings - Fork 8
Open
Description
For workloads such as AI inference, we want to dynamically scale GPUs according to the load.
The current operator has the ability to attach multiple GPUs to a node when there are no GPU present, or to detach all GPUs at once when there are multiple GPUs attached to a node.
Therefore, I propose a feature that allows you to increase or decrease GPUs one by one.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels
