Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion helm_chart/HyperPodHelmChart/Chart.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -81,7 +81,7 @@ dependencies:
repository: "file://charts/team-role-and-bindings"
condition: team-role-and-bindings.enabled
- name: hyperpod-inference-operator
version: "1.2.0"
version: "1.3.0"
repository: "file://charts/inference-operator"
condition: inferenceOperators.enabled
- name: hyperpod-patching
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -15,11 +15,11 @@ type: application
# This is the chart version. This version number should be incremented each time you make changes
# to the chart and its templates, including the app version.
# Versions are expected to follow Semantic Versioning (https://semver.org/)
version: 1.2.0
version: 1.3.0

# This is the version number of the application being deployed. Keep this aligned
# with operator image MAJOR.MINOR version.
appVersion: "2.2"
appVersion: "2.3"

dependencies:
- name: aws-mountpoint-s3-csi-driver
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -351,9 +351,19 @@ spec:
pattern: ^$|^[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}$
type: string
instanceType:
description: Instance Type to deploy the model on
description: |-
Single instance type to deploy the model on.
This field is mutually exclusive with instanceTypes.
Use this when you want to deploy on a specific instance type.
pattern: ^ml\..*
type: string
instanceTypes:
description: |-
List of instance types to deploy the model on, in order of preference.
Instance types are selected based on the order specified, selecting the first available type.
items:
type: string
type: array
intelligentRoutingSpec:
description: |-
Configuration for intelligent routing
Expand Down Expand Up @@ -863,8 +873,9 @@ spec:
in a Container.
properties:
name:
description: Name of the environment variable. Must be a
C_IDENTIFIER.
description: |-
Name of the environment variable.
May consist of any printable ASCII characters except '='.
type: string
value:
description: |-
Expand Down Expand Up @@ -922,6 +933,43 @@ spec:
- fieldPath
type: object
x-kubernetes-map-type: atomic
fileKeyRef:
description: |-
FileKeyRef selects a key of the env file.
Requires the EnvFiles feature gate to be enabled.
properties:
key:
description: |-
The key within the env file. An invalid key will prevent the pod from starting.
The keys defined within a source may consist of any printable ASCII characters except '='.
During Alpha stage of the EnvFiles feature gate, the key size is limited to 128 characters.
type: string
optional:
default: false
description: |-
Specify whether the file or its key must be defined. If the file or key
does not exist, then the env var is not published.
If optional is set to true and the specified key does not exist,
the environment variable will not be set in the Pod's containers.

If optional is set to false and the specified key does not exist,
an error will be returned during Pod creation.
type: boolean
path:
description: |-
The path within the volume from which to select the file.
Must be relative and may not contain the '..' path or start with '..'.
type: string
volumeName:
description: The name of the volume mount containing
the env file.
type: string
required:
- key
- path
- volumeName
type: object
x-kubernetes-map-type: atomic
resourceFieldRef:
description: |-
Selects a resource of the container: only resources limits and requests
Expand Down Expand Up @@ -1495,7 +1543,7 @@ spec:
Claims lists the names of resources, defined in spec.resourceClaims,
that are used by this container.

This is an alpha field and requires enabling the
This field depends on the
DynamicResourceAllocation feature gate.

This field is immutable. It can only be set for containers.
Expand Down Expand Up @@ -1556,7 +1604,6 @@ spec:
- resources
type: object
required:
- instanceType
- modelName
- modelSourceConfig
- worker
Expand Down Expand Up @@ -1647,8 +1694,8 @@ spec:
description: Status of the Deployment Object
properties:
availableReplicas:
description: Total number of available pods (ready for at
least minReadySeconds) targeted by this deployment.
description: Total number of available non-terminating pods
(ready for at least minReadySeconds) targeted by this deployment.
format: int32
type: integer
collisionCount:
Expand Down Expand Up @@ -1701,15 +1748,23 @@ spec:
format: int64
type: integer
readyReplicas:
description: readyReplicas is the number of pods targeted
description: Total number of non-terminating pods targeted
by this Deployment with a Ready Condition.
format: int32
type: integer
replicas:
description: Total number of non-terminated pods targeted
description: Total number of non-terminating pods targeted
by this deployment (their labels match the selector).
format: int32
type: integer
terminatingReplicas:
description: |-
Total number of terminating pods targeted by this deployment. Terminating pods have a non-null
.metadata.deletionTimestamp and have not yet reached the Failed or Succeeded .status.phase.

This is an alpha field. Enable DeploymentReplicaSetTerminatingReplicas to be able to use this field.
format: int32
type: integer
unavailableReplicas:
description: |-
Total number of unavailable pods targeted by this deployment. This is the total number of
Expand All @@ -1718,7 +1773,7 @@ spec:
format: int32
type: integer
updatedReplicas:
description: Total number of non-terminated pods targeted
description: Total number of non-terminating pods targeted
by this deployment that have the desired template spec.
format: int32
type: integer
Expand Down Expand Up @@ -2216,8 +2271,9 @@ spec:
in a Container.
properties:
name:
description: Name of the environment variable. Must be a
C_IDENTIFIER.
description: |-
Name of the environment variable.
May consist of any printable ASCII characters except '='.
type: string
value:
description: |-
Expand Down Expand Up @@ -2275,6 +2331,43 @@ spec:
- fieldPath
type: object
x-kubernetes-map-type: atomic
fileKeyRef:
description: |-
FileKeyRef selects a key of the env file.
Requires the EnvFiles feature gate to be enabled.
properties:
key:
description: |-
The key within the env file. An invalid key will prevent the pod from starting.
The keys defined within a source may consist of any printable ASCII characters except '='.
During Alpha stage of the EnvFiles feature gate, the key size is limited to 128 characters.
type: string
optional:
default: false
description: |-
Specify whether the file or its key must be defined. If the file or key
does not exist, then the env var is not published.
If optional is set to true and the specified key does not exist,
the environment variable will not be set in the Pod's containers.

If optional is set to false and the specified key does not exist,
an error will be returned during Pod creation.
type: boolean
path:
description: |-
The path within the volume from which to select the file.
Must be relative and may not contain the '..' path or start with '..'.
type: string
volumeName:
description: The name of the volume mount containing
the env file.
type: string
required:
- key
- path
- volumeName
type: object
x-kubernetes-map-type: atomic
resourceFieldRef:
description: |-
Selects a resource of the container: only resources limits and requests
Expand Down Expand Up @@ -2380,7 +2473,7 @@ spec:
Claims lists the names of resources, defined in spec.resourceClaims,
that are used by this container.

This is an alpha field and requires enabling the
This field depends on the
DynamicResourceAllocation feature gate.

This field is immutable. It can only be set for containers.
Expand Down Expand Up @@ -2529,8 +2622,8 @@ spec:
description: Status of the Deployment Object
properties:
availableReplicas:
description: Total number of available pods (ready for at
least minReadySeconds) targeted by this deployment.
description: Total number of available non-terminating pods
(ready for at least minReadySeconds) targeted by this deployment.
format: int32
type: integer
collisionCount:
Expand Down Expand Up @@ -2583,15 +2676,23 @@ spec:
format: int64
type: integer
readyReplicas:
description: readyReplicas is the number of pods targeted
description: Total number of non-terminating pods targeted
by this Deployment with a Ready Condition.
format: int32
type: integer
replicas:
description: Total number of non-terminated pods targeted
description: Total number of non-terminating pods targeted
by this deployment (their labels match the selector).
format: int32
type: integer
terminatingReplicas:
description: |-
Total number of terminating pods targeted by this deployment. Terminating pods have a non-null
.metadata.deletionTimestamp and have not yet reached the Failed or Succeeded .status.phase.

This is an alpha field. Enable DeploymentReplicaSetTerminatingReplicas to be able to use this field.
format: int32
type: integer
unavailableReplicas:
description: |-
Total number of unavailable pods targeted by this deployment. This is the total number of
Expand All @@ -2600,7 +2701,7 @@ spec:
format: int32
type: integer
updatedReplicas:
description: Total number of non-terminated pods targeted
description: Total number of non-terminating pods targeted
by this deployment that have the desired template spec.
format: int32
type: integer
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -944,8 +944,8 @@ spec:
description: Status of the Deployment Object
properties:
availableReplicas:
description: Total number of available pods (ready for at
least minReadySeconds) targeted by this deployment.
description: Total number of available non-terminating pods
(ready for at least minReadySeconds) targeted by this deployment.
format: int32
type: integer
collisionCount:
Expand Down Expand Up @@ -998,15 +998,23 @@ spec:
format: int64
type: integer
readyReplicas:
description: readyReplicas is the number of pods targeted
description: Total number of non-terminating pods targeted
by this Deployment with a Ready Condition.
format: int32
type: integer
replicas:
description: Total number of non-terminated pods targeted
description: Total number of non-terminating pods targeted
by this deployment (their labels match the selector).
format: int32
type: integer
terminatingReplicas:
description: |-
Total number of terminating pods targeted by this deployment. Terminating pods have a non-null
.metadata.deletionTimestamp and have not yet reached the Failed or Succeeded .status.phase.

This is an alpha field. Enable DeploymentReplicaSetTerminatingReplicas to be able to use this field.
format: int32
type: integer
unavailableReplicas:
description: |-
Total number of unavailable pods targeted by this deployment. This is the total number of
Expand All @@ -1015,7 +1023,7 @@ spec:
format: int32
type: integer
updatedReplicas:
description: Total number of non-terminated pods targeted
description: Total number of non-terminating pods targeted
by this deployment that have the desired template spec.
format: int32
type: integer
Expand Down Expand Up @@ -1598,8 +1606,8 @@ spec:
description: Status of the Deployment Object
properties:
availableReplicas:
description: Total number of available pods (ready for at
least minReadySeconds) targeted by this deployment.
description: Total number of available non-terminating pods
(ready for at least minReadySeconds) targeted by this deployment.
format: int32
type: integer
collisionCount:
Expand Down Expand Up @@ -1652,15 +1660,23 @@ spec:
format: int64
type: integer
readyReplicas:
description: readyReplicas is the number of pods targeted
description: Total number of non-terminating pods targeted
by this Deployment with a Ready Condition.
format: int32
type: integer
replicas:
description: Total number of non-terminated pods targeted
description: Total number of non-terminating pods targeted
by this deployment (their labels match the selector).
format: int32
type: integer
terminatingReplicas:
description: |-
Total number of terminating pods targeted by this deployment. Terminating pods have a non-null
.metadata.deletionTimestamp and have not yet reached the Failed or Succeeded .status.phase.

This is an alpha field. Enable DeploymentReplicaSetTerminatingReplicas to be able to use this field.
format: int32
type: integer
unavailableReplicas:
description: |-
Total number of unavailable pods targeted by this deployment. This is the total number of
Expand All @@ -1669,7 +1685,7 @@ spec:
format: int32
type: integer
updatedReplicas:
description: Total number of non-terminated pods targeted
description: Total number of non-terminating pods targeted
by this deployment that have the desired template spec.
format: int32
type: integer
Expand Down
Loading