View Source GoogleApi.AIPlatform.V1.Model.GoogleCloudAiplatformV1ResourcePoolAutoscalingSpec (google_api_ai_platform v0.13.0)
The min/max number of replicas allowed if enabling autoscaling
Attributes
-
maxReplicaCount
(type:String.t
, default:nil
) - Optional. max replicas in the node pool, must be ≥ replica_count and > min_replica_count or will throw error -
minReplicaCount
(type:String.t
, default:nil
) - Optional. min replicas in the node pool, must be ≤ replica_count and < max_replica_count or will throw error. For autoscaling enabled Ray-on-Vertex, we allow min_replica_count of a resource_pool to be 0 to match the OSS Ray behavior(https://docs.ray.io/en/latest/cluster/vms/user-guides/configuring-autoscaling.html#cluster-config-parameters). As for Persistent Resource, the min_replica_count must be > 0, we added a corresponding validation inside CreatePersistentResourceRequestValidator.java.
Summary
Functions
Unwrap a decoded JSON object into its complex fields.
Types
Functions
Unwrap a decoded JSON object into its complex fields.