KubernetesOnlineDeployment Class
Definition
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
Properties specific to a KubernetesOnlineDeployment.
[System.ComponentModel.TypeConverter(typeof(Microsoft.Azure.PowerShell.Cmdlets.MachineLearningServices.Models.Api20220501.KubernetesOnlineDeploymentTypeConverter))]
public class KubernetesOnlineDeployment : Microsoft.Azure.PowerShell.Cmdlets.MachineLearningServices.Models.Api20220501.IKubernetesOnlineDeployment, Microsoft.Azure.PowerShell.Cmdlets.MachineLearningServices.Runtime.IValidates
[<System.ComponentModel.TypeConverter(typeof(Microsoft.Azure.PowerShell.Cmdlets.MachineLearningServices.Models.Api20220501.KubernetesOnlineDeploymentTypeConverter))>]
type KubernetesOnlineDeployment = class
interface IKubernetesOnlineDeployment
interface IJsonSerializable
interface IOnlineDeploymentProperties
interface IEndpointDeploymentPropertiesBase
interface IValidates
Public Class KubernetesOnlineDeployment
Implements IKubernetesOnlineDeployment, IValidates
- Inheritance
-
KubernetesOnlineDeployment
- Attributes
- Implements
Constructors
KubernetesOnlineDeployment() |
Creates an new KubernetesOnlineDeployment instance. |
Properties
AppInsightsEnabled |
If true, enables Application Insights logging. |
CodeConfiguration |
Code configuration for the endpoint deployment. |
CodeConfigurationCodeId |
ARM resource ID of the code asset. |
CodeConfigurationScoringScript |
[Required] The script to execute on startup. eg. "score.py" |
ContainerResourceLimitCpu |
Number of vCPUs request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
ContainerResourceLimitGpu |
Number of Nvidia GPU cards request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
ContainerResourceLimitMemory |
Memory size request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
ContainerResourceRequestCpu |
Number of vCPUs request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
ContainerResourceRequestGpu |
Number of Nvidia GPU cards request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
ContainerResourceRequestMemory |
Memory size request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
Description |
Description of the endpoint deployment. |
EndpointComputeType |
[Required] The compute type of the endpoint. |
EnvironmentId |
ARM resource ID or AssetId of the environment specification for the endpoint deployment. |
EnvironmentVariable |
Environment variables configuration for the deployment. |
InstanceType |
Compute instance type. |
LivenessProbe |
Liveness probe monitors the health of the container regularly. |
LivenessProbeFailureThreshold |
The number of failures to allow before returning an unhealthy status. |
LivenessProbeInitialDelay |
The delay before the first probe in ISO 8601 format. |
LivenessProbePeriod |
The length of time between probes in ISO 8601 format. |
LivenessProbeSuccessThreshold |
The number of successful probes before returning a healthy status. |
LivenessProbeTimeout |
The probe timeout in ISO 8601 format. |
Model |
The URI path to the model. |
ModelMountPath |
The path to mount the model in custom container. |
Property |
Property dictionary. Properties can be added, but not removed or altered. |
ProvisioningState |
Provisioning state for the endpoint deployment. |
ReadinessProbe |
Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe. |
ReadinessProbeFailureThreshold |
The number of failures to allow before returning an unhealthy status. |
ReadinessProbeInitialDelay |
The delay before the first probe in ISO 8601 format. |
ReadinessProbePeriod |
The length of time between probes in ISO 8601 format. |
ReadinessProbeSuccessThreshold |
The number of successful probes before returning a healthy status. |
ReadinessProbeTimeout |
The probe timeout in ISO 8601 format. |
RequestSetting |
Request settings for the deployment. |
RequestSettingMaxConcurrentRequestsPerInstance |
The number of maximum concurrent requests per node allowed per deployment. Defaults to 1. |
RequestSettingMaxQueueWait |
The maximum amount of time a request will stay in the queue in ISO 8601 format. Defaults to 500ms. |
RequestSettingRequestTimeout |
The scoring timeout in ISO 8601 format. Defaults to 5000ms. |
ScaleSetting |
Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment. |
ScaleSettingScaleType |
[Required] Type of deployment scaling algorithm |
Methods
DeserializeFromDictionary(IDictionary) |
Deserializes a IDictionary into an instance of KubernetesOnlineDeployment. |
DeserializeFromPSObject(PSObject) |
Deserializes a PSObject into an instance of KubernetesOnlineDeployment. |
FromJson(JsonNode) |
Deserializes a JsonNode into an instance of Microsoft.Azure.PowerShell.Cmdlets.MachineLearningServices.Models.Api20220501.IKubernetesOnlineDeployment. |
FromJsonString(String) |
Creates a new instance of KubernetesOnlineDeployment, deserializing the content from a json string. |
ToJson(JsonObject, SerializationMode) |
Serializes this instance of KubernetesOnlineDeployment into a JsonNode. |
ToJsonString() |
Serializes this instance to a json string. |
ToString() | |
Validate(IEventListener) |
Validates that this object meets the validation criteria. |