AccelInferenceConfig Class

Model deployment config specific to accel model deployments.

Will either:

  1. Deploy a converted model if it's passed one.

  2. Deploy a previously converted model version of the model passed, if one exists.

  3. Convert the passed model and deploy the converted model.

Model deployment config specific to the accel model deployments.

Inheritance
builtins.object
AccelInferenceConfig

Constructor

AccelInferenceConfig(description=None, input_tensor=None, output_tensor=None)

Parameters

description
str
default value: None

A description to give this image

input_tensor
str
default value: None

The name of the input tensor to the model

  • only used if it needs to be converted.
output_tensor
str
default value: None

The name of the output tensor to the model

  • only used if it needs to be converted.

Methods

build_create_payload

Build the creation payload for the Container image. For accelerated inference, the model id will be the base model, and may be converted or used to select a previously converted model.

build_profile_payload

Build the profiling payload for the Model package.

validate_configuration

Check that the specified configuration values are valid.

Will raise a WebserviceException if validation fails.

build_create_payload

Build the creation payload for the Container image. For accelerated inference, the model id will be the base model, and may be converted or used to select a previously converted model.

build_create_payload(workspace, image_name, model_ids)

Parameters

workspace
Workspace
Required

The workspace object to create the image in

image_name
str
Required

The name of the image

model_ids
list[str]
Required

A list containing a single model id to determine which

Returns

Container image creation payload

Return type

Exceptions

build_profile_payload

Build the profiling payload for the Model package.

build_profile_payload(profile_name, input_data)

Parameters

profile_name
str
Required

The name of the profile

input_data
str
Required

The input data for profiling

Returns

Model profile payload

Return type

Exceptions

validate_configuration

Check that the specified configuration values are valid.

Will raise a WebserviceException if validation fails.

validate_configuration()

Exceptions

WebserviceException