AccelInferenceConfig Class

Reference

Model deployment config specific to accel model deployments.

Will either:

Deploy a converted model if it's passed one.
Deploy a previously converted model version of the model passed, if one exists.
Convert the passed model and deploy the converted model.

Model deployment config specific to the accel model deployments.

Inheritance: builtins.object

AccelInferenceConfig

Constructor

AccelInferenceConfig(description=None, input_tensor=None, output_tensor=None)

Parameters

description: str

default value: None

A description to give this image

input_tensor: str

default value: None

The name of the input tensor to the model

only used if it needs to be converted.

output_tensor: str

default value: None

The name of the output tensor to the model

only used if it needs to be converted.

Methods

build_create_payload

Build the creation payload for the Container image. For accelerated inference, the model id will be the base model, and may be converted or used to select a previously converted model.

build_profile_payload

Build the profiling payload for the Model package.

validate_configuration

Check that the specified configuration values are valid.

Will raise a WebserviceException if validation fails.

build_create_payload

Build the creation payload for the Container image. For accelerated inference, the model id will be the base model, and may be converted or used to select a previously converted model.

build_create_payload(workspace, image_name, model_ids)

Parameters

workspace: Workspace

Required

The workspace object to create the image in

image_name: str

Required

The name of the image

model_ids: list[str]

Required

A list containing a single model id to determine which

Returns

Container image creation payload

Return type

dict

Exceptions

WebserviceException

build_profile_payload

Build the profiling payload for the Model package.

build_profile_payload(profile_name, input_data)

Parameters

profile_name: str

Required

The name of the profile

input_data: str

Required

The input data for profiling

Returns

Model profile payload

Return type

dict

Exceptions

WebserviceException

validate_configuration

Check that the specified configuration values are valid.

Will raise a WebserviceException if validation fails.

validate_configuration()

Exceptions

WebserviceException

AccelInferenceConfig Class

Constructor

Parameters

Methods

build_create_payload

Parameters

Returns

Return type

Exceptions

build_profile_payload

Parameters

Returns

Return type

Exceptions

validate_configuration

Exceptions

Feedback

Feedback

Additional resources