Azure.AI.Inference Namespace

Classes

AIInferenceModelFactory

Model factory for models.

ChatChoice

The representation of a single prompt completion as part of an overall chat completions request. Generally, n choices are generated per provided prompt with a default value of 1. Token limits and other settings may limit the number of choices generated.

ChatCompletions

Representation of the response data from a chat completions request. Completions support a wide variety of tasks and generate text that continues from or "completes" provided prompt data.

ChatCompletionsClient

The ChatCompletions service client.

ChatCompletionsClientOptions

Client options for ChatCompletionsClient.

ChatCompletionsFunctionToolCall

A tool call to a function tool, issued by the model in evaluation of a configured function tool, that represents a function invocation needed for a subsequent chat completions request to resolve.

ChatCompletionsFunctionToolDefinition

The definition information for a chat completions function tool that can call a function in response to a tool call.

ChatCompletionsFunctionToolSelection

A tool selection of a specific, named function tool that will limit chat completions to using the named function.

ChatCompletionsNamedFunctionToolSelection

A tool selection of a specific, named function tool that will limit chat completions to using the named function.

ChatCompletionsNamedToolSelection

An abstract representation of an explicit, named tool selection to use for a chat completions request. Please note ChatCompletionsNamedToolSelection is the base class. According to the scenario, a derived class of the base class might need to be assigned here, or this property needs to be casted to one of the possible derived classes. The available derived classes include ChatCompletionsNamedFunctionToolSelection.

ChatCompletionsOptions

The configuration information for a chat completions request. Completions support a wide variety of tasks and generate text that continues from or "completes" provided prompt data.

ChatCompletionsResponseFormat

Represents the format that the model must output. Use this to enable JSON mode instead of the default text mode. Note that to enable JSON mode, some AI models may also require you to instruct the model to produce JSON via a system or user message. Please note ChatCompletionsResponseFormat is the base class. According to the scenario, a derived class of the base class might need to be assigned here, or this property needs to be casted to one of the possible derived classes. The available derived classes include ChatCompletionsResponseFormatJSON and ChatCompletionsResponseFormatText.

ChatCompletionsResponseFormatJSON

A response format for Chat Completions that restricts responses to emitting valid JSON objects. Note that to enable JSON mode, some AI models may also require you to instruct the model to produce JSON via a system or user message.

ChatCompletionsResponseFormatText

A response format for Chat Completions that emits text responses. This is the default response format.

ChatCompletionsToolCall

An abstract representation of a tool call that must be resolved in a subsequent request to perform the requested chat completion. Please note ChatCompletionsToolCall is the base class. According to the scenario, a derived class of the base class might need to be assigned here, or this property needs to be casted to one of the possible derived classes. The available derived classes include ChatCompletionsFunctionToolCall.

ChatCompletionsToolChoice

Represents an optional control that specifies which, if any, tools may be called by the model while processing a chat completions request.

ChatCompletionsToolDefinition

An abstract representation of a tool that can be used by the model to improve a chat completions response. Please note ChatCompletionsToolDefinition is the base class. According to the scenario, a derived class of the base class might need to be assigned here, or this property needs to be casted to one of the possible derived classes. The available derived classes include ChatCompletionsFunctionToolDefinition.

ChatMessageContentItem

An abstract representation of a structured content item within a chat message. Please note ChatMessageContentItem is the base class. According to the scenario, a derived class of the base class might need to be assigned here, or this property needs to be casted to one of the possible derived classes. The available derived classes include ChatMessageImageContentItem and ChatMessageTextContentItem.

ChatMessageImageContentItem

A structured chat content item containing an image reference.

ChatMessageTextContentItem

A structured chat content item containing plain text.

ChatRequestAssistantMessage

A request chat message representing response or action from the assistant.

ChatRequestMessage

An abstract representation of a chat message as provided in a request. Please note ChatRequestMessage is the base class. According to the scenario, a derived class of the base class might need to be assigned here, or this property needs to be casted to one of the possible derived classes. The available derived classes include ChatRequestAssistantMessage, ChatRequestSystemMessage, ChatRequestToolMessage and ChatRequestUserMessage.

ChatRequestSystemMessage

A request chat message containing system instructions that influence how the model will generate a chat completions response.

ChatRequestToolMessage

A request chat message representing requested output from a configured tool.

ChatRequestUserMessage

A request chat message representing user input to the assistant.

ChatResponseMessage

A representation of a chat message as received in a response.

CompletionsUsage

Representation of the token counts processed for a completions request. Counts consider all tokens across prompts, choices, choice alternates, best_of generations, and other consumers.

FunctionCall

The name and arguments of a function that should be called, as generated by the model.

FunctionDefinition

The definition of a caller-specified function that chat completions may invoke in response to matching user input.

ModelInfo

Represents some basic information about the AI model.

StreamingChatCompletionsUpdate

Represents an incremental update to a streamed Chat Completions response.

StreamingFunctionToolCallUpdate

Represents an incremental update to a streaming function tool call that is part of a streaming chat completions choice.

StreamingResponse<T>

Represents an operation response with streaming content that can be deserialized and enumerated while the response is still being received.

StreamingToolCallUpdate

Represents an incremental update to a streaming tool call that is part of a streaming chat completions choice.

Structs

ChatCompletionsToolSelectionPreset

Represents a generic policy for how a chat completions tool may be selected.

ChatMessageImageDetailLevel

A representation of the possible image detail levels for image-based chat completions message content.

ChatRole

A description of the intended purpose of a message within a chat completions interaction.

CompletionsFinishReason

Representation of the manner in which a completions response concluded.

ExtraParameters

Controls what happens if extra parameters, undefined by the REST API, are passed in the JSON request payload.

ModelType

The type of AI model.

Enums

ChatCompletionsClientOptions.ServiceVersion

The version of the service to use.