| Ai21LabsConfig |
| AiGatewayConfig |
| AiGatewayGuardrailParameters |
| AiGatewayGuardrailPiiBehavior |
| AiGatewayGuardrailPiiBehaviorBehavior |
| AiGatewayGuardrails |
| AiGatewayInferenceTableConfig |
| AiGatewayRateLimit |
| AiGatewayRateLimitKey |
| AiGatewayRateLimitRenewalPeriod |
| AiGatewayUsageTrackingConfig |
| AmazonBedrockConfig |
| AmazonBedrockConfigBedrockProvider |
| AnthropicConfig |
| ApiKeyAuth |
| AutoCaptureConfigInput |
| AutoCaptureConfigOutput |
| AutoCaptureState |
| BearerTokenAuth |
| BuildLogsRequest
Get build logs for a served model
|
| BuildLogsResponse |
| ChatMessage |
| ChatMessageRole
The role of the message.
|
| CohereConfig |
| CreatePtEndpointRequest |
| CreateServingEndpoint |
| CustomProviderConfig
Configs needed to create a custom provider model route.
|
| DatabricksModelServingConfig |
| DataframeSplitInput |
| DataPlaneInfo
Details necessary to query this object's API through the DataPlane APIs.
|
| DeleteServingEndpointRequest
Delete a serving endpoint
|
| EmbeddingsV1ResponseEmbeddingElement |
| EmbeddingsV1ResponseEmbeddingElementObject
This will always be 'embedding'.
|
| EndpointCoreConfigInput |
| EndpointCoreConfigOutput |
| EndpointCoreConfigSummary |
| EndpointPendingConfig |
| EndpointState |
| EndpointStateConfigUpdate |
| EndpointStateReady |
| EndpointTag |
| EndpointTags |
| ExportMetricsRequest
Get metrics of a serving endpoint
|
| ExportMetricsResponse |
| ExternalFunctionRequest
Simple Proto message for testing
|
| ExternalFunctionRequestHttpMethod |
| ExternalModel |
| ExternalModelProvider |
| ExternalModelUsageElement |
| FallbackConfig |
| FoundationModel
All fields are not sensitive as they are hard-coded in the system and made available to
customers.
|
| GetOpenApiRequest
Get the schema for a serving endpoint
|
| GetOpenApiResponse |
| GetServingEndpointPermissionLevelsRequest
Get serving endpoint permission levels
|
| GetServingEndpointPermissionLevelsResponse |
| GetServingEndpointPermissionsRequest
Get serving endpoint permissions
|
| GetServingEndpointRequest
Get a single serving endpoint
|
| GoogleCloudVertexAiConfig |
| HttpRequestResponse |
| ListEndpointsResponse |
| LogsRequest
Get the latest logs for a served model
|
| ModelDataPlaneInfo
A representation of all DataPlaneInfo for operations that can be done on a model through Data
Plane APIs.
|
| OpenAiConfig
Configs needed to create an OpenAI model route.
|
| PaLmConfig |
| PatchServingEndpointTags |
| PayloadTable |
| PtEndpointCoreConfig |
| PtServedModel |
| PutAiGatewayRequest |
| PutAiGatewayResponse |
| PutRequest |
| PutResponse |
| QueryEndpointInput |
| QueryEndpointResponse |
| QueryEndpointResponseObject
The type of object returned by the __external/foundation model__ serving endpoint, one of
[text_completion, chat.completion, list (of embeddings)].
|
| RateLimit |
| RateLimitKey |
| RateLimitRenewalPeriod |
| Route |
| ServedEntityInput |
| ServedEntityOutput |
| ServedEntitySpec |
| ServedModelInput |
| ServedModelInputWorkloadType
Please keep this in sync with with workload types in InferenceEndpointEntities.scala
|
| ServedModelOutput |
| ServedModelSpec |
| ServedModelState |
| ServedModelStateDeployment |
| ServerLogsResponse |
| ServingEndpoint |
| ServingEndpointAccessControlRequest |
| ServingEndpointAccessControlResponse |
| ServingEndpointDetailed |
| ServingEndpointDetailedPermissionLevel |
| ServingEndpointPermission |
| ServingEndpointPermissionLevel
Permission level
|
| ServingEndpointPermissions |
| ServingEndpointPermissionsDescription |
| ServingEndpointPermissionsRequest |
| ServingEndpointsAPI
The Serving Endpoints API allows you to create, update, and delete model serving endpoints.
|
| ServingEndpointsDataPlaneService
Serving endpoints DataPlane provides a set of operations to interact with data plane endpoints
for Serving endpoints service.
|
| ServingEndpointsService
The Serving Endpoints API allows you to create, update, and delete model serving endpoints.
|
| ServingModelWorkloadType
Please keep this in sync with with workload types in InferenceEndpointEntities.scala
|
| TrafficConfig |
| UpdateProvisionedThroughputEndpointConfigRequest |
| V1ResponseChoiceElement |