InferenceEngineDetail
Inference Engine Object
id string
The unique identifier for the inference engine
Example:
engine-12345
name string
Example:
test-inference-engine
status string
Possible values: [Creating
, Active
, Inactive
]
Example:
Active
base_model string
Example:
Llama-3.2-3B-Instruct
task_type string
Example:
text-generation
server_type string
Example:
on-demand
deployments object[]
Array [
id string
The unique identifier for the deployment
Example:
dp-12345
name string
The name of the deployment
Example:
deployment-1
status string
The current status of the deployment
Possible values: [Pending
, Deploying
, Deployed
, Failed
, Cancelled
, Terminated
, Deactivated
, Reactivating
, TimedOut
]
Example:
Deployed
created_at integer
The timestamp when the deployment was created
Example:
1633036800
]
base_model_option string
Example:
pre-trained
last_access_timestamp integer
Example:
1633036800
resource_management_config object
oneOf
- Inactive Timeout
- Schedule
management_type string
Example:
inactive_timeout
management_config object
inactive_duration integer
Period of inactivity
Example:
3600
management_type string
Example:
schedule
management_config object
timezone string
The IANA timezone for the schedule
Example:
America/Los_Angeles
start_time string
The start time for the schedule in HH:MM format
Example:
10:00
end_time string
The end time for the schedule in HH:MM format
Example:
18:00
days string[]
The days of the week for the schedule
Possible values: [monday
, tuesday
, wednesday
, thursday
, friday
, saturday
, sunday
]
Example:
["monday","tuesday","wednesday","thursday","friday","saturday","sunday"]
created_at integer
The timestamp when the deployment was last updated
Example:
1633036800
InferenceEngineDetail
{
"id": "engine-12345",
"name": "test-inference-engine",
"status": "Active",
"base_model": "Llama-3.2-3B-Instruct",
"task_type": "text-generation",
"server_type": "on-demand",
"deployments": [
{
"id": "dp-12345",
"name": "deployment-1",
"status": "Deployed",
"created_at": 1633036800
}
],
"base_model_option": "pre-trained",
"last_access_timestamp": 1633036800,
"resource_management_config": {
"management_type": "inactive_timeout",
"management_config": {
"inactive_duration": 3600
}
},
"created_at": 1633036800
}