InferenceEngineDetail
Inference Engine Object
id string
The unique identifier for the inference engine
Example:
engine-12345name string
Example:
test-inference-enginestatus string
Possible values: [Creating, Active, Inactive]
Example:
Activebase_model string
Example:
Llama-3.2-3B-Instructtask_type string
Example:
text-generationserver_type string
Example:
on-demanddeployments object[]
Array [
id string
The unique identifier for the deployment
Example:
dp-12345name string
The name of the deployment
Example:
deployment-1status string
The current status of the deployment
Possible values: [Pending, Deploying, Deployed, Failed, Cancelled, Terminated, Deactivated, Reactivating, TimedOut]
Example:
Deployedcreated_at integer
The timestamp when the deployment was created
Example:
1633036800]
base_model_option string
Example:
pre-trainedlast_access_timestamp integer
Example:
1633036800resource_management_config object
oneOf
- Inactive Timeout
- Schedule
management_type string
Example:
inactive_timeoutmanagement_config object
inactive_duration integer
Period of inactivity
Example:
3600management_type string
Example:
schedulemanagement_config object
timezone string
The IANA timezone for the schedule
Example:
America/Los_Angelesstart_time string
The start time for the schedule in HH:MM format
Example:
10:00end_time string
The end time for the schedule in HH:MM format
Example:
18:00days string[]
The days of the week for the schedule
Possible values: [monday, tuesday, wednesday, thursday, friday, saturday, sunday]
Example:
["monday","tuesday","wednesday","thursday","friday","saturday","sunday"]created_at integer
The timestamp when the deployment was last updated
Example:
1633036800InferenceEngineDetail
{
"id": "engine-12345",
"name": "test-inference-engine",
"status": "Active",
"base_model": "Llama-3.2-3B-Instruct",
"task_type": "text-generation",
"server_type": "on-demand",
"deployments": [
{
"id": "dp-12345",
"name": "deployment-1",
"status": "Deployed",
"created_at": 1633036800
}
],
"base_model_option": "pre-trained",
"last_access_timestamp": 1633036800,
"resource_management_config": {
"management_type": "inactive_timeout",
"management_config": {
"inactive_duration": 3600
}
},
"created_at": 1633036800
}