🔒 Permission Denied — Role Viewer has limited access. Some actions are disabled.
llm-prod-2 Running
View inference service metrics, scaling, and configuration.
Status: Running Model: llama-3.1-70b Runtime: Custom Replicas: 1/5 RPS: 3149
Back to List
Service ID
is-4138
Name
llm-prod-2
Status
Running
Model
llama-3.1-70b
Runtime
Custom
Replicas
1/5
Requests/sec
3149
P50 Latency
170ms
P99 Latency
2019ms
GPU Type
A10-24G
Created
2026-06-17

Resource Metrics

CPU Usage Memory

Activity Timeline

Event 1: Resource was created 1h ago
Event 2: Resource was updated 2h ago
Event 3: Resource was accessed 3h ago
Event 4: Resource was scaled 4h ago
Event 5: Resource was restarted 5h ago