🔒 Permission Denied — Role Viewer has limited access. Some actions are disabled.
llm-prod-1 Running
View inference service metrics, scaling, and configuration.
Status: Running Model: llama-3.1-70b Runtime: TGI Replicas: 1/5 RPS: 3551
Back to List
Service ID
is-4707
Name
llm-prod-1
Status
Running
Model
llama-3.1-70b
Runtime
TGI
Replicas
1/5
Requests/sec
3551
P50 Latency
451ms
P99 Latency
1185ms
GPU Type
A100-80G
Created
2026-04-08

Resource Metrics

CPU Usage Memory

Activity Timeline

Event 1: Resource was created 1h ago
Event 2: Resource was updated 2h ago
Event 3: Resource was accessed 3h ago
Event 4: Resource was scaled 4h ago
Event 5: Resource was restarted 5h ago