🔒 Permission Denied — Role Viewer has limited access. Some actions are disabled.
llm-staging-12 Running
View inference service metrics, scaling, and configuration.
Status: Running Model: llama-3.1-70b Runtime: Custom Replicas: 2/4 RPS: 2938
Back to List
Service ID
is-1760
Name
llm-staging-12
Status
Running
Model
llama-3.1-70b
Runtime
Custom
Replicas
2/4
Requests/sec
2938
P50 Latency
278ms
P99 Latency
334ms
GPU Type
H100-80G
Created
2026-06-14

Resource Metrics

CPU Usage Memory

Activity Timeline

Event 1: Resource was created 1h ago
Event 2: Resource was updated 2h ago
Event 3: Resource was accessed 3h ago
Event 4: Resource was scaled 4h ago
Event 5: Resource was restarted 5h ago