🔒 Permission Denied — Role Viewer has limited access. Some actions are disabled.
speech-staging-15 Deploying
View inference service metrics, scaling, and configuration.
Status: Deploying Model: llama-3.1-70b Runtime: vLLM Replicas: 1/2 RPS: 180
Back to List
Service ID
is-5364
Name
speech-staging-15
Status
Deploying
Model
llama-3.1-70b
Runtime
vLLM
Replicas
1/2
Requests/sec
180
P50 Latency
62ms
P99 Latency
925ms
GPU Type
A100-80G
Created
2026-06-09

Resource Metrics

CPU Usage Memory

Activity Timeline

Event 1: Resource was created 1h ago
Event 2: Resource was updated 2h ago
Event 3: Resource was accessed 3h ago
Event 4: Resource was scaled 4h ago
Event 5: Resource was restarted 5h ago