🔒 Permission Denied — Role Viewer has limited access. Some actions are disabled.
speech-staging-2 Scaling
View inference service metrics, scaling, and configuration.
Status: Scaling Model: whisper-large Runtime: vLLM Replicas: 2/6 RPS: 136
Back to List
Service ID
is-7037
Name
speech-staging-2
Status
Scaling
Model
whisper-large
Runtime
vLLM
Replicas
2/6
Requests/sec
136
P50 Latency
62ms
P99 Latency
2643ms
GPU Type
H100-80G
Created
2026-06-07

Resource Metrics

CPU Usage Memory

Activity Timeline

Event 1: Resource was created 1h ago
Event 2: Resource was updated 2h ago
Event 3: Resource was accessed 3h ago
Event 4: Resource was scaled 4h ago
Event 5: Resource was restarted 5h ago