🔒 Permission Denied — Role Viewer has limited access. Some actions are disabled.
embed-prod-7 Deploying
View inference service metrics, scaling, and configuration.
Status: Deploying Model: llama-3.1-70b Runtime: vLLM Replicas: 2/8 RPS: 2349
Back to List
Service ID
is-1007
Name
embed-prod-7
Status
Deploying
Model
llama-3.1-70b
Runtime
vLLM
Replicas
2/8
Requests/sec
2349
P50 Latency
479ms
P99 Latency
1122ms
GPU Type
A100-80G
Created
2026-05-26

Resource Metrics

CPU Usage Memory

Activity Timeline

Event 1: Resource was created 1h ago
Event 2: Resource was updated 2h ago
Event 3: Resource was accessed 3h ago
Event 4: Resource was scaled 4h ago
Event 5: Resource was restarted 5h ago