🔒 Permission Denied — Role Viewer has limited access. Some actions are disabled.
llm-finetune-12 Failed
View training progress, metrics, and GPU utilization.
Status: Failed Framework: DeepSpeed GPU: A100-80G GPUs: 2 Owner: henry.zhao
Back to List
Job ID
tj-2907
Name
llm-finetune-12
Status
Failed
Framework
DeepSpeed
GPU Type
A100-80G
GPU Count
2
Epoch Progress
65/154
Current Loss
1.4989
Duration
35h 27m
Owner
henry.zhao
Project
Project Alpha
Created
2026-02-17

Resource Metrics

CPU Usage Memory

Activity Timeline

Event 1: Resource was created 1h ago
Event 2: Resource was updated 2h ago
Event 3: Resource was accessed 3h ago
Event 4: Resource was scaled 4h ago
Event 5: Resource was restarted 5h ago