🔒 Permission Denied — Role Viewer has limited access. Some actions are disabled.
Training Jobs
Submit and monitor model training jobs.
🏋️
4,425
Total
▶️
1,094
Running
✅
8,096
Completed
❌
9,735
Failed
| Name | Status | Model | GPU Type | GPUs | Epoch | Loss | Duration | Owner | |
|---|---|---|---|---|---|---|---|---|---|
| embedding-170 | Queued | A10-24G | 32 | 34/71 | 0.9104 | 12h 50m | bob.zhang | ||
| embedding-6 | Queued | H100-80G | 1 | 32/55 | 0.3011 | 46h 57m | emma.wu | ||
| embedding-137 | Failed | H100-80G | 2 | 19/153 | 1.7977 | 16h 41m | alice.liu | ||
| rl-agent-69 | Cancelled | A10-24G | 1 | 80/110 | 0.2452 | 46h 4m | alice.liu | ||
| bert-pretrain-106 | Completed | A10-24G | 16 | 45/199 | 1.9537 | 2h 40m | henry.zhao | ||
| rl-agent-38 | Completed | A10-24G | 16 | 32/80 | 0.8738 | 17h 48m | emma.wu | ||
| img-classify-161 | Queued | H100-80G | 2 | 55/51 | 1.4001 | 13h 5m | emma.wu | ||
| diffusion-112 | Completed | A10-24G | 8 | 1/168 | 0.2123 | 6h 26m | emma.wu | ||
| diffusion-116 | Failed | H100-80G | 1 | 64/73 | 1.8740 | 42h 16m | emma.wu | ||
| diffusion-130 | Failed | H100-80G | 32 | 58/197 | 0.5349 | 12h 21m | henry.zhao |