🔒 Permission Denied — Role Viewer has limited access. Some actions are disabled.
Training Jobs
Submit and monitor model training jobs.
🏋️
3,849
Total
▶️
9,449
Running
✅
5,593
Completed
❌
2,039
Failed
| Name | Status | Model | GPU Type | GPUs | Epoch | Loss | Duration | Owner | |
|---|---|---|---|---|---|---|---|---|---|
| bert-pretrain-10 | Failed | H100-80G | 16 | 70/131 | 1.7822 | 3h 19m | emma.wu | ||
| img-classify-42 | Queued | H100-80G | 2 | 19/116 | 0.8323 | 23h 11m | emma.wu | ||
| bert-pretrain-144 | Completed | A100-80G | 16 | 52/108 | 1.3309 | 43h 9m | alice.liu | ||
| rl-agent-72 | Running | A10-24G | 32 | 15/127 | 0.1165 | 28h 2m | henry.zhao | ||
| img-classify-163 | Running | A100-80G | 8 | 53/105 | 1.2731 | 27h 42m | henry.zhao | ||
| img-classify-110 | Running | A10-24G | 2 | 71/197 | 0.7290 | 40h 15m | henry.zhao | ||
| img-classify-140 | Completed | A10-24G | 32 | 14/94 | 1.9123 | 31h 25m | henry.zhao | ||
| diffusion-99 | Cancelled | A10-24G | 16 | 54/51 | 1.3134 | 38h 58m | emma.wu | ||
| llm-finetune-189 | Failed | A10-24G | 32 | 99/196 | 1.2405 | 27h 12m | bob.zhang | ||
| rl-agent-100 | Queued | A100-80G | 4 | 22/165 | 0.9957 | 26h 54m | henry.zhao |