964 B
964 B
PLAN
Goal
Train a 50k-step IMF baseline with the original ResNet vision backbone, using only the front camera as image conditioning.
Fixed comparison contract
- Same as the active
top/frontrun except image input is reduced to[front] - Agent:
resnet_imf_attnres - Vision backbone mode:
resnet pred_horizon=16,num_action_steps=8n_emb=384,n_layer=12,n_head=1,n_kv_head=1inference_steps=1batch_size=80,lr=2.5e-4, cosine, warmup=2000- dataset:
/home/droid/sim_dataset/sim_transfer - cameras:
[front]only - rollout every 5 epochs with 5 episodes, headless
Resource plan
- Host:
100.119.99.14 - GPU:
0
Important dimension override
- Single-camera visual cond dim =
64 + 16 = 80, so overrideagent.head.cond_dim=80andagent.num_cams=1.
Execution path
- 2-step smoke test on remote GPU0.
- If smoke passes, launch 50k main run with SwanLab.
- Record pid / run_dir / log / URL locally.