paper

Motivation: 之前的工作(pi0, pi0.5)过度依赖特定任务的fine-tune, 需要对base model进行中等规模的fine-tune之后才能在benchmark中得到一个较好的分数.

因此pi0.6提出:

与pi0.5一样, 基于Flow Matching和FAST, 同时有discrete和continuous的action loss. 但是VLM的backbone使用了Gemma 4B, Action Expert用了Gemma 860M, 扩大了参数.

Knowledge Base

Explorer