RoboGene Boosting VLA Pre-training via Diversity-Driven Agentic Framework for Real-World Task Generation

Title: RoboGene: Boosting VLA Pre-training via Diversity-Driven Agentic Framework for Real-World Task Generation
Authors: Yixue Zhang et al.
arXiv: https://arxiv.org/abs/2602.16444

Problem framing

VLA 预训练短板在于真实机器人任务数据稀缺且昂贵；手工任务不可扩展，大模型直生任务又常物理不可执行。

Core method

RoboGene 以 agentic pipeline 自动任务生成：diversity-driven sampling + self-reflection 物理约束 + human-in-the-loop 迭代修正。

Key equations and mechanisms

可概括为多目标优化：

T max α D (T) + βF (T) + γ Q (T)

其中 $D / F / Q$ 对应多样性、可行性、训练价值。

Experiment reading guide

先看任务质量指标，再看下游 VLA success/generalization 与 GPT-4o、Gemini 2.5 Pro 基线对比。

Limitations

仍需人工反馈闭环；指标可能偏向易评估任务；依赖先验规则覆盖度。

Future work

扩展到长时序复合任务与多机器人协作，构建在线数据 flywheel。

Replication angle

在固定数据预算下比较“任务质量提升”而非“仅增数据量”。

图链接（可直链渲染）：https://arxiv.org/html/2602.16444v1/x1.png

Graph: Paper Node 2602.16444