VLA & 世界模型

VLA（Vision - Language - Action）：视觉 - 语言 - 动作模型

世界模型（World Model）

端到端不能做到，就分层，视觉运动反馈，视觉识别，语言识别，

Back to top