Abstract
Learning in simulation and transferring the learned policy to the real world has the potential to enable generalist robots. The key challenge of this approach is to address simulation-to-reality (sim-to-real) gaps. Previous methods often require domain-specific knowledge a priori. We argue that a straightforward way to obtain such knowledge is by asking humans to observe and assist robot policy execution in the real world. The robots can then learn from humans to close various sim-to-real gaps. We propose TRANSIC, a data-driven approach to enable successful sim-to-real transfer based on a human-in-the-loop framework. TRANSIC allows humans to augment simulation policies to overcome various unmodeled sim-to-real gaps holistically through intervention and online correction. Residual policies can be learned from human corrections and integrated with simulation policies for autonomous execution. We show that our approach can achieve successful sim-to-real transfer in complex and contact-rich manipulation tasks such as furniture assembly. Through synergistic integration of policies learned in simulation and from humans, TRANSIC is effective as a holistic approach to addressing various, often coexisting sim-to-real gaps. It displays attractive properties such as scaling with human effort. Videos and code are available at this https URL
Abstract (translated)
在模拟中学习和将学到的策略应用于现实世界具有实现通用机器人的潜力。这种方法的关键挑战是解决模拟与现实之间的差距(sim-to-real gaps)。之前的方法通常需要先验的知识领域特定知识。我们认为,获得这种知识的最直接方法是让人类在现实生活中观察和辅助机器人策略执行。机器人可以从人类那里学习以填补各种模拟与现实之间的差距。我们提出了TRANSIC,一种基于人类在环框架的数据驱动方法,以实现基于人类在环的模拟与现实之间的成功转移。TRANSIC允许人类通过干预和在线纠错来通过各种未建模的模拟与现实之间的差距来扩展模拟策略。残余策略可以从人类的纠正中学习,并将其与模拟策略集成以实现自主执行。我们证明了,在我们的方法下,可以实现成功的模拟与现实之间的转移,特别是在复杂的接触操作任务中,如家具组装。通过模拟策略和学习人类策略的协同作用,TRANSIC是一种有效的全面方法来解决各种经常存在的模拟与现实之间的差距。它具有可扩展 human effort 的特点。视频和代码可以通过这个链接https://www.youtube.com/watch?v=获取:
URL
https://arxiv.org/abs/2405.10315