Incorporating Task Progress Knowledge for Subgoal Generation in Robotic Manipulation through Image Edits

October 14, 2024 · Declared Dead · 🏛 IEEE Workshop/Winter Conference on Applications of Computer Vision

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Xuhui Kang, Yen-Ling Kuo arXiv ID 2410.11013 Category cs.RO: Robotics Citations 9 Venue IEEE Workshop/Winter Conference on Applications of Computer Vision Last Checked 3 months ago

Abstract

Understanding the progress of a task allows humans to not only track what has been done but also to better plan for future goals. We demonstrate TaKSIE, a novel framework that incorporates task progress knowledge into visual subgoal generation for robotic manipulation tasks. We jointly train a recurrent network with a latent diffusion model to generate the next visual subgoal based on the robot's current observation and the input language command. At execution time, the robot leverages a visual progress representation to monitor the task progress and adaptively samples the next visual subgoal from the model to guide the manipulation policy. We train and validate our model in simulated and real-world robotic tasks, achieving state-of-the-art performance on the CALVIN manipulation benchmark. We find that the inclusion of task progress knowledge can improve the robustness of trained policy for different initial robot poses or various movement speeds during demonstrations. The project website can be found at https://live-robotics-uva.github.io/TaKSIE/ .

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Robotics

R.I.P. 👻 Ghosted

Past, Present, and Future of Simultaneous Localization And Mapping: Towards the Robust-Perception Age

Cesar Cadena, Luca Carlone, ... (+6 more)

cs.RO 🏛 IEEE TRO 📚 3.2K cites 10 years ago

R.I.P. 👻 Ghosted

AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles

Shital Shah, Debadeepta Dey, ... (+2 more)

cs.RO 🏛 ICFSR 📚 2.3K cites 9 years ago

📚 📚 The Cartographer

A Survey of Motion Planning and Control Techniques for Self-driving Urban Vehicles

Brian Paden, Michal Cap, ... (+3 more)

cs.RO 🏛 IEEE TIV 📚 2.3K cites 10 years ago

📚 📚 The Cartographer

Unmanned Aerial Vehicles: A Survey on Civil Applications and Key Research Challenges

Hazim Shakhatreh, Ahmad Sawalmeh, ... (+7 more)

cs.RO 🏛 arXiv 📚 1.8K cites 8 years ago

📚 📚 The Cartographer

A Survey of Autonomous Driving: Common Practices and Emerging Technologies

Ekim Yurtsever, Jacob Lambert, ... (+2 more)

cs.RO 🏛 IEEE Access 📚 1.7K cites 7 years ago

R.I.P. 👻 Ghosted

Learning agile and dynamic motor skills for legged robots

Jemin Hwangbo, Joonho Lee, ... (+5 more)

cs.RO 🏛 Sci. Robot. 📚 1.6K cites 7 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 8 years ago