Putting People in Their Place: Affordance-Aware Human Insertion into Scenes

April 27, 2023 · Entered Twilight · 🏛 Computer Vision and Pattern Recognition

"No code URL or promise found in abstract"
"Derived repo from GitHub Pages (backfill)"

Evidence collected by the PWNC Scanner

Repo contents: index.html, juxtapose, static

Authors Sumith Kulal, Tim Brooks, Alex Aiken, Jiajun Wu, Jimei Yang, Jingwan Lu, Alexei A. Efros, Krishna Kumar Singh arXiv ID 2304.14406 Category cs.CV: Computer Vision Cross-listed cs.AI, cs.GR, cs.LG Citations 57 Venue Computer Vision and Pattern Recognition Repository https://github.com/sumith1896/affordance-insertion Last Checked 9 days ago

Abstract

We study the problem of inferring scene affordances by presenting a method for realistically inserting people into scenes. Given a scene image with a marked region and an image of a person, we insert the person into the scene while respecting the scene affordances. Our model can infer the set of realistic poses given the scene context, re-pose the reference person, and harmonize the composition. We set up the task in a self-supervised fashion by learning to re-pose humans in video clips. We train a large-scale diffusion model on a dataset of 2.4M video clips that produces diverse plausible poses while respecting the scene context. Given the learned human-scene composition, our model can also hallucinate realistic people and scenes when prompted without conditioning and also enables interactive editing. A quantitative evaluation shows that our method synthesizes more realistic human appearance and more natural human-scene interactions than prior work.