🌅
🌅
Old Age
Modeling Context Between Objects for Referring Expression Understanding
August 01, 2016 · 🏛 European Conference on Computer Vision
"No code URL or promise found in abstract"
"HuggingFace models found (backfill)"
Evidence collected by the PWNC Scanner
Authors
Varun K. Nagaraja, Vlad I. Morariu, Larry S. Davis
arXiv ID
1608.00525
Category
cs.CV: Computer Vision
Citations
526
Venue
European Conference on Computer Vision
Repository
https://huggingface.co/linhuixiao/Awesome-Visual-Grounding
Last Checked
9 days ago
Abstract
Referring expressions usually describe an object using properties of the object and relationships of the object with other objects. We propose a technique that integrates context between objects to understand referring expressions. Our approach uses an LSTM to learn the probability of a referring expression, with input features from a region and a context region. The context regions are discovered using multiple-instance learning (MIL) since annotations for context objects are generally not available for training. We utilize max-margin based MIL objective functions for training the LSTM. Experiments on the Google RefExp and UNC RefExp datasets show that modeling context between objects provides better performance than modeling only object properties. We also qualitatively show that our technique can ground a referring expression to its referred region along with the supporting context region.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
📜 Similar Papers
In the same crypt — Computer Vision
🌅
🌅
Old Age
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
🌅
🌅
Old Age
SSD: Single Shot MultiBox Detector
🌅
🌅
Old Age
Squeeze-and-Excitation Networks
🌅
🌅
Old Age
Fast R-CNN
🌅
🌅
Old Age