Constructing a Visual Relationship Authenticity Dataset

October 11, 2020 ยท Declared Dead ยท ๐Ÿ› arXiv.org

๐Ÿฆด CAUSE OF DEATH: Skeleton Repo
Boilerplate only, no real code

Repo contents: LICENSE, README.md, testSplit_AMT.csv, testSplit_parse.csv, trainSplit_AMT.csv, trainSplit_parse.csv, valSplit_AMT.csv, valSplit_parse.csv

Authors Chenhui Chu, Yuto Takebayashi, Mishra Vipul, Yuta Nakashima arXiv ID 2010.05185 Category cs.CV: Computer Vision Cross-listed cs.AI, cs.CL Citations 0 Venue arXiv.org Repository https://github.com/codecreator2053/VR_ClassifiedDataset Last Checked 1 month ago
Abstract
A visual relationship denotes a relationship between two objects in an image, which can be represented as a triplet of (subject; predicate; object). Visual relationship detection is crucial for scene understanding in images. Existing visual relationship detection datasets only contain true relationships that correctly describe the content in an image. However, distinguishing false visual relationships from true ones is also crucial for image understanding and grounded natural language processing. In this paper, we construct a visual relationship authenticity dataset, where both true and false relationships among all objects appeared in the captions in the Flickr30k entities image caption dataset are annotated. The dataset is available at https://github.com/codecreator2053/VR_ClassifiedDataset. We hope that this dataset can promote the study on both vision and language understanding.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision

Died the same way โ€” ๐Ÿฆด Skeleton Repo

R.I.P. ๐Ÿฆด Skeleton Repo

Neural Style Transfer: A Review

Yongcheng Jing, Yezhou Yang, ... (+4 more)

cs.CV ๐Ÿ› IEEE TVCG ๐Ÿ“š 828 cites 8 years ago