Personalized Showcases: Generating Multi-Modal Explanations for Recommendations

June 30, 2022 · Declared Dead · 🏛 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

Authors An Yan, Zhankui He, Jiacheng Li, Tianyang Zhang, Julian McAuley arXiv ID 2207.00422 Category cs.IR: Information Retrieval Cross-listed cs.AI, cs.CV Citations 58 Venue Annual International ACM SIGIR Conference on Research and Development in Information Retrieval Repository https://github.com/zzxslp/Gest ⭐ 11 Last Checked 1 month ago

Abstract

Existing explanation models generate only text for recommendations but still struggle to produce diverse contents. In this paper, to further enrich explanations, we propose a new task named personalized showcases, in which we provide both textual and visual information to explain our recommendations. Specifically, we first select a personalized image set that is the most relevant to a user's interest toward a recommended item. Then, natural language explanations are generated accordingly given our selected images. For this new task, we collect a large-scale dataset from Google Local (i.e.,~maps) and construct a high-quality subset for generating multi-modal explanations. We propose a personalized multi-modal framework which can generate diverse and visually-aligned explanations via contrastive learning. Experiments show that our framework benefits from different modalities as inputs, and is able to produce more diverse and expressive explanations compared to previous methods on a variety of evaluation metrics.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 💻 Repository 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Information Retrieval

R.I.P. 👻 Ghosted

Neural Collaborative Filtering

Xiangnan He, Lizi Liao, ... (+4 more)

cs.IR 🏛 WWW 📚 6.8K cites 8 years ago

R.I.P. 👻 Ghosted

LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation

Xiangnan He, Kuan Deng, ... (+4 more)

cs.IR 🏛 SIGIR 📚 4.7K cites 6 years ago

R.I.P. 👻 Ghosted

Graph Convolutional Neural Networks for Web-Scale Recommender Systems

Rex Ying, Ruining He, ... (+4 more)

cs.IR 🏛 KDD 📚 4.0K cites 7 years ago

🌅 🌅 Old Age

Neural Graph Collaborative Filtering

Xiang Wang, Xiangnan He, ... (+3 more)

cs.IR 🏛 SIGIR 📚 3.6K cites 6 years ago

R.I.P. 👻 Ghosted

Self-Attentive Sequential Recommendation

Wang-Cheng Kang, Julian McAuley

cs.IR 🏛 ICDM 📚 3.3K cites 7 years ago

R.I.P. 👻 Ghosted

DeepFM: A Factorization-Machine based Neural Network for CTR Prediction

Huifeng Guo, Ruiming Tang, ... (+3 more)

cs.IR 🏛 IJCAI 📚 3.0K cites 9 years ago

Died the same way — ⚰️ The Empty Tomb

R.I.P. ⚰️ The Empty Tomb

DSFD: Dual Shot Face Detector

Jian Li, Yabiao Wang, ... (+7 more)

cs.CV 🏛 CVPR 📚 462 cites 7 years ago

R.I.P. ⚰️ The Empty Tomb

InstanceCut: from Edges to Instances with MultiCut

Alexander Kirillov, Evgeny Levinkov, ... (+3 more)

cs.CV 🏛 CVPR 📚 261 cites 9 years ago

R.I.P. ⚰️ The Empty Tomb

FLNet: Landmark Driven Fetching and Learning Network for Faithful Talking Facial Animation Synthesis

Kuangxiao Gu, Yuqian Zhou, Thomas Huang

cs.CV 🏛 AAAI 📚 62 cites 6 years ago

R.I.P. ⚰️ The Empty Tomb

XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing

Peng Shi, Rui Zhang, ... (+2 more)

cs.CL 🏛 EMNLP 📚 55 cites 3 years ago