Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking

May 13, 2024 · Declared Dead · 🏛 European Conference on Information Retrieval

Repo contents: README.md

Authors Ferdinand Schlatt, Maik Fröbe, Harrisen Scells, Shengyao Zhuang, Bevan Koopman, Guido Zuccon, Benno Stein, Martin Potthast, Matthias Hagen arXiv ID 2405.07920 Category cs.IR: Information Retrieval Citations 12 Venue European Conference on Information Retrieval Repository https://github.com/webis-de/ECIR-25 Last Checked 1 month ago

Abstract

Cross-encoders distilled from large language models (LLMs) are often more effective re-rankers than cross-encoders fine-tuned on manually labeled data. However, distilled models do not match the effectiveness of their teacher LLMs. We hypothesize that this effectiveness gap is due to the fact that previous work has not applied the best-suited methods for fine-tuning cross-encoders on manually labeled data (e.g., hard-negative sampling, deep sampling, and listwise loss functions). To close this gap, we create a new dataset, Rank-DistiLLM. Cross-encoders trained on Rank-DistiLLM achieve the effectiveness of LLMs while being up to 173 times faster and 24 times more memory efficient. Our code and data is available at https://github.com/webis-de/ECIR-25.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 💻 Repository 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Information Retrieval

R.I.P. 👻 Ghosted

Neural Collaborative Filtering

Xiangnan He, Lizi Liao, ... (+4 more)

cs.IR 🏛 WWW 📚 6.8K cites 8 years ago

R.I.P. 👻 Ghosted

LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation

Xiangnan He, Kuan Deng, ... (+4 more)

cs.IR 🏛 SIGIR 📚 4.7K cites 6 years ago

R.I.P. 👻 Ghosted

Graph Convolutional Neural Networks for Web-Scale Recommender Systems

Rex Ying, Ruining He, ... (+4 more)

cs.IR 🏛 KDD 📚 4.0K cites 7 years ago

🌅 🌅 Old Age

Neural Graph Collaborative Filtering

Xiang Wang, Xiangnan He, ... (+3 more)

cs.IR 🏛 SIGIR 📚 3.6K cites 6 years ago

R.I.P. 👻 Ghosted

Self-Attentive Sequential Recommendation

Wang-Cheng Kang, Julian McAuley

cs.IR 🏛 ICDM 📚 3.3K cites 7 years ago

R.I.P. 👻 Ghosted

DeepFM: A Factorization-Machine based Neural Network for CTR Prediction

Huifeng Guo, Ruiming Tang, ... (+3 more)

cs.IR 🏛 IJCAI 📚 3.0K cites 9 years ago

Died the same way — 📜 Death by README

R.I.P. 📜 Death by README

Momentum Contrast for Unsupervised Visual Representation Learning

Kaiming He, Haoqi Fan, ... (+3 more)

cs.CV 🏛 CVPR 📚 14.3K cites 6 years ago

R.I.P. 📜 Death by README

LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model

Peng Gao, Jiaming Han, ... (+10 more)

cs.CV 🏛 arXiv 📚 716 cites 2 years ago

R.I.P. 📜 Death by README

Revisiting Graph based Collaborative Filtering: A Linear Residual Graph Convolutional Network Approach

Lei Chen, Le Wu, ... (+3 more)

cs.IR 🏛 AAAI 📚 609 cites 6 years ago

R.I.P. 📜 Death by README

Diffusion Models for Medical Image Analysis: A Comprehensive Survey

Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, ... (+5 more)

eess.IV 🏛 MedIA 📚 599 cites 3 years ago