R.I.P.
๐ป
Ghosted
IncDSI: Incrementally Updatable Document Retrieval
July 19, 2023 ยท Entered Twilight ยท ๐ International Conference on Machine Learning
Repo contents: BertModel.py, LICENSE, README.md, incdsi.py, incdsi.sh, save_embeddings.py, save_embeddings.sh, train_initial_model.py, train_initial_model.sh, utils.py
Authors
Varsha Kishore, Chao Wan, Justin Lovelace, Yoav Artzi, Kilian Q. Weinberger
arXiv ID
2307.10323
Category
cs.IR: Information Retrieval
Cross-listed
cs.CL,
cs.LG
Citations
19
Venue
International Conference on Machine Learning
Repository
https://github.com/varshakishore/IncDSI
โญ 11
Last Checked
1 month ago
Abstract
Differentiable Search Index is a recently proposed paradigm for document retrieval, that encodes information about a corpus of documents within the parameters of a neural network and directly maps queries to corresponding documents. These models have achieved state-of-the-art performances for document retrieval across many benchmarks. These kinds of models have a significant limitation: it is not easy to add new documents after a model is trained. We propose IncDSI, a method to add documents in real time (about 20-50ms per document), without retraining the model on the entire dataset (or even parts thereof). Instead we formulate the addition of documents as a constrained optimization problem that makes minimal changes to the network parameters. Although orders of magnitude faster, our approach is competitive with re-training the model on the whole dataset and enables the development of document retrieval systems that can be updated with new information in real-time. Our code for IncDSI is available at https://github.com/varshakishore/IncDSI.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Information Retrieval
R.I.P.
๐ป
Ghosted
LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation
R.I.P.
๐ป
Ghosted
Graph Convolutional Neural Networks for Web-Scale Recommender Systems
๐
๐
Old Age
Neural Graph Collaborative Filtering
R.I.P.
๐ป
Ghosted
Self-Attentive Sequential Recommendation
R.I.P.
๐ป
Ghosted