Differentiable Product Quantization for End-to-End Embedding Compression

August 26, 2019 · Entered Twilight · 🏛 International Conference on Machine Learning

"Last commit was 6.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, LICENSE, README.md, core, lm, nmt, text_classification

Authors Ting Chen, Lala Li, Yizhou Sun arXiv ID 1908.09756 Category cs.LG: Machine Learning Cross-listed cs.AI, cs.CL, stat.ML Citations 77 Venue International Conference on Machine Learning Repository https://github.com/chentingpc/dpq_embedding_compression ⭐ 64 Last Checked 1 month ago

Abstract

Embedding layers are commonly used to map discrete symbols into continuous embedding vectors that reflect their semantic meanings. Despite their effectiveness, the number of parameters in an embedding layer increases linearly with the number of symbols and poses a critical challenge on memory and storage constraints. In this work, we propose a generic and end-to-end learnable compression framework termed differentiable product quantization (DPQ). We present two instantiations of DPQ that leverage different approximation techniques to enable differentiability in end-to-end learning. Our method can readily serve as a drop-in alternative for any existing embedding layer. Empirically, DPQ offers significant compression ratios (14-238$\times$) at negligible or no performance cost on 10 datasets across three different language tasks.