Sentence Representation Learning with Generative Objective rather than Contrastive Objective

October 16, 2022 · Entered Twilight · 🏛 Conference on Empirical Methods in Natural Language Processing

Repo contents: LICENSE, README.md, SentEval, data, eval.sh, evaluation.py, paser, requirements.txt, scripts, simcse_to_huggingface.py, train.py

Authors Bohong Wu, Hai Zhao arXiv ID 2210.08474 Category cs.CL: Computation & Language Citations 13 Venue Conference on Empirical Methods in Natural Language Processing Repository https://github.com/chengzhipanpan/PaSeR ⭐ 10 Last Checked 1 month ago

Abstract

Though offering amazing contextualized token-level representations, current pre-trained language models take less attention on accurately acquiring sentence-level representation during their self-supervised pre-training. However, contrastive objectives which dominate the current sentence representation learning bring little linguistic interpretability and no performance guarantee on downstream semantic tasks. We instead propose a novel generative self-supervised learning objective based on phrase reconstruction. To overcome the drawbacks of previous generative methods, we carefully model intra-sentence structure by breaking down one sentence into pieces of important phrases. Empirical studies show that our generative learning achieves powerful enough performance improvement and outperforms the current state-of-the-art contrastive methods not only on the STS benchmarks, but also on downstream semantic retrieval and reranking tasks. Our code is available at https://github.com/chengzhipanpan/PaSeR.