TSGP: Two-Stage Generative Prompting for Unsupervised Commonsense Question Answering

November 24, 2022 · Declared Dead · 🏛 Conference on Empirical Methods in Natural Language Processing

Authors Yueqing Sun, Yu Zhang, Le Qi, Qi Shi arXiv ID 2211.13515 Category cs.CL: Computation & Language Cross-listed cs.AI Citations 7 Venue Conference on Empirical Methods in Natural Language Processing Repository https://github.com/Yueqing-Sun/TSGP ⭐ 1 Last Checked 1 month ago

Abstract

Unsupervised commonsense question answering requires mining effective commonsense knowledge without the rely on the labeled task data. Previous methods typically retrieved from traditional knowledge bases or used pre-trained language models (PrLMs) to generate fixed types of knowledge, which have poor generalization ability. In this paper, we aim to address the above limitation by leveraging the implicit knowledge stored in PrLMs and propose a two-stage prompt-based unsupervised commonsense question answering framework (TSGP). Specifically, we first use knowledge generation prompts to generate the knowledge required for questions with unlimited types and possible candidate answers independent of specified choices. Then, we further utilize answer generation prompts to generate possible candidate answers independent of specified choices. Experimental results and analysis on three different commonsense reasoning tasks, CommonsenseQA, OpenBookQA, and SocialIQA, demonstrate that TSGP significantly improves the reasoning ability of language models in unsupervised settings. Our code is available at: https://github.com/Yueqing-Sun/TSGP.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 💻 Repository 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Computation & Language

🌅 🌅 Old Age

Attention Is All You Need

Ashish Vaswani, Noam Shazeer, ... (+6 more)

cs.CL 🏛 NeurIPS 📚 166.0K cites 8 years ago

🌅 🌅 Old Age

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, Ming-Wei Chang, ... (+2 more)

cs.CL 🏛 NAACL 📚 110.2K cites 7 years ago

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Yinhan Liu, Myle Ott, ... (+8 more)

cs.CL 🏛 arXiv 📚 28.4K cites 6 years ago

R.I.P. 👻 Ghosted

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

Mike Lewis, Yinhan Liu, ... (+6 more)

cs.CL 🏛 ACL 📚 12.3K cites 6 years ago

R.I.P. 👻 Ghosted

Deep contextualized word representations

Matthew E. Peters, Mark Neumann, ... (+5 more)

cs.CL 🏛 NAACL 📚 12.0K cites 8 years ago

Died the same way — ⚰️ The Empty Tomb

R.I.P. ⚰️ The Empty Tomb

DSFD: Dual Shot Face Detector

Jian Li, Yabiao Wang, ... (+7 more)

cs.CV 🏛 CVPR 📚 462 cites 7 years ago

R.I.P. ⚰️ The Empty Tomb

InstanceCut: from Edges to Instances with MultiCut

Alexander Kirillov, Evgeny Levinkov, ... (+3 more)

cs.CV 🏛 CVPR 📚 261 cites 9 years ago

R.I.P. ⚰️ The Empty Tomb

FLNet: Landmark Driven Fetching and Learning Network for Faithful Talking Facial Animation Synthesis

Kuangxiao Gu, Yuqian Zhou, Thomas Huang

cs.CV 🏛 AAAI 📚 62 cites 6 years ago

R.I.P. ⚰️ The Empty Tomb

Personalized Showcases: Generating Multi-Modal Explanations for Recommendations

An Yan, Zhankui He, ... (+3 more)

cs.IR 🏛 SIGIR 📚 58 cites 3 years ago