| 1 |
SQuAD: 100,000+ Questions for Machine Comprehension of Text
Pranav Rajpurkar, Jian Zhang, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
9.1K |
9 years ago |
| 2 |
Effective Approaches to Attention-based Neural Machine Translation
Minh-Thang Luong, Hieu Pham, Christopher D. Manning
|
👻
Ghosted
|
cs.CL
|
8.3K |
10 years ago |
| 3 |
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Wang, Amanpreet Singh, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
8.2K |
7 years ago |
| 4 |
A large annotated corpus for learning natural language inference
Samuel R. Bowman, Gabor Angeli, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
4.6K |
10 years ago |
| 5 |
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering
Zhilin Yang, Peng Qi, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
3.9K |
7 years ago |
| 6 |
SciBERT: A Pretrained Language Model for Scientific Text
Iz Beltagy, Kyle Lo, Arman Cohan
|
🌅
Old Age
|
cs.CL
|
3.5K |
7 years ago |
| 7 |
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
Hao Tan, Mohit Bansal
|
🌅
Old Age
|
cs.CL
|
2.8K |
6 years ago |
| 8 |
A Neural Attention Model for Abstractive Sentence Summarization
Alexander M. Rush, Sumit Chopra, Jason Weston
|
👻
Ghosted
|
cs.CL
|
2.8K |
10 years ago |
| 9 |
Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
Alexis Conneau, Douwe Kiela, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
2.2K |
8 years ago |
| 10 |
Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering
Todor Mihaylov, Peter Clark, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
2.1K |
7 years ago |
| 11 |
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization
Shashi Narayan, Shay B. Cohen, Mirella Lapata
|
👻
Ghosted
|
cs.CL
|
1.9K |
7 years ago |
| 12 |
Adversarial Examples for Evaluating Reading Comprehension Systems
Robin Jia, Percy Liang
|
👻
Ghosted
|
cs.CL
|
1.7K |
8 years ago |
| 13 |
Tensor Fusion Network for Multimodal Sentiment Analysis
Amir Zadeh, Minghai Chen, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.6K |
8 years ago |
| 14 |
Text Summarization with Pretrained Encoders
Yang Liu, Mirella Lapata
|
🌅
Old Age
|
cs.CL
|
1.6K |
6 years ago |
| 15 |
RACE: Large-scale ReAding Comprehension Dataset From Examinations
Guokun Lai, Qizhe Xie, ... (+3 more)
|
💀
404 Not Found
|
cs.CL
|
1.6K |
8 years ago |
| 16 |
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui, Dong Huk Park, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
1.6K |
9 years ago |
| 17 |
XNLI: Evaluating Cross-lingual Sentence Representations
Alexis Conneau, Guillaume Lample, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
1.5K |
7 years ago |
| 18 |
MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling
Paweł Budzianowski, Tsung-Hsien Wen, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
7 years ago |
| 19 |
COMET: A Neural Framework for MT Evaluation
Ricardo Rei, Craig Stewart, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
5 years ago |
| 20 |
A Decomposable Attention Model for Natural Language Inference
Ankur P. Parikh, Oscar Täckström, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
9 years ago |
| 21 |
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li, Will Monroe, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
9 years ago |
| 22 |
How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation
Chia-Wei Liu, Ryan Lowe, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
10 years ago |
| 23 |
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Bin Lin, Yang Ye, ... (+5 more)
|
💀
404 Not Found
|
cs.CV
|
1.3K |
2 years ago |
| 24 |
Sequence-Level Knowledge Distillation
Yoon Kim, Alexander M. Rush
|
👻
Ghosted
|
cs.CL
|
1.2K |
9 years ago |
| 25 |
Transformer Feed-Forward Layers Are Key-Value Memories
Mor Geva, Roei Schuster, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
1.2K |
5 years ago |
| 26 |
Long Short-Term Memory-Networks for Machine Reading
Jianpeng Cheng, Li Dong, Mirella Lapata
|
👻
Ghosted
|
cs.CL
|
1.2K |
10 years ago |
| 27 |
How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings
Kawin Ethayarajh
|
👻
Ghosted
|
cs.CL
|
1.1K |
6 years ago |
| 28 |
Attention is not not Explanation
Sarah Wiegreffe, Yuval Pinter
|
👻
Ghosted
|
cs.CL
|
1.1K |
6 years ago |
| 29 |
Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints
Jieyu Zhao, Tianlu Wang, ... (+3 more)
|
👻
Ghosted
|
cs.AI
|
1.0K |
8 years ago |
| 30 |
Key-Value Memory Networks for Directly Reading Documents
Alexander Miller, Adam Fisch, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.0K |
9 years ago |
| 31 |
Measuring and Narrowing the Compositionality Gap in Language Models
Ofir Press, Muru Zhang, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.0K |
3 years ago |
| 32 |
Universal Adversarial Triggers for Attacking and Analyzing NLP
Eric Wallace, Shi Feng, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.0K |
6 years ago |
| 33 |
Generating Natural Language Adversarial Examples
Moustafa Alzantot, Yash Sharma, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.0K |
7 years ago |
| 34 |
Aspect Level Sentiment Classification with Deep Memory Network
Duyu Tang, Bing Qin, Ting Liu
|
👻
Ghosted
|
cs.CL
|
967 |
9 years ago |
| 35 |
Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems
Tsung-Hsien Wen, Milica Gasic, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
967 |
10 years ago |
| 36 |
End-to-end Neural Coreference Resolution
Kenton Lee, Luheng He, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
941 |
8 years ago |
| 37 |
Patient Knowledge Distillation for BERT Model Compression
Siqi Sun, Yu Cheng, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
932 |
6 years ago |
| 38 |
Red Teaming Language Models with Language Models
Ethan Perez, Saffron Huang, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
916 |
4 years ago |
| 39 |
Adversarial Learning for Neural Dialogue Generation
Jiwei Li, Will Monroe, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
914 |
9 years ago |
| 40 |
A Survey on In-context Learning
Qingxiu Dong, Lei Li, ... (+12 more)
|
👻
Ghosted
|
cs.CL
|
911 |
3 years ago |
| 41 |
Transfer Learning for Low-Resource Neural Machine Translation
Barret Zoph, Deniz Yuret, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
894 |
9 years ago |
| 42 |
Evaluating the Factual Consistency of Abstractive Text Summarization
Wojciech Kryściński, Bryan McCann, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
874 |
6 years ago |
| 43 |
Encoding Sentences with Graph Convolutional Networks for Semantic Role Labeling
Diego Marcheggiani, Ivan Titov
|
👻
Ghosted
|
cs.CL
|
869 |
9 years ago |
| 44 |
CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models
Nikita Nangia, Clara Vania, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
867 |
5 years ago |
| 45 |
Rationalizing Neural Predictions
Tao Lei, Regina Barzilay, Tommi Jaakkola
|
👻
Ghosted
|
cs.CL
|
857 |
9 years ago |
| 46 |
Sparse Communication for Distributed Gradient Descent
Alham Fikri Aji, Kenneth Heafield
|
👻
Ghosted
|
cs.CL
|
830 |
8 years ago |
| 47 |
DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning
Wenhan Xiong, Thien Hoang, William Yang Wang
|
👻
Ghosted
|
cs.CL
|
813 |
8 years ago |
| 48 |
Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm
Bjarke Felbo, Alan Mislove, ... (+3 more)
|
👻
Ghosted
|
stat.ML
|
787 |
8 years ago |
| 49 |
Large Language Models Can Self-Improve
Jiaxin Huang, Shixiang Shane Gu, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
784 |
3 years ago |
| 50 |
Graph Convolution over Pruned Dependency Trees Improves Relation Extraction
Yuhao Zhang, Peng Qi, Christopher D. Manning
|
🌅
Old Age
|
cs.CL
|
782 |
7 years ago |