| 1 |
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
Mike Lewis, Yinhan Liu, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
12.3K |
6 years ago |
| 2 |
Enriching Word Vectors with Subword Information
Piotr Bojanowski, Edouard Grave, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
10.5K |
9 years ago |
| 3 |
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich, Barry Haddow, Alexandra Birch
|
👻
Ghosted
|
cs.CL
|
8.5K |
10 years ago |
| 4 |
Unsupervised Cross-lingual Representation Learning at Scale
Alexis Conneau, Kartikay Khandelwal, ... (+8 more)
|
👻
Ghosted
|
cs.CL
|
7.9K |
6 years ago |
| 5 |
Get To The Point: Summarization with Pointer-Generator Networks
Abigail See, Peter J. Liu, Christopher D. Manning
|
👻
Ghosted
|
cs.CL
|
4.3K |
8 years ago |
| 6 |
HellaSwag: Can a Machine Really Finish Your Sentence?
Rowan Zellers, Ari Holtzman, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
3.7K |
6 years ago |
| 7 |
Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks
Kai Sheng Tai, Richard Socher, Christopher D. Manning
|
👻
Ghosted
|
cs.CL
|
3.2K |
11 years ago |
| 8 |
Know What You Don't Know: Unanswerable Questions for SQuAD
Pranav Rajpurkar, Robin Jia, Percy Liang
|
👻
Ghosted
|
cs.CL
|
3.2K |
7 years ago |
| 9 |
Energy and Policy Considerations for Deep Learning in NLP
Emma Strubell, Ananya Ganesh, Andrew McCallum
|
👻
Ghosted
|
cs.CL
|
3.1K |
6 years ago |
| 10 |
Improving Neural Machine Translation Models with Monolingual Data
Rico Sennrich, Barry Haddow, Alexandra Birch
|
👻
Ghosted
|
cs.CL
|
2.9K |
10 years ago |
| 11 |
End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF
Xuezhe Ma, Eduard Hovy
|
👻
Ghosted
|
cs.LG
|
2.8K |
10 years ago |
| 12 |
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation
Melvin Johnson, Mike Schuster, ... (+10 more)
|
👻
Ghosted
|
cs.CL
|
2.2K |
9 years ago |
| 13 |
Reading Wikipedia to Answer Open-Domain Questions
Danqi Chen, Adam Fisch, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
2.2K |
8 years ago |
| 14 |
SpanBERT: Improving Pre-training by Representing and Predicting Spans
Mandar Joshi, Danqi Chen, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
2.1K |
6 years ago |
| 15 |
Named Entity Recognition with Bidirectional LSTM-CNNs
Jason P. C. Chiu, Eric Nichols
|
👻
Ghosted
|
cs.CL
|
2.0K |
10 years ago |
| 16 |
OpenNMT: Open-source Toolkit for Neural Machine Translation
Guillaume Klein, Yoon Kim, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.9K |
8 years ago |
| 17 |
Hierarchical Neural Story Generation
Angela Fan, Mike Lewis, Yann Dauphin
|
👻
Ghosted
|
cs.CL
|
1.9K |
7 years ago |
| 18 |
Multimodal Transformer for Unaligned Multimodal Language Sequences
Yao-Hung Hubert Tsai, Shaojie Bai, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.9K |
6 years ago |
| 19 |
BERT Rediscovers the Classical NLP Pipeline
Ian Tenney, Dipanjan Das, Ellie Pavlick
|
👻
Ghosted
|
cs.CL
|
1.7K |
6 years ago |
| 20 |
DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation
Yizhe Zhang, Siqi Sun, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
1.7K |
6 years ago |
| 21 |
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
Mirac Suzgun, Nathan Scales, ... (+9 more)
|
💤
Eternal Rest
|
cs.CL
|
1.7K |
3 years ago |
| 22 |
Personalizing Dialogue Agents: I have a dog, do you have pets too?
Saizheng Zhang, Emily Dinan, ... (+4 more)
|
👻
Ghosted
|
cs.AI
|
1.6K |
8 years ago |
| 23 |
How multilingual is Multilingual BERT?
Telmo Pires, Eva Schlinger, Dan Garrette
|
👻
Ghosted
|
cs.CL
|
1.6K |
6 years ago |
| 24 |
"Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection
William Yang Wang
|
👻
Ghosted
|
cs.CL
|
1.6K |
8 years ago |
| 25 |
Incorporating Copying Mechanism in Sequence-to-Sequence Learning
Jiatao Gu, Zhengdong Lu, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
1.6K |
10 years ago |
| 26 |
Neural Network Acceptability Judgments
Alex Warstadt, Amanpreet Singh, Samuel R. Bowman
|
👻
Ghosted
|
cs.CL
|
1.6K |
7 years ago |
| 27 |
Language (Technology) is Power: A Critical Survey of "Bias" in NLP
Su Lin Blodgett, Solon Barocas, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
1.5K |
5 years ago |
| 28 |
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations
Soujanya Poria, Devamanyu Hazarika, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
7 years ago |
| 29 |
Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned
Elena Voita, David Talbot, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.4K |
6 years ago |
| 30 |
CoQA: A Conversational Question Answering Challenge
Siva Reddy, Danqi Chen, Christopher D. Manning
|
🌅
Old Age
|
cs.CL
|
1.3K |
7 years ago |
| 31 |
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
Taku Kudo
|
👻
Ghosted
|
cs.CL
|
1.3K |
7 years ago |
| 32 |
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Marco Tulio Ribeiro, Tongshuang Wu, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
1.3K |
5 years ago |
| 33 |
End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures
Makoto Miwa, Mohit Bansal
|
👻
Ghosted
|
cs.CL
|
1.2K |
10 years ago |
| 34 |
Enhanced LSTM for Natural Language Inference
Qian Chen, Xiaodan Zhu, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.2K |
9 years ago |
| 35 |
Neural Responding Machine for Short-Text Conversation
Lifeng Shang, Zhengdong Lu, Hang Li
|
👻
Ghosted
|
cs.CL
|
1.2K |
11 years ago |
| 36 |
Language-agnostic BERT Sentence Embedding
Fangxiaoyu Feng, Yinfei Yang, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.2K |
5 years ago |
| 37 |
Adversarial NLI: A New Benchmark for Natural Language Understanding
Yixin Nie, Adina Williams, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.2K |
6 years ago |
| 38 |
Latent Retrieval for Weakly Supervised Open Domain Question Answering
Kenton Lee, Ming-Wei Chang, Kristina Toutanova
|
👻
Ghosted
|
cs.CL
|
1.1K |
6 years ago |
| 39 |
A Persona-Based Neural Conversation Model
Jiwei Li, Michel Galley, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.1K |
10 years ago |
| 40 |
CamemBERT: a Tasty French Language Model
Louis Martin, Benjamin Muller, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
1.1K |
6 years ago |
| 41 |
COMET: Commonsense Transformers for Automatic Knowledge Graph Construction
Antoine Bosselut, Hannah Rashkin, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
997 |
6 years ago |
| 42 |
Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change
William L. Hamilton, Jure Leskovec, Dan Jurafsky
|
👻
Ghosted
|
cs.CL
|
977 |
9 years ago |
| 43 |
Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies
Tal Linzen, Emmanuel Dupoux, Yoav Goldberg
|
👻
Ghosted
|
cs.CL
|
965 |
9 years ago |
| 44 |
ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs
Wenpeng Yin, Hinrich Schütze, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
965 |
10 years ago |
| 45 |
What you can cram into a single vector: Probing sentence embeddings for linguistic properties
Alexis Conneau, German Kruszewski, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
960 |
7 years ago |
| 46 |
The NarrativeQA Reading Comprehension Challenge
Tomáš Kočiský, Jonathan Schwarz, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
954 |
8 years ago |
| 47 |
The LAMBADA dataset: Word prediction requiring a broad discourse context
Denis Paperno, Germán Kruszewski, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
949 |
9 years ago |
| 48 |
Compositional Semantic Parsing on Semi-Structured Tables
Panupong Pasupat, Percy Liang
|
👻
Ghosted
|
cs.CL
|
938 |
10 years ago |
| 49 |
On the Cross-lingual Transferability of Monolingual Representations
Mikel Artetxe, Sebastian Ruder, Dani Yogatama
|
👻
Ghosted
|
cs.CL
|
905 |
6 years ago |
| 50 |
Program Induction by Rationale Generation : Learning to Solve and Explain Algebraic Word Problems
Wang Ling, Dani Yogatama, ... (+2 more)
|
👻
Ghosted
|
cs.AI
|
898 |
8 years ago |