| 1 |
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin, Ming-Wei Chang, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
110.2K |
7 years ago |
| 2 |
HellaSwag: Can a Machine Really Finish Your Sentence?
Rowan Zellers, Ari Holtzman, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
3.7K |
6 years ago |
| 3 |
Attention is not Explanation
Sarthak Jain, Byron C. Wallace
|
🌅
Old Age
|
cs.CL
|
1.6K |
7 years ago |
| 4 |
CoQA: A Conversational Question Answering Challenge
Siva Reddy, Danqi Chen, Christopher D. Manning
|
🌅
Old Age
|
cs.CL
|
1.3K |
7 years ago |
| 5 |
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms
Aida Amini, Saadia Gabriel, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
774 |
6 years ago |
| 6 |
ERASER: A Benchmark to Evaluate Rationalized NLP Models
Jay DeYoung, Sarthak Jain, ... (+5 more)
|
🌅
Old Age
|
cs.CL
|
736 |
6 years ago |
| 7 |
A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings
Mikel Artetxe, Gorka Labaka, Eneko Agirre
|
🌅
Old Age
|
cs.CL
|
613 |
7 years ago |
| 8 |
Gated-Attention Readers for Text Comprehension
Bhuwan Dhingra, Hanxiao Liu, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
429 |
9 years ago |
| 9 |
Joint Embedding of Words and Labels for Text Classification
Guoyin Wang, Chunyuan Li, ... (+6 more)
|
🌅
Old Age
|
cs.CL
|
417 |
7 years ago |
| 10 |
Cyclical Annealing Schedule: A Simple Approach to Mitigating KL Vanishing
Hao Fu, Chunyuan Li, ... (+4 more)
|
🌅
Old Age
|
cs.LG
|
417 |
7 years ago |
| 11 |
Improving Topic Models with Latent Feature Word Representations
Dat Quoc Nguyen, Richard Billingsley, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
354 |
7 years ago |
| 12 |
Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms
Dinghan Shen, Guoyin Wang, ... (+7 more)
|
🌅
Old Age
|
cs.CL
|
342 |
7 years ago |
| 13 |
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access
Bhuwan Dhingra, Lihong Li, ... (+5 more)
|
🌅
Old Age
|
cs.CL
|
309 |
9 years ago |
| 14 |
Discourse-Aware Neural Extractive Text Summarization
Jiacheng Xu, Zhe Gan, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
293 |
6 years ago |
| 15 |
Structural Scaffolds for Citation Intent Classification in Scientific Publications
Arman Cohan, Waleed Ammar, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
285 |
7 years ago |
| 16 |
Zero-Shot Entity Linking by Reading Entity Descriptions
Lajanugen Logeswaran, Ming-Wei Chang, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
276 |
6 years ago |
| 17 |
Chains of Reasoning over Entities, Relations, and Text using Recurrent Neural Networks
Rajarshi Das, Arvind Neelakantan, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
274 |
9 years ago |
| 18 |
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
Hao Tian, Can Gao, ... (+6 more)
|
🌅
Old Age
|
cs.CL
|
272 |
5 years ago |
| 19 |
Multi-hop Reading Comprehension through Question Decomposition and Rescoring
Sewon Min, Victor Zhong, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
263 |
6 years ago |
| 20 |
A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss
Wan-Ting Hsu, Chieh-Kai Lin, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
248 |
7 years ago |
| 21 |
Joint Slot Filling and Intent Detection via Capsule Neural Networks
Chenwei Zhang, Yaliang Li, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
246 |
7 years ago |
| 22 |
Towards Topic-Guided Conversational Recommender System
Kun Zhou, Yuanhang Zhou, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
235 |
5 years ago |
| 23 |
Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data
Wei Zhao, Liang Wang, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
227 |
7 years ago |
| 24 |
SParC: Cross-Domain Semantic Parsing in Context
Tao Yu, Rui Zhang, ... (+17 more)
|
🌅
Old Age
|
cs.CL
|
222 |
6 years ago |
| 25 |
MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Jie Lei, Liwei Wang, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
202 |
5 years ago |
| 26 |
SciREX: A Challenge Dataset for Document-Level Information Extraction
Sarthak Jain, Madeleine van Zuylen, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
186 |
5 years ago |
| 27 |
Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings
Dorottya Demszky, Nikhil Garg, ... (+5 more)
|
🌅
Old Age
|
cs.CL
|
180 |
7 years ago |
| 28 |
Compositional Questions Do Not Necessitate Multi-hop Reasoning
Sewon Min, Eric Wallace, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
160 |
6 years ago |
| 29 |
Improving Knowledge Graph Embedding Using Simple Constraints
Boyang Ding, Quan Wang, ... (+2 more)
|
🌅
Old Age
|
cs.AI
|
151 |
7 years ago |
| 30 |
Deep Multitask Learning for Semantic Dependency Parsing
Hao Peng, Sam Thomson, Noah A. Smith
|
🌅
Old Age
|
cs.CL
|
148 |
8 years ago |
| 31 |
Question Answering on Knowledge Bases and Text using Universal Schema and Memory Networks
Rajarshi Das, Manzil Zaheer, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
143 |
8 years ago |
| 32 |
Target-Guided Open-Domain Conversation
Jianheng Tang, Tiancheng Zhao, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
141 |
6 years ago |
| 33 |
Tracking State Changes in Procedural Text: A Challenge Dataset and Models for Process Paragraph Comprehension
Bhavana Dalvi Mishra, Lifu Huang, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
140 |
7 years ago |
| 34 |
Explainable Automated Fact-Checking: A Survey
Neema Kotonya, Francesca Toni
|
🌅
Old Age
|
cs.CL
|
139 |
5 years ago |
| 35 |
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization
Yang Gao, Wei Zhao, Steffen Eger
|
🌅
Old Age
|
cs.CL
|
136 |
5 years ago |
| 36 |
Playing Text-Adventure Games with Graph-Based Deep Reinforcement Learning
Prithviraj Ammanabrolu, Mark O. Riedl
|
🌅
Old Age
|
cs.CL
|
131 |
7 years ago |
| 37 |
CharBERT: Character-aware Pre-trained Language Model
Wentao Ma, Yiming Cui, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
123 |
5 years ago |
| 38 |
Question Answering through Transfer Learning from Large Fine-grained Supervision Data
Sewon Min, Minjoon Seo, Hannaneh Hajishirzi
|
🌅
Old Age
|
cs.CL
|
121 |
9 years ago |
| 39 |
Improving Machine Reading Comprehension with General Reading Strategies
Kai Sun, Dian Yu, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
118 |
7 years ago |
| 40 |
Argument Mining with Structured SVMs and RNNs
Vlad Niculae, Joonsuk Park, Claire Cardie
|
🌅
Old Age
|
cs.CL
|
116 |
8 years ago |
| 41 |
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search
Gyuwan Kim, Kyunghyun Cho
|
🌅
Old Age
|
cs.CL
|
107 |
5 years ago |
| 42 |
A Qualitative Comparison of CoQA, SQuAD 2.0 and QuAC
Mark Yatskar
|
🌅
Old Age
|
cs.CL
|
102 |
7 years ago |
| 43 |
Top-down Tree Long Short-Term Memory Networks
Xingxing Zhang, Liang Lu, Mirella Lapata
|
🌅
Old Age
|
cs.CL
|
101 |
10 years ago |
| 44 |
GCDT: A Global Context Enhanced Deep Transition Architecture for Sequence Labeling
Yijin Liu, Fandong Meng, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
96 |
6 years ago |
| 45 |
Dependency Parsing as Head Selection
Xingxing Zhang, Jianpeng Cheng, Mirella Lapata
|
🌅
Old Age
|
cs.CL
|
95 |
9 years ago |
| 46 |
Learning Multilingual Word Embeddings in Latent Metric Space: A Geometric Approach
Pratik Jawanpuria, Arjun Balgovind, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
80 |
7 years ago |
| 47 |
Merge and Label: A novel neural network architecture for nested NER
Joseph Fisher, Andreas Vlachos
|
🌅
Old Age
|
cs.CL
|
76 |
6 years ago |
| 48 |
Speak to your Parser: Interactive Text-to-SQL with Natural Language Feedback
Ahmed Elgohary, Saghar Hosseini, Ahmed Hassan Awadallah
|
🌅
Old Age
|
cs.CL
|
75 |
5 years ago |
| 49 |
Representation Learning for Grounded Spatial Reasoning
Michael Janner, Karthik Narasimhan, Regina Barzilay
|
🌅
Old Age
|
cs.CL
|
73 |
8 years ago |
| 50 |
Cross-lingual Distillation for Text Classification
Ruochen Xu, Yiming Yang
|
🌅
Old Age
|
cs.CL
|
72 |
8 years ago |