| 1 |
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin, Ming-Wei Chang, ... (+2 more)
|
🌅
Old Age
|
cs.CL
|
110.2K |
7 years ago |
| 2 |
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin
|
👻
Ghosted
|
cs.LG
|
20.3K |
10 years ago |
| 3 |
Deep contextualized word representations
Matthew E. Peters, Mark Neumann, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
12.0K |
8 years ago |
| 4 |
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
Adina Williams, Nikita Nangia, Samuel R. Bowman
|
👻
Ghosted
|
cs.CL
|
4.9K |
8 years ago |
| 5 |
Neural Architectures for Named Entity Recognition
Guillaume Lample, Miguel Ballesteros, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
4.2K |
10 years ago |
| 6 |
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott, Sergey Edunov, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
3.3K |
6 years ago |
| 7 |
mT5: A massively multilingual pre-trained text-to-text transformer
Linting Xue, Noah Constant, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
3.0K |
5 years ago |
| 8 |
Self-Attention with Relative Position Representations
Peter Shaw, Jakob Uszkoreit, Ashish Vaswani
|
👻
Ghosted
|
cs.CL
|
2.7K |
8 years ago |
| 9 |
A Diversity-Promoting Objective Function for Neural Conversation Models
Jiwei Li, Michel Galley, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
2.6K |
10 years ago |
| 10 |
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Alon Talmor, Jonathan Herzig, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
2.2K |
7 years ago |
| 11 |
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions
Christopher Clark, Kenton Lee, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
2.1K |
6 years ago |
| 12 |
FEVER: a large-scale dataset for Fact Extraction and VERification
James Thorne, Andreas Vlachos, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
2.0K |
8 years ago |
| 13 |
Attention is not Explanation
Sarthak Jain, Byron C. Wallace
|
🌅
Old Age
|
cs.CL
|
1.6K |
7 years ago |
| 14 |
Annotation Artifacts in Natural Language Inference Data
Suchin Gururangan, Swabha Swayamdipta, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.2K |
8 years ago |
| 15 |
DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs
Dheeru Dua, Yizhong Wang, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
1.2K |
7 years ago |
| 16 |
Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods
Jieyu Zhao, Tianlu Wang, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.1K |
7 years ago |
| 17 |
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
Timo Schick, Hinrich Schütze
|
👻
Ghosted
|
cs.CL
|
1.1K |
5 years ago |
| 18 |
A Neural Network Approach to Context-Sensitive Generation of Conversational Responses
Alessandro Sordoni, Michel Galley, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
913 |
10 years ago |
| 19 |
A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents
Arman Cohan, Franck Dernoncourt, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
866 |
7 years ago |
| 20 |
A Novel Embedding Model for Knowledge Base Completion Based on Convolutional Neural Network
Dai Quoc Nguyen, Tu Dinh Nguyen, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
801 |
8 years ago |
| 21 |
Linguistic Knowledge and Transferability of Contextual Representations
Nelson F. Liu, Matt Gardner, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
797 |
7 years ago |
| 22 |
A Corpus and Evaluation Framework for Deeper Understanding of Commonsense Stories
Nasrin Mostafazadeh, Nathanael Chambers, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
780 |
9 years ago |
| 23 |
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms
Aida Amini, Saadia Gabriel, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
774 |
6 years ago |
| 24 |
Adversarial Example Generation with Syntactically Controlled Paraphrase Networks
Mohit Iyyer, John Wieting, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
768 |
7 years ago |
| 25 |
BERT Post-Training for Review Reading Comprehension and Aspect-based Sentiment Analysis
Hu Xu, Bing Liu, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
763 |
6 years ago |
| 26 |
Visualizing and Understanding Neural Models in NLP
Jiwei Li, Xinlei Chen, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
736 |
10 years ago |
| 27 |
Unsupervised Learning of Sentence Embeddings using Compositional n-Gram Features
Matteo Pagliardini, Prakhar Gupta, Martin Jaggi
|
👻
Ghosted
|
cs.CL
|
724 |
9 years ago |
| 28 |
The Web as a Knowledge-base for Answering Complex Questions
Alon Talmor, Jonathan Berant
|
👻
Ghosted
|
cs.CL
|
718 |
8 years ago |
| 29 |
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy Lin, Rodrigo Nogueira, Andrew Yates
|
👻
Ghosted
|
cs.IR
|
709 |
5 years ago |
| 30 |
Gender Bias in Coreference Resolution
Rachel Rudinger, Jason Naradowsky, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
708 |
7 years ago |
| 31 |
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering
Yingqi Qu, Yuchen Ding, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
696 |
5 years ago |
| 32 |
Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence
Chi Sun, Luyao Huang, Xipeng Qiu
|
👻
Ghosted
|
cs.CL
|
694 |
7 years ago |
| 33 |
On Measuring Social Biases in Sentence Encoders
Chandler May, Alex Wang, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
691 |
7 years ago |
| 34 |
Contextual Augmentation: Data Augmentation by Words with Paradigmatic Relations
Sosuke Kobayashi
|
👻
Ghosted
|
cs.CL
|
663 |
7 years ago |
| 35 |
Explainable Prediction of Medical Codes from Clinical Text
James Mullenbach, Sarah Wiegreffe, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
636 |
8 years ago |
| 36 |
Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism
Orhan Firat, Kyunghyun Cho, Yoshua Bengio
|
👻
Ghosted
|
cs.CL
|
634 |
10 years ago |
| 37 |
Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them
Hila Gonen, Yoav Goldberg
|
👻
Ghosted
|
cs.CL
|
605 |
7 years ago |
| 38 |
Newsroom: A Dataset of 1.3 Million Summaries with Diverse Extractive Strategies
Max Grusky, Mor Naaman, Yoav Artzi
|
👻
Ghosted
|
cs.CL
|
603 |
7 years ago |
| 39 |
Learning Distributed Representations of Sentences from Unlabelled Data
Felix Hill, Kyunghyun Cho, Anna Korhonen
|
👻
Ghosted
|
cs.CL
|
589 |
10 years ago |
| 40 |
PAWS: Paraphrase Adversaries from Word Scrambling
Yuan Zhang, Jason Baldridge, Luheng He
|
👻
Ghosted
|
cs.CL
|
580 |
6 years ago |
| 41 |
Learning to Compose Neural Networks for Question Answering
Jacob Andreas, Marcus Rohrbach, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
580 |
10 years ago |
| 42 |
Ranking Sentences for Extractive Summarization with Reinforcement Learning
Shashi Narayan, Shay B. Cohen, Mirella Lapata
|
👻
Ghosted
|
cs.CL
|
579 |
8 years ago |
| 43 |
Delete, Retrieve, Generate: A Simple Approach to Sentiment and Style Transfer
Juncen Li, Robin Jia, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
577 |
7 years ago |
| 44 |
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations
Mohammad Taher Pilehvar, Jose Camacho-Collados
|
👻
Ghosted
|
cs.CL
|
565 |
7 years ago |
| 45 |
Recurrent Neural Network Grammars
Chris Dyer, Adhiguna Kuncoro, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
540 |
10 years ago |
| 46 |
Colorless green recurrent networks dream hierarchically
Kristina Gulordava, Piotr Bojanowski, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
533 |
8 years ago |
| 47 |
Visual Storytelling
Ting-Hao, Huang, ... (+14 more)
|
👻
Ghosted
|
cs.CL
|
532 |
9 years ago |
| 48 |
Massively Multilingual Neural Machine Translation
Roee Aharoni, Melvin Johnson, Orhan Firat
|
👻
Ghosted
|
cs.CL
|
526 |
7 years ago |
| 49 |
End-to-End Open-Domain Question Answering with BERTserini
Wei Yang, Yuqing Xie, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
517 |
7 years ago |
| 50 |
Counter-fitting Word Vectors to Linguistic Constraints
Nikola Mrkšić, Diarmuid Ó Séaghdha, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
505 |
10 years ago |