| 1 |
Attention Is All You Need
Ashish Vaswani, Noam Shazeer, ... (+6 more)
|
🌅
Old Age
|
cs.CL
|
166.0K |
8 years ago |
| 2 |
Language Models are Few-Shot Learners
Tom B. Brown, Benjamin Mann, ... (+29 more)
|
👻
Ghosted
|
cs.CL
|
54.2K |
5 years ago |
| 3 |
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke, Sam Gross, ... (+19 more)
|
👻
Ghosted
|
cs.LG
|
49.7K |
6 years ago |
| 4 |
A Unified Approach to Interpreting Model Predictions
Scott Lundberg, Su-In Lee
|
👻
Ghosted
|
cs.AI
|
30.8K |
8 years ago |
| 5 |
Inductive Representation Learning on Large Graphs
William L. Hamilton, Rex Ying, Jure Leskovec
|
👻
Ghosted
|
cs.SI
|
18.5K |
8 years ago |
| 6 |
PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space
Charles R. Qi, Li Yi, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
13.3K |
8 years ago |
| 7 |
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Patrick Lewis, Ethan Perez, ... (+10 more)
|
👻
Ghosted
|
cs.CL
|
11.3K |
5 years ago |
| 8 |
Improved Training of Wasserstein GANs
Ishaan Gulrajani, Faruk Ahmed, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
10.5K |
9 years ago |
| 9 |
Improved Techniques for Training GANs
Tim Salimans, Ian Goodfellow, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
10.0K |
9 years ago |
| 10 |
Prototypical Networks for Few-shot Learning
Jake Snell, Kevin Swersky, Richard S. Zemel
|
👻
Ghosted
|
cs.LG
|
9.5K |
9 years ago |
| 11 |
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi, Zhourong Chen, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
9.2K |
10 years ago |
| 12 |
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang, Zihang Dai, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
9.2K |
6 years ago |
| 13 |
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
Michaël Defferrard, Xavier Bresson, Pierre Vandergheynst
|
👻
Ghosted
|
cs.LG
|
8.3K |
9 years ago |
| 14 |
Matching Networks for One Shot Learning
Oriol Vinyals, Charles Blundell, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
8.1K |
9 years ago |
| 15 |
Spatial Transformer Networks
Max Jaderberg, Karen Simonyan, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
7.9K |
10 years ago |
| 16 |
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski, Henry Zhou, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
7.6K |
5 years ago |
| 17 |
Learning both Weights and Connections for Efficient Neural Networks
Song Han, Jeff Pool, ... (+2 more)
|
👻
Ghosted
|
cs.NE
|
7.4K |
10 years ago |
| 18 |
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan, Alexander Pritzel, Charles Blundell
|
👻
Ghosted
|
stat.ML
|
7.0K |
9 years ago |
| 19 |
Character-level Convolutional Networks for Text Classification
Xiang Zhang, Junbo Zhao, Yann LeCun
|
👻
Ghosted
|
cs.LG
|
6.8K |
10 years ago |
| 20 |
Neural Discrete Representation Learning
Aaron van den Oord, Oriol Vinyals, Koray Kavukcuoglu
|
👻
Ghosted
|
cs.LG
|
6.6K |
8 years ago |
| 21 |
Neural Ordinary Differential Equations
Ricky T. Q. Chen, Yulia Rubanova, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
6.4K |
7 years ago |
| 22 |
R-FCN: Object Detection via Region-based Fully Convolutional Networks
Jifeng Dai, Yi Li, ... (+2 more)
|
🌅
Old Age
|
cs.CV
|
6.0K |
9 years ago |
| 23 |
What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision?
Alex Kendall, Yarin Gal
|
👻
Ghosted
|
cs.CV
|
5.5K |
9 years ago |
| 24 |
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan Lowe, Yi Wu, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
5.4K |
8 years ago |
| 25 |
Dynamic Routing Between Capsules
Sara Sabour, Nicholas Frosst, Geoffrey E Hinton
|
👻
Ghosted
|
cs.CV
|
5.0K |
8 years ago |
| 26 |
Equality of Opportunity in Supervised Learning
Moritz Hardt, Eric Price, Nathan Srebro
|
👻
Ghosted
|
cs.LG
|
4.9K |
9 years ago |
| 27 |
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann, Romain Beaumont, ... (+14 more)
|
👻
Ghosted
|
cs.CV
|
4.7K |
3 years ago |
| 28 |
Deep reinforcement learning from human preferences
Paul Christiano, Jan Leike, ... (+4 more)
|
👻
Ghosted
|
stat.ML
|
4.6K |
8 years ago |
| 29 |
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
Xi Chen, Yan Duan, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
4.4K |
9 years ago |
| 30 |
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu, Dhruv Batra, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
4.3K |
6 years ago |
| 31 |
Teaching Machines to Read and Comprehend
Karl Moritz Hermann, Tomáš Kočiský, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
3.8K |
10 years ago |
| 32 |
Neural Tangent Kernel: Convergence and Generalization in Neural Networks
Arthur Jacot, Franck Gabriel, Clément Hongler
|
👻
Ghosted
|
cs.LG
|
3.7K |
7 years ago |
| 33 |
Convolutional Networks on Graphs for Learning Molecular Fingerprints
David Duvenaud, Dougal Maclaurin, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
3.6K |
10 years ago |
| 34 |
Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings
Tolga Bolukbasi, Kai-Wei Chang, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
3.5K |
9 years ago |
| 35 |
Generative Adversarial Imitation Learning
Jonathan Ho, Stefano Ermon
|
👻
Ghosted
|
cs.LG
|
3.5K |
9 years ago |
| 36 |
Glow: Generative Flow with Invertible 1x1 Convolutions
Diederik P. Kingma, Prafulla Dhariwal
|
🌅
Old Age
|
stat.ML
|
3.5K |
7 years ago |
| 37 |
MixMatch: A Holistic Approach to Semi-Supervised Learning
David Berthelot, Nicholas Carlini, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
3.4K |
6 years ago |
| 38 |
Pointer Networks
Oriol Vinyals, Meire Fortunato, Navdeep Jaitly
|
👻
Ghosted
|
stat.ML
|
3.3K |
10 years ago |
| 39 |
MS MARCO: A Human Generated MAchine Reading COmprehension Dataset
Payal Bajaj, Daniel Campos, ... (+13 more)
|
👻
Ghosted
|
cs.CL
|
3.2K |
9 years ago |
| 40 |
Gradient Episodic Memory for Continual Learning
David Lopez-Paz, Marc'Aurelio Ranzato
|
👻
Ghosted
|
cs.LG
|
3.2K |
8 years ago |
| 41 |
BinaryConnect: Training Deep Neural Networks with binary weights during propagations
Matthieu Courbariaux, Yoshua Bengio, Jean-Pierre David
|
👻
Ghosted
|
cs.LG
|
3.2K |
10 years ago |
| 42 |
Elucidating the Design Space of Diffusion-Based Generative Models
Tero Karras, Miika Aittala, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
2.9K |
3 years ago |
| 43 |
Learning to summarize from human feedback
Nisan Stiennon, Long Ouyang, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
2.9K |
5 years ago |
| 44 |
Attention-Based Models for Speech Recognition
Jan Chorowski, Dzmitry Bahdanau, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
2.7K |
10 years ago |
| 45 |
Deep Leakage from Gradients
Ligeng Zhu, Zhijian Liu, Song Han
|
👻
Ghosted
|
cs.LG
|
2.7K |
6 years ago |
| 46 |
Conditional Image Generation with PixelCNN Decoders
Aaron van den Oord, Nal Kalchbrenner, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
2.7K |
9 years ago |
| 47 |
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems
Alex Wang, Yada Pruksachatkun, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
2.7K |
6 years ago |
| 48 |
Big Bird: Transformers for Longer Sequences
Manzil Zaheer, Guru Guruganesh, ... (+9 more)
|
👻
Ghosted
|
cs.LG
|
2.6K |
5 years ago |
| 49 |
Hindsight Experience Replay
Marcin Andrychowicz, Filip Wolski, ... (+8 more)
|
👻
Ghosted
|
cs.LG
|
2.6K |
8 years ago |
| 50 |
Unsupervised Data Augmentation for Consistency Training
Qizhe Xie, Zihang Dai, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
2.6K |
6 years ago |