| 51 |
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Martín Abadi, Ashish Agarwal, ... (+38 more)
|
👻
Ghosted
|
cs.DC
|
11.6K |
10 years ago |
| 52 |
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie, Ross Girshick, ... (+3 more)
|
🌅
Old Age
|
cs.CV
|
11.4K |
9 years ago |
| 53 |
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Patrick Lewis, Ethan Perez, ... (+10 more)
|
👻
Ghosted
|
cs.CL
|
11.3K |
5 years ago |
| 54 |
mixup: Beyond Empirical Risk Minimization
Hongyi Zhang, Moustapha Cisse, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
11.3K |
8 years ago |
| 55 |
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Justin Johnson, Alexandre Alahi, Li Fei-Fei
|
👻
Ghosted
|
cs.CV
|
11.2K |
10 years ago |
| 56 |
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Yarin Gal, Zoubin Ghahramani
|
👻
Ghosted
|
stat.ML
|
11.0K |
10 years ago |
| 57 |
Identity Mappings in Deep Residual Networks
Kaiming He, Xiangyu Zhang, ... (+2 more)
|
🌅
Old Age
|
cs.CV
|
11.0K |
10 years ago |
| 58 |
Domain-Adversarial Training of Neural Networks
Yaroslav Ganin, Evgeniya Ustinova, ... (+6 more)
|
👻
Ghosted
|
stat.ML
|
10.8K |
10 years ago |
| 59 |
Denoising Diffusion Implicit Models
Jiaming Song, Chenlin Meng, Stefano Ermon
|
👻
Ghosted
|
cs.LG
|
10.7K |
5 years ago |
| 60 |
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Kelvin Xu, Jimmy Ba, ... (+6 more)
|
👻
Ghosted
|
cs.LG
|
10.6K |
11 years ago |
| 61 |
Enriching Word Vectors with Subword Information
Piotr Bojanowski, Edouard Grave, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
10.5K |
9 years ago |
| 62 |
Improved Training of Wasserstein GANs
Ishaan Gulrajani, Faruk Ahmed, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
10.5K |
9 years ago |
| 63 |
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja, Aurick Zhou, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
10.4K |
8 years ago |
| 64 |
Learning Deep Features for Discriminative Localization
Bolei Zhou, Aditya Khosla, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
10.3K |
10 years ago |
| 65 |
V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation
Fausto Milletari, Nassir Navab, Seyed-Ahmad Ahmadi
|
👻
Ghosted
|
cs.CV
|
10.2K |
9 years ago |
| 66 |
Improved Techniques for Training GANs
Tim Salimans, Ian Goodfellow, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
10.0K |
9 years ago |
| 67 |
Non-local Neural Networks
Xiaolong Wang, Ross Girshick, ... (+2 more)
|
🌅
Old Age
|
cs.CV
|
9.9K |
8 years ago |
| 68 |
SGDR: Stochastic Gradient Descent with Warm Restarts
Ilya Loshchilov, Frank Hutter
|
🌅
Old Age
|
cs.LG
|
9.8K |
9 years ago |
| 69 |
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih, Adrià Puigdomènech Badia, ... (+6 more)
|
👻
Ghosted
|
cs.LG
|
9.7K |
10 years ago |
| 70 |
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
Song Han, Huizi Mao, William J. Dally
|
👻
Ghosted
|
cs.CV
|
9.7K |
10 years ago |
| 71 |
Rethinking Atrous Convolution for Semantic Image Segmentation
Liang-Chieh Chen, George Papandreou, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
9.6K |
8 years ago |
| 72 |
Towards Evaluating the Robustness of Neural Networks
Nicholas Carlini, David Wagner
|
👻
Ghosted
|
cs.CR
|
9.5K |
9 years ago |
| 73 |
Prototypical Networks for Few-shot Learning
Jake Snell, Kevin Swersky, Richard S. Zemel
|
👻
Ghosted
|
cs.LG
|
9.5K |
9 years ago |
| 74 |
Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song, Jascha Sohl-Dickstein, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
9.3K |
5 years ago |
| 75 |
How Powerful are Graph Neural Networks?
Keyulu Xu, Weihua Hu, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
9.3K |
7 years ago |
| 76 |
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi, Zhourong Chen, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
9.2K |
10 years ago |
| 77 |
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang, Zihang Dai, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
9.2K |
6 years ago |
| 78 |
Multi-Scale Context Aggregation by Dilated Convolutions
Fisher Yu, Vladlen Koltun
|
👻
Ghosted
|
cs.CV
|
9.2K |
10 years ago |
| 79 |
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Sohl-Dickstein, Eric A. Weiss, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
9.2K |
11 years ago |
| 80 |
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Joao Carreira, Andrew Zisserman
|
👻
Ghosted
|
cs.CV
|
9.1K |
8 years ago |
| 81 |
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Victor Sanh, Lysandre Debut, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
9.1K |
6 years ago |
| 82 |
SQuAD: 100,000+ Questions for Machine Comprehension of Text
Pranav Rajpurkar, Jian Zhang, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
9.1K |
9 years ago |
| 83 |
Overcoming catastrophic forgetting in neural networks
James Kirkpatrick, Razvan Pascanu, ... (+12 more)
|
👻
Ghosted
|
cs.LG
|
9.1K |
9 years ago |
| 84 |
Deep Reinforcement Learning with Double Q-learning
Hado van Hasselt, Arthur Guez, David Silver
|
👻
Ghosted
|
cs.LG
|
8.7K |
10 years ago |
| 85 |
Wide Residual Networks
Sergey Zagoruyko, Nikos Komodakis
|
🌅
Old Age
|
cs.CV
|
8.7K |
9 years ago |
| 86 |
Neural Message Passing for Quantum Chemistry
Justin Gilmer, Samuel S. Schoenholz, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
8.6K |
8 years ago |
| 87 |
Training data-efficient image transformers & distillation through attention
Hugo Touvron, Matthieu Cord, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
8.5K |
5 years ago |
| 88 |
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich, Barry Haddow, Alexandra Birch
|
👻
Ghosted
|
cs.CL
|
8.5K |
10 years ago |
| 89 |
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
Michaël Defferrard, Xavier Bresson, Pierre Vandergheynst
|
👻
Ghosted
|
cs.LG
|
8.3K |
9 years ago |
| 90 |
Effective Approaches to Attention-based Neural Machine Translation
Minh-Thang Luong, Hieu Pham, Christopher D. Manning
|
👻
Ghosted
|
cs.CL
|
8.3K |
10 years ago |
| 91 |
Progressive Growing of GANs for Improved Quality, Stability, and Variation
Tero Karras, Timo Aila, ... (+2 more)
|
👻
Ghosted
|
cs.NE
|
8.2K |
8 years ago |
| 92 |
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size
Forrest N. Iandola, Song Han, ... (+4 more)
|
🌅
Old Age
|
cs.CV
|
8.2K |
10 years ago |
| 93 |
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Wang, Amanpreet Singh, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
8.2K |
7 years ago |
| 94 |
Matching Networks for One Shot Learning
Oriol Vinyals, Charles Blundell, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
8.1K |
9 years ago |
| 95 |
UNet++: A Nested U-Net Architecture for Medical Image Segmentation
Zongwei Zhou, Md Mahfuzur Rahman Siddiquee, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
8.1K |
7 years ago |
| 96 |
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang, Xinyu Zhou, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
8.0K |
8 years ago |
| 97 |
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord, Sander Dieleman, ... (+7 more)
|
👻
Ghosted
|
cs.SD
|
8.0K |
9 years ago |
| 98 |
Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising
Kai Zhang, Wangmeng Zuo, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
7.9K |
9 years ago |
| 99 |
Spatial Transformer Networks
Max Jaderberg, Karen Simonyan, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
7.9K |
10 years ago |
| 100 |
Unsupervised Cross-lingual Representation Learning at Scale
Alexis Conneau, Kartikay Khandelwal, ... (+8 more)
|
👻
Ghosted
|
cs.CL
|
7.9K |
6 years ago |