| 1 |
Attention Is All You Need
Ashish Vaswani, Noam Shazeer, ... (+6 more)
|
🌅
Old Age
|
cs.CL
|
166.0K |
8 years ago |
| 2 |
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang, Zihang Dai, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
9.2K |
6 years ago |
| 3 |
R-FCN: Object Detection via Region-based Fully Convolutional Networks
Jifeng Dai, Yi Li, ... (+2 more)
|
🌅
Old Age
|
cs.CV
|
6.0K |
9 years ago |
| 4 |
Glow: Generative Flow with Invertible 1x1 Convolutions
Diederik P. Kingma, Prafulla Dhariwal
|
🌅
Old Age
|
stat.ML
|
3.5K |
7 years ago |
| 5 |
Unsupervised Data Augmentation for Consistency Training
Qizhe Xie, Zihang Dai, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
2.6K |
6 years ago |
| 6 |
Learning Structured Sparsity in Deep Neural Networks
Wei Wen, Chunpeng Wu, ... (+3 more)
|
🌅
Old Age
|
cs.NE
|
2.5K |
9 years ago |
| 7 |
Visualizing the Loss Landscape of Neural Nets
Hao Li, Zheng Xu, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
2.2K |
8 years ago |
| 8 |
Gradient Surgery for Multi-Task Learning
Tianhe Yu, Saurabh Kumar, ... (+4 more)
|
🌅
Old Age
|
cs.LG
|
1.6K |
6 years ago |
| 9 |
Toward Multimodal Image-to-Image Translation
Jun-Yan Zhu, Richard Zhang, ... (+5 more)
|
🌅
Old Age
|
cs.CV
|
1.4K |
8 years ago |
| 10 |
Adversarial Training for Free!
Ali Shafahi, Mahyar Najibi, ... (+7 more)
|
🌅
Old Age
|
cs.LG
|
1.4K |
6 years ago |
| 11 |
Defending Against Neural Fake News
Rowan Zellers, Ari Holtzman, ... (+5 more)
|
🌅
Old Age
|
cs.CL
|
1.2K |
6 years ago |
| 12 |
Dynamic Network Surgery for Efficient DNNs
Yiwen Guo, Anbang Yao, Yurong Chen
|
🌅
Old Age
|
cs.NE
|
1.1K |
9 years ago |
| 13 |
Video-to-Video Synthesis
Ting-Chun Wang, Ming-Yu Liu, ... (+5 more)
|
🌅
Old Age
|
cs.CV
|
1.1K |
7 years ago |
| 14 |
On Exact Computation with an Infinitely Wide Neural Net
Sanjeev Arora, Simon S. Du, ... (+4 more)
|
🌅
Old Age
|
cs.LG
|
1.0K |
6 years ago |
| 15 |
Dual Path Networks
Yunpeng Chen, Jianan Li, ... (+4 more)
|
🌅
Old Age
|
cs.CV
|
884 |
8 years ago |
| 16 |
Lookahead Optimizer: k steps forward, 1 step back
Michael R. Zhang, James Lucas, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
825 |
6 years ago |
| 17 |
Neural Architecture Optimization
Renqian Luo, Fei Tian, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
697 |
7 years ago |
| 18 |
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding
Kexin Yi, Jiajun Wu, ... (+4 more)
|
🌅
Old Age
|
cs.AI
|
669 |
7 years ago |
| 19 |
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu, Elman Mansimov, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
665 |
8 years ago |
| 20 |
Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise
Dan Hendrycks, Mantas Mazeika, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
597 |
8 years ago |
| 21 |
Provably Robust Deep Learning via Adversarially Trained Smoothed Classifiers
Hadi Salman, Greg Yang, ... (+5 more)
|
🌅
Old Age
|
cs.LG
|
595 |
6 years ago |
| 22 |
MarrNet: 3D Shape Reconstruction via 2.5D Sketches
Jiajun Wu, Yifan Wang, ... (+4 more)
|
🌅
Old Age
|
cs.CV
|
436 |
8 years ago |
| 23 |
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Alex X. Lee, Anusha Nagabandi, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
413 |
6 years ago |
| 24 |
Controllable Text-to-Image Generation
Bowen Li, Xiaojuan Qi, ... (+2 more)
|
🌅
Old Age
|
cs.CV
|
410 |
6 years ago |
| 25 |
Hierarchical Neural Architecture Search for Deep Stereo Matching
Xuelian Cheng, Yiran Zhong, ... (+6 more)
|
🌅
Old Age
|
cs.CV
|
406 |
5 years ago |
| 26 |
The Lottery Ticket Hypothesis for Pre-trained BERT Networks
Tianlong Chen, Jonathan Frankle, ... (+5 more)
|
🌅
Old Age
|
cs.LG
|
405 |
5 years ago |
| 27 |
Understanding Attention and Generalization in Graph Neural Networks
Boris Knyazev, Graham W. Taylor, Mohamed R. Amer
|
🌅
Old Age
|
cs.LG
|
381 |
6 years ago |
| 28 |
Certified Adversarial Robustness with Additive Noise
Bai Li, Changyou Chen, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
371 |
7 years ago |
| 29 |
Neural Nearest Neighbors Networks
Tobias Plötz, Stefan Roth
|
🌅
Old Age
|
cs.CV
|
365 |
7 years ago |
| 30 |
Distribution Matching for Crowd Counting
Boyu Wang, Huidong Liu, ... (+2 more)
|
🌅
Old Age
|
cs.CV
|
363 |
5 years ago |
| 31 |
Meta-Learning Representations for Continual Learning
Khurram Javed, Martha White
|
🌅
Old Age
|
cs.LG
|
360 |
6 years ago |
| 32 |
FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification
Yixiao Ge, Zhuowan Li, ... (+5 more)
|
🌅
Old Age
|
cs.CV
|
354 |
7 years ago |
| 33 |
Dilated Recurrent Neural Networks
Shiyu Chang, Yang Zhang, ... (+8 more)
|
🌅
Old Age
|
cs.AI
|
339 |
8 years ago |
| 34 |
Learning to Navigate in Cities Without a Map
Piotr Mirowski, Matthew Koichi Grimes, ... (+8 more)
|
🌅
Old Age
|
cs.AI
|
339 |
7 years ago |
| 35 |
Attentional Pooling for Action Recognition
Rohit Girdhar, Deva Ramanan
|
🌅
Old Age
|
cs.CV
|
330 |
8 years ago |
| 36 |
One-Sided Unsupervised Domain Mapping
Sagie Benaim, Lior Wolf
|
🌅
Old Age
|
cs.CV
|
327 |
8 years ago |
| 37 |
Brain-Like Object Recognition with High-Performing Shallow Recurrent ANNs
Jonas Kubilius, Martin Schrimpf, ... (+12 more)
|
🌅
Old Age
|
cs.CV
|
307 |
6 years ago |
| 38 |
Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels
Simon S. Du, Kangcheng Hou, ... (+4 more)
|
🌅
Old Age
|
cs.LG
|
300 |
6 years ago |
| 39 |
AttentionXML: Label Tree-based Attention-Aware Deep Model for High-Performance Extreme Multi-Label Text Classification
Ronghui You, Zihan Zhang, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
290 |
7 years ago |
| 40 |
Gate Decorator: Global Filter Pruning Method for Accelerating Deep Convolutional Neural Networks
Zhonghui You, Kun Yan, ... (+3 more)
|
🌅
Old Age
|
cs.CV
|
283 |
6 years ago |
| 41 |
A Convex Relaxation Barrier to Tight Robustness Verification of Neural Networks
Hadi Salman, Greg Yang, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
282 |
7 years ago |
| 42 |
Theoretical Linear Convergence of Unfolded ISTA and its Practical Weights and Thresholds
Xiaohan Chen, Jialin Liu, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
275 |
7 years ago |
| 43 |
MetaSDF: Meta-learning Signed Distance Functions
Vincent Sitzmann, Eric R. Chan, ... (+3 more)
|
🌅
Old Age
|
cs.CV
|
274 |
5 years ago |
| 44 |
Visual Object Networks: Image Generation with Disentangled 3D Representation
Jun-Yan Zhu, Zhoutong Zhang, ... (+5 more)
|
🌅
Old Age
|
cs.CV
|
267 |
7 years ago |
| 45 |
Unsupervised Learning of Object Landmarks through Conditional Image Generation
Tomas Jakab, Ankush Gupta, ... (+2 more)
|
🌅
Old Age
|
cs.CV
|
265 |
7 years ago |
| 46 |
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing
Zihang Dai, Guokun Lai, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
260 |
5 years ago |
| 47 |
Learning Conditioned Graph Structures for Interpretable Visual Question Answering
Will Norcliffe-Brown, Efstathios Vafeias, Sarah Parisot
|
🌅
Old Age
|
cs.CV
|
253 |
7 years ago |
| 48 |
Pixels to Graphs by Associative Embedding
Alejandro Newell, Jia Deng
|
🌅
Old Age
|
cs.CV
|
238 |
8 years ago |
| 49 |
Learning to Pivot with Adversarial Networks
Gilles Louppe, Michael Kagan, Kyle Cranmer
|
🌅
Old Age
|
stat.ML
|
236 |
9 years ago |
| 50 |
Tensor Programs I: Wide Feedforward or Recurrent Neural Networks of Any Architecture are Gaussian Processes
Greg Yang
|
🌅
Old Age
|
cs.NE
|
226 |
6 years ago |