| 1 |
SGDR: Stochastic Gradient Descent with Warm Restarts
Ilya Loshchilov, Frank Hutter
|
🌅
Old Age
|
cs.LG
|
9.8K |
9 years ago |
| 2 |
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu, Weijie Su, ... (+4 more)
|
🌅
Old Age
|
cs.CV
|
6.8K |
5 years ago |
| 3 |
DARTS: Differentiable Architecture Search
Hanxiao Liu, Karen Simonyan, Yiming Yang
|
🌅
Old Age
|
cs.LG
|
4.8K |
7 years ago |
| 4 |
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer
Sergey Zagoruyko, Nikos Komodakis
|
🌅
Old Age
|
cs.CV
|
2.9K |
9 years ago |
| 5 |
AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty
Dan Hendrycks, Norman Mu, ... (+4 more)
|
🌅
Old Age
|
stat.ML
|
1.5K |
6 years ago |
| 6 |
Decoupling Representation and Classifier for Long-Tailed Recognition
Bingyi Kang, Saining Xie, ... (+5 more)
|
🌅
Old Age
|
cs.CV
|
1.4K |
6 years ago |
| 7 |
Mitigating Adversarial Effects Through Randomization
Cihang Xie, Jianyu Wang, ... (+3 more)
|
🌅
Old Age
|
cs.CV
|
1.2K |
8 years ago |
| 8 |
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Aojun Zhou, Anbang Yao, ... (+3 more)
|
🌅
Old Age
|
cs.CV
|
1.1K |
9 years ago |
| 9 |
The relativistic discriminator: a key element missing from standard GAN
Alexia Jolicoeur-Martineau
|
🌅
Old Age
|
cs.LG
|
1.1K |
7 years ago |
| 10 |
SNAS: Stochastic Neural Architecture Search
Sirui Xie, Hehui Zheng, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
986 |
7 years ago |
| 11 |
Deep Predictive Coding Networks for Video Prediction and Unsupervised Learning
William Lotter, Gabriel Kreiman, David Cox
|
🌅
Old Age
|
cs.LG
|
978 |
9 years ago |
| 12 |
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision
Jiayuan Mao, Chuang Gan, ... (+3 more)
|
🌅
Old Age
|
cs.CV
|
797 |
6 years ago |
| 13 |
SMASH: One-Shot Model Architecture Search through HyperNetworks
Andrew Brock, Theodore Lim, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
795 |
8 years ago |
| 14 |
Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning
Yanbin Liu, Juho Lee, ... (+5 more)
|
🌅
Old Age
|
cs.LG
|
730 |
7 years ago |
| 15 |
Slimmable Neural Networks
Jiahui Yu, Linjie Yang, ... (+3 more)
|
🌅
Old Age
|
cs.CV
|
620 |
7 years ago |
| 16 |
CLEVRER: CoLlision Events for Video REpresentation and Reasoning
Kexin Yi, Chuang Gan, ... (+5 more)
|
🌅
Old Age
|
cs.CV
|
551 |
6 years ago |
| 17 |
FreeLB: Enhanced Adversarial Training for Natural Language Understanding
Chen Zhu, Yu Cheng, ... (+4 more)
|
🌅
Old Age
|
cs.CL
|
493 |
6 years ago |
| 18 |
Revisiting Few-sample BERT Fine-tuning
Tianyi Zhang, Felix Wu, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
491 |
5 years ago |
| 19 |
Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable Objects, and Fluids
Yunzhu Li, Jiajun Wu, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
444 |
7 years ago |
| 20 |
Prior Convictions: Black-Box Adversarial Attacks with Bandits and Priors
Andrew Ilyas, Logan Engstrom, Aleksander Madry
|
🌅
Old Age
|
stat.ML
|
413 |
7 years ago |
| 21 |
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images
Rewon Child
|
🌅
Old Age
|
cs.LG
|
386 |
5 years ago |
| 22 |
Building Generalizable Agents with a Realistic and Rich 3D Environment
Yi Wu, Yuxin Wu, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
355 |
8 years ago |
| 23 |
PseudoSeg: Designing Pseudo Labels for Semantic Segmentation
Yuliang Zou, Zizhao Zhang, ... (+5 more)
|
🌅
Old Age
|
cs.CV
|
354 |
5 years ago |
| 24 |
Combining Label Propagation and Simple Models Out-performs Graph Neural Networks
Qian Huang, Horace He, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
311 |
5 years ago |
| 25 |
MMA Training: Direct Input Space Margin Maximization through Adversarial Training
Gavin Weiguang Ding, Yash Sharma, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
299 |
7 years ago |
| 26 |
DeepV2D: Video to Depth with Differentiable Structure from Motion
Zachary Teed, Jia Deng
|
🌅
Old Age
|
cs.CV
|
298 |
7 years ago |
| 27 |
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU
Mohammad Babaeizadeh, Iuri Frosio, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
291 |
9 years ago |
| 28 |
Episodic Curiosity through Reachability
Nikolay Savinov, Anton Raichuk, ... (+5 more)
|
🌅
Old Age
|
cs.LG
|
287 |
7 years ago |
| 29 |
LAMOL: LAnguage MOdeling for Lifelong Language Learning
Fan-Keng Sun, Cheng-Hao Ho, Hung-Yi Lee
|
🌅
Old Age
|
cs.CL
|
243 |
6 years ago |
| 30 |
Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks
Victor Campos, Brendan Jou, ... (+3 more)
|
🌅
Old Age
|
cs.AI
|
227 |
8 years ago |
| 31 |
On the Quantitative Analysis of Decoder-Based Generative Models
Yuhuai Wu, Yuri Burda, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
226 |
9 years ago |
| 32 |
Deep Probabilistic Programming
Dustin Tran, Matthew D. Hoffman, ... (+4 more)
|
🌅
Old Age
|
stat.ML
|
201 |
9 years ago |
| 33 |
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
Rohit Girdhar, Deva Ramanan
|
🌅
Old Age
|
cs.CV
|
195 |
6 years ago |
| 34 |
Prediction Poisoning: Towards Defenses Against DNN Model Stealing Attacks
Tribhuvanesh Orekondy, Bernt Schiele, Mario Fritz
|
🌅
Old Age
|
cs.LG
|
191 |
6 years ago |
| 35 |
Certified Defenses for Adversarial Patches
Ping-Yeh Chiang, Renkun Ni, ... (+4 more)
|
🌅
Old Age
|
cs.CR
|
189 |
6 years ago |
| 36 |
Self-Supervised Policy Adaptation during Deployment
Nicklas Hansen, Rishabh Jangir, ... (+6 more)
|
🌅
Old Age
|
cs.LG
|
183 |
5 years ago |
| 37 |
Adv-BNN: Improved Adversarial Defense through Robust Bayesian Neural Network
Xuanqing Liu, Yao Li, ... (+2 more)
|
🌅
Old Age
|
cs.LG
|
182 |
7 years ago |
| 38 |
DARTS-: Robustly Stepping out of Performance Collapse Without Indicators
Xiangxiang Chu, Xiaoxing Wang, ... (+4 more)
|
🌅
Old Age
|
cs.LG
|
182 |
5 years ago |
| 39 |
Trellis Networks for Sequence Modeling
Shaojie Bai, J. Zico Kolter, Vladlen Koltun
|
🌅
Old Age
|
cs.LG
|
162 |
7 years ago |
| 40 |
Exploring Model-based Planning with Policy Networks
Tingwu Wang, Jimmy Ba
|
🌅
Old Age
|
cs.LG
|
162 |
6 years ago |
| 41 |
Learning to Infer and Execute 3D Shape Programs
Yonglong Tian, Andrew Luo, ... (+5 more)
|
🌅
Old Age
|
cs.CV
|
154 |
7 years ago |
| 42 |
R-GAP: Recursive Gradient Attack on Privacy
Junyi Zhu, Matthew Blaschko
|
🌅
Old Age
|
cs.LG
|
154 |
5 years ago |
| 43 |
Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Translation
Jungo Kasai, Nikolaos Pappas, ... (+3 more)
|
🌅
Old Age
|
cs.CL
|
151 |
5 years ago |
| 44 |
Generalizable Adversarial Training via Spectral Normalization
Farzan Farnia, Jesse M. Zhang, David Tse
|
🌅
Old Age
|
cs.LG
|
148 |
7 years ago |
| 45 |
Learning Intrinsic Sparse Structures within Long Short-Term Memory
Wei Wen, Yuxiong He, ... (+7 more)
|
🌅
Old Age
|
cs.LG
|
142 |
8 years ago |
| 46 |
AtomNAS: Fine-Grained End-to-End Neural Architecture Search
Jieru Mei, Yingwei Li, ... (+5 more)
|
🌅
Old Age
|
cs.CV
|
110 |
6 years ago |
| 47 |
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Chun-Fu Chen, Quanfu Fan, ... (+3 more)
|
🌅
Old Age
|
cs.CV
|
100 |
7 years ago |
| 48 |
Memory-Based Graph Networks
Amir Hosein Khasahmadi, Kaveh Hassani, ... (+3 more)
|
🌅
Old Age
|
cs.LG
|
99 |
6 years ago |
| 49 |
DNA-GAN: Learning Disentangled Representations from Multi-Attribute Images
Taihong Xiao, Jiapeng Hong, Jinwen Ma
|
🌅
Old Age
|
cs.CV
|
92 |
8 years ago |
| 50 |
Stochastic Security: Adversarial Defense Using Long-Run Dynamics of Energy-Based Models
Mitch Hill, Jonathan Mitchell, Song-Chun Zhu
|
🌅
Old Age
|
stat.ML
|
87 |
5 years ago |