| 251 |
Learned Optimizers that Scale and Generalize
Olga Wichrowska, Niru Maheswaranathan, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
302 |
9 years ago |
| 252 |
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds
Andrea Zanette, Emma Brunskill
|
👻
Ghosted
|
cs.LG
|
302 |
7 years ago |
| 253 |
Unsupervised Learning by Predicting Noise
Piotr Bojanowski, Armand Joulin
|
👻
Ghosted
|
stat.ML
|
299 |
9 years ago |
| 254 |
The Mechanics of n-Player Differentiable Games
David Balduzzi, Sebastien Racaniere, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
299 |
8 years ago |
| 255 |
Gromov-Wasserstein Learning for Graph Matching and Node Embedding
Hongteng Xu, Dixin Luo, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
299 |
7 years ago |
| 256 |
Online Meta-Learning
Chelsea Finn, Aravind Rajeswaran, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
299 |
7 years ago |
| 257 |
Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Vitchyr H. Pong, Murtaza Dalal, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
299 |
7 years ago |
| 258 |
Provably Efficient Exploration in Policy Optimization
Qi Cai, Zhuoran Yang, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
299 |
6 years ago |
| 259 |
The case for 4-bit precision: k-bit Inference Scaling Laws
Tim Dettmers, Luke Zettlemoyer
|
👻
Ghosted
|
cs.LG
|
299 |
3 years ago |
| 260 |
Variational inference for Monte Carlo objectives
Andriy Mnih, Danilo J. Rezende
|
👻
Ghosted
|
cs.LG
|
298 |
10 years ago |
| 261 |
The loss surface of deep and wide neural networks
Quynh Nguyen, Matthias Hein
|
👻
Ghosted
|
cs.LG
|
297 |
9 years ago |
| 262 |
AdaNet: Adaptive Structural Learning of Artificial Neural Networks
Corinna Cortes, Xavi Gonzalvo, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
295 |
9 years ago |
| 263 |
Understanding the impact of entropy on policy optimization
Zafarali Ahmed, Nicolas Le Roux, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
295 |
7 years ago |
| 264 |
Cascading Bandits: Learning to Rank in the Cascade Model
Branislav Kveton, Csaba Szepesvari, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
294 |
11 years ago |
| 265 |
Learning to Exploit Long-term Relational Dependencies in Knowledge Graphs
Lingbing Guo, Zequn Sun, Wei Hu
|
👻
Ghosted
|
cs.AI
|
294 |
7 years ago |
| 266 |
A Kronecker-factored approximate Fisher matrix for convolution layers
Roger Grosse, James Martens
|
👻
Ghosted
|
stat.ML
|
292 |
10 years ago |
| 267 |
Compositional Fairness Constraints for Graph Embeddings
Avishek Joey Bose, William L. Hamilton
|
👻
Ghosted
|
cs.LG
|
291 |
7 years ago |
| 268 |
Training Neural Networks Without Gradients: A Scalable ADMM Approach
Gavin Taylor, Ryan Burmeister, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
289 |
10 years ago |
| 269 |
Analyzing Uncertainty in Neural Machine Translation
Myle Ott, Michael Auli, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
288 |
8 years ago |
| 270 |
Poisoning Language Models During Instruction Tuning
Alexander Wan, Eric Wallace, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
285 |
3 years ago |
| 271 |
On the Power of Over-parametrization in Neural Networks with Quadratic Activation
Simon S. Du, Jason D. Lee
|
👻
Ghosted
|
cs.LG
|
284 |
8 years ago |
| 272 |
Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam
Mohammad Emtiyaz Khan, Didrik Nielsen, ... (+4 more)
|
👻
Ghosted
|
stat.ML
|
284 |
7 years ago |
| 273 |
Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning
Junhyuk Oh, Satinder Singh, ... (+2 more)
|
👻
Ghosted
|
cs.AI
|
282 |
8 years ago |
| 274 |
Rademacher Complexity for Adversarially Robust Generalization
Dong Yin, Kannan Ramchandran, Peter Bartlett
|
👻
Ghosted
|
cs.LG
|
282 |
7 years ago |
| 275 |
Graying the black box: Understanding DQNs
Tom Zahavy, Nir Ben Zrihem, Shie Mannor
|
👻
Ghosted
|
cs.LG
|
281 |
10 years ago |
| 276 |
Equivariance Through Parameter-Sharing
Siamak Ravanbakhsh, Jeff Schneider, Barnabas Poczos
|
👻
Ghosted
|
stat.ML
|
281 |
9 years ago |
| 277 |
More Robust Doubly Robust Off-policy Evaluation
Mehrdad Farajtabar, Yinlam Chow, Mohammad Ghavamzadeh
|
👻
Ghosted
|
cs.AI
|
280 |
8 years ago |
| 278 |
Spurious Local Minima are Common in Two-Layer ReLU Neural Networks
Itay Safran, Ohad Shamir
|
👻
Ghosted
|
cs.LG
|
279 |
8 years ago |
| 279 |
Self-Imitation Learning
Junhyuk Oh, Yijie Guo, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
279 |
7 years ago |
| 280 |
Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
Ian Osband, Benjamin Van Roy
|
👻
Ghosted
|
stat.ML
|
278 |
9 years ago |
| 281 |
A Laplacian Framework for Option Discovery in Reinforcement Learning
Marlos C. Machado, Marc G. Bellemare, Michael Bowling
|
👻
Ghosted
|
cs.LG
|
278 |
9 years ago |
| 282 |
Bounding and Counting Linear Regions of Deep Neural Networks
Thiago Serra, Christian Tjandraatmadja, Srikumar Ramalingam
|
👻
Ghosted
|
cs.LG
|
278 |
8 years ago |
| 283 |
Memory-Efficient Pipeline-Parallel DNN Training
Deepak Narayanan, Amar Phanishayee, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
274 |
5 years ago |
| 284 |
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Colin Raffel, Minh-Thang Luong, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
273 |
9 years ago |
| 285 |
Low Latency Privacy Preserving Inference
Alon Brutzkus, Oren Elisha, Ran Gilad-Bachrach
|
👻
Ghosted
|
cs.LG
|
272 |
7 years ago |
| 286 |
Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks
Juho Lee, Yoonho Lee, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
270 |
7 years ago |
| 287 |
Variants of RMSProp and Adagrad with Logarithmic Regret Bounds
Mahesh Chandra Mukkamala, Matthias Hein
|
👻
Ghosted
|
cs.LG
|
269 |
8 years ago |
| 288 |
Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteriors
Christos Louizos, Max Welling
|
👻
Ghosted
|
stat.ML
|
268 |
10 years ago |
| 289 |
NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks
Yandong Li, Lijun Li, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
266 |
7 years ago |
| 290 |
Being Robust (in High Dimensions) Can Be Practical
Ilias Diakonikolas, Gautam Kamath, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
265 |
9 years ago |
| 291 |
Tensor-Train Recurrent Neural Networks for Video Classification
Yinchong Yang, Denis Krompass, Volker Tresp
|
👻
Ghosted
|
cs.CV
|
264 |
8 years ago |
| 292 |
DRACO: Byzantine-resilient Distributed Training via Redundant Gradients
Lingjiao Chen, Hongyi Wang, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
264 |
8 years ago |
| 293 |
Image-to-Markup Generation with Coarse-to-Fine Attention
Yuntian Deng, Anssi Kanervisto, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
263 |
9 years ago |
| 294 |
Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings
Rie Johnson, Tong Zhang
|
👻
Ghosted
|
stat.ML
|
260 |
10 years ago |
| 295 |
AutoML-Zero: Evolving Machine Learning Algorithms From Scratch
Esteban Real, Chen Liang, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
260 |
6 years ago |
| 296 |
One-Shot Generalization in Deep Generative Models
Danilo Jimenez Rezende, Shakir Mohamed, ... (+3 more)
|
👻
Ghosted
|
stat.ML
|
259 |
10 years ago |
| 297 |
On orthogonality and learning recurrent networks with long term dependencies
Eugene Vorontsov, Chiheb Trabelsi, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
256 |
9 years ago |
| 298 |
Semi-Amortized Variational Autoencoders
Yoon Kim, Sam Wiseman, ... (+3 more)
|
👻
Ghosted
|
stat.ML
|
256 |
8 years ago |
| 299 |
Privacy for Free: Posterior Sampling and Stochastic Gradient Monte Carlo
Yu-Xiang Wang, Stephen E. Fienberg, Alex Smola
|
👻
Ghosted
|
stat.ML
|
252 |
11 years ago |
| 300 |
A Unified View of Multi-Label Performance Measures
Xi-Zhu Wu, Zhi-Hua Zhou
|
👻
Ghosted
|
cs.LG
|
251 |
9 years ago |