| 51 |
Prune-then-Quantize or Quantize-then-Prune? Understanding the Impact of Compression Order in Joint Model Compression
Minjun Kim, Jaehyeon Choi, ... (+4 more)
|
|
cs.AI
|
0 |
2 months ago |
| 52 |
SynQ: Accurate Zero-shot Quantization by Synthesis-aware Fine-tuning
Minjun Kim, Jongjin Kim, U Kang
|
|
cs.CV
|
0 |
2 months ago |
| 53 |
Sharpness-Aware Minimization in Logit Space Efficiently Enhances Direct Preference Optimization
Haocheng Luo, Zehang Deng, ... (+4 more)
|
|
cs.LG
|
0 |
2 months ago |
| 54 |
R2-Dreamer: Redundancy-Reduced World Models without Decoders or Augmentation
Naoki Morihira, Amal Nahar, ... (+4 more)
|
|
cs.LG
|
0 |
2 months ago |
| 55 |
CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention
Zhongzhu Zhou, Fengxiang Bie, ... (+7 more)
|
|
cs.LG
|
0 |
2 months ago |
| 56 |
Can Blindfolded LLMs Still Trade? An Anonymization-First Framework for Portfolio Optimization
Joohyoung Jeon, Hongchul Lee
|
|
cs.LG
|
0 |
2 months ago |
| 57 |
Noise-Response Calibration: A Causal Intervention Protocol for LLM-Judges
Maxim Khomiakov, Jes Frellsen
|
|
cs.LG
|
0 |
2 months ago |
| 58 |
CircuitBuilder: From Polynomials to Circuits via Reinforcement Learning
Weikun K. Zhang, Rohan Pandey, ... (+6 more)
|
|
cs.LG
|
0 |
2 months ago |
| 59 |
Me, Myself, and $π$ : Evaluating and Explaining LLM Introspection
Atharv Naphade, Samarth Bhargav, ... (+2 more)
|
|
cs.AI
|
0 |
2 months ago |
| 60 |
Execution-Grounded Credit Assignment for GRPO in Code Generation
Abhijit Kumar, Natalya Kumar, Shikhar Gupta
|
|
cs.LG
|
0 |
2 months ago |
| 61 |
Pre-training LLM without Learning Rate Decay Enhances Supervised Fine-Tuning
Kazuki Yano, Shun Kiyono, ... (+3 more)
|
|
cs.CL
|
0 |
2 months ago |
| 62 |
NANOZK: Layerwise Zero-Knowledge Proofs for Verifiable Large Language Model Inference
Zhaohui Geoffrey Wang
|
|
cs.LG
|
0 |
2 months ago |
| 63 |
GASP: Guided Asymmetric Self-Play For Coding LLMs
Swadesh Jana, Cansu Sancaktar, ... (+4 more)
|
|
cs.LG
|
0 |
2 months ago |
| 64 |
Learning to Recall with Transformers Beyond Orthogonal Embeddings
Nuri Mert Vural, Alberto Bietti, ... (+2 more)
|
|
stat.ML
|
0 |
2 months ago |
| 65 |
Regularized Latent Dynamics Prediction is a Strong Baseline For Behavioral Foundation Models
Pranaya Jajoo, Harshit Sikchi, ... (+4 more)
|
|
cs.AI
|
0 |
2 months ago |
| 66 |
When Stability Fails: Hidden Failure Modes Of LLMS in Data-Constrained Scientific Decision-Making
Nazia Riasat
|
|
cs.LG
|
0 |
2 months ago |
| 67 |
Exact Certification of Neural Networks and Partition Aggregation Ensembles against Label Poisoning
Ajinkya Mohgaonkar, Lukas Gosch, ... (+3 more)
|
|
cs.LG
|
0 |
1 month ago |
| 68 |
Panoptic Pairwise Distortion Graph
Muhammad Kamran Janjua, Abdul Wahab, Bahador Rashidi
|
|
cs.CV
|
0 |
1 month ago |
| 69 |
Mitigating Privacy Risk via Forget Set-Free Unlearning
Aviraj Newatia, Michael Cooper, ... (+2 more)
|
|
cs.LG
|
0 |
1 month ago |
| 70 |
WaveMoE: A Wavelet-Enhanced Mixture-of-Experts Foundation Model for Time Series Forecasting
Shunyu Wu, Jiawei Huang, ... (+7 more)
|
|
cs.LG
|
0 |
1 month ago |
| 71 |
PepBenchmark: A Standardized Benchmark for Peptide Machine Learning
Jiahui Zhang, Rouyi Wang, ... (+5 more)
|
|
cs.LG
|
0 |
1 month ago |
| 72 |
Shuffling the Data, Stretching the Step-size: Sharper Bias in constant step-size SGD
Konstantinos Emmanouilidis, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Rene Vidal
|
|
math.OC
|
0 |
2 months ago |
| 73 |
TimeSeriesExamAgent: Creating Time Series Reasoning Benchmarks at Scale
Malgorzata Gwiazda, Yifu Cai, ... (+3 more)
|
|
cs.AI
|
0 |
2 months ago |
| 74 |
Relational Probing: LM-to-Graph Adaptation for Financial Prediction
Yingjie Niu, Changhong Jin, ... (+2 more)
|
|
cs.CL
|
0 |
2 months ago |
| 75 |
Credit-Budgeted ICPC-Style Coding: When Agents Must Pay for Every Decision
Lingfeng Zhou, Junhao Shi, ... (+2 more)
|
|
cs.AI
|
0 |
2 months ago |
| 76 |
Learning Hierarchical and Geometry-Aware Graph Representations for Text-to-CAD
Shengjie Gong, Wenjie Peng, ... (+6 more)
|
|
cs.AI
|
0 |
2 months ago |
| 77 |
SinkTrack: Attention Sink based Context Anchoring for Large Language Models
Xu Liu, Guikun Chen, Wenguan Wang
|
|
cs.CV
|
0 |
2 months ago |
| 78 |
New Hybrid Fine-Tuning Paradigm for LLMs: Algorithm Design and Convergence Analysis Framework
Shaocong Ma, Peiran Yu, Heng Huang
|
|
cs.AI
|
0 |
2 months ago |
| 79 |
PAS: Estimating the target accuracy before domain adaptation
Raphaella Diniz, Jackson de Faria, Martin Ester
|
|
cs.CV
|
0 |
2 months ago |
| 80 |
MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval
Shaden Alshammari, Kevin Wen, ... (+6 more)
|
|
cs.AI
|
0 |
1 month ago |
| 81 |
Revisiting Active Sequential Prediction-Powered Mean Estimation
Maria-Eleni Sfyraki, Jun-Kun Wang
|
|
stat.ML
|
0 |
1 month ago |
| 82 |
NI Sampling: Accelerating Discrete Diffusion Sampling by Token Order Optimization
Enshu Liu, Xuefei Ning, ... (+2 more)
|
|
cs.LG
|
0 |
1 month ago |
| 83 |
The implicated scientist: on the role of AI researchers in the development of weapons systems
Alexandra Volokhova, Alex Hernandez-Garcia
|
|
cs.AI
|
0 |
1 month ago |
| 84 |
EAST: Early Action Prediction Sampling Strategy with Token Masking
Iva Sović, Ivan Martinović, Marin Oršić
|
|
cs.CV
|
0 |
1 month ago |
| 85 |
Long-Text-to-Image Generation via Compositional Prompt Decomposition
Jen-Yuan Huang, Tong Lin, Yilun Du
|
|
cs.CV
|
0 |
1 month ago |
| 86 |
DiffuSAM: Diffusion Guided Zero-Shot Object Grounding for Remote Sensing Imagery
Geet Sethi, Panav Shah, ... (+2 more)
|
|
cs.CV
|
0 |
1 month ago |
| 87 |
Does "Do Differentiable Simulators Give Better Policy Gradients?'' Give Better Policy Gradients?
Ku Onoda, Paavo Parmas, ... (+2 more)
|
|
cs.LG
|
0 |
1 month ago |
| 88 |
Latent Fourier Transform
Mason Wang, Cheng-Zhi Anna Huang
|
|
cs.SD
|
0 |
1 month ago |
| 89 |
Adversarial Arena: Crowdsourcing Data Generation through Interactive Competition
Prasoon Goyal, Sattvik Sahai, ... (+15 more)
|
|
cs.AI
|
0 |
1 month ago |
| 90 |
Diverse Dictionary Learning
Yujia Zheng, Zijian Li, ... (+3 more)
|
|
cs.LG
|
0 |
1 month ago |
| 91 |
Contraction and Hourglass Persistence for Learning on Graphs, Simplices, and Cells
Mattie Ji, Indradyumna Roy, Vikas Garg
|
|
cs.LG
|
0 |
1 month ago |
| 92 |
LASER: Low-Rank Activation SVD for Efficient Recursion
Ege Çakar, Ketan Ali Raghu, Lia Zheng
|
|
cs.LG
|
0 |
1 month ago |
| 93 |
Graph-of-Agents: A Graph-based Framework for Multi-Agent LLM Collaboration
Sukwon Yun, Jie Peng, ... (+6 more)
|
|
cs.AI
|
0 |
1 month ago |
| 94 |
Noise-Adaptive Diffusion Sampling for Inverse Problems Without Task-Specific Tuning
Yingzhi Xia, Setthakorn Tanomkiattikun, ... (+2 more)
|
|
cs.LG
|
0 |
1 month ago |
| 95 |
FairNVT: Improving Fairness via Noise Injection in Vision Transformers
Qiaoyue Tang, Sepidehsadat Hosseini, ... (+3 more)
|
|
cs.CV
|
0 |
1 month ago |
| 96 |
SAVE: A Generalizable Framework for Multi-Condition Single-Cell Generation with Gene Block Attention
Jiahao Li, Jiayi Dong, ... (+4 more)
|
|
cs.AI
|
0 |
1 month ago |
| 97 |
Active World-Model with 4D-informed Retrieval for Exploration and Awareness
Elaheh Vaezpour, Amirhosein Javadi, Tara Javidi
|
|
cs.CV
|
0 |
1 month ago |
| 98 |
Structured Abductive-Deductive-Inductive Reasoning for LLMs via Algebraic Invariants
Sankalp Gilda, Shlok Gilda
|
|
cs.AI
|
0 |
1 month ago |
| 99 |
Majority Voting for Code Generation
Tim Launer, Jonas Hübotter, ... (+3 more)
|
|
cs.LG
|
0 |
1 month ago |
| 100 |
ProtoTTA: Prototype-Guided Test-Time Adaptation
Mohammad Mahdi Abootorabi, Parvin Mousavi, ... (+2 more)
|
|
cs.LG
|
0 |
1 month ago |