| 1 |
Beyond Benchmark Islands: Toward Representative Trustworthiness Evaluation for Agentic AI
Jinhu Qi, Yifan Li, ... (+5 more)
|
|
cs.CL
|
0 |
2 months ago |
| 2 |
Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems
Hailing Cheng
|
|
cs.IR
|
0 |
2 months ago |
| 3 |
Meta-RL with Shared Representations Enables Fast Adaptation in Energy Systems
Théo Zangato, Aomar Osmani, Pegah Alizadeh
|
|
cs.LG
|
0 |
2 months ago |
| 4 |
Learning Hierarchical Knowledge in Text-Rich Networks with Taxonomy-Informed Representation Learning
Yunhui Liu, Yongchao Liu, ... (+4 more)
|
|
cs.LG
|
0 |
2 months ago |
| 5 |
SciZoom: A Large-scale Benchmark for Hierarchical Scientific Summarization across the LLM Era
Han Jang, Junhyeok Lee, Kyu Sung Choi
|
|
cs.CL
|
0 |
2 months ago |
| 6 |
EviCare: Enhancing Diagnosis Prediction with Deep Model-Guided Evidence for In-Context Reasoning
Hengyu Zhang, Xuyun Zhang, ... (+6 more)
|
|
cs.CL
|
0 |
1 month ago |
| 7 |
SEPTQ: A Simple and Effective Post-Training Quantization Paradigm for Large Language Models
Han Liu, Haotian Gao, ... (+6 more)
|
|
cs.CL
|
0 |
1 month ago |
| 8 |
When Does Data Augmentation Help? Evaluating LLM and Back-Translation Methods for Hausa and Fongbe NLP
Mahounan Pericles Adjovi, Roald Eiselen, Prasenjit Mitra
|
|
cs.CL
|
0 |
1 month ago |
| 9 |
End-to-End Learning for Partially-Observed Time Series with PyPOTS
Wenjie Du, Yiyuan Yang, ... (+2 more)
|
|
cs.LG
|
0 |
1 month ago |
| 10 |
On Reasoning Behind Next Occupation Recommendation
Shan Dong, Palakorn Achananuparp, ... (+4 more)
|
|
cs.CL
|
0 |
1 month ago |
| 11 |
DialToM: A Theory of Mind Benchmark for Forecasting State-Driven Dialogue Trajectories
Neemesh Yadav, Palakorn Achananuparp, ... (+2 more)
|
|
cs.CL
|
0 |
1 month ago |
| 12 |
SENSE: Satellite-based ENergy Synthesis for Sustainable Environment
Kailai Sun, Mingyi He, ... (+6 more)
|
|
cs.CV
|
0 |
9 days ago |
| 13 |
TeleCom-Bench: How Far Are Large Language Models from Industrial Telecommunication Applications?
Jieting Xiao, Yun Lin, ... (+11 more)
|
|
cs.AI
|
0 |
9 days ago |
| 14 |
BacktestBench: Benchmarking Large Language Models for Automated Quantitative Strategy Backtesting
Zhensheng Wang, Wenmian Yang, ... (+4 more)
|
|
cs.CL
|
0 |
9 days ago |
| 15 |
Uncertainty-Calibrated Recommendations for Low-Active Users
Bob Junyi Zou, Sai Li, ... (+3 more)
|
|
cs.IR
|
0 |
9 days ago |
| 16 |
Text-Guided Visual Representation Learning for Robust Multimodal E-Commerce Recommendation
Yufei Guo, Jing Ma, ... (+6 more)
|
|
cs.IR
|
0 |
10 days ago |
| 17 |
Rethinking Weak Supervision in Anomaly Detection: A Comprehensive Benchmark
Xu Yao, Siyuan Zhou, ... (+7 more)
|
|
cs.LG
|
0 |
2 days ago |
| 18 |
Causal methods for LLM development and evaluation
Dennis Frauen, Marie Brockschmidt, ... (+11 more)
|
|
cs.LG
|
0 |
2 days ago |
| 19 |
NPSolver: Neural Poisson Solver with Iterative Physics Supervision
Bocheng Zeng, Rui Zhang, ... (+6 more)
|
|
cs.LG
|
0 |
2 days ago |
| 20 |
DeGRe: Dense-supervised Generative Reranking for Recommendation
Chaotian Song, Jingyao Zhang, ... (+7 more)
|
|
cs.IR
|
0 |
2 days ago |
| 21 |
Learning Latent Dynamical Causal Processes for Single-Cell Perturbation Prediction
Wenkang Jiang, Yuhang Liu, ... (+4 more)
|
|
cs.LG
|
0 |
2 days ago |
| 22 |
MindAdapter: Few-Shot Parameter-Efficient Residual Calibration of Cross-Subject Brain-to-Visual Decoding Models
Jiaxiang Liu, Jiawei Du, ... (+5 more)
|
|
cs.CV
|
0 |
4 days ago |
| 23 |
VaaWIT: Visual-Aware Adaptation of Large Language Models for Multilingual Web Image Translation
Bo Li, Ronghao Chen, ... (+4 more)
|
|
cs.CV
|
0 |
4 days ago |
| 24 |
Treatment Effect Estimation with Differentiated Networked Effect on Graph Data
Xiaofeng Lin, Han Bao, Hisashi Kashima
|
|
cs.LG
|
0 |
4 days ago |