| 1 |
Personalized Cell Segmentation: Benchmark and Framework for Reference-Guided Cell Type Segmentation
Bisheng Wang, Jaime S. Cardoso, Lin Wu
|
|
cs.CV
|
0 |
2 months ago |
| 2 |
SemiTooth: a Generalizable Semi-supervised Framework for Multi-Source Tooth Segmentation
Muyi Sun, Yifan Gao, ... (+5 more)
|
|
cs.CV
|
0 |
2 months ago |
| 3 |
Multimodal Self-Attention Network with Temporal Alignment for Audio-Visual Emotion Recognition
Inyong Koo, yeeun Seong, ... (+3 more)
|
|
cs.MM
|
0 |
2 months ago |
| 4 |
V2A-DPO: Omni-Preference Optimization for Video-to-Audio Generation
Nolan Chan, Timmy Gang, ... (+3 more)
|
|
cs.SD
|
0 |
2 months ago |
| 5 |
PRoADS: Provably Secure and Robust Audio Diffusion Steganography with latent optimization and backward Euler Inversion
YongPeng Yan, Yanan Li, ... (+2 more)
|
|
cs.CR
|
0 |
2 months ago |
| 6 |
Robust Provably Secure Image Steganography via Latent Iterative Optimization
Yanan Li, Zixuan Wang, ... (+2 more)
|
|
cs.CR
|
0 |
2 months ago |
| 7 |
QUSR: Quality-Aware and Uncertainty-Guided Image Super-Resolution Diffusion Model
Junjie Yin, Jiaju Li, Hanfa Xing
|
|
cs.CV
|
0 |
2 months ago |
| 8 |
Fast Low-light Enhancement and Deblurring for 3D Dark Scenes
Feng Zhang, Jinglong Wang, ... (+5 more)
|
|
cs.CV
|
0 |
2 months ago |
| 9 |
Enhancing Document-Level Machine Translation via Filtered Synthetic Corpora and Two-Stage LLM Adaptation
Ireh Kim, Tesia Sker, Chanwoo Kim
|
|
cs.CL
|
0 |
2 months ago |
| 10 |
Dual-Space Knowledge Distillation with Key-Query Matching for Large Language Models with Vocabulary Mismatch
Stella Eva Tsiapali, Cong-Thanh Do, Kate Knill
|
|
cs.CL
|
0 |
2 months ago |
| 11 |
AnimalCLAP: Taxonomy-Aware Language-Audio Pretraining for Species Recognition and Trait Inference
Risa Shinoda, Kaede Shiohara, ... (+3 more)
|
|
cs.SD
|
0 |
2 months ago |
| 12 |
Look, Listen and Segment: Towards Weakly Supervised Audio-visual Semantic Segmentation
Chengzhi Li, Heyan Huang, ... (+2 more)
|
|
cs.MM
|
0 |
2 months ago |
| 13 |
Ara-Best-RQ: Multi Dialectal Arabic SSL
Haroun Elleuch, Ryan Whetten, ... (+3 more)
|
|
cs.CL
|
0 |
2 months ago |
| 14 |
LipsAM: Lipschitz-Continuous Amplitude Modifier for Audio Signal Processing and its Application to Plug-and-Play Dereverberation
Kazuki Matsumoto, Ren Uchida, Kohei Yatabe
|
|
cs.SD
|
0 |
2 months ago |
| 15 |
mmWave-Diffusion:A Novel Framework for Respiration Sensing Using Observation-Anchored Conditional Diffusion Model
Yong Wang, Qifan Shen, ... (+5 more)
|
|
eess.IV
|
0 |
2 months ago |
| 16 |
Evaluating Test-Time Adaptation For Facial Expression Recognition Under Natural Cross-Dataset Distribution Shifts
John Turnbull, Shivam Grover, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 17 |
Uncertainty-aware Prototype Learning with Variational Inference for Few-shot Point Cloud Segmentation
Yifei Zhao, Fanyu Zhao, Yinsheng Li
|
|
cs.CV
|
0 |
2 months ago |
| 18 |
DaPT: A Dual-Path Framework for Multilingual Multi-hop Question Answering
Yilin Wang, Yuchun Fan, ... (+6 more)
|
|
cs.CL
|
0 |
2 months ago |
| 19 |
Shared Representation Learning for Reference-Guided Targeted Sound Detection
Shubham Gupta, Adarsh Arigala, ... (+2 more)
|
|
eess.AS
|
0 |
2 months ago |
| 20 |
TCATSeg: A Tooth Center-Wise Attention Network for 3D Dental Model Semantic Segmentation
Qiang He, Wentian Qu, ... (+10 more)
|
|
cs.CV
|
0 |
2 months ago |
| 21 |
Video-based Heart Rate Estimation with Angle-guided ROI Optimization and Graph Signal Denoising
Gan Pei, Junhao Ning, ... (+3 more)
|
|
cs.CV
|
0 |
1 month ago |
| 22 |
LDEPrompt: Layer-importance guided Dual Expandable Prompt Pool for Pre-trained Model-based Class-Incremental Learning
Linjie Li, Zhenyu Wu, ... (+2 more)
|
|
cs.CV
|
0 |
1 month ago |
| 23 |
Cross-Cultural Bias in Mel-Scale Representations: Evidence and Alternatives from Speech and Music
Shivam Chauhan, Ajay Pundhir
|
|
cs.SD
|
0 |
1 month ago |
| 24 |
Aligning Language Models for Lyric-to-Melody Generation with Rule-Based Musical Constraints
Hao Meng, Siyuan Zheng, ... (+3 more)
|
|
cs.SD
|
0 |
1 month ago |
| 25 |
Incremental learning for audio classification with Hebbian Deep Neural Networks
Riccardo Casciotti, Francesco De Santis, ... (+2 more)
|
|
eess.AS
|
0 |
1 month ago |
| 26 |
DIRCR: Dual-Inference Rule-Contrastive Reasoning for Solving RAVENs
Jiachen Zhang, Chengtai Li, ... (+4 more)
|
|
cs.AI
|
0 |
1 month ago |
| 27 |
AutoVQA-G: Self-Improving Agentic Framework for Automated Visual Question Answering and Grounding Annotation
Rongsheng Hu, Runwei Guan, ... (+3 more)
|
|
cs.CV
|
0 |
1 month ago |
| 28 |
Towards Generalizable Deepfake Image Detection with Vision Transformers
Kaliki V Srinanda, M Manvith Prabhu, ... (+5 more)
|
|
cs.CV
|
0 |
1 month ago |
| 29 |
Frequency-guided Multi-level Reasoning for Scene Graph Generation in Video
Chenxing Li, Yiping Duan, Xiaoming Tao
|
|
cs.CV
|
0 |
1 month ago |
| 30 |
Motion-Guided Semantic Alignment with Negative Prompts for Zero-Shot Video Action Recognition
Yiming Wang, Frederick W. B. Li, Jingyun Wang
|
|
cs.CV
|
0 |
1 month ago |
| 31 |
SCHK-HTC: Sibling Contrastive Learning with Hierarchical Knowledge-Aware Prompt Tuning for Hierarchical Text Classification
Ke Xiong, Qian Wu, ... (+3 more)
|
|
cs.CL
|
0 |
1 month ago |
| 32 |
Learning Affine-Equivariant Proximal Operators
Oriel Savir, Zhenghan Fang, Jeremias Sulam
|
|
cs.LG
|
0 |
1 month ago |
| 33 |
SmoGVLM: A Small, Graph-enhanced Vision-Language Model
Debjyoti Mondal, Rituraj Singh, Subhadarshi Panda
|
|
cs.CV
|
0 |
1 month ago |
| 34 |
Capacity Analysis of OFDM Systems with a Swarm of Network-Controlled Repeaters
Doğa Evgür, Ozan Alp Topal, Özlem Tuğfe Demir
|
|
eess.SP
|
0 |
1 month ago |
| 35 |
Network-Controlled Repeaters Under Power Amplifier Non-linearities
Özlem Tuğfe Demir, Emil Björnson
|
|
eess.SP
|
0 |
1 month ago |
| 36 |
Robust Grounding with MLLMs against Occlusion and Small Objects via Language-guided Semantic Cues
Beomchan Park, Seongho Kim, ... (+3 more)
|
|
cs.CV
|
0 |
1 month ago |
| 37 |
Exploring Audio Hallucination in Egocentric Video Understanding
Ashish Seth, Xinhao Mei, ... (+10 more)
|
|
cs.CV
|
0 |
1 month ago |
| 38 |
Agri-CPJ: A Training-Free Explainable Framework for Agricultural Pest Diagnosis Using Caption-Prompt-Judge and LLM-as-a-Judge
Wentao Zhang, Qi Zhang, ... (+7 more)
|
|
cs.CL
|
0 |
1 month ago |
| 39 |
DARC-CLIP: Dynamic Adaptive Refinement with Cross-Attention for Meme Understanding
Qiyuan Jin
|
|
cs.CL
|
0 |
1 month ago |
| 40 |
KD-CVG: A Knowledge-Driven Approach for Creative Video Generation
Linkai Liu, Wei Feng, ... (+10 more)
|
|
cs.CV
|
0 |
1 month ago |
| 41 |
Differentially Private Clustered Federated Learning with Privacy-Preserving Initialization and Normality-Driven Aggregation
Jie Xu, Haaris Mehmood, ... (+3 more)
|
|
cs.LG
|
0 |
1 month ago |
| 42 |
Unleashing Vision Transformer Potential In Image Quality Assessment via Global-Local Adaptive Interaction
Yu Li, Puchao Zhou, ... (+4 more)
|
|
cs.CV
|
0 |
9 days ago |
| 43 |
t-gems: text-guided exit modules for decreasing clip image encoder
Alberto Presta, Grzegorz Stefanski, ... (+2 more)
|
|
cs.LG
|
0 |
10 days ago |
| 44 |
A Distribution Matching Approach to Neural Piano Transcription with Optimal Transport
Weixing Wei, Raynaldi Lalang, ... (+2 more)
|
|
cs.SD
|
0 |
10 days ago |
| 45 |
Learning Fill-in Reduction Ordering via Graph Policy Optimization for Sparse Matrices
Ziwei Li, Shuzi Niu, ... (+3 more)
|
|
cs.LG
|
0 |
10 days ago |
| 46 |
Taming Audio VAEs via Target-KL Regularization
Prem Seetharaman, Rithesh Kumar
|
|
cs.SD
|
0 |
11 days ago |
| 47 |
QuChaTeR: A Hybrid Quantum-Chaotic Temporal Framework for Earthquake Prediction
Emir Kaan Özdemir
|
|
cs.LG
|
0 |
13 days ago |