| 101 |
CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing
Yucheng Wang, Zedong Wang, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 102 |
GraphVLM: Benchmarking Vision Language Models for Multimodal Graph Learning
Jiajin Liu, Dongzhe Fan, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 103 |
All Vehicles Can Lie: Efficient Adversarial Defense in Fully Untrusted-Vehicle Collaborative Perception via Pseudo-Random Bayesian Inference
Yi Yu, Libing Wu, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 104 |
X-AVDT: Audio-Visual Cross-Attention for Robust Deepfake Detection
Youngseo Kim, Kwan Yun, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 105 |
Concept-Guided Fine-Tuning: Steering ViTs away from Spurious Correlations to Improve Robustness
Yehonatan Elisha, Oren Barkan, Noam Koenigstein
|
|
cs.CV
|
0 |
2 months ago |
| 106 |
Prototype-Guided Concept Erasure in Diffusion Models
Yuze Cai, Jiahao Lu, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 107 |
WaDi: Weight Direction-aware Distillation for One-step Image Synthesis
Lei Wang, Yang Cheng, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 108 |
SAVE: Speech-Aware Video Representation Learning for Video-Text Retrieval
Ruixiang Zhao, Zhihao Xu, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 109 |
TALON: Test-time Adaptive Learning for On-the-Fly Category Discovery
Yanan Wu, Yuhan Yan, ... (+6 more)
|
|
cs.CV
|
0 |
2 months ago |
| 110 |
Speed3R: Sparse Feed-forward 3D Reconstruction Models
Weining Ren, Xiao Tan, Kai Han
|
|
cs.CV
|
0 |
2 months ago |
| 111 |
Missing No More: Dictionary-Guided Cross-Modal Image Fusion under Missing Infrared
Yafei Zhang, Meng Ma, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 112 |
It's Time to Get It Right: Improving Analog Clock Reading and Clock-Hand Spatial Reasoning in Vision-Language Models
Jaeha Choi, Jin Won Lee, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 113 |
TeamHOI: Learning a Unified Policy for Cooperative Human-Object Interactions with Any Team Size
Stefan Lionar, Gim Hee Lee
|
|
cs.CV
|
0 |
2 months ago |
| 114 |
On the Feasibility and Opportunity of Autoregressive 3D Object Detection
Zanming Huang, Jinsu Yoo, ... (+7 more)
|
|
cs.CV
|
0 |
2 months ago |
| 115 |
VisualAD: Language-Free Zero-Shot Anomaly Detection via Vision Transformer
Yanning Hou, Peiyuan Li, ... (+5 more)
|
|
cs.CV
|
0 |
2 months ago |
| 116 |
Beyond Heuristic Prompting: A Concept-Guided Bayesian Framework for Zero-Shot Image Recognition
Hui Liu, Kecheng Chen, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 117 |
Revisiting Unknowns: Towards Effective and Efficient Open-Set Active Learning
Chen-Chen Zong, Yu-Qi Chi, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 118 |
DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution
Zhengyao Lv, Menghan Xia, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 119 |
UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos
Gu Zhang, Qicheng Xu, ... (+17 more)
|
|
cs.RO
|
0 |
2 months ago |
| 120 |
PAM: A Pose-Appearance-Motion Engine for Sim-to-Real HOI Video Generation
Mingju Gao, Kaisen Yang, ... (+13 more)
|
|
cs.CV
|
0 |
2 months ago |
| 121 |
Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement
Junrong Guo, Shancheng Fang, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 122 |
DA-VAE: Plug-in Latent Compression for Diffusion via Detail Alignment
Xin Cai, Zhiyuan You, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 123 |
FreeArtGS: Articulated Gaussian Splatting Under Free-moving Scenario
Hang Dai, Hongwei Fan, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 124 |
Principled Steering via Null-space Projection for Jailbreak Defense in Vision-Language Models
Xingyu Zhu, Beier Zhu, ... (+5 more)
|
|
cs.CV
|
0 |
2 months ago |
| 125 |
Adapting Point Cloud Analysis via Multimodal Bayesian Distribution Learning
Xingyu Zhu, Liang Yi, ... (+5 more)
|
|
cs.CV
|
0 |
2 months ago |
| 126 |
FontCrafter: High-Fidelity Element-Driven Artistic Font Creation with Visual In-Context Generation
Wuyang Luo, Chengkai Tan, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 127 |
GeoFusion-CAD: Structure-Aware Diffusion with Geometric State Space for Parametric 3D Design
Xiaolei Zhou, Chuangjie Fang, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 128 |
Unified Spatiotemporal Token Compression for Video-LLMs at Ultra-Low Retention
Junhao Du, Jialong Xue, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 129 |
GeoFlow: Real-Time Fine-Grained Cross-View Geolocalization via Iterative Flow Prediction
Ayesh Abu Lehyeh, Xiaohan Zhang, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 130 |
Cross-Instance Gaussian Splatting Registration via Geometry-Aware Feature-Guided Alignment
Roy Amoyal, Oren Freifeld, Chaim Baskin
|
|
cs.CV
|
0 |
2 months ago |
| 131 |
The Golden Subspace: Where Efficiency Meets Generalization in Continual Test-Time Adaptation
Guannan Lai, Da-Wei Zhou, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 132 |
SteelDefectX: A Coarse-to-Fine Vision-Language Dataset and Benchmark for Generalizable Steel Surface Defect Detection
Shuxian Zhao, Jie Gui, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 133 |
The Universal Normal Embedding
Chen Tasker, Roy Betser, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 134 |
BiPreManip: Learning Affordance-Based Bimanual Preparatory Manipulation through Anticipatory Collaboration
Yan Shen, Feng Jiang, ... (+6 more)
|
|
cs.RO
|
0 |
2 months ago |
| 135 |
OmniFM: Toward Modality-Robust and Task-Agnostic Federated Learning for Heterogeneous Medical Imaging
Meilin Liu, Jiaying Wang, Jing Shan
|
|
cs.CV
|
0 |
2 months ago |
| 136 |
PGR-Net: Prior-Guided ROI Reasoning Network for Brain Tumor MRI Segmentation
Jiacheng Lu, Hui Ding, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 137 |
PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation
Gensheng Pei, Xiruo Jiang, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 138 |
Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection
Kaiqiang Li, Gang Li, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 139 |
Parameter-efficient Prompt Tuning and Hierarchical Textual Guidance for Few-shot Whole Slide Image Classification
Jayanie Bogahawatte, Sachith Seneviratne, Saman Halgamuge
|
|
cs.CV
|
0 |
2 months ago |
| 140 |
Which Concepts to Forget and How to Refuse? Decomposing Concepts for Continual Unlearning in Large Vision-Language Models
Hyundong Jin, Dongyoon Han, Eunwoo Kim
|
|
cs.CV
|
0 |
2 months ago |
| 141 |
Uncertainty-Aware Knowledge Distillation for Multimodal Large Language Models
Jingchen Sun, Shaobo Han, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 142 |
FluidGaussian: Propagating Simulation-Based Uncertainty Toward Functionally-Intelligent 3D Reconstruction
Yuqiu Liu, Jialin Song, ... (+6 more)
|
|
cs.CV
|
0 |
2 months ago |
| 143 |
EmoTaG: Emotion-Aware Talking Head Synthesis on Gaussian Splatting with Few-Shot Personalization
Haolan Xu, Keli Cheng, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 144 |
Text-Image Conditioned 3D Generation
Jiazhong Cen, Jiemin Fang, ... (+9 more)
|
|
cs.CV
|
0 |
2 months ago |
| 145 |
Plant Taxonomy Meets Plant Counting: A Fine-Grained, Taxonomic Dataset for Counting Hundreds of Plant Species
Jinyu Xu, Tianqi Hu, ... (+5 more)
|
|
cs.CV
|
0 |
2 months ago |
| 146 |
Reframing Long-Tailed Learning via Loss Landscape Geometry
Shenghan Chen, Yiming Liu, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 147 |
Frequency Switching Mechanism for Parameter-E!cient Multi-Task Learning
Shih-Wen Liu, Yen-Chang Chen, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 148 |
Taming Sampling Perturbations with Variance Expansion Loss for Latent Diffusion Models
Qifan Li, Xingyu Zhou, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 149 |
CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models
Nan Zhou, Huiqun Wang, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 150 |
CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels
Ping Guo, Chengzhou Li, ... (+7 more)
|
|
cs.CV
|
0 |
2 months ago |