| 51 |
Real-World Point Tracking with Verifier-Guided Pseudo-Labeling
Görkay Aydemir, Fatma Güney, Weidi Xie
|
|
cs.CV
|
0 |
2 months ago |
| 52 |
SaPaVe: Towards Active Perception and Manipulation in Vision-Language-Action Models for Robotics
Mengzhen Liu, Enshen Zhou, ... (+7 more)
|
|
cs.RO
|
0 |
2 months ago |
| 53 |
HATS: Hardness-Aware Trajectory Synthesis for GUI Agents
Rui Shao, Ruize Gao, ... (+6 more)
|
|
cs.CV
|
0 |
2 months ago |
| 54 |
Towards Universal Computational Aberration Correction in Photographic Cameras: A Comprehensive Benchmark Analysis
Xiaolong Qian, Qi Jiang, ... (+6 more)
|
|
cs.CV
|
0 |
2 months ago |
| 55 |
Node-RF: Learning Generalized Continuous Space-Time Scene Dynamics with Neural ODE-based NeRFs
Hiran Sarkar, Liming Kuang, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 56 |
Intrinsic Concept Extraction Based on Compositional Interpretability
Hanyu Shi, Hong Tao, ... (+6 more)
|
|
cs.CV
|
0 |
2 months ago |
| 57 |
UCAN: Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution
Cao Thien Tan, Phan Thi Thu Trang, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 58 |
Stable Spike: Dual Consistency Optimization via Bitwise AND Operations for Spiking Neural Networks
Yongqi Ding, Kunshan Yang, ... (+4 more)
|
|
cs.NE
|
0 |
2 months ago |
| 59 |
PROMO: Promptable Outfitting for Efficient High-Fidelity Virtual Try-On
Haohua Chen, Tianze Zhou, ... (+9 more)
|
|
cs.CV
|
0 |
2 months ago |
| 60 |
Tokenization Allows Multimodal Large Language Models to Understand, Generate and Edit Architectural Floor Plans
Sizhong Qin, Ramon Elias Weber, Xinzheng Lu
|
|
cs.CV
|
0 |
2 months ago |
| 61 |
Shape-of-You: Fused Gromov-Wasserstein Optimal Transport for Semantic Correspondence in-the-Wild
Jiin Im, Sisung Liu, Je Hyeong Hong
|
|
cs.CV
|
0 |
2 months ago |
| 62 |
LaMoGen: Language to Motion Generation Through LLM-Guided Symbolic Inference
Junkun Jiang, Ho Yin Au, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 63 |
R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection
Zhongyu Xia, Yousen Tang, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 64 |
EReCu: Pseudo-label Evolution Fusion and Refinement with Multi-Cue Learning for Unsupervised Camouflage Detection
Shuo Jiang, Gaojia Zhang, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 65 |
SPEGC: Continual Test-Time Adaptation via Semantic-Prompt-Enhanced Graph Clustering for Medical Image Segmentation
Xiaogang Du, Jiawei Zhang, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 66 |
Follow the Saliency: Supervised Saliency for Retrieval-augmented Dense Video Captioning
Seung hee Choi, MinJu Jeon, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 67 |
Stay in your Lane: Role Specific Queries with Overlap Suppression Loss for Dense Video Captioning
Seung Hyup Baek, Jimin Lee, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 68 |
Learning to Assist: Physics-Grounded Human-Human Control via Multi-Agent Reinforcement Learning
Yuto Shibata, Kashu Yamazaki, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 69 |
GGPT: Geometry Grounded Point Transformer
Yutong Chen, Yiming Wang, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 70 |
Lifelong Imitation Learning with Multimodal Latent Replay and Incremental Adjustment
Fanqi Yu, Matteo Tiezzi, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 71 |
Bilevel Layer-Positioning LoRA for Real Image Dehazing
Yan Zhang, Long Ma, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 72 |
Guiding Diffusion Models with Semantically Degraded Conditions
Shilong Han, Yuming Zhang, Hongxia Wang
|
|
cs.CV
|
0 |
2 months ago |
| 73 |
COT-FM: Cluster-wise Optimal Transport Flow Matching
Chiensheng Chiang, Kuan-Hsun Tu, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 74 |
Recover to Predict: Progressive Retrospective Learning for Variable-Length Trajectory Prediction
Hao Zhou, Lu Qi, ... (+6 more)
|
|
cs.RO
|
0 |
2 months ago |
| 75 |
Attribution as Retrieval: Model-Agnostic AI-Generated Image Attribution
Hongsong Wang, Renxi Cheng, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 76 |
DSFlash: Comprehensive Panoptic Scene Graph Generation in Realtime
Julian Lorenz, Vladyslav Kovganko, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 77 |
Sparse Task Vector Mixup with Hypernetworks for Efficient Knowledge Transfer in Whole-Slide Image Prognosis
Pei Liu, Xiangxiang Zeng, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 78 |
Fighting Hallucinations with Counterfactuals: Diffusion-Guided Perturbations for LVLM Hallucination Suppression
Hamidreza Dastmalchi, Aijun An, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 79 |
Federated Active Learning Under Extreme Non-IID and Global Class Imbalance
Chen-Chen Zong, Sheng-Jun Huang
|
|
cs.LG
|
0 |
2 months ago |
| 80 |
HG-Lane: High-Fidelity Generation of Lane Scenes under Adverse Weather and Lighting Conditions without Re-annotation
Daichao Zhao, Qiupu Chen, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 81 |
4DEquine: Disentangling Motion and Appearance for 4D Equine Reconstruction from Monocular Video
Jin Lyu, Liang An, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 82 |
VLM-Loc: Localization in Point Cloud Maps via Vision-Language Models
Shuhao Kang, Youqi Liao, ... (+6 more)
|
|
cs.CV
|
0 |
2 months ago |
| 83 |
Test-time Ego-Exo-centric Adaptation for Action Anticipation via Multi-Label Prototype Growing and Dual-Clue Consistency
Zhaofeng Shi, Heqian Qiu, ... (+5 more)
|
|
cs.CV
|
0 |
2 months ago |
| 84 |
VarSplat: Uncertainty-aware 3D Gaussian Splatting for Robust RGB-D SLAM
Anh Thuan Tran, Jana Kosecka
|
|
cs.CV
|
0 |
2 months ago |
| 85 |
ParTY: Part-Guidance for Expressive Text-to-Motion Synthesis
KunHo Heo, SuYeon Kim, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 86 |
BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers
Chaodong Xiao, Zhengqiang Zhang, Lei Zhang
|
|
cs.CV
|
0 |
2 months ago |
| 87 |
More than the Sum: Panorama-Language Models for Adverse Omni-Scenes
Weijia Fan, Ruiping Liu, ... (+8 more)
|
|
cs.CV
|
0 |
2 months ago |
| 88 |
Context-Nav: Context-Driven Exploration and Viewpoint-Aware 3D Spatial Reasoning for Instance Navigation
Won Shik Jang, Ue-Hwan Kim
|
|
cs.CV
|
0 |
2 months ago |
| 89 |
CIGPose: Causal Intervention Graph Neural Network for Whole-Body Pose Estimation
Bohao Li, Zhicheng Cao, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 90 |
Reviving ConvNeXt for Efficient Convolutional Diffusion Models
Taesung Kwon, Lorenzo Bianchi, ... (+6 more)
|
|
cs.CV
|
0 |
2 months ago |
| 91 |
OddGridBench: Exposing the Lack of Fine-Grained Visual Discrepancy Sensitivity in Multimodal Large Language Models
Tengjin Weng, Wenhao Jiang, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 92 |
Speeding Up the Learning of 3D Gaussians with Much Shorter Gaussian Lists
Jiaqi Liu, Zhizhong Han
|
|
cs.CV
|
0 |
2 months ago |
| 93 |
ForgeDreamer: Industrial Text-to-3D Generation with Multi-Expert LoRA and Cross-View Hypergraph
Junhao Cai, Deyu Zeng, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 94 |
Implicit Geometry Representations for Vision-and-Language Navigation from Web Videos
Mingfei Han, Haihong Hao, ... (+7 more)
|
|
cs.CV
|
0 |
2 months ago |
| 95 |
Training-free Motion Factorization for Compositional Video Generation
Zixuan Wang, Ziqin Zhou, ... (+5 more)
|
|
cs.CV
|
0 |
2 months ago |
| 96 |
Chain of Event-Centric Causal Thought for Physically Plausible Video Generation
Zixuan Wang, Yixin Hu, ... (+5 more)
|
|
cs.CV
|
0 |
2 months ago |
| 97 |
Vision-Language Models Encode Clinical Guidelines for Concept-Based Medical Reasoning
Mohamed Harmanani, Bining Long, ... (+7 more)
|
|
cs.CV
|
0 |
2 months ago |
| 98 |
Where, What, Why: Toward Explainable 3D-GS Watermarking
Mingshu Cai, Jiajun Li, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 99 |
Talking Together: Synthesizing Co-Located 3D Conversations from Audio
Mengyi Shan, Shouchieh Chang, ... (+7 more)
|
|
cs.CV
|
0 |
2 months ago |
| 100 |
StreamReady: Learning What to Answer and When in Long Streaming Videos
Shehreen Azad, Vibhav Vineet, Yogesh Singh Rawat
|
|
cs.CV
|
0 |
2 months ago |