| 601 |
Learning from Concealed Labels
Zhongnian Li, Meng Wei, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
1 |
1 year ago |
| 602 |
Compositional Zero-Shot Learning with Contextualized Cues and Adaptive Contrastive Training
Yun Li, Zhe Liu, Lina Yao
|
👻
Ghosted
|
cs.CV
|
1 |
1 year ago |
| 603 |
Enhancing Modality Representation and Alignment for Multimodal Cold-start Active Learning
Meng Shen, Yake Wei, ... (+4 more)
|
👻
Ghosted
|
cs.MM
|
1 |
1 year ago |
| 604 |
Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos
Haowen Gao, Liang Pang, ... (+5 more)
|
👻
Ghosted
|
cs.IR
|
1 |
1 year ago |
| 605 |
Towards Explainable Partial-AIGC Image Quality Assessment
Jiaying Qian, Ziheng Jia, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
1 |
1 year ago |
| 606 |
InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation
Yukang Lin, Yan Hong, ... (+11 more)
|
👻
Ghosted
|
cs.CV
|
1 |
1 year ago |
| 607 |
Casual3DHDR: Deblurring High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos
Shucheng Gong, Lingzhe Zhao, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
1 |
1 year ago |
| 608 |
Small Stickers, Big Meanings: A Multilingual Sticker Semantic Understanding Dataset with a Gamified Approach
Heng Er Metilda Chee, Jiayin Wang, ... (+3 more)
|
👻
Ghosted
|
cs.MM
|
1 |
1 year ago |
| 609 |
EyeNavGS: A 6-DoF Navigation Dataset and Record-n-Replay Software for Real-World 3DGS Scenes in VR
Zihao Ding, Cheng-Tse Lee, ... (+5 more)
|
👻
Ghosted
|
cs.MM
|
1 |
1 year ago |
| 610 |
Argus Inspection: Do Multimodal Large Language Models Possess the Eye of Panoptes?
Yang Yao, Lingyu Li, ... (+9 more)
|
👻
Ghosted
|
cs.CV
|
1 |
1 year ago |
| 611 |
Let Your Video Listen to Your Music!
Xinyu Zhang, Dong Gong, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
1 |
12 months ago |
| 612 |
HeLo: Heterogeneous Multi-Modal Fusion with Label Correlation for Emotion Distribution Learning
Chuhang Zheng, Chunwei Tian, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
1 |
11 months ago |
| 613 |
Wavelet-GS: 3D Gaussian Splatting with Wavelet Decomposition
Beizhen Zhao, Yifan Zhou, ... (+3 more)
|
👻
Ghosted
|
cs.GR
|
1 |
11 months ago |
| 614 |
SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation
Kien T. Pham, Yingqing He, ... (+3 more)
|
👻
Ghosted
|
cs.GR
|
1 |
10 months ago |
| 615 |
OpenMap: Instruction Grounding via Open-Vocabulary Visual-Language Mapping
Danyang Li, Zenghui Yang, ... (+5 more)
|
👻
Ghosted
|
cs.RO
|
1 |
10 months ago |
| 616 |
T2UE: Generating Unlearnable Examples from Text Descriptions
Xingjun Ma, Hanxun Huang, ... (+4 more)
|
👻
Ghosted
|
cs.AI
|
1 |
10 months ago |
| 617 |
I$^3$-MRec: Invariant Learning with Information Bottleneck for Incomplete Modality Recommendation
Huilin Chen, Miaomiao Cai, ... (+4 more)
|
👻
Ghosted
|
cs.IR
|
1 |
10 months ago |
| 618 |
Universally Unfiltered and Unseen:Input-Agnostic Multimodal Jailbreaks against Text-to-Image Model Safeguards
Song Yan, Hui Wei, ... (+4 more)
|
👻
Ghosted
|
cs.CR
|
1 |
10 months ago |
| 619 |
SimViews: An Interactive Multi-Agent System Simulating Visitor-to-Visitor Conversational Patterns to Present Diverse Perspectives of Artifacts in Virtual Museums
Mingyang Su, Chao Liu, ... (+3 more)
|
👻
Ghosted
|
cs.HC
|
1 |
10 months ago |
| 620 |
Advancing 3D Scene Understanding with MV-ScanQA Multi-View Reasoning Evaluation and TripAlign Pre-training Dataset
Wentao Mo, Qingchao Chen, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
1 |
10 months ago |
| 621 |
INDS: Incremental Named Data Streaming for Real-Time Point Cloud Video
Ruonan Chai, Yixiang Zhu, ... (+4 more)
|
👻
Ghosted
|
cs.MM
|
1 |
10 months ago |
| 622 |
Hierarchical Vision-Language Reasoning for Multimodal Multiple-Choice Question Answering
Ao Zhou, Zebo Gu, ... (+7 more)
|
👻
Ghosted
|
cs.IR
|
1 |
10 months ago |
| 623 |
MoTAS: MoE-Guided Feature Selection from TTS-Augmented Speech for Enhanced Multimodal Alzheimer's Early Screening
Yongqi Shao, Binxin Mei, ... (+3 more)
|
👻
Ghosted
|
cs.SD
|
1 |
9 months ago |
| 624 |
PRINTER:Deformation-Aware Adversarial Learning for Virtual IHC Staining with In Situ Fidelity
Yizhe Yuan, Bingsen Xue, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
1 |
9 months ago |
| 625 |
Multi-level SSL Feature Gating for Audio Deepfake Detection
Hoan My Tran, Damien Lolive, ... (+4 more)
|
👻
Ghosted
|
cs.SD
|
1 |
9 months ago |
| 626 |
PG-Agent: An Agent Powered by Page Graph
Weizhi Chen, Ziwei Wang, ... (+6 more)
|
👻
Ghosted
|
cs.AI
|
1 |
9 months ago |
| 627 |
Tackling Device Data Distribution Real-time Shift via Prototype-based Parameter Editing
Zheqi Lv, Wenqiao Zhang, ... (+7 more)
|
👻
Ghosted
|
cs.LG
|
1 |
9 months ago |
| 628 |
A New Dataset and Benchmark for Grounding Multimodal Misinformation
Bingjian Yang, Danni Xu, ... (+4 more)
|
👻
Ghosted
|
cs.SI
|
1 |
9 months ago |
| 629 |
SemanticGarment: Semantic-Controlled Generation and Editing of 3D Gaussian Garments
Ruiyan Wang, Zhengxue Cheng, ... (+6 more)
|
👻
Ghosted
|
cs.GR
|
1 |
9 months ago |
| 630 |
Harnessing Multimodal Large Language Models for Personalized Product Search with Query-aware Refinement
Beibei Zhang, Yanan Lu, ... (+5 more)
|
👻
Ghosted
|
cs.MM
|
1 |
8 months ago |
| 631 |
ReSSFormer: A Recursive Sparse Structured Transformer for Scalable and Long-Context Reasoning
Haochen You, Baojing Liu
|
👻
Ghosted
|
cs.CL
|
1 |
8 months ago |
| 632 |
ReactDiff: Fundamental Multiple Appropriate Facial Reaction Diffusion Model
Luo Cheng, Song Siyang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
1 |
8 months ago |
| 633 |
One Size Fits All? A Modular Adaptive Sanitization Kit (MASK) for Customizable Privacy-Preserving Phone Scam Detection
Kangzhong Wang, Zitong Shen, ... (+5 more)
|
👻
Ghosted
|
cs.CR
|
1 |
8 months ago |
| 634 |
Reasoning Like Experts: Leveraging Multimodal Large Language Models for Drawing-based Psychoanalysis
Xueqi Ma, Yanbei Jiang, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
1 |
7 months ago |
| 635 |
EventFormer: A Node-graph Hierarchical Attention Transformer for Action-centric Video Event Prediction
Qile Su, Shoutai Zhu, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
1 |
8 months ago |
| 636 |
PESTalk: Speech-Driven 3D Facial Animation with Personalized Emotional Styles
Tianshun Han, Benjia Zhou, ... (+5 more)
|
👻
Ghosted
|
cs.GR
|
1 |
8 months ago |
| 637 |
Towards Global Optimization in Display Advertising by Integrating Multimedia Metrics with Real-Time Bidding
Xiang Chen
|
👻
Ghosted
|
cs.GT
|
0 |
8 years ago |
| 638 |
DrawMon: A Distributed System for Detection of Atypical Sketch Content in Concurrent Pictionary Games
Nikhil Bansal, Kartik Gupta, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
0 |
3 years ago |
| 639 |
DiVa: An Iterative Framework to Harvest More Diverse and Valid Labels from User Comments for Music
Hongru Liang, Jingyao Liu, ... (+5 more)
|
👻
Ghosted
|
cs.IR
|
0 |
2 years ago |
| 640 |
Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions
Junzhang Liu, Zhecan Wang, ... (+6 more)
|
👻
Ghosted
|
cs.CV
|
0 |
2 years ago |
| 641 |
Protecting Copyright of Medical Pre-trained Language Models: Training-Free Backdoor Model Watermarking
Cong Kong, Rui Xu, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
0 |
1 year ago |
| 642 |
Evaluating the Evaluators: Towards Human-aligned Metrics for Missing Markers Reconstruction
Taras Kucherenko, Derek Peristy, Judith Bütepage
|
👻
Ghosted
|
cs.CV
|
0 |
1 year ago |
| 643 |
Block based Adaptive Compressive Sensing with Sampling Rate Control
Kosuke Iwama, Ryugo Morita, Jinjia Zhou
|
👻
Ghosted
|
cs.CV
|
0 |
1 year ago |
| 644 |
Seeing the Undefined: Chain-of-Action for Generative Semantic Labels
Meng Wei, Zhongnian Li, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
0 |
1 year ago |
| 645 |
CFSSeg: Closed-Form Solution for Class-Incremental Semantic Segmentation of 2D Images and 3D Point Clouds
Jiaxu Li, Rui Li, ... (+9 more)
|
👻
Ghosted
|
cs.CV
|
0 |
1 year ago |
| 646 |
SafeCFG: Controlling Harmful Features with Dynamic Safe Guidance for Safe Generation
Jiadong Pan, Liang Li, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
0 |
1 year ago |
| 647 |
MUDI: A Multimodal Biomedical Dataset for Understanding Pharmacodynamic Drug-Drug Interactions
Tung-Lam Ngo, Ba-Hoang Tran, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
0 |
1 year ago |
| 648 |
HER2 Expression Prediction with Flexible Multi-Modal Inputs via Dynamic Bidirectional Reconstruction
Jie Qin, Wei Yang, ... (+6 more)
|
👻
Ghosted
|
cs.MM
|
0 |
1 year ago |
| 649 |
MEGC2025: Micro-Expression Grand Challenge on Spot Then Recognize and Visual Question Answering
Xinqi Fan, Jingting Li, ... (+7 more)
|
👻
Ghosted
|
cs.CV
|
0 |
1 year ago |
| 650 |
MuteSwap: Visual-informed Silent Video Identity Conversion
Yifan Liu, Yu Fang, Zhouhan Lin
|
👻
Ghosted
|
cs.SD
|
0 |
11 months ago |