| 601 |
Scalable Compression of Deep Neural Networks
Xing Wang, Jie Liang
|
👻
Ghosted
|
cs.CV
|
4 |
9 years ago |
| 602 |
MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussians
Peng Chen, Xiaobao Wei, ... (+4 more)
|
⏳
Coming Soon™
|
cs.CV
|
4 |
1 year ago |
| 603 |
DGNS: Deformable Gaussian Splatting and Dynamic Neural Surface for Monocular Dynamic 3D Reconstruction
Xuesong Li, Jinguang Tong, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
4 |
1 year ago |
| 604 |
From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models
Yuying Shang, Xinyi Zeng, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
4 |
1 year ago |
| 605 |
Achieving Resolution-Agnostic DNN-based Image Watermarking: A Novel Perspective of Implicit Neural Representation
Yuchen Wang, Xingyu Zhu, ... (+3 more)
|
👻
Ghosted
|
cs.CR
|
4 |
2 years ago |
| 606 |
CoheDancers: Enhancing Interactive Group Dance Generation through Music-Driven Coherence Decomposition
Kaixing Yang, Xulong Tang, ... (+5 more)
|
⏳
Coming Soon™
|
cs.SD
|
4 |
1 year ago |
| 607 |
Regularized Contrastive Partial Multi-view Outlier Detection
Yijia Wang, Qianqian Xu, ... (+3 more)
|
👻
Ghosted
|
cs.MM
|
4 |
1 year ago |
| 608 |
An Inverse Partial Optimal Transport Framework for Music-guided Movie Trailer Generation
Yutong Wang, Sidan Zhu, ... (+2 more)
|
👻
Ghosted
|
cs.MM
|
4 |
1 year ago |
| 609 |
HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification
Shuyi Ouyang, Hongyi Wang, ... (+7 more)
|
👻
Ghosted
|
cs.CV
|
4 |
1 year ago |
| 610 |
TeViS:Translating Text Synopses to Video Storyboards
Xu Gu, Yuchong Sun, ... (+6 more)
|
👻
Ghosted
|
cs.CV
|
4 |
3 years ago |
| 611 |
JPS: Jailbreak Multimodal Large Language Models with Collaborative Visual Perturbation and Textual Steering
Renmiao Chen, Shiyao Cui, ... (+8 more)
|
💀
404 Not Found
|
cs.MM
|
3 |
10 months ago |
| 612 |
MaXsive: High-Capacity and Robust Training-Free Generative Image Watermarking in Diffusion Models
Po-Yuan Mao, Cheng-Chang Tsai, Chun-Shien Lu
|
👻
Ghosted
|
cs.CR
|
3 |
10 months ago |
| 613 |
StePO-Rec: Towards Personalized Outfit Styling Assistant via Knowledge-Guided Multi-Step Reasoning
Yuxi Bi, Yunfan Gao, Haofen Wang
|
👻
Ghosted
|
cs.IR
|
3 |
1 year ago |
| 614 |
ERR@HRI 2.0 Challenge: Multimodal Detection of Errors and Failures in Human-Robot Conversations
Shiye Cao, Maia Stiber, ... (+6 more)
|
👻
Ghosted
|
cs.RO
|
3 |
10 months ago |
| 615 |
Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?
Yunbo Lyu, Zhou Yang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
3 |
1 year ago |
| 616 |
VLMPlanner: Integrating Visual Language Models with Motion Planning
Zhipeng Tang, Sha Zhang, ... (+6 more)
|
⏳
Coming Soon™
|
cs.AI
|
3 |
10 months ago |
| 617 |
A Satellite-Ground Synergistic Large Vision-Language Model System for Earth Observation
Yuxin Zhang, Jiahao Yang, ... (+4 more)
|
👻
Ghosted
|
cs.NI
|
3 |
11 months ago |
| 618 |
ChoreoMuse: Robust Music-to-Dance Video Generation with Style Transfer and Beat-Adherent Motion
Xuanchen Wang, Heng Wang, Weidong Cai
|
👻
Ghosted
|
cs.GR
|
3 |
10 months ago |
| 619 |
RTR-GS: 3D Gaussian Splatting for Inverse Rendering with Radiance Transfer and Reflection
Yongyang Zhou, Fang-Lue Zhang, ... (+2 more)
|
👻
Ghosted
|
cs.GR
|
3 |
11 months ago |
| 620 |
FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion model
Lingzhou Mu, Baiji Liu, ... (+5 more)
|
👻
Ghosted
|
cs.GR
|
3 |
1 year ago |
| 621 |
Controllable Video-to-Music Generation with Multiple Time-Varying Conditions
Junxian Wu, Weitao You, ... (+4 more)
|
👻
Ghosted
|
cs.MM
|
3 |
10 months ago |
| 622 |
KEN: Knowledge Augmentation and Emotion Guidance Network for Multimodal Fake News Detection
Peican Zhu, Yubo Jing, ... (+3 more)
|
👻
Ghosted
|
cs.MM
|
3 |
11 months ago |
| 623 |
Residual Prior-driven Frequency-aware Network for Image Fusion
Guan Zheng, Xue Wang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
3 |
11 months ago |
| 624 |
HKD4VLM: A Progressive Hybrid Knowledge Distillation Framework for Robust Multimodal Hallucination and Factuality Detection in VLMs
Zijian Zhang, Xuecheng Wu, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
3 |
11 months ago |
| 625 |
Multiverse Through Deepfakes: The MultiFakeVerse Dataset of Person-Centric Visual and Conceptual Manipulations
Parul Gupta, Shreya Ghosh, ... (+3 more)
|
💀
404 Not Found
|
cs.MM
|
3 |
1 year ago |
| 626 |
SD-VSum: A Method and Dataset for Script-Driven Video Summarization
Manolis Mylonas, Evlampios Apostolidis, Vasileios Mezaris
|
👻
Ghosted
|
cs.CV
|
3 |
1 year ago |
| 627 |
MusFlow: Multimodal Music Generation via Conditional Flow Matching
Jiahao Song, Yuzhao Wang
|
👻
Ghosted
|
cs.SD
|
3 |
1 year ago |
| 628 |
CGCOD: Class-Guided Camouflaged Object Detection
Chenxi Zhang, Qing Zhang, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
3 |
1 year ago |
| 629 |
MetaDragonBoat: Exploring Paddling Techniques of Virtual Dragon Boating in a Metaverse Campus
Wei He, Xiang Li, ... (+5 more)
|
👻
Ghosted
|
cs.MM
|
3 |
1 year ago |
| 630 |
MATK: The Meme Analytical Tool Kit
Ming Shan Hee, Aditi Kumaresan, ... (+4 more)
|
💀
404 Not Found
|
cs.CL
|
3 |
2 years ago |
| 631 |
Restoration of Analog Videos Using Swin-UNet
Lorenzo Agnolucci, Leonardo Galteri, ... (+2 more)
|
💤
Eternal Rest
|
cs.CV
|
3 |
2 years ago |
| 632 |
Exploiting Diverse Feature for Multimodal Sentiment Analysis
Jia Li, Wei Qian, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
3 |
2 years ago |
| 633 |
Semantics Preserving Hierarchy based Retrieval of Indian heritage monuments
Ronak Gupta, Prerana Mukherjee, ... (+2 more)
|
👻
Ghosted
|
cs.MM
|
3 |
5 years ago |
| 634 |
Flavour Enhanced Food Recommendation
Nitish Nag, Aditya Bharadwaj, ... (+7 more)
|
👻
Ghosted
|
cs.SI
|
3 |
7 years ago |
| 635 |
Analyzing structural characteristics of object category representations from their semantic-part distributions
Ravi Kiran Sarvadevabhatla, Venkatesh Babu R
|
👻
Ghosted
|
cs.CV
|
3 |
10 years ago |
| 636 |
Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models
Yiming Wu, Zhenghao Chen, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
3 |
1 year ago |
| 637 |
LinkThief: Combining Generalized Structure Knowledge with Node Similarity for Link Stealing Attack against GNN
Yuxing Zhang, Siyuan Meng, ... (+4 more)
|
👻
Ghosted
|
cs.CR
|
3 |
1 year ago |
| 638 |
Personalized Federated Learning via Backbone Self-Distillation
Pengju Wang, Bochao Liu, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
3 |
1 year ago |
| 639 |
3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field
Zhenyu Bao, Guibiao Liao, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
3 |
2 years ago |
| 640 |
AdaCoder: Adaptive Prompt Compression for Programmatic Visual Question Answering
Mahiro Ukai, Shuhei Kurita, ... (+3 more)
|
👻
Ghosted
|
cs.AI
|
3 |
1 year ago |
| 641 |
Towards Fast and Stable Federated Learning: Confronting Heterogeneity via Knowledge Anchor
Jinqian Chen, Jihua Zhu, Qinghai Zheng
|
👻
Ghosted
|
cs.LG
|
3 |
2 years ago |
| 642 |
Adaptive Multi-Modality Prompt Learning
Zongqian Wu, Yujing Liu, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
3 |
2 years ago |
| 643 |
Enhancing HOI Detection with Contextual Cues from Large Vision-Language Models
Yu-Wei Zhan, Fan Liu, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
3 |
2 years ago |
| 644 |
Queryable 3D Scene Representation: A Multi-Modal Framework for Semantic Reasoning and Robotic Task Planning
Xun Li, Rodrigo Santa Cruz, ... (+10 more)
|
👻
Ghosted
|
cs.RO
|
2 |
8 months ago |
| 645 |
Refining Contrastive Learning and Homography Relations for Multi-Modal Recommendation
Shouxing Ma, Yawen Zeng, ... (+2 more)
|
💀
404 Not Found
|
cs.IR
|
2 |
9 months ago |
| 646 |
LSC-ADL: An Activity of Daily Living (ADL)-Annotated Lifelog Dataset Generated via Semi-Automatic Clustering
Minh-Quan Ho-Le, Duy-Khang Ho, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
2 |
1 year ago |
| 647 |
Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Yi Bin, Junrong Liao, ... (+5 more)
|
💀
404 Not Found
|
cs.MM
|
2 |
1 year ago |
| 648 |
Deformable NeRF using Recursively Subdivided Tetrahedra
Zherui Qiu, Chenqu Ren, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
2 |
1 year ago |
| 649 |
Perspective from a Higher Dimension: Can 3D Geometric Priors Help Visual Floorplan Localization?
Bolei Chen, Jiaxu Kang, ... (+3 more)
|
⏳
Coming Soon™
|
cs.CV
|
2 |
10 months ago |
| 650 |
BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video
Yu Hong, Yize Wu, ... (+6 more)
|
👻
Ghosted
|
cs.GR
|
2 |
1 year ago |