| 101 |
Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection
Rui Cao, Ming Shan Hee, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
77 |
2 years ago |
| 102 |
DEPA: Self-Supervised Audio Embedding for Depression Detection
Pingyue Zhang, Mengyue Wu, ... (+2 more)
|
👻
Ghosted
|
cs.HC
|
74 |
6 years ago |
| 103 |
Semi-supervised Multi-modal Emotion Recognition with Cross-Modal Distribution Matching
Jingjun Liang, Ruichen Li, Qin Jin
|
👻
Ghosted
|
eess.AS
|
73 |
5 years ago |
| 104 |
Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive Architectures
Gaurav Mittal, Tanya Marwah, Vineeth N. Balasubramanian
|
👻
Ghosted
|
cs.CV
|
72 |
9 years ago |
| 105 |
Modeling the Resource Requirements of Convolutional Neural Networks on Mobile Devices
Zongqing Lu, Swati Rallapalli, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
72 |
8 years ago |
| 106 |
Attentive Crowd Flow Machines
Lingbo Liu, Ruimao Zhang, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
72 |
7 years ago |
| 107 |
SepMark: Deep Separable Watermarking for Unified Source Tracing and Deepfake Detection
Xiaoshuai Wu, Xin Liao, Bo Ou
|
👻
Ghosted
|
cs.CV
|
72 |
3 years ago |
| 108 |
Video Captioning with Guidance of Multimodal Latent Topics
Shizhe Chen, Jia Chen, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
71 |
8 years ago |
| 109 |
Explainable Video Action Reasoning via Prior Knowledge and State Transitions
Tao Zhuo, Zhiyong Cheng, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
71 |
6 years ago |
| 110 |
Eye in the Sky: Drone-Based Object Tracking and 3D Localization
Haotian Zhang, Gaoang Wang, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
71 |
6 years ago |
| 111 |
LinesToFacePhoto: Face Photo Generation from Lines with Conditional Self-Attention Generative Adversarial Network
Yuhang Li, Xuejin Chen, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
70 |
6 years ago |
| 112 |
Heterogeneous Domain Adaptation via Soft Transfer Network
Yuan Yao, Yu Zhang, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
69 |
6 years ago |
| 113 |
Bidirectional Long-Short Term Memory for Video Description
Yi Bin, Yang Yang, ... (+4 more)
|
👻
Ghosted
|
cs.MM
|
67 |
10 years ago |
| 114 |
Guided Attention Network for Object Detection and Counting on Drones
Yuanqiang Cai, Dawei Du, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
67 |
6 years ago |
| 115 |
Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding
Xuejing Liu, Liang Li, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
66 |
6 years ago |
| 116 |
Selective Deep Convolutional Features for Image Retrieval
Tuan Hoang, Thanh-Toan Do, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
65 |
8 years ago |
| 117 |
A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning
Pengfei Wang, Chengquan Zhang, ... (+7 more)
|
👻
Ghosted
|
cs.CV
|
65 |
6 years ago |
| 118 |
Personalized Hashtag Recommendation for Micro-videos
Yinwei Wei, Zhiyong Cheng, ... (+4 more)
|
👻
Ghosted
|
cs.MM
|
65 |
6 years ago |
| 119 |
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder
Chenpeng Du, Qi Chen, ... (+6 more)
|
👻
Ghosted
|
cs.CV
|
65 |
3 years ago |
| 120 |
The MuSe 2022 Multimodal Sentiment Analysis Challenge: Humor, Emotional Reactions, and Stress
Lukas Christ, Shahin Amiriparian, ... (+10 more)
|
👻
Ghosted
|
cs.LG
|
64 |
3 years ago |
| 121 |
Deep Multimodal Image-Repurposing Detection
Ekraam Sabir, Wael AbdAlmageed, ... (+2 more)
|
👻
Ghosted
|
cs.MM
|
62 |
7 years ago |
| 122 |
Describe What to Change: A Text-guided Unsupervised Image-to-Image Translation Approach
Yahui Liu, Marco De Nadai, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
62 |
5 years ago |
| 123 |
Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training
Yingwei Pan, Yehao Li, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
61 |
5 years ago |
| 124 |
Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos
Jie Wu, Guanbin Li, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
61 |
5 years ago |
| 125 |
DDGHM: Dual Dynamic Graph with Hybrid Metric Training for Cross-Domain Sequential Recommendation
Xiaolin Zheng, Jiajie Su, ... (+2 more)
|
👻
Ghosted
|
cs.IR
|
61 |
3 years ago |
| 126 |
Control3D: Towards Controllable Text-to-3D Generation
Yang Chen, Yingwei Pan, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
61 |
2 years ago |
| 127 |
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Jingwen Chen, Yingwei Pan, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
60 |
2 years ago |
| 128 |
Unconstrained Fashion Landmark Detection via Hierarchical Recurrent Transformer Networks
Sijie Yan, Ziwei Liu, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
59 |
8 years ago |
| 129 |
GNAS: A Greedy Neural Architecture Search Method for Multi-Attribute Learning
Siyu Huang, Xi Li, ... (+3 more)
|
👻
Ghosted
|
cs.NE
|
59 |
8 years ago |
| 130 |
Deep Multimodal Speaker Naming
Yongtao Hu, Jimmy Ren, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
57 |
10 years ago |
| 131 |
Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning
Yiyan Chen, Li Tao, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
57 |
6 years ago |
| 132 |
Target-Guided Composed Image Retrieval
Haokun Wen, Xian Zhang, ... (+3 more)
|
👻
Ghosted
|
cs.MM
|
57 |
2 years ago |
| 133 |
Adversarial Bipartite Graph Learning for Video Domain Adaptation
Yadan Luo, Zi Huang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
56 |
5 years ago |
| 134 |
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
Zhen Ye, Wei Xue, ... (+4 more)
|
👻
Ghosted
|
cs.SD
|
56 |
3 years ago |
| 135 |
ChainerCV: a Library for Deep Learning in Computer Vision
Yusuke Niitani, Toru Ogawa, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
54 |
8 years ago |
| 136 |
Counterfactual Reasoning for Out-of-distribution Multimodal Sentiment Analysis
Teng Sun, Wenjie Wang, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
53 |
3 years ago |
| 137 |
Context-Dependent Diffusion Network for Visual Relationship Detection
Zhen Cui, Chunyan Xu, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
52 |
7 years ago |
| 138 |
Who, Where, and What to Wear? Extracting Fashion Knowledge from Social Media
Yunshan Ma, Xun Yang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
52 |
6 years ago |
| 139 |
Video Imagination from a Single Image with Transformation Generation
Baoyang Chen, Wenmin Wang, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
51 |
9 years ago |
| 140 |
Multimedia Semantic Integrity Assessment Using Joint Embedding Of Images And Text
Ayush Jaiswal, Ekraam Sabir, ... (+2 more)
|
👻
Ghosted
|
cs.MM
|
51 |
8 years ago |
| 141 |
FakingRecipe: Detecting Fake News on Short Video Platforms from the Perspective of Creative Process
Yuyan Bu, Qiang Sheng, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
51 |
1 year ago |
| 142 |
Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection
Youbao Tang, Xiangqian Wu, Wei Bu
|
👻
Ghosted
|
cs.CV
|
50 |
9 years ago |
| 143 |
Outfit Compatibility Prediction and Diagnosis with Multi-Layered Comparison Network
Xin Wang, Bo Wu, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
50 |
6 years ago |
| 144 |
Learning a Target Sample Re-Generator for Cross-Database Micro-Expression Recognition
Yuan Zong, Xiaohua Huang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
49 |
8 years ago |
| 145 |
Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN
Xiangteng He, Yuxin Peng, Junjie Zhao
|
👻
Ghosted
|
cs.CV
|
49 |
8 years ago |
| 146 |
Amora: Black-box Adversarial Morphing Attack
Run Wang, Felix Juefei-Xu, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
48 |
6 years ago |
| 147 |
Salvage Reusable Samples from Noisy Data for Robust Learning
Zeren Sun, Xian-Sheng Hua, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
48 |
5 years ago |
| 148 |
Temporally Guided Music-to-Body-Movement Generation
Hsuan-Kai Kao, Li Su
|
👻
Ghosted
|
cs.MM
|
48 |
5 years ago |
| 149 |
Leveraging Contextual Cues for Generating Basketball Highlights
Vinay Bettadapura, Caroline Pantofaru, Irfan Essa
|
👻
Ghosted
|
cs.MM
|
47 |
9 years ago |
| 150 |
Attention Transfer from Web Images for Video Recognition
Junnan Li, Yongkang Wong, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
47 |
8 years ago |