| 451 |
Sign language segmentation with temporal convolutional networks
Katrin Renz, Nicolaj C. Stache, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
33 |
5 years ago |
| 452 |
Adaptive Bi-directional Attention: Exploring Multi-Granularity Representations for Machine Reading Comprehension
Nuo Chen, Fenglin Liu, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
33 |
5 years ago |
| 453 |
REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling
Hu Hu, Xuesong Yang, ... (+7 more)
|
👻
Ghosted
|
eess.AS
|
33 |
5 years ago |
| 454 |
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Suyoun Kim, Yuan Shangguan, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
33 |
5 years ago |
| 455 |
Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech
Vatsal Aggarwal, Marius Cotescu, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
33 |
6 years ago |
| 456 |
Towards Pose-invariant Lip-Reading
Shiyang Cheng, Pingchuan Ma, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
33 |
6 years ago |
| 457 |
Semi-supervised and Transfer learning approaches for low resource sentiment classification
Rahul Gupta, Saurabh Sahu, ... (+2 more)
|
👻
Ghosted
|
cs.IR
|
33 |
7 years ago |
| 458 |
MF-AED-AEC: Speech Emotion Recognition by Leveraging Multimodal Fusion, Asr Error Detection, and Asr Error Correction
Jiajun He, Xiaohan Shi, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
33 |
2 years ago |
| 459 |
Stethoscope-guided Supervised Contrastive Learning for Cross-domain Adaptation on Respiratory Sound Classification
June-Woo Kim, Sangmin Bae, ... (+3 more)
|
👻
Ghosted
|
cs.SD
|
33 |
2 years ago |
| 460 |
Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects
Junghyun Koo, Marco A. Martínez-Ramírez, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
33 |
3 years ago |
| 461 |
Textless Direct Speech-to-Speech Translation with Discrete Speech Representation
Xinjian Li, Ye Jia, Chung-Cheng Chiu
|
👻
Ghosted
|
cs.CL
|
33 |
3 years ago |
| 462 |
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization
Dongmei Wang, Xiong Xiao, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
33 |
3 years ago |
| 463 |
LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Fuyan Ma, Bin Sun, Shutao Li
|
👻
Ghosted
|
cs.CV
|
32 |
2 years ago |
| 464 |
CIF-based Collaborative Decoding for End-to-end Contextual Speech Recognition
Minglun Han, Linhao Dong, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
32 |
5 years ago |
| 465 |
Single channel voice separation for unknown number of speakers under reverberant and noisy settings
Shlomo E. Chazan, Lior Wolf, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
32 |
5 years ago |
| 466 |
An Embarrassingly Simple Model for Dialogue Relation Extraction
Fuzhao Xue, Aixin Sun, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
32 |
5 years ago |
| 467 |
A Simple but Effective BERT Model for Dialog State Tracking on Resource-Limited Systems
Tuan Manh Lai, Quan Hung Tran, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
32 |
6 years ago |
| 468 |
Accurate and Scalable Version Identification Using Musically-Motivated Embeddings
Furkan Yesiler, Joan Serrà, Emilia Gómez
|
👻
Ghosted
|
cs.SD
|
32 |
6 years ago |
| 469 |
What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis
Chung-Yi Li, Pei-Chieh Yuan, Hung-Yi Lee
|
👻
Ghosted
|
cs.CL
|
32 |
6 years ago |
| 470 |
Encrypted Speech Recognition using Deep Polynomial Networks
Shi-Xiong Zhang, Yifan Gong, Dong Yu
|
👻
Ghosted
|
cs.CR
|
32 |
6 years ago |
| 471 |
Deep factorization for speech signal
Lantian Li, Dong Wang, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
32 |
8 years ago |
| 472 |
Why Do Neural Dialog Systems Generate Short and Meaningless Replies? A Comparison between Dialog and Translation
Bolin Wei, Shuai Lu, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
32 |
8 years ago |
| 473 |
Graph learning under sparsity priors
Hermina Petric Maretic, Dorina Thanou, Pascal Frossard
|
👻
Ghosted
|
cs.LG
|
32 |
8 years ago |
| 474 |
Multi-centrality Graph Spectral Decompositions and their Application to Cyber Intrusion Detection
Pin-Yu Chen, Sutanay Choudhury, Alfred O. Hero
|
👻
Ghosted
|
cs.SI
|
32 |
10 years ago |
| 475 |
VoiceLDM: Text-to-Speech with Environmental Context
Yeonghyeon Lee, Inmo Yeon, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
32 |
2 years ago |
| 476 |
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition
Chao-Han Huck Yang, Bo Li, ... (+5 more)
|
👻
Ghosted
|
cs.SD
|
32 |
3 years ago |
| 477 |
ShaDocNet: Learning Spatial-Aware Tokens in Transformer for Document Shadow Removal
Xuhang Chen, Xiaodong Cun, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
32 |
3 years ago |
| 478 |
Underwater Image Restoration via Polymorphic Large Kernel CNNs
Xiaojiao Guo, Yihang Dong, ... (+5 more)
|
💀
404 Not Found
|
cs.CV
|
31 |
1 year ago |
| 479 |
Fine-grained Disentangled Representation Learning for Multimodal Emotion Recognition
Haoqin Sun, Shiwan Zhao, ... (+4 more)
|
👻
Ghosted
|
cs.SD
|
31 |
2 years ago |
| 480 |
SyncFusion: Multimodal Onset-synchronized Video-to-Audio Foley Synthesis
Marco Comunità, Riccardo F. Gramaccioni, ... (+4 more)
|
👻
Ghosted
|
cs.SD
|
31 |
2 years ago |
| 481 |
A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Dongwei Jiang, Wubo Li, ... (+6 more)
|
👻
Ghosted
|
eess.AS
|
31 |
5 years ago |
| 482 |
CNN-based Analog CSI Feedback in FDD MIMO-OFDM Systems
Mahdi Boloursaz Mashhadi, Qianqian Yang, Deniz Gunduz
|
👻
Ghosted
|
cs.IT
|
31 |
6 years ago |
| 483 |
Towards Generating Ambisonics Using Audio-Visual Cue for Virtual Reality
Aakanksha Rana, Cagri Ozcinar, Aljoscha Smolic
|
👻
Ghosted
|
cs.SD
|
31 |
6 years ago |
| 484 |
Towards Unsupervised Single-Channel Blind Source Separation using Adversarial Pair Unmix-and-Remix
Yedid Hoshen
|
👻
Ghosted
|
eess.SP
|
31 |
7 years ago |
| 485 |
Dialog Context Language Modeling with Recurrent Neural Networks
Bing Liu, Ian Lane
|
👻
Ghosted
|
cs.CL
|
31 |
9 years ago |
| 486 |
BER Analysis of the box relaxation for BPSK Signal Recovery
Christos Thrampoulidis, Ehsan Abbasi, ... (+2 more)
|
👻
Ghosted
|
cs.IT
|
31 |
10 years ago |
| 487 |
Self-Supervised Learning for Anomalous Sound Detection
Kevin Wilkinghoff
|
👻
Ghosted
|
eess.AS
|
31 |
2 years ago |
| 488 |
Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue
Guan-Ting Lin, Prashanth Gurunath Shivakumar, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
31 |
2 years ago |
| 489 |
SEF-VC: Speaker Embedding Free Zero-Shot Voice Conversion with Cross Attention
Junjie Li, Yiwei Guo, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
31 |
2 years ago |
| 490 |
One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition
Samuele Cornell, Jee-weon Jung, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
31 |
2 years ago |
| 491 |
Occluded Person Re-Identification via Relational Adaptive Feature Correction Learning
Minjung Kim, MyeongAh Cho, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
31 |
3 years ago |
| 492 |
Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages
Kaushal Santosh Bhogale, Abhigyan Raman, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
31 |
3 years ago |
| 493 |
Semi-Supervised Speech Recognition via Graph-based Temporal Classification
Niko Moritz, Takaaki Hori, Jonathan Le Roux
|
👻
Ghosted
|
cs.LG
|
30 |
5 years ago |
| 494 |
Graph Attention Networks for Speaker Verification
Jee-weon Jung, Hee-Soo Heo, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
30 |
5 years ago |
| 495 |
Bit Allocation for Multi-Task Collaborative Intelligence
Saeed Ranjbar Alvar, Ivan V. Bajić
|
👻
Ghosted
|
cs.LG
|
30 |
6 years ago |
| 496 |
Visually Guided Self Supervised Learning of Speech Representations
Abhinav Shukla, Konstantinos Vougioukas, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
30 |
6 years ago |
| 497 |
Partial AUC optimization based deep speaker embeddings with class-center learning for text-independent speaker verification
Zhongxin Bai, Xiao-Lei Zhang, Jingdong Chen
|
👻
Ghosted
|
cs.LG
|
30 |
6 years ago |
| 498 |
Sampling Strategies for GAN Synthetic Data
Binod Bhattarai, Seungryul Baek, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
30 |
6 years ago |
| 499 |
Semantic query-by-example speech search using visual grounding
Herman Kamper, Aristotelis Anastassiou, Karen Livescu
|
👻
Ghosted
|
cs.CL
|
30 |
7 years ago |
| 500 |
Distributed Deep Learning Strategies For Automatic Speech Recognition
Wei Zhang, Xiaodong Cui, ... (+5 more)
|
👻
Ghosted
|
cs.SD
|
30 |
7 years ago |