| 401 |
AdaFlow: Domain-Adaptive Density Estimator with Application to Anomaly Detection and Unpaired Cross-Domain Translation
Masataka Yamaguchi, Yuma Koizumi, Noboru Harada
|
👻
Ghosted
|
stat.ML
|
39 |
7 years ago |
| 402 |
Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling
Hainan Xu, Shuoyang Ding, Shinji Watanabe
|
👻
Ghosted
|
cs.CL
|
39 |
7 years ago |
| 403 |
Achievable Uplink Rates for Massive MIMO with Coarse Quantization
Christopher Mollén, Junil Choi, ... (+2 more)
|
👻
Ghosted
|
cs.IT
|
39 |
9 years ago |
| 404 |
SAMO: Speaker Attractor Multi-Center One-Class Learning for Voice Anti-Spoofing
Siwen Ding, You Zhang, Zhiyao Duan
|
👻
Ghosted
|
eess.AS
|
39 |
3 years ago |
| 405 |
McNet: Fuse Multiple Cues for Multichannel Speech Enhancement
Yujie Yang, Changsheng Quan, Xiaofei Li
|
👻
Ghosted
|
eess.AS
|
39 |
3 years ago |
| 406 |
Adversarial Example Detection by Classification for Deep Speech Recognition
Saeid Samizade, Zheng-Hua Tan, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
38 |
6 years ago |
| 407 |
Bootstrapping single-channel source separation via unsupervised spatial clustering on stereo mixtures
Prem Seetharaman, Gordon Wichern, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
38 |
7 years ago |
| 408 |
An HMM-based behavior modeling approach for continuous mobile authentication
Aditi Roy, Tzipora Halevi, Nasir Memon
|
👻
Ghosted
|
cs.CR
|
38 |
8 years ago |
| 409 |
End-to-End Speech Recognition Contextualization with Large Language Models
Egor Lakomkin, Chunyang Wu, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
38 |
2 years ago |
| 410 |
Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Rohit Prabhavalkar, Yanzhang He, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
37 |
5 years ago |
| 411 |
Private Wireless Federated Learning with Anonymous Over-the-Air Computation
Burak Hasircioglu, Deniz Gunduz
|
👻
Ghosted
|
cs.CR
|
37 |
5 years ago |
| 412 |
ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language Understanding
Minjeong Kim, Gyuwan Kim, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
37 |
5 years ago |
| 413 |
AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition
Yi-Chen Chen, Zhaojun Yang, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
37 |
6 years ago |
| 414 |
Adversarial Speaker Adaptation
Zhong Meng, Jinyu Li, Yifan Gong
|
👻
Ghosted
|
cs.LG
|
37 |
6 years ago |
| 415 |
On the Transferability of Adversarial Examples Against CNN-Based Image Forensics
Mauro Barni, Kassem Kallas, ... (+2 more)
|
👻
Ghosted
|
cs.CR
|
37 |
7 years ago |
| 416 |
Spherical clustering of users navigating 360° content
Silvia Rossi, Francesca De Simone, ... (+2 more)
|
👻
Ghosted
|
cs.MM
|
37 |
7 years ago |
| 417 |
Solving Inverse Problems with Hybrid Deep Image Priors: the challenge of preventing overfitting
Zhaodong Sun, Thomas Sanchez, ... (+2 more)
|
👻
Ghosted
|
eess.IV
|
36 |
5 years ago |
| 418 |
Unsupervised Style and Content Separation by Minimizing Mutual Information for Speech Synthesis
Ting-Yao Hu, Ashish Shrivastava, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
36 |
6 years ago |
| 419 |
Spiking neural networks trained with backpropagation for low power neuromorphic implementation of voice activity detection
Flavio Martinelli, Giorgia Dellaferrera, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
36 |
6 years ago |
| 420 |
Multimodal One-Shot Learning of Speech and Images
Ryan Eloff, Herman A. Engelbrecht, Herman Kamper
|
👻
Ghosted
|
cs.CL
|
36 |
7 years ago |
| 421 |
Deep Learning Based Speech Beamforming
Kaizhi Qian, Yang Zhang, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
36 |
8 years ago |
| 422 |
Spectral feature mapping with mimic loss for robust speech recognition
Deblin Bagchi, Peter Plantinga, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
36 |
8 years ago |
| 423 |
Relaxed Wasserstein with Applications to GANs
Xin Guo, Johnny Hong, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
36 |
8 years ago |
| 424 |
Fixed-point optimization of deep neural networks with adaptive step size retraining
Sungho Shin, Yoonho Boo, Wonyong Sung
|
👻
Ghosted
|
cs.LG
|
36 |
9 years ago |
| 425 |
Content-based Representations of audio using Siamese neural networks
Pranay Manocha, Rohan Badlani, ... (+4 more)
|
👻
Ghosted
|
cs.SD
|
36 |
8 years ago |
| 426 |
PQLM -- Multilingual Decentralized Portable Quantum Language Model for Privacy Protection
Shuyue Stella Li, Xiangyu Zhang, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
36 |
3 years ago |
| 427 |
ByteCover: Cover Song Identification via Multi-Loss Training
Xingjian Du, Zhesong Yu, ... (+3 more)
|
👻
Ghosted
|
cs.SD
|
35 |
5 years ago |
| 428 |
On the Stability of Graph Convolutional Neural Networks under Edge Rewiring
Henry Kenlay, Dorina Thanou, Xiaowen Dong
|
👻
Ghosted
|
cs.LG
|
35 |
5 years ago |
| 429 |
Self-supervised learning for audio-visual speaker diarization
Yifan Ding, Yong Xu, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
35 |
6 years ago |
| 430 |
End-to-end training of time domain audio separation and recognition
Thilo von Neumann, Keisuke Kinoshita, ... (+5 more)
|
👻
Ghosted
|
eess.AS
|
35 |
6 years ago |
| 431 |
Acoustically Grounded Word Embeddings for Improved Acoustics-to-Word Speech Recognition
Shane Settle, Kartik Audhkhasi, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
35 |
7 years ago |
| 432 |
Intelligent Reflecting Surface for Massive Device Connectivity: Joint Activity Detection and Channel Estimation
Shuhao Xia, Yuanming Shi
|
👻
Ghosted
|
eess.SP
|
35 |
6 years ago |
| 433 |
Active Eavesdropping via Spoofing Relay Attack
Yong Zeng, Rui Zhang
|
👻
Ghosted
|
cs.IT
|
35 |
10 years ago |
| 434 |
Axonal Delay As a Short-Term Memory for Feed Forward Deep Spiking Neural Networks
Pengfei Sun, Longwei Zhu, Dick Botteldooren
|
👻
Ghosted
|
cs.NE
|
35 |
4 years ago |
| 435 |
Phoneme Based Neural Transducer for Large Vocabulary Speech Recognition
Wei Zhou, Simon Berger, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
34 |
5 years ago |
| 436 |
Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
Sanyuan Chen, Yu Wu, ... (+4 more)
|
👻
Ghosted
|
cs.SD
|
34 |
5 years ago |
| 437 |
Overlap Local-SGD: An Algorithmic Approach to Hide Communication Delays in Distributed SGD
Jianyu Wang, Hao Liang, Gauri Joshi
|
👻
Ghosted
|
cs.LG
|
34 |
6 years ago |
| 438 |
Cooperative Learning via Federated Distillation over Fading Channels
Jin-Hyun Ahn, Osvaldo Simeone, Joonhyuk Kang
|
👻
Ghosted
|
eess.SP
|
34 |
6 years ago |
| 439 |
Attentive Modality Hopping Mechanism for Speech Emotion Recognition
Seunghyun Yoon, Subhadeep Dey, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
34 |
6 years ago |
| 440 |
Learning a Representation for Cover Song Identification Using Convolutional Neural Network
Zhesong Yu, Xiaoshuo Xu, ... (+2 more)
|
👻
Ghosted
|
cs.MM
|
34 |
6 years ago |
| 441 |
End-to-End Sound Source Separation Conditioned On Instrument Labels
Olga Slizovskaia, Leo Kim, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
34 |
7 years ago |
| 442 |
End-to-end Speech Recognition with Adaptive Computation Steps
Mohan Li, Min Liu, Masanori Hattori
|
👻
Ghosted
|
eess.AS
|
34 |
7 years ago |
| 443 |
Scalable Sentiment for Sequence-to-sequence Chatbot Response with Performance Analysis
Chih-Wei Lee, Yau-Shian Wang, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
34 |
8 years ago |
| 444 |
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop
Odette Scharenborg, Laurent Besacier, ... (+17 more)
|
👻
Ghosted
|
cs.CL
|
34 |
8 years ago |
| 445 |
Twitter User Geolocation using Deep Multiview Learning
Tien Huu Do, Duc Minh Nguyen, ... (+3 more)
|
👻
Ghosted
|
cs.SI
|
34 |
7 years ago |
| 446 |
Sparse Bayesian Dictionary Learning with a Gaussian Hierarchical Model
Linxiao Yang, Jun Fang, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
34 |
11 years ago |
| 447 |
Random Access for Massive MIMO Systems with Intra-Cell Pilot Contamination
Elisabeth de Carvalho, Emil Bjornson, ... (+2 more)
|
👻
Ghosted
|
cs.IT
|
34 |
10 years ago |
| 448 |
Extending Whisper with prompt tuning to target-speaker ASR
Hao Ma, Zhiyuan Peng, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
34 |
2 years ago |
| 449 |
Vision, Deduction and Alignment: An Empirical Study on Multi-modal Knowledge Graph Alignment
Yangning Li, Jiaoyan Chen, ... (+4 more)
|
👻
Ghosted
|
cs.AI
|
34 |
3 years ago |
| 450 |
A data set providing synthetic and real-world fisheye video sequences
Andrea Eichenseer, André Kaup
|
👻
Ghosted
|
eess.IV
|
34 |
3 years ago |