💀 The Wall of Shame

The most cited papers with no code. Sorted by the weight of their sins.

Page 5, showing 50 papers

# Paper Cause of Death Category Citations Published
201 Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models
Herman Kamper
👻 Ghosted cs.CL 71 7 years ago
202 Analyzing ASR pretraining for low-resource speech-to-text translation
Mihaela C. Stoian, Sameer Bansal, Sharon Goldwater
👻 Ghosted cs.CL 71 6 years ago
203 Unsupervised Contrastive Learning of Sound Event Representations
Eduardo Fonseca, Diego Ortego, ... (+3 more)
👻 Ghosted cs.SD 71 5 years ago
204 Towards Language-Universal End-to-End Speech Recognition
Suyoun Kim, Michael L. Seltzer
👻 Ghosted cs.CL 70 8 years ago
205 Two-Step Sound Source Separation: Training on Learned Latent Targets
Efthymios Tzinis, Shrikant Venkataramani, ... (+3 more)
👻 Ghosted cs.LG 70 6 years ago
206 FPGA Based Implementation of Deep Neural Networks Using On-chip Memory Only
Jinhwan Park, Wonyong Sung
👻 Ghosted cs.AR 69 10 years ago
207 A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis
Xin Wang, Jaime Lorenzo-Trueba, ... (+3 more)
👻 Ghosted eess.AS 69 8 years ago
208 Character-Level Language Modeling with Hierarchical Recurrent Neural Networks
Kyuyeon Hwang, Wonyong Sung
👻 Ghosted cs.LG 68 9 years ago
209 Towards Audio to Scene Image Synthesis using Generative Adversarial Network
Chia-Hung Wan, Shun-Po Chuang, Hung-Yi Lee
👻 Ghosted cs.CL 68 7 years ago
210 Deep Joint Source-Channel Coding for Wireless Image Retrieval
Mikolaj Jankowski, Deniz Gunduz, Krystian Mikolajczyk
👻 Ghosted cs.IT 68 6 years ago
211 Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training
Sameer Khurana, Niko Moritz, ... (+2 more)
👻 Ghosted cs.CL 68 5 years ago
212 When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing
Chao-Han Huck Yang, Jun Qi, ... (+3 more)
👻 Ghosted cs.CL 68 4 years ago
213 Adversarial Speaker Verification
Zhong Meng, Yong Zhao, ... (+2 more)
👻 Ghosted cs.SD 67 7 years ago
214 Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Yinghui Huang, Hong-Kwang Kuo, ... (+6 more)
👻 Ghosted cs.CL 67 5 years ago
215 Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Yosuke Higuchi, Hirofumi Inaguma, ... (+3 more)
👻 Ghosted eess.AS 67 5 years ago
216 Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Xuankai Chang, Brian Yan, ... (+15 more)
👻 Ghosted cs.CL 67 2 years ago
217 Invariances and Data Augmentation for Supervised Music Transcription
John Thickstun, Zaid Harchaoui, ... (+2 more)
👻 Ghosted stat.ML 66 8 years ago
218 Muse: Multi-modal target speaker extraction with visual cues
Zexu Pan, Ruijie Tao, ... (+2 more)
👻 Ghosted eess.AS 65 5 years ago
219 Diffusion-based Generative Speech Source Separation
Robin Scheibler, Youna Ji, ... (+4 more)
👻 Ghosted eess.AS 65 3 years ago
220 Fixed-Point Performance Analysis of Recurrent Neural Networks
Sungho Shin, Kyuyeon Hwang, Wonyong Sung
👻 Ghosted cs.LG 64 10 years ago
221 Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
Giovanni Morrone, Luca Pasa, ... (+4 more)
👻 Ghosted cs.CL 64 7 years ago
222 A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet
David Ditter, Timo Gerkmann
👻 Ghosted eess.AS 64 6 years ago
223 Small-Footprint Keyword Spotting on Raw Audio Data with Sinc-Convolutions
Simon Mittermaier, Ludwig Kürzinger, ... (+2 more)
👻 Ghosted eess.AS 64 6 years ago
224 Character-Level Incremental Speech Recognition with Recurrent Neural Networks
Kyuyeon Hwang, Wonyong Sung
👻 Ghosted cs.CL 63 10 years ago
225 Quaternion Convolutional Neural Networks for Detection and Localization of 3D Sound Events
Danilo Comminiello, Marco Lella, ... (+2 more)
👻 Ghosted eess.AS 63 7 years ago
226 Demystifying TasNet: A Dissecting Approach
Jens Heitkaemper, Darius Jakobeit, ... (+3 more)
👻 Ghosted cs.SD 63 6 years ago
227 FuzzLLM: A Novel and Universal Fuzzing Framework for Proactively Discovering Jailbreak Vulnerabilities in Large Language Models
Dongyu Yao, Jianshu Zhang, ... (+2 more)
👻 Ghosted cs.CR 63 2 years ago
228 High efficiency compression for object detection
Hyomin Choi, Ivan V. Bajic
👻 Ghosted eess.IV 62 8 years ago
229 Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-End Speaker Verification
Gautam Bhattacharya, Joao Monteiro, ... (+2 more)
👻 Ghosted eess.AS 62 7 years ago
230 Effect of data reduction on sequence-to-sequence neural TTS
Javier Latorre, Jakub Lachowicz, ... (+5 more)
👻 Ghosted cs.CL 62 7 years ago
231 Frequency and temporal convolutional attention for text-independent speaker recognition
Sarthak Yadav, Atul Rai
👻 Ghosted cs.SD 62 6 years ago
232 Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining
Cheng-I Lai, Yung-Sung Chuang, ... (+3 more)
👻 Ghosted cs.CL 62 5 years ago
233 VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Yiwei Guo, Chenpeng Du, ... (+3 more)
👻 Ghosted eess.AS 62 2 years ago
234 On the Influence of Momentum Acceleration on Online Learning
Kun Yuan, Bicheng Ying, Ali H. Sayed
👻 Ghosted math.OC 61 10 years ago
235 Low-resource expressive text-to-speech using data augmentation
Goeric Huybrechts, Thomas Merritt, ... (+4 more)
👻 Ghosted eess.AS 61 5 years ago
236 Distributed Scheduling using Graph Neural Networks
Zhongyuan Zhao, Gunjan Verma, ... (+3 more)
👻 Ghosted eess.SP 61 5 years ago
237 Grad-StyleSpeech: Any-speaker Adaptive Text-to-Speech Synthesis with Diffusion Models
Minki Kang, Dongchan Min, Sung Ju Hwang
👻 Ghosted eess.AS 61 3 years ago
238 Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech
David Harwath, Galen Chuang, James Glass
👻 Ghosted cs.CL 60 8 years ago
239 Speaker-invariant Affective Representation Learning via Adversarial Training
Haoqi Li, Ming Tu, ... (+3 more)
👻 Ghosted eess.AS 60 6 years ago
240 EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Yiwei Guo, Chenpeng Du, ... (+2 more)
👻 Ghosted eess.AS 60 3 years ago
241 End-to-end contextual speech recognition using class language models and a token passing decoder
Zhehuai Chen, Mahaveer Jain, ... (+3 more)
👻 Ghosted cs.CL 59 7 years ago
242 Efficient Video and Audio processing with Loihi 2
Sumit Bam Shrestha, Jonathan Timcheck, ... (+3 more)
👻 Ghosted cs.NE 59 2 years ago
243 Low-complexity Recurrent Neural Network-based Polar Decoder with Weight Quantization Mechanism
Chieh-Fang Teng, Chen-Hsi Wu, ... (+2 more)
👻 Ghosted eess.SP 58 7 years ago
244 Deep Signal Recovery with One-Bit Quantization
Shahin Khobahi, Naveed Naimipour, ... (+2 more)
👻 Ghosted eess.SP 58 7 years ago
245 C3DVQA: Full-Reference Video Quality Assessment with 3D Convolutional Neural Network
Munan Xu, Junming Chen, ... (+4 more)
👻 Ghosted eess.IV 58 6 years ago
246 FDDWNet: A Lightweight Convolutional Neural Network for Real-time Sementic Segmentation
Jia Liu, Quan Zhou, ... (+4 more)
👻 Ghosted cs.CV 58 6 years ago
247 Re-Translation Strategies For Long Form, Simultaneous, Spoken Language Translation
Naveen Arivazhagan, Colin Cherry, ... (+4 more)
👻 Ghosted cs.CL 58 6 years ago
248 Knowledge Distillation for Improved Accuracy in Spoken Question Answering
Chenyu You, Nuo Chen, Yuexian Zou
👻 Ghosted cs.CL 58 5 years ago
249 Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Qiujia Li, David Qiu, ... (+6 more)
👻 Ghosted eess.AS 58 5 years ago
250 Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals
Meng Ge, Chenglin Xu, ... (+4 more)
👻 Ghosted eess.AS 58 5 years ago