Deep learning methods in speaker recognition: a review
November 14, 2019 Β· The Cartographer Β· π Periodica Polytechnica Electrical Engineering and Computer Science
"No code URL or promise found in abstract"
"Title-pattern auto-detect: Deep learning methods in speaker recognition: a review"
Evidence collected by the PWNC Scanner
Authors
DΓ‘vid SztahΓ³, GyΓΆrgy SzaszΓ‘k, AndrΓ‘s Beke
arXiv ID
1911.06615
Category
eess.AS: Audio & Speech
Cross-listed
cs.LG,
cs.SD,
stat.ML
Citations
53
Venue
Periodica Polytechnica Electrical Engineering and Computer Science
Last Checked
9 days ago
Abstract
This paper summarizes the applied deep learning practices in the field of speaker recognition, both verification and identification. Speaker recognition has been a widely used field topic of speech technology. Many research works have been carried out and little progress has been achieved in the past 5-6 years. However, as deep learning techniques do advance in most machine learning fields, the former state-of-the-art methods are getting replaced by them in speaker recognition too. It seems that DL becomes the now state-of-the-art solution for both speaker verification and identification. The standard x-vectors, additional to i-vectors, are used as baseline in most of the novel works. The increasing amount of gathered data opens up the territory to DL, where they are the most effective.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Audio & Speech
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction
R.I.P.
π»
Ghosted
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
R.I.P.
π»
Ghosted
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
R.I.P.
π»
Ghosted
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
R.I.P.
π»
Ghosted