A Survey of Recent DNN Architectures on the TIMIT Phone Recognition Task
June 19, 2018 ยท The Cartographer ยท ๐ International Conference on Text, Speech and Dialogue
"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey of Recent DNN Architectures on the TIMIT Phone Recognition Task"
Evidence collected by the PWNC Scanner
Authors
Josef Michalek, Jan Vanek
arXiv ID
1806.07974
Category
cs.CL: Computation & Language
Cross-listed
cs.HC
Citations
16
Venue
International Conference on Text, Speech and Dialogue
Last Checked
10 days ago
Abstract
In this survey paper, we have evaluated several recent deep neural network (DNN) architectures on a TIMIT phone recognition task. We chose the TIMIT corpus due to its popularity and broad availability in the community. It also simulates a low-resource scenario that is helpful in minor languages. Also, we prefer the phone recognition task because it is much more sensitive to an acoustic model quality than a large vocabulary continuous speech recognition (LVCSR) task. In recent years, many DNN published papers reported results on TIMIT. However, the reported phone error rates (PERs) were often much higher than a PER of a simple feed-forward (FF) DNN. That was the main motivation of this paper: To provide a baseline DNNs with open-source scripts to easily replicate the baseline results for future papers with lowest possible PERs. According to our knowledge, the best-achieved PER of this survey is better than the best-published PER to date.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computation & Language
๐
๐
Old Age
๐
๐
Old Age
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
๐
๐
Old Age
XLNet: Generalized Autoregressive Pretraining for Language Understanding
๐ฎ
๐ฎ
The Ethereal
Effective Approaches to Attention-based Neural Machine Translation
๐
๐
Old Age
A large annotated corpus for learning natural language inference
๐
๐
Old Age