Comparing Time and Frequency Domain for Audio Event Recognition Using Deep Learning

March 18, 2016 · Declared Dead · 🏛 IEEE International Joint Conference on Neural Network

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Lars Hertel, Huy Phan, Alfred Mertins arXiv ID 1603.05824 Category cs.NE: Neural & Evolutionary Cross-listed cs.LG, cs.SD Citations 65 Venue IEEE International Joint Conference on Neural Network Last Checked 3 months ago

Abstract

Recognizing acoustic events is an intricate problem for a machine and an emerging field of research. Deep neural networks achieve convincing results and are currently the state-of-the-art approach for many tasks. One advantage is their implicit feature learning, opposite to an explicit feature extraction of the input signal. In this work, we analyzed whether more discriminative features can be learned from either the time-domain or the frequency-domain representation of the audio signal. For this purpose, we trained multiple deep networks with different architectures on the Freiburg-106 and ESC-10 datasets. Our results show that feature learning from the frequency domain is superior to the time domain. Moreover, additionally using convolution and pooling layers, to explore local structures of the audio signal, significantly improves the recognition performance and achieves state-of-the-art results.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Neural & Evolutionary

R.I.P. 👻 Ghosted

A Style-Based Generator Architecture for Generative Adversarial Networks

Tero Karras, Samuli Laine, Timo Aila

cs.NE 🏛 CVPR 📚 12.3K cites 7 years ago

R.I.P. 👻 Ghosted

Progressive Growing of GANs for Improved Quality, Stability, and Variation

Tero Karras, Timo Aila, ... (+2 more)

cs.NE 🏛 ICLR 📚 8.2K cites 8 years ago

R.I.P. 👻 Ghosted

Learning both Weights and Connections for Efficient Neural Networks

Song Han, Jeff Pool, ... (+2 more)

cs.NE 🏛 NeurIPS 📚 7.4K cites 10 years ago

R.I.P. 👻 Ghosted

LSTM: A Search Space Odyssey

Klaus Greff, Rupesh Kumar Srivastava, ... (+3 more)

cs.NE 🏛 IEEE TNNLS 📚 6.0K cites 11 years ago

R.I.P. 👻 Ghosted

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

Dan Hendrycks, Kevin Gimpel

cs.NE 🏛 ICLR 📚 4.0K cites 9 years ago

R.I.P. 👻 Ghosted

An Introduction to Convolutional Neural Networks

Keiron O'Shea, Ryan Nash

cs.NE 🏛 arXiv 📚 3.8K cites 10 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 6 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago