Robust Audio Event Recognition with 1-Max Pooling Convolutional Neural Networks

April 21, 2016 · Declared Dead · 🏛 Interspeech

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Huy Phan, Lars Hertel, Marco Maass, Alfred Mertins arXiv ID 1604.06338 Category cs.NE: Neural & Evolutionary Cross-listed cs.LG, cs.SD Citations 128 Venue Interspeech Last Checked 3 months ago

Abstract

We present in this paper a simple, yet efficient convolutional neural network (CNN) architecture for robust audio event recognition. Opposing to deep CNN architectures with multiple convolutional and pooling layers topped up with multiple fully connected layers, the proposed network consists of only three layers: convolutional, pooling, and softmax layer. Two further features distinguish it from the deep architectures that have been proposed for the task: varying-size convolutional filters at the convolutional layer and 1-max pooling scheme at the pooling layer. In intuition, the network tends to select the most discriminative features from the whole audio signals for recognition. Our proposed CNN not only shows state-of-the-art performance on the standard task of robust audio event recognition but also outperforms other deep architectures up to 4.5% in terms of recognition accuracy, which is equivalent to 76.3% relative error reduction.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Neural & Evolutionary

R.I.P. 👻 Ghosted

A Style-Based Generator Architecture for Generative Adversarial Networks

Tero Karras, Samuli Laine, Timo Aila

cs.NE 🏛 CVPR 📚 12.3K cites 7 years ago

R.I.P. 👻 Ghosted

Progressive Growing of GANs for Improved Quality, Stability, and Variation

Tero Karras, Timo Aila, ... (+2 more)

cs.NE 🏛 ICLR 📚 8.2K cites 8 years ago

R.I.P. 👻 Ghosted

Learning both Weights and Connections for Efficient Neural Networks

Song Han, Jeff Pool, ... (+2 more)

cs.NE 🏛 NeurIPS 📚 7.4K cites 10 years ago

R.I.P. 👻 Ghosted

LSTM: A Search Space Odyssey

Klaus Greff, Rupesh Kumar Srivastava, ... (+3 more)

cs.NE 🏛 IEEE TNNLS 📚 6.0K cites 11 years ago

R.I.P. 👻 Ghosted

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

Dan Hendrycks, Kevin Gimpel

cs.NE 🏛 ICLR 📚 4.0K cites 9 years ago

R.I.P. 👻 Ghosted

An Introduction to Convolutional Neural Networks

Keiron O'Shea, Ryan Nash

cs.NE 🏛 arXiv 📚 3.8K cites 10 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 6 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago