GANStrument: Adversarial Instrument Sound Synthesis with Pitch-invariant Instance Conditioning

November 10, 2022 · Entered Twilight · 🏛 IEEE International Conference on Acoustics, Speech, and Signal Processing

"No code URL or promise found in abstract"
"Derived repo from GitHub Pages (backfill)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, _config.yml, index.md, media

Authors Gaku Narita, Junichi Shimizu, Taketo Akama arXiv ID 2211.05385 Category cs.SD: Sound Cross-listed cs.LG, eess.AS Citations 14 Venue IEEE International Conference on Acoustics, Speech, and Signal Processing Repository https://github.com/ganstrument/ganstrument-demo ⭐ 9 Last Checked 8 days ago

Abstract

We propose GANStrument, a generative adversarial model for instrument sound synthesis. Given a one-shot sound as input, it is able to generate pitched instrument sounds that reflect the timbre of the input within an interactive time. By exploiting instance conditioning, GANStrument achieves better fidelity and diversity of synthesized sounds and generalization ability to various inputs. In addition, we introduce an adversarial training scheme for a pitch-invariant feature extractor that significantly improves the pitch accuracy and timbre consistency. Experimental results show that GANStrument outperforms strong baselines that do not use instance conditioning in terms of generation quality and input editability. Qualitative examples are available online.