CSVideoNet: A Real-time End-to-end Learning Framework for High-frame-rate Video Compressive Sensing

December 15, 2016 · Entered Twilight · 🏛 arXiv.org

"Last commit was 5.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: 0.25_196_ExtractFrame.sh, GenerateTrainData1ChanVarMix.m, LICENSE, README.md, caffe, extractFeatures_5_25.m, genPhi.m, generateTrainCNN.m, generateValCNN.m, model, phi, store2hdf5.m, store2hdf5Mix.m, util

Authors Kai Xu, Fengbo Ren arXiv ID 1612.05203 Category cs.CV: Computer Vision Cross-listed cs.LG Citations 8 Venue arXiv.org Repository https://github.com/PSCLab-ASU/CSVideoNet ⭐ 22 Last Checked 1 month ago

Abstract

This paper addresses the real-time encoding-decoding problem for high-frame-rate video compressive sensing (CS). Unlike prior works that perform reconstruction using iterative optimization-based approaches, we propose a non-iterative model, named "CSVideoNet". CSVideoNet directly learns the inverse mapping of CS and reconstructs the original input in a single forward propagation. To overcome the limitations of existing CS cameras, we propose a multi-rate CNN and a synthesizing RNN to improve the trade-off between compression ratio (CR) and spatial-temporal resolution of the reconstructed videos. The experiment results demonstrate that CSVideoNet significantly outperforms the state-of-the-art approaches. With no pre/post-processing, we achieve 25dB PSNR recovery quality at 100x CR, with a frame rate of 125 fps on a Titan X GPU. Due to the feedforward and high-data-concurrency natures of CSVideoNet, it can take advantage of GPU acceleration to achieve three orders of magnitude speed-up over conventional iterative-based approaches. We share the source code at https://github.com/PSCLab-ASU/CSVideoNet.