Caffe con Troll: Shallow Ideas to Speed Up Deep Learning

April 16, 2015 ยท Declared Dead ยท ๐Ÿ› DanaC@SIGMOD

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Stefan Hadjis, Firas Abuzaid, Ce Zhang, Christopher Rรฉ arXiv ID 1504.04343 Category cs.LG: Machine Learning Cross-listed cs.CV, stat.ML Citations 73 Venue DanaC@SIGMOD Last Checked 3 months ago
Abstract
We present Caffe con Troll (CcT), a fully compatible end-to-end version of the popular framework Caffe with rebuilt internals. We built CcT to examine the performance characteristics of training and deploying general-purpose convolutional neural networks across different hardware architectures. We find that, by employing standard batching optimizations for CPU training, we achieve a 4.5x throughput improvement over Caffe on popular networks like CaffeNet. Moreover, with these improvements, the end-to-end training time for CNNs is directly proportional to the FLOPS delivered by the CPU, which enables us to efficiently train hybrid CPU-GPU systems for CNNs.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Machine Learning

Died the same way โ€” ๐Ÿ‘ป Ghosted