Multi-View Dynamic Facial Action Unit Detection
April 25, 2017 ยท Entered Twilight ยท ๐ Image and Vision Computing
"Last commit was 6.0 years ago (โฅ5 year threshold)"
Evidence collected by the PWNC Scanner
Repo contents: .gitignore, Demo, Demo_OF, LICENSE, README.md, azure-pipelines.yml, bp4d_3splits.pkl, config.py, data_loader.py, generate_data, logger.py, main.py, main.sh, misc, models, requirements.txt, solver.py, split_train_val_test.py, utils.py
Authors
Andres Romero, Juan Leon, Pablo Arbelaez
arXiv ID
1704.07863
Category
cs.CV: Computer Vision
Citations
22
Venue
Image and Vision Computing
Repository
https://github.com/BCV-Uniandes/AUNets
โญ 152
Last Checked
1 month ago
Abstract
We propose a novel convolutional neural network approach to address the fine-grained recognition problem of multi-view dynamic facial action unit detection. We leverage recent gains in large-scale object recognition by formulating the task of predicting the presence or absence of a specific action unit in a still image of a human face as holistic classification. We then explore the design space of our approach by considering both shared and independent representations for separate action units, and also different CNN architectures for combining color and motion information. We then move to the novel setup of the FERA 2017 Challenge, in which we propose a multi-view extension of our approach that operates by first predicting the viewpoint from which the video was taken, and then evaluating an ensemble of action unit detectors that were trained for that specific viewpoint. Our approach is holistic, efficient, and modular, since new action units can be easily included in the overall system. Our approach significantly outperforms the baseline of the FERA 2017 Challenge, with an absolute improvement of 14% on the F1-metric. Additionally, it compares favorably against the winner of the FERA 2017 challenge. Code source is available at https://github.com/BCV-Uniandes/AUNets.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computer Vision
๐
๐
Old Age
๐
๐
Old Age
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
R.I.P.
๐ป
Ghosted
You Only Look Once: Unified, Real-Time Object Detection
๐
๐
Old Age
SSD: Single Shot MultiBox Detector
๐
๐
Old Age
Squeeze-and-Excitation Networks
R.I.P.
๐ป
Ghosted