On Learning 3D Face Morphable Model from In-the-wild Images

August 28, 2018 ยท Entered Twilight ยท ๐Ÿ› IEEE Transactions on Pattern Analysis and Machine Intelligence

๐ŸŒ… TWILIGHT: Old Age
Predates the code-sharing era โ€” a pioneer of its time

"Last commit was 5.0 years ago (โ‰ฅ5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, LICENSE, README.md, TF_newop, _3dmm_utils.py, config.py, expected_rendered_img_without_bg.png, images, main_non_linear_3DMM.py, mean_exp_para.npy, mean_m.npy, mean_shape.npy, model_non_linear_3DMM.py, model_non_linear_3DMM_proxy.py, ops.py, rendering_example.py, rendering_example_dev.py, rendering_ops.py, sample_data.npz, std_exp_para.npy, std_m.npy, std_shape.npy, utils.py

Authors Luan Tran, Xiaoming Liu arXiv ID 1808.09560 Category cs.CV: Computer Vision Citations 164 Venue IEEE Transactions on Pattern Analysis and Machine Intelligence Repository https://github.com/tranluan/Nonlinear_Face_3DMM โญ 678 Last Checked 1 month ago
Abstract
As a classic statistical model of 3D facial shape and albedo, 3D Morphable Model (3DMM) is widely used in facial analysis, e.g., model fitting, image synthesis. Conventional 3DMM is learned from a set of 3D face scans with associated well-controlled 2D face images, and represented by two sets of PCA basis functions. Due to the type and amount of training data, as well as, the linear bases, the representation power of 3DMM can be limited. To address these problems, this paper proposes an innovative framework to learn a nonlinear 3DMM model from a large set of in-the-wild face images, without collecting 3D face scans. Specifically, given a face image as input, a network encoder estimates the projection, lighting, shape and albedo parameters. Two decoders serve as the nonlinear 3DMM to map from the shape and albedo parameters to the 3D shape and albedo, respectively. With the projection parameter, lighting, 3D shape, and albedo, a novel analytically-differentiable rendering layer is designed to reconstruct the original input face. The entire network is end-to-end trainable with only weak supervision. We demonstrate the superior representation power of our nonlinear 3DMM over its linear counterpart, and its contribution to face alignment, 3D reconstruction, and face editing.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision