3D Face Modeling from Diverse Raw Scan Data

Traditional 3D face models learn a latent representation of faces using linear subspaces from no more than 300 training scans of a single database. The main roadblock of building a large-scale face model from diverse 3D databases lies in the lack of dense correspondence among raw scans. To address these problems, this paper proposes an innovative framework to jointly learn a nonlinear face model from a diverse set of raw 3D scan databases and establish dense point-to-point correspondence among their scans. Specifically, by treating input raw scans as unorganized point clouds, we explore the use of PointNet architectures for converting point clouds to identity and expression feature representations, from which the decoder networks recover their 3D face shapes. Further, we propose a weakly supervised learning approach that does not require correspondence label for the scans. We demonstrate the superior dense correspondence and representation power of our proposed method in shape and expression, and its contribution to single-image 3D face reconstruction.

Figure 1: Overview of our 3D face modeling method. A mixture of synthetic and real data is used to train the encoder-decoder network with supervised (green) and unsupervised (red) loss. Our network can be used for 3D dense correspondence and 3D face reconstruction.

Visualization and Analysis of Latent Space

3DFC implementation in Python and Pytorch may be downloaded from here.

3D Face Modeling from Diverse Raw Scan Data

Visualization and Analysis of Latent Space

3D Face Modeling Source Code

Publications