Beruflich Dokumente
Kultur Dokumente
Abstract
In this Paper, we present reconstruction of 3DMMs using DEMAP (detection and mapping)
algorithm that are strong priors on 3D face, shape and texture that could be leveraged in fitting
algorithms to accurate and robust the reconstructed complete 3D representations of faces from data
sources like in-the-wild 2D images or live input video. The process of reconstruction of 3DMMs
includes two major sections. First robust cascaded regression landmark detection from Random
Cascaded-Regression Copse(R-CR-C) for robust facial Landmark detection in order to generate
3D mesh frame models to convert that mesh frame into morphable model from live camera input.
It loads a face landmark’s mesh frame model trained with RCR-training, detects a face using Haar
cascade classifier and then runs the landmark detection algorithm. Second understanding the
annotated training data for face geometry algorithms that covers a large variety of pose estimation
and face parameters position. It proceeds furthers to mapping of 3D mesh frame with face
(detected from live camera) using landmark mapper localization algorithm of OpenCV or DLB
resulting the reconstruction of 3D morph able model. As a results 3DMM will be generated whose
resolution could vary with system specifications. In contrast to many existing fitting algorithms
our system achieves real time performance.
1. Introduction
A Real Time 3D Face Reconstruction
Modelling and synthesizing faces is an active research field in computer graphics and
computer vision. Characters modelling and animation techniques are instrumental in
increasing the level of realism in modern real time applications.
Many applications of virtual environments (VE) require the modelling of human-like
characters with high visual and physical accuracy. Many attempts have been made to
model, animate, and interact with human-like characters in VE. Modelling of these
characters, however, can become a non-trivial task when it comes to dealing with real-
time constraints [1][3][4].
The 3D modelling is used in a many industries example: films, animation and
gaming. A large number of software are being used in modelling a digital view of
character and objects models before their manufacturing. These kinds of software not
only generate the different part of an object but also assemble their functionality with
respect to their specifications.
Our main objective in this paper is to generate a human head model by assembling
the proposed features information via camera. The features of modelled character will be
as similar as in real.
Our approach is basically reconstruction of 3D Face Model using 2D video stream
that will be fix from the camera. Reconstruction of 3D face model from a 2D video is an
open problem in the field of computer vision. As requirement for 3D face reconstruction
is still growing, however, an accurate reconstruction of a 3D face model from 2D face
images still remains challenging due to training algorithm complexity [2][3] .
The proposed method of our 3D face reconstruction, belongs to the type of face
reproducing, in which, given a 2D input from live camera, its corresponding 3D face
version is generated. The reconstruction process of 3D facial models is a wide topic that
is considered hot area within the research community. This is an example of 3D face
frontalization, where approaches of Computer Vision and Computer Graphics are used to
solve a problem related to reconstruction of faces.
For the face reconstructing, many methods of reconstruction and 3D facial animation
have been proposed. The approaches considered in our approach is. i) based on the
location of facial landmarks, where we used to do mapping between the 2D input face
and the face model obtained used in the training stage, and ii) application of face
frontalization algorithm via face detection algorithm of open Cv.
2. Related Work
Research Many researchers have approached lot of techniques with variation results with
different prospectus. The work that we highlighted here, has multiple domain [5].Our
main concern is 3D face reconstruction with input obtained through live camera. But
many existing systems have been developed like face modeling from a base shape, face
deforming, face texturing and use of Gaussian filters to improve face modeling
techniques and to produce more like real face [5][6] .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS
The work we have proposed is not very much new since it has work already done
from various perspectives yet the solution carries a different technique that is far cheaper
and easy to access as compared to existing ones like a research has covered face modeling
via Infrared Scan based face detection that carries 6 monochromatic cameras which is an
expansive set up. More ever use of 3D morph able model to map input face on an output
with a predefined set of landmarks provides far better and accurate results than most of
systems [7] .
3. Proposed Approach
Section A
A face shape that is an area used to detect landmark on is represented by the coordinates
of P landmarks
s = [x1, y1, · · ·, xP, yP ]T while s is set of coordinates of landmark. Given a facial image I
and an initial face shape estimate s0, the aim of CR- based approach is to find a shape
updater U:
two outer mouth corners and the mean of two outer eye corners, or the maximum of these
two distances. Here S f is calculated by updating face shape directly, but this attempt is not
very accurate after the first regressor’s estimated data, due to the rough initial shape
estimate from the face bounding box. However, the estimate becomes more accurate as
the current shape gets closer to the defined deep value and data of 3 regressors have been
obtained.
Technical view of this training can be seen in following diagram.
For instance, when we set Sf to the pupil distance and pick the shrinking parameter K =
2 for a 5-deep (D-deep: number of week regressor) CR copse, the patch size of area to put
landmark on decreases from half size of the inter landmark distance for the 1st regressor
to a quarter for the last one. Here we resize these patches to a fixed size (25 × 25 in our
case) and then extract local features.
The shape initialization and training data rescaling for landmark detection were
performed in the same way as mentioned above. The initial shape estimate was obtained
by putting the mean shape at the centre of the detected face bounding box. The training
data was increased in size by randomly fazing the initially recorded shape estimates. The
parameters of R-CR-C were adjust by cross validation where the width is set to 3 depth of
facial landmark is set to 5 and the regularization weight in order to provide additional
information for landmark accurate detection.
Section B:
Face detection has been improved in terms of speed with the application of haar-features
whose detailed data training supported by open CV’s algorithm haar Cascaded.
Implementations of this framework, such as OpenCV, provide different face classifiers
created by authors that used different datasets into their training. The performance and
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS
reliability of these classifiers varies a lot evaluating the performance of some classifiers
and also tested their accuracy. This paper focuses on evaluating facial classifiers
regarding facial features contained in the found face.
In this paper we used Haar cascaded algorithm a complete build in package integrated
with open CV. Simple method of using .xml file that contains coded haar like feature
based face detection can be represented:
("facedetector,f",po::value<fs::path>(&facedetector)-
>required()>default_value("../share/haarcascade_frontalfac alt2.xml"),
(haarcascade_frontalface_alt2.xml)")
V=¯V+Xtexαtex (4)
Where A are the 3D shape coordinates computed as the linear combination of the mean
shape ¯A the shape basis Xid and the expression basis X exp, while V is the texture that is
the linear combination of the mean texture ¯V and the texture basis X tex. The coefficients
{αid, αexp , αtex} defines a unique 3D face.
Let p={m, αid, αexp , αtex} denotes the 3DMM coefficients. The target of the
reconstruction module R is to estimate p=R(x), given an input image x. Since the intent is
for R to also be trainable with the rest of the framework, we use a CNN model based on
CASIA-Net [3] for this regression task. We apply z -score normalization to each
dimension of the parameters before training. A weighted parameter distance cost similar
to [4] is used:
Where W is the importance matrix whose diagonal is the weight of each parameter.
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS
Tool obtained Resolution 649x480(via VGA sensors) Based on video resolution captured
from the camera
processing
rate
Landmarks 28 With the Over all 22 View Based,
considered help of 1257 point Modular
Fiducials 160 points Eigenspace
3D points Semi methods of
have been automatic tracking eyes,
selected deature head, ,nose
which acta point
as control extraction
points to Snake
wrap contouring
cyber method by
ware scan Kass
mesh of
head
Face DDMs(Discrete Cyberware FreeForm Features FEM(finite
geometry Deformable scan of Deformatio tracking by elements
model face n kalman method) mesh
filter
LMTC(loca
l maximum
curvature
tracing)
algorithm
Resulted Functional face 3D 3D 3D Noval FACE like
Dimension model poor avatar model
resolution
frontalization
Input type Texture map Video of 2D pictures Video frame 2D recoded
via relaxation Live video
method actor’s
face
A Real Time 3D Face Reconstruction
Deformable polygonal 3D
face models with changing
texture [4]
FEM(finite elements
A Real Time 3D Face Reconstruction
5. Conclusion
The 3D Morphable models of the human face have prepared myriad of application in computer
graphics and computational photography. However, the acquisition, registration, modelling and
reconstruction of morphable models are currently point of interest that demands major efforts.
The 3DMMs have been used in face recognition and also introduces in face analysis, face
texture tracking, landmark and facial parameters and pose estimation based on 3D face data.
However, due to the type, amount and complexity of training data, as well as the landmark
mapping fine detailing bases, the representation power of 3DMM can be limited. In this Paper, we
present reconstruction of 3DMMs using DEMAP (detection and mapping) algorithm that are
strong priors on 3D face, shape and texture that could be leveraged in fitting algorithms to accurate
and robust the reconstructed complete 3D representations of faces from data sources like in-the-
wild 2D images or live input video.
References
[1] D. Yi, Z. Lei, S. Liao, and S. Z. Li. Learning face representation from scratch. arXiv
preprint:1411.7923, 2014.
[2]. X. Zhu, Z. Lei, X. Liu, H. Shi, and S. Z. Li. Face alignment across large poses: A 3D
solution. In CVPR, 2016.
[3]. Lee, Y., Terzopoulos, D., & Waters, K. (1995, September) Realistic modeling for facial
animation In Proceedings of the 22nd annual conference on Computer graphics and interactive
techniques (pp. 5562). ACM.
[4]. Prag, A. J. N. W. (1997). Making faces
[5]. Lee, W. S., & MagnenatThalmann, N. (2000) Fast head modeling for animation Image
and Vision Computing, 18(4), 355364.
[6]. Wei, X., Zhu, Z., Yin, L., & Ji, Q. (2004, July). A real time face tracking and animation
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS
system In Computer Vision and Pattern Recognition Workshop, 2004. CVPRW'04. Conference on
(pp. 7171). IEEE.
[7]. Essa, I., Basu, S., Darrell, T., & Pentland, A. (1996, June). Modeling, tracking and
interactive animation of faces and heads using input from video. In Computer Animation'96.
Proceedings (pp. 6879). IEEE
[8]. Chen Cao* YanlinWeng* Stephen Lin* Kun Zhou* at State Key Lab of CAD&CG,
Zhejiang University † 3D Shape Regression for Realtime Facial Animation Microsoft Research
Asia.
[9]. Patrik Huber1, Guosheng Hu2, Rafael Tena1_, Pouria Mortazavian3, Willem P.
Koppen1, William Christmas1, Matthias R¨atsch4 and Josef Kittler1 (2015) A Multi resolution
3D Morphable Face Model and Fitting Framework.
[10]. Rafael Padilla, Marly Costa, cicero Evaluation of Haar Cascade Classifiers for Face
Detection Conference paper April 2012.
[11]. ZhenHua Feng, Patrik Huber, Josef Kittler, Life Member, IEEE, William Christmas,
Xion Random CascadedRegression Copse for Robust Facial Landmark Detection.
[12]. Xi Yiny, Xiang Yuz, Kihyuk Sohnz, Xiaoming Liuy and Manmohan Chandraker
Towards LargePose Face Frontalization in the Wild Michigan State University University of
California, San Diego NEC Laboratories America.
[13]. Jesús P. MenaChalco, Luiz Velho, Roberto M. Cesar Junior 3D human face
reconstruction using principal components spaces 1 Institute of Mathematics and Statistics,
University of São Paulo, São Paulo – Brazil 2 National Institute of Pure and Applied
Mathematics, Rio de Janeiro – Braziljmena@vision.ime.usp.br,lvelho@impa.br,
cesar@ime.usp.br [2016].
[14]. Sandipan Banerjee, Joel Brogan, Janez Kriˇzaj, Aparna Bharati, ‘‘To Frontalize or Not To
Frontalize: Do We Really Need Elaborate Preprocessing To Improve Face Recognition?’’Dept. of
Computer Science & Engineering, University of Notre Dame, USA
[15]. Aaron S. Jackson, Adrian, Vasileios ,Georgios Tzimiropoulos1 ‘‘Large Pose 3D Face
Reconstruction from a Single Image via Direct Volumetric CNN Regression’’ 1 The University of
Nottingham, UK 2 Kingston University, UK
[16]. Elad Richardson, Matan Sela, Roy OrEl, Ron Kimme, Learning Detailed Face
Reconstruction from a Single Image, Department of Computer Science and Engineering,
University of Washington.
[17]. Christos, Yannis, Stefanos , Maja , Robust Statistical Face Frontalization ICVV paper.
A Real Time 3D Face Reconstruction