Text-Guided 3D Face Synthesis - From Generation to Editing

Yunjie Wu

Netease Fuxi AI Lab

Yapeng Meng

Tsinghua University

Zhipeng Hu

Netease Fuxi AI Lab

Lincheng Li

Netease Fuxi AI Lab

Haoqian Wu

Netease Fuxi AI Lab

Kun Zhou

Zhejiang University

Weiwei Xu

Zhejiang University

Xin Yu

University of Queensland

Abstract

Text-guided 3D face synthesis has achieved remarkable results by leveraging text-to-image (T2I) diffusion models. However, most existing works focus solely on the direct generation, ignoring the editing of the faces, restricting them from synthesizing customized 3D faces through iterative editing. In this paper, we propose a unified text-guided framework from face generation to editing. In the generation stage, we propose a geometry-texture decoupled generation to mitigate the loss of geometric details caused by coupling. Besides, decoupling enables us to utilize the generated geometry as a condition for texture generation, yielding highly geometry-texture aligned results. We further employ a fine-tuned texture diffusion model to enhance texture quality in both RGB and YUV space. In the editing stage, we first employ a pre-trained diffusion model to update facial geometry or texture based on the texts. To enable sequential editing, we introduce a UV domain consistency preservation regularization, preventing unintentional changes to irrelevant facial attributes. Besides, we propose a self-guided consistency weight strategy to improve editing efficacy while preserving consistency. Through comprehensive experiments and comparisons with existing methods, we showcase our method's superiority in face synthesis.



Example generated faces

FaceG2E generates high-fidelity facial geometry and texture from diverse captions.


Example single-round edited faces

FaceG2E enables high-fidelity, fine-grained 3D face editing following diverse instructions.


Example sequentially edited faces

FaceG2E enables sequential 3D face editing, resulting in finely-crafted 3D faces with customized details.


Animation or Relighting

Our synthesized faces can be semlessly integrated to existing CG pipeline, enables animation or relighting.

Animation Example

Relighting Example


Comparison with other methods

Project page template is borrowed from DreamFusion.