eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Natural Text-Driven, Multi-Attribute Editing of Facial Images with Robustness in Sparse Latent Space

Resource Type: Conference
Authors: Zou, Jianpeng; Uchida, Kaoru; Yao, Wenxin; Yuen, Kashing
Source: 2023 3rd Asia-Pacific Conference on Communications Technology and Computer Science (ACCTCS) ACCTCS Communications Technology and Computer Science (ACCTCS), 2023 3rd Asia-Pacific Conference on. :380-385 Feb, 2023
Subject: Computing and Processing
Computer science
Image coding
Image synthesis
Semantics
Natural languages
Robustness
Generators
CLIP
StyleGAN
GAN inversion
image editing
text-driven
Language

Online Access

Full Text (IEEE)

초록

This study presents a novel text-driven method for the editing of portraits with robustness in a sparse latent space. Due to the development of GAN and the proposal of many excellent models like StyleGAN, text-driven image editing and image generation have made great progress in recent years, but the task of generating facial images under the guidance of text is still lacking in some special situations. Our model combines and makes good use of two pre-training models, CLIP2Latent and StyleGAN2, to conduct a preliminary exploration of the above task. The latent code of the input portrait is driven to be edited and manipulated in the StyleGAN latent space via a CLIP-based text-driven generation module. Finally, some promising results have been obtained, especially in the sparse region of the generator latent space and when simultaneously changing numerous attributes.

공지

DAU Library

eArticles

요약정보

Natural Text-Driven, Multi-Attribute Editing of Facial Images with Robustness in Sparse Latent Space

Online Access

초록