Generating cartoon images from face photos with cycle-consistent adversarial networks

Tao Zhang, Zhanjie Zhang, Wenjing Jia, Xiangjian He, Jie Yang

Research output: Journal PublicationArticlepeer-review

2 Citations (Scopus)

Abstract

The generative adversarial network (GAN) is first proposed in 2014, and this kind of network model is machine learning systems that can learn to measure a given distribution of data, one of the most important applications is style transfer. Style transfer is a class of vision and graphics problems where the goal is to learn the mapping between an input image and an output image. CYCLE-GAN is a classic GAN model, which has a wide range of scenarios in style transfer. Considering its unsupervised learning characteristics, the mapping is easy to be learned between an input image and an output image. However, it is difficult forCYCLE-GANto converge and generate high-quality images. In order to solve this problem, spectral normalization is introduced into each convolutional kernel of the discriminator. Every convolutional kernel reaches Lipschitz stability constraint with adding spectral normalization and the value of the convolutional kernel is limited to [0, 1], which promotes the training process of the proposed model. Besides, we use pretrained model (VGG16) to control the loss of image content in the position of l1 regularization. To avoid overfitting, l1 regularization term and l2 regularization term are both used in the object loss function. In terms of Frechet Inception Distance (FID) score evaluation, our proposed model achieves outstanding performance and preserves more discriminative features. Experimental results show that the proposed model converges faster and achieves better FID scores than the state of the art.

Original languageEnglish
Pages (from-to)2733-2747
Number of pages15
JournalComputers, Materials and Continua
Volume69
Issue number2
DOIs
Publication statusPublished - 2021
Externally publishedYes

Keywords

  • Frechet inception distance
  • Generative adversarial network
  • L1 regularization term
  • L2 regularization term
  • Lipschitz stability constraint
  • Spectral normalization
  • VGG16

ASJC Scopus subject areas

  • Biomaterials
  • Modelling and Simulation
  • Mechanics of Materials
  • Computer Science Applications
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Generating cartoon images from face photos with cycle-consistent adversarial networks'. Together they form a unique fingerprint.

Cite this