Font Style Transfer, with GAN & Unet

Introduction

This is a project that trains GAN-based model with alphabet characters, and generates various character images through interpolation. This model is an application and extension of the pix2pix model to English characters and words.

Model Structure

The model structure is GAN, which consists of Generator and Discriminator. It is based off of pix2pix with the addition of category embedding in Generator.

Generator gets Noto-Sans font type image for input. It works based on U-net which has Encoder and Decoder inside. Generator keep making better generated image during evaluated by Discriminator.

This U-net Architecture is details of Generator. Encoder(Left side) convert input image to encoded input vector. It extracts features of image, and then The Target Style Vector is concatenated with the encoded input vector. This pair-vectors are decoded by Decoder(Right side).

Discriminator gets Real target style image and Fake image for input, and calculate the probability of them to be seen as the Real target style image. At the same time, it also predicts the category of the font type. Using each Real/Fake Loss and Category Loss allows the discriminator to work in the better direction.

Results

How to render

Dependency

This work was trained with Tensorflow 2.5.0, CUDA 11.0 python 3.8.10.

requirements:

Python 3.7
CUDA / cudnn
Tensorflow 2.5.0
selenium / webdriver-manager
Pillow / opencv
scipy
skimage
imageio

pip install tensorflow-2.5.0 selenium webdriver-manager Pillow scipy opencv-python skimage imageio

Data set

We collected font data from Google fonts.
train datasets = 762 fonts (sans, serif, display) X 62 characters (small letters 26/capital letters 26/numbers 10)

Get datasets

Download package
Run data.py

python data.py

if you run data.py successfully, you can see the structure of folders as below:

font-style-transfer
├─ data_modules
├─ eval_modules
├─ pickledata      # Final output of data.py code
├─ resized_img
├─ train_set
├─ ttf_zips
├─ ttf2png
└─ ttfs

Training

python train.py \
--data_dir C:/fst/pickledata \
--embedding_name embedding.pickle \
--train_name train.pickle

Training arguments

--data_dir: folder path to pickledata folder
--embedding_name: name of embedding pickle
--train_name: name of train dataset pickle

Evaluation

If you want to evaluate generated images and use this evaluation code, prepare 256*128 sized evaluation dataset as below

When you get ready for evaluation, run eval.py

python eval.py \
--img_path IMG_PATH \
--weight_path WEIGHT_PATH

Evaluation arguments

--img_path: folder path to paired-concatenated image of ground truth images and generated images(256X128)
--weight_path: folder path to pretrained inception classifier weight(62 classes)

Package directory

data/      # Data preprocessing : Web crawling -> image(.png) -> pickle(.pkl)
├─ crawling.py     # make font list from google font & download ttf files
├─ ttf2png.py      # derive image files(png) from ttf file  
├─ preprocess.py   # resize images and concat base font & train font
└─ package.py      # transfer file type & split train/valid set 

eval/      # Evaluate the generated font
├─ eval_prdc.py    # Evaluate by prdc on generated images
├─ eval_IS.py      # Evaluate by IS on generated images
├─ eval_fid.py     # Evaluate by fid on generated images
└─ eval_ssim.py    # Evaluate by ssim on generated images

model/     # GAN : Generator(U-net), Discriminator
├─ load_data.py    # Load data 
├─ model.py        # Encoder, Decoder, Generator, Discrminator
├─ train.py        # Trainer, Save Model & logs
└─ utils.py        # Data preprocessing functions

Acknowledgements

Code derived and rehashed from ;

License

Apache 2.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Font Style Transfer, with GAN & Unet

Introduction

Model Structure

Results

How to render

Dependency

Data set

Get datasets

Training

Training arguments

Evaluation

Evaluation arguments

Package directory

Acknowledgements

License

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
data_modules		data_modules
image		image
README.md		README.md
load_data.py		load_data.py
model.py		model.py
results.py		results.py
train.py		train.py
utils.py		utils.py

cgvvxx/Font_Style_Transfer

Folders and files

Latest commit

History

Repository files navigation

Font Style Transfer, with GAN & Unet

Introduction

Model Structure

Results

How to render

Dependency

Data set

Get datasets

Training

Training arguments

Evaluation

Evaluation arguments

Package directory

Acknowledgements

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages