Handwriting Generation

Requirements

    Python>=3.8,
    gradio>=4.3.0,
    torch>=2.1.1,
    torchvision>=0.16.1,
    torchaudio>=2.1.1,
    lightning[extra]>=2.1.1,
    torchmetrics>=1.2.0,
    einops>=0.7.0,
    neptune>=1.8.3,
    dataclass-wizard>=0.22.2,
    setuptools>=68.2.2,
    h5py>=3.10.0,
    diffusers[torch]>=0.23.0,
    potracer>=0.0.4,
    clean-fid>=0.1.35

Datasets & Pre-processing

Download the IAM Dataset and IAM Online Dataset from https://fki.tic.heia-fr.ch/databases/iam-handwriting-database and https://fki.tic.heia-fr.ch/databases/iam-on-line-handwriting-database, respectively. Place them in the raw_data/IAMDB and raw_data/IAMonDB folders, respectively.

Then, to preprocess the dataset and save it to an H5 file, simply run the following command:

python3 prepare_data.py -c {RNN,Diffusion,LatentDiffusion}

Training from scratch

To train a diffusion model, run the following command:

python3 train.py -c Diffusion

Generate handwriting

To generate handwriting run the following command:

python3 synthesize.py -c LatentDiffusion -t "the quick brown fox jumps" -w 64

Full sampling

WIP

Commands

prepare_data.py

prepare_data.py [-h] -c {RNN,Diffusion,LatentDiffusion} [-cf CONFIG_FILE]

options:
  -h, --help            show this help message and exit
  -c {RNN,Diffusion,LatentDiffusion}, --config {RNN,Diffusion,LatentDiffusion}
                        Type of model
  -cf CONFIG_FILE, --config-file CONFIG_FILE
                        Filename for configs

train.py

train.py [-h] -c {RNN,Diffusion,LatentDiffusion} [-cf CONFIG_FILE] [-r] [-n]

options:
  -h, --help            show this help message and exit
  -c {RNN,Diffusion,LatentDiffusion}, --config {RNN,Diffusion,LatentDiffusion}
                        Type of model
  -cf CONFIG_FILE, --config-file CONFIG_FILE
                        Filename for configs
  -r, --remote          Flag indicating whether the model will be trained on a server with dedicated
                        GPUs, such as the A100
  -n, --neptune         Flag for using NeptuneLogger

synthesize.py

synthesize.py [-h] -c {RNN,Diffusion,LatentDiffusion} [-cf CONFIG_FILE] [-t TEXT] [-w WRITER]
                     [--color COLOR] [-s STYLE_PATH]

options:
  -h, --help            show this help message and exit
  -c {RNN,Diffusion,LatentDiffusion}, --config {RNN,Diffusion,LatentDiffusion}
                        Type of model
  -cf CONFIG_FILE, --config-file CONFIG_FILE
                        Filename for configs
  -t TEXT, --text TEXT  Text to generate
  -w WRITER, --writer WRITER
                        Writer style. If not provided, the default writer is selected randomly
  --color COLOR         Handwriting color. If not provided, the default color is black
  -s STYLE_PATH, --style_path STYLE_PATH
                        Filename for style. If not provided, the default style is selected randomly

full_sample.py

full_sample.py [-h] -c {Diffusion,LatentDiffusion} [-cf CONFIG_FILE] [--strict]

options:
  -h, --help            show this help message and exit
  -c {RNN,Diffusion,LatentDiffusion}, --config {RNN,Diffusion,LatentDiffusion}
                        Type of model
  -cf CONFIG_FILE, --config-file CONFIG_FILE
                        Filename for configs
  --strict              Strict mode for a dataset that excludes OOV words

Name		Name	Last commit message	Last commit date
Latest commit History 210 Commits
assets		assets
configs		configs
data		data
metrics		metrics
models		models
raw_data		raw_data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
full_sample.py		full_sample.py
generate_dataset.py		generate_dataset.py
image_classification.py		image_classification.py
pdm.lock		pdm.lock
pdm.toml		pdm.toml
periodic_checkpoint.py		periodic_checkpoint.py
prepare_data.py		prepare_data.py
pyproject.toml		pyproject.toml
synthesize.py		synthesize.py
train.py		train.py
train_style_classificator.py		train_style_classificator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Handwriting Generation

Requirements

Datasets & Pre-processing

Training from scratch

Generate handwriting

Full sampling

Commands

prepare_data.py

train.py

synthesize.py

full_sample.py

References

About

Languages

License

c0deplayer/handwriting-generation

Folders and files

Latest commit

History

Repository files navigation

Handwriting Generation

Requirements

Datasets & Pre-processing

Training from scratch

Generate handwriting

Full sampling

Commands

prepare_data.py

train.py

synthesize.py

full_sample.py

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages