The Image Local Autoregressive Transformer (NeurIPS 2021)

Overview

We propose the two-stream convolution and the local autoregressive mask to maintain the image consistence without information leakage.

News

Release the training codes.
Codes for pose editing.

Requirements

These codes are only tested in pytorch==1.3.1.

Preparation

Download the pretrained sketch-vqgan, image-vqgan, and transformer.
For masked training, we provide irregular and segmentation masks (download) with different masking rates. And you should define the mask file list before the training (flist_example.txt).

Training

The TS-VQGAN is designed a little different from VQGAN, which can trained with less GPU memory with the same performance.

Remove attentions.

Using InstanceNorm instead of GroupNorm.

Progressive channel widths.

Fewer parameters for the sketch vqgan.

Generating XDoG sketches.

python generate_xdog_sketch.py --input_path <img fold> --output_path <output fold>

Train sketch VQGAN.

python train_sketch_vqgan.py --path <model path> --config_path configs/vqgan_ffhq.yml --max_iters 150000 --learning_rate 1e-4 --gpu 0

Train image TS-VQGAN.

python train_image_tsvqgan.py --path <model path> --config_path configs/vqgan_ffhq.yml --max_iters 150000 --learning_rate 2e-4 --gpu 0
python train_image_tsvqgan.py --path <model path> --config_path configs/vqgan_ffhq.yml --max_iters 300000 --learning_rate 4e-5 --gpu 0 --finetune

Train the iLAT.

python train_transformer.py --path <model path> --config_path configs/transformer_ffhq.yml \
                            --sketch_model_path <sketch-VQGAN path> \
                            --image_model_path <image-VQGAN path> \
                            --max_iters 300000 --learning_rate 5e-5 --gpu 3

Testing

See face_editing_demo.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
configs		configs
data/samples		data/samples
src		src
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
face_editing_demo.ipynb		face_editing_demo.ipynb
flist_example.txt		flist_example.txt
generate_xdog_sketch.py		generate_xdog_sketch.py
train_image_tsvqgan.py		train_image_tsvqgan.py
train_sketch_vqgan.py		train_sketch_vqgan.py
train_transformer.py		train_transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Image Local Autoregressive Transformer (NeurIPS 2021)

Overview

News

Requirements

Preparation

Training

Testing

About

Releases

Packages

Languages

License

ewrfcas/iLAT

Folders and files

Latest commit

History

Repository files navigation

The Image Local Autoregressive Transformer (NeurIPS 2021)

Overview

News

Requirements

Preparation

Training

Testing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages