ControlTalk

Official code for "Controllable Talking Face Generation by Implicit Facial Keypoints Editing"

ControlTalk: A talking face generation method to control face expression deformation based on driven audio, constructing the head pose and facial expression (lip motion) for both single image or sequential video inputs in a unified manner.

demo_15s.mp4

News

2024/10/31: Inference code is now available!

Installation

# Create a python 3.10 conda env (you could also use virtualenv)
conda env create -f environment.yml

Inference

1. Download checkpoints

Download pretrained models from huggingface detailed guidance.

# Download hubert model
https://huggingface.co/TencentGameMate/chinese-hubert-large

# Download our pretrained model 
https://huggingface.co/Lavivis/ControlTalk

Put all pretrained models in ./checkpoints, the file structure should be like:

checkpoints
├── audio_encoder.pt
├── lipControlNet.pt
├── 20231128_210236_337a_e0362-checkpoint.pth.tar
├── TencentGameMate
├───└──chinese-hubert-large
├─────────└──config.json
├─────────└──pytorch_model.bin
├─────────└──preprocessor_config.json
└─────────└──chinese-hubert-large-fairseq-ckpt.pt

2. Inference

python inference.py \
        --source_video './data/drive_video.mp4' \
        --source_img_path  './data/example.png' \
        --audio './data/drive_audio.wav' \
        --save_as_video \
        --box -1 0 0 0 \
        # --img_mode   # if you only want to control the face expression

Training

Coming soon!

Acknowledgements

Citation

If our work and codebase is useful for you, please cite as:

@article{zhao2024controllable,
  title={Controllable Talking Face Generation by Implicit Facial Keypoints Editing},
  author={Zhao, Dong and Shi, Jiaying and Li, Wenjun and Wang, Shudong and Xu, Shenghui and Pan, Zhaoming},
  journal={arXiv preprint arXiv:2406.02880},
  year={2024}
}

License

Our code is released under MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
checkpoints		checkpoints
configs		configs
data		data
modules		modules
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
hparams.py		hparams.py
inference.py		inference.py
local_utils.py		local_utils.py
model.py		model.py
render.py		render.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ControlTalk

News

Installation

Inference

1. Download checkpoints

2. Inference

Training

Acknowledgements

Citation

License

About

Releases

Packages

Languages

License

NetEase-Media/ControlTalk

Folders and files

Latest commit

History

Repository files navigation

ControlTalk

News

Installation

Inference

1. Download checkpoints

2. Inference

Training

Acknowledgements

Citation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages