train

Overview

This project involves training two separate models for a face generation system: an audio model to extract mouth movement patterns and a rendering model to generate the final face. Each model requires its own data preparation and training process.

Audio Model Training (LSTM)

The audio model is distilled from other models and is designed to extract mouth movement patterns from audio input. The code for this model will be released soon.

Render Model Training (simplified DiNet)

Original Data Structure

Please ensure that your data directory structure is organized as follows:

|--/dir_to_data
|  |--/video0.mp4
|  |--/video1.mp4
|  |--/video2.mp4

Data Preparation

Next, prepare your video using the data_preparation script. Replace YOUR_VIDEO_PATH with the path to your video:

python train/data_preparation_face.py dir_to_data

After running the script, your data directory structure should be updated to:

|--/dir_to_data
|  |--/video0
|  |  |--/keypoint_rotate.pkl
|  |  |--/face_mat_mask.pkl
|  |  |--/image
|  |      |--/000000.png
|  |      |--/000001.png
|  |      |--/...
|  |--/video1
|  |  |--/keypoint_rotate.pkl
|  |  |--/face_mat_mask.pkl
|  |  |--/image
|  |      |--/000000.png
|  |      |--/000001.png
|  |      |--/...

Data Validation

Verify the prepared data using the following script:

python train/train_input_validation_render_model.py dir_to_data

you will get the following image:

Training

python train/train_render_model.py --train_data dir_to_data

Monitoring Training Progress

Monitor the training progress using TensorBoard.

tensorboard --logdir=checkpoint/Dinet_five_ref

Then, open http://localhost:6006/ in your web browser to view the training metrics.

Downloading the Pre-trained Model

A pre-trained model can be accessed via both Baidu Netdisk and Google Drive.

Baidu Netdisk Extraction Code: ym7k

Google Drive

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
data_preparation_face.py		data_preparation_face.py
train_input_validation_render_model.py		train_input_validation_render_model.py
train_render_model.py		train_render_model.py
validation.jpg		validation.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train

train

README.md

Overview

Audio Model Training (LSTM)

Render Model Training (simplified DiNet)

Original Data Structure

Data Preparation

Data Validation

Training

Monitoring Training Progress

Downloading the Pre-trained Model

Files

train

Directory actions

More options

Directory actions

More options

Latest commit

History

train

Folders and files

parent directory

README.md

Overview

Audio Model Training (LSTM)

Render Model Training (simplified DiNet)

Original Data Structure

Data Preparation

Data Validation

Training

Monitoring Training Progress

Downloading the Pre-trained Model