Lizard-Pose-Estimation-and-Evaluation

DLC

DeepLabCut is an open-source, deep learning-based tool designed for precise, markerless pose estimation of user-defined body parts across various species and behaviors. Developed by Mathis1 in 2018, it utilizes transfer learning to fine-tune deep neural networks on limited datasets, enabling accurate tracking without the need for physical markers. This approach facilitates the study of complex motor behaviors in naturalistic settings, offering a significant advantage over traditional marker-based systems.

Configuration set-up:

The training configuration for this DeepLabCut model leverages a ResNet50 backbone with Group Normalization and 2048 output channels, optimized for keypoint detection using Gaussian heatmap generation. The model employs weighted loss functions: MSE for heatmap predictions and Huber for location refinement, ensuring precise detection. Data preprocessing includes RGB normalization and augmentations such as affine transformations (rotation up to ±30° and scaling) and Gaussian noise, while maintaining minimal lateral transformations and no histogram equalization or motion blur. Images are resized dynamically, with scales between 0.4 and 1.0 and a shorter side ranging from 128 to 1152 pixels. The training process uses an AdamW optimizer with a learning rate of 0.0001 and a two-phase learning rate scheduler to refine performance across 200 epochs. The batch size is set to 1, and snapshots are saved every 25 epochs, with test mAP serving as the key evaluation metric, assessed every 10 epochs. This setup, running on an auto-detected device, is optimized for robust generalization and high accuracy in tracking animal movements, particularly in climbing behavior analysis.

Data Processing:

The DeepLabCut GUI is used to create labeled data frames. The K-means algorithm was used for to extract X frames for annotation. Each video were extracted 20-30 frames where lizard was moving/still, and 20 vidoes were extracted and labeled in total. Each anole was labeled consistently at the head, upper spine, lower spine, left hindleg, right hindleg and tail. The labeled file was stored in .csv (position of each bodypart points) and .h5 format file and prepared for training step. [image_to_add]

Model performance:

Training and validation loss vs. epochs and model evaluation on train and test data

Video output

Here is the annotated video generated by trained model:

Trajectory plot

Bodyparts of all trajectories in test videos were shown:

B-SOiD

B-SOiD (Behavioral Segmentation of Open-field In Deep Learning) allows users to find behaviors using unsupervised learning, without the need for behavior-annotated data. Specifically, B-SOiD finds clusters in animal behavior using pose estimation data from another tool such as DeepLabCut. B-SOiD begins by extracting pose relationships like distance, speed, and relative angle. Next, it performs a non-linear transformation called UMAP to re-frame data in a lower-dimensional space. Then, HDBSCAN is used to identify clusters, and the clustered features are fed as input to a random forest classifier. In the Python implementation, scikit-learn's RandomForestClassifier is used for this step. Finally, the classifier can used to predict behavior categories in any related data.

VAME

VAME (Video-based Animal Motion Estimation) finds patterns in animal movement with a focus on finding repetitive behaviors. Like B-SOiD, VAME uses pose estimation files from a program like DeepLabCut to identify motifs in animal behavior. VAME first aligns pose estimation data egocentrically and splits the data into fixed-length time windows. Next, it uses a bi-directional recurrent neural network (biRNN) with an encoder-decoder architecture within a Variational Autoencoder (VAE) framework to learn latent representations of the data. After that, both reconstruction and prediction decoders are used to ensure that the latent space captures both reconstruction and prediction. The data is then embedded into the final latent space, and a Hidden Markov Model (HMM) is used to segment the latent space into behavioral motifs.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
Behavioral Analysis		Behavioral Analysis
DLC_model		DLC_model
Pose Estimation		Pose Estimation
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lizard-Pose-Estimation-and-Evaluation

DLC

Configuration set-up:

Data Processing:

Model performance:

Video output

Trajectory plot

B-SOiD

VAME

About

Releases

Packages

Languages

License

RuiqingW20/Lizard-Pose-Estimation-and-Evaluation

Folders and files

Latest commit

History

Repository files navigation

Lizard-Pose-Estimation-and-Evaluation

DLC

Configuration set-up:

Data Processing:

Model performance:

Video output

Trajectory plot

B-SOiD

VAME

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages