ProTAS

This is the repository for the paper Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos accepted by CVPR 2024.

Most of the codes are adapted from MSTCN.

Repository Structure

main.py: Script to train and evaluate the model.
model.py: Contains the implementation of the neural network models (MultiStageModel, SingleStageModel, etc.).
batch_gen.py: Script for generating batches of data for training and evaluation.
eval.py: Evaluation script.
utils: Utility functions including write_graph_from_transcripts and write_progress_values.
data/: Directory containing datasets, including ground truth and feature files.

Data

GTEA: download GTEA data from link1 or link2. Please refer to ms-tcn or CVPR2024-FACT.
EgoProceL: download EgoProceL data from G-Drive. Please refer to CVPR2024-FACT.
EgoPER: download EgoPER data from G-Drive. Please refer to EgoPER for the original data.

Usage

Preprocessing

To generate the target progress values:

python utils/write_progress_values.py

To generate task graphs from video transcripts:

python utils/write_graph.py

Training

To train the model, use the following command:

python main.py --action train --dataset <dataset_name> --split <split_number> --exp_id protas --causal --graph --learnable_graph [other options]

Testing

To test the model, use the following command:

python main.py --action predict --dataset <dataset_name> --split <split_number> --exp_id protas --causal --graph --learnable_graph [other options]

Note: Theoretically, to test the model in an online setting, you should use the --action predict_online argument, which makes predictions frame by frame. However, if the model is set to be causal, it will only make predictions based on frames up to the current frame. In this case, using --action predict will produce the same results while being more efficient.

Citation

If you find the project helpful, we would appreciate if you cite the work:

@article{Shen:CVPR24,
  author = {Y.~Shen and E.~Elhamifar},
  title = {Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos},
  journal = {{IEEE} Conference on Computer Vision and Pattern Recognition},
  year = {2024}}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ProTAS

Repository Structure

Data

Usage

Preprocessing

Training

Testing

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
utils		utils
README.md		README.md
batch_gen.py		batch_gen.py
eval.py		eval.py
main.py		main.py
model.py		model.py

Yuhan-Shen/ProTAS

Folders and files

Latest commit

History

Repository files navigation

ProTAS

Repository Structure

Data

Usage

Preprocessing

Training

Testing

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages