Skip to content

Ego4DSounds: A diverse egocentric dataset with high action-audio correspondence

Notifications You must be signed in to change notification settings

Ego4DSounds/Ego4DSounds

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 

Repository files navigation

Ego4DSounds

Dataset introduced in "Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos".

This repository contains scripts for processing the Ego4DSounds dataset. It includes functionality for loading video and audio data and extracting clips using metadata.

Contents

  • extract_ego4d_clips.py: Extracts clips from the Ego4D dataset
  • dataset.py: Defines the Ego4DSounds dataset class for loading, processing, and extracting video and audio clips
  • Metadata files: train_clips_1.2m.csv, test_clips_11k.csv, ego4d.json

About

Ego4DSounds: A diverse egocentric dataset with high action-audio correspondence

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages