Skip to content
This repository has been archived by the owner on Sep 3, 2023. It is now read-only.

Commit

Permalink
formatting
Browse files Browse the repository at this point in the history
  • Loading branch information
NeubertJonas authored Aug 1, 2023
1 parent 3f19bd5 commit 6cbdc3e
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions audio_video/README.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
# Working with Audio & Video Files

*The detailed instructions below are out-of-date, but remain here for historical purposes. I originally planned to utilize Praat to automatically sync the separately recorded audio and video files, but this has been proven too unreliable (e.g., when the start time was more than 30 seconds apart). Instead, I synced the files manually via Adobe Premiere Pro, which also includes great filters to further enhance the audio quality for better clarity.
**The detailed instructions below are out-of-date, but remain here for historical purposes. I originally planned to utilize Praat to automatically sync the separately recorded audio and video files, but this has been proven too unreliable (e.g., when the start time was more than 30 seconds apart). Instead, I synced the files manually via Adobe Premiere Pro, which also includes great filters to further enhance the audio quality for better clarity.**

Concerning transcription, I initially used [OpenAI's Whisper](https://github.com/openai/whisper) package, but then moved on to the [WhisperX](https://github.com/m-bain/whisperX) package because it is faster, more accurate, does not suffer from timestamp drift, and includes speaker labels. WhisperX is still very much experimental and, thus, more complicated to install for the less tech-savvy. It also requires one to have a (somewhat) recent nVidia graphics card.*
**Concerning transcription, I initially used [OpenAI's Whisper](https://github.com/openai/whisper) package, but then moved on to the [WhisperX](https://github.com/m-bain/whisperX) package because it is faster, more accurate, does not suffer from timestamp drift, and includes speaker labels. WhisperX is still very much experimental and, thus, more complicated to install for the less tech-savvy. It also requires one to have a (somewhat) recent nVidia graphics card.**

If you still wanna give it a try, I recommend having a look at its GitHub repo. For reference, I used the following parameter for transcribing:

Expand Down

0 comments on commit 6cbdc3e

Please sign in to comment.