Hindi-Whisper-Automatic-Speech-Recognition

Transcription of Hindi audio using Whisper OpenAI and Whisper-medium model fine-tuned for Hindi

Overview

This repository contains the implementation of the Hindi Whisper Automatic Speech Recognition (ASR) model using OpenAI's Whisper. It includes the transcription of audio files and evaluation of performance using Word Error Rate (WER). The dataset used is the Kathbath dataset.

Project Description

The project involves:

Transcription: Using the Whisper ASR model to transcribe Hindi audio files.
Evaluation: Calculating the Word Error Rate (WER) to assess the accuracy of the transcriptions.
Analysis: Analyzing errors such as substitutions, deletions, and insertions in the transcriptions.

Usage

Download and Prepare Data: The dataset is automatically downloaded. The dataset used is Kathbath dataset.
Transcription: The ASR model transcribes audio files from the Kathbath dataset.
Evaluation: The script calculates the WER and analyzes substitutions, deletions, and insertions.

Results

We observe that as expected, whisper medium fine-tuned on hindi provides much better results than custom whisper openAI.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Hindi_Whisper_ASR_model.ipynb		Hindi_Whisper_ASR_model.ipynb
README.md		README.md
Whisper-medium model.ipynb		Whisper-medium model.ipynb
audiodata.csv		audiodata.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hindi-Whisper-Automatic-Speech-Recognition

Overview

Project Description

Usage

Results

About

Releases

Packages

Languages

wannasleepforlong/Hindi-Whisper-Automatic-Speech-Recognition

Folders and files

Latest commit

History

Repository files navigation

Hindi-Whisper-Automatic-Speech-Recognition

Overview

Project Description

Usage

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages