Whisper Web

Web application for accurate speech-to-text conversion, powered by OpenAI Whisper and Pyannote.

Features

Performance

Currently, the model can process 2 hours of audio in 12 minutes on an RTX 3060 graphics card.

TODO

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github/workflows		.github/workflows
.vscode		.vscode
back		back
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.html		index.html
package.json		package.json
svelte.config.js		svelte.config.js
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts
yarn.lock		yarn.lock