Skip to content
View manbaaaa's full-sized avatar
🫡
🫡

Highlights

  • Pro

Block or report manbaaaa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 172 24 Updated Oct 31, 2024

Official implementation of "Separate Anything You Describe"

Python 1,612 117 Updated Oct 25, 2024

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

Python 223 44 Updated Oct 28, 2024

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!

Python 160 16 Updated Nov 1, 2024

Speech recognition

C 567 104 Updated Sep 4, 2024

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 1,988 156 Updated Oct 30, 2024

Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment

Python 15 2 Updated Oct 31, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,635 321 Updated Oct 27, 2024

Text-To-Speech for NotebookLM

9 Updated Oct 31, 2024

Android下音视频对讲演示程序

Java 158 57 Updated Oct 31, 2024

an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网

Python 1,314 151 Updated Jul 29, 2024

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 619 43 Updated Oct 27, 2024

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 908 53 Updated Jul 20, 2024

An Open-source Streaming High-fidelity Neural Audio Codec

Python 430 20 Updated Oct 28, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 50,338 7,214 Updated Nov 1, 2024

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 1,812 532 Updated Oct 27, 2023

Offline Speaker Diarization with SenseVoice by Sherpa ONNX.

Python 7 Updated Oct 29, 2024

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

Python 1,321 157 Updated Oct 18, 2024

一个超轻量级、可以在移动端实时运行的数字人模型

Python 711 116 Updated Oct 14, 2024

Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).

Python 7 3 Updated Jun 30, 2023

Synchronized Translation for Videos. Video dubbing

Python 803 150 Updated Oct 23, 2024

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,812 525 Updated Oct 24, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 1,997 152 Updated Oct 31, 2024

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,607 226 Updated Oct 16, 2024

基于Pytorch的OCR工具库,支持常用的文字检测和识别算法

Python 1,381 305 Updated Sep 2, 2024

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

Python 414 66 Updated Feb 14, 2023

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers

TypeScript 1,183 65 Updated Nov 1, 2024

TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models (2024 ICASSP)

Python 132 4 Updated Aug 29, 2024

A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline

Python 44 2 Updated Oct 23, 2024

通过LLM进行进行字幕断句分割,处理和优化字幕文件,将自动语音识别(ASR)数据的分段合并与拆分,

Python 4 1 Updated Oct 25, 2024
Next