-
MIPT
- London, UK
- https://www.linkedin.com/in/asmekal/
Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Stars
An extremely fast Python package and project manager, written in Rust.
Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GPT4o (closed) or Moshi (complex), it's open, simple, natural.
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
🐫 CAMEL: Finding the Scaling Law of Agents. A multi-agent framework. https://www.camel-ai.org
Speech To Speech: an effort for an open-sourced and modular GPT4-o
If Reddit's content was completely AI-generated.
The fastest way to create an HTML app
The official gpt4free repository | various collection of powerful language models
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
A feature-rich command-line audio/video downloader
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
Unofficial Implementation of Animate Anyone by Novita AI
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Unofficial Implementation of Animate Anyone
An efficient video loader for deep learning with smart shuffling that's super easy to digest
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.
[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
Label Studio is a multi-type data labeling and annotation tool with standardized output format
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation