Store
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Source code for free AI video upscaler tool
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Generate music in different genres using text and audio prompts.
Rust CLI and API server for Voxtral TTS from Mistral
Separate Anything You Describe (https://huggingface.co/spaces/Audio-AGI/AudioSep)
Uncensored Deepfakes for images and videos without training and an easy-to-use GUI.
🎙️ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.
[v0.5.1] FramePack Video App offering multiple generation types: Original, F1, video extension, end frame. Features include: LoRA support, job queueing, advanced timestamped prompts, offline mode, a post-processing suite including upscaling, interpolation, filters and more!

Autonomous 16x16 Chess-Grid research agent (KoboldCPP + Qwen GGUF). Walks a grid of Markdown knowledge cells, synthesizes short papers, scores novelty, updates a persistent soul.md.
TTS app built around the EchoTTS model. TTS, Dub, and voice cloning.
Industry leading face manipulation platform
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
RC Stable Audio ToolsFeatured
Advanced Gradio UI for Stable Audio https://github.com/RoyalCities/RC-stable-audio-tools
FooocusFeatured
Minimal Stable Diffusion UI
Multi-Voice Text-to-Speech for Stories and Audiobooks. Supports Kokoro and Chatterbox TTS engines with GPU acceleration.
One-click install & launch for Stable Diffusion WebUI. Free, local, no API key needed. Just type a prompt and create images.
An Efficient Framework For High Fidelity Face Swapping
Upgraded to v1.0!