Store
SongGeneration StudioFeatured
AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model. [NVIDIA ONLY]
PhospheneFeatured
[MAC ONLY] Local generative video panel for Apple Silicon. Joint audio+video via LTX 2.3 (MLX). T2V, I2V, FFLF, Extend. Lossless h264. LoRA picker + CivitAI browser. Free, open source.
Director's ConsoleFeatured
Unified AI VFX pipeline with CPE prompt engineering, storyboard canvas, and multi-node orchestrator. https://github.com/NickPittas/DirectorsConsole
Generate editable website timeline reports from Wayback Machine captures.
Hermes Agent with modern WebUI (nesquena/hermes-webui). Persistent memory, multi-provider AI (OpenAI, Anthropic, Gemini, DeepSeek, OpenRouter), scheduled cron jobs, skills, and sessions. Three-panel interface with chat, tasks, memory, and workspace browser. https://github.com/nesquena/hermes-webui
Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Ultimate-TTS-StudioFeatured
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
ACE-Step 1.5Featured
The most powerful local music generation model that outperforms most commercial alternatives.
Side-StepFeatured
Optimized Training script for Ace-Step with low VRAM support for local GPUs.
Professional browser-based video editor. Open source CapCut alternative. 100% browser-based, no cloud uploads, no watermarks.
A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects
OverworldFeatured
[NVIDIA GPU REQUIRED] Realtime world generator by Overworld Waypoint world model
MagicQuillFeatured
An intelligent, interactive Image Editing System. Easily erase and add objects on a user-friendly interface.
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
VoxCPMFeatured
Tokenizer-free multilingual TTS and voice cloning with low-VRAM and VoxCPM2 Web UI/API launch modes.
FlashVSR - Video and Image Upscaler: [Runs on 12GB vram, 32GB ram] Diffusion-Based Streaming Video Super-Resolution
FaceFusion 3.5.4Featured
Industry leading face manipulation platform
VibeVoice RealtimeFeatured
Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B
LTX-Desktop Video Generation + Editor - Powered By WanGP
