Store
ColorFusion-XL Video Colorization (PAL-stabil, RTX3080-optimiert)
AudioCraft Plus is an all-in-one WebUI for the original AudioCraft, adding many quality features on top https://github.com/GrandaddyShmax/audiocraft_plus
All in one Gradio interface for chatterbox. Voice cloning from uploaded audio samples, automatic text processing for long content and real-time speech generation with configurable parameters. (Minimum Requirements 4GB VRAM / Recommended Requirements 8GB VRAM)
VibeVoice RealtimeFeatured
Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B
PinokioLangGraph — Agent-StateSync for SillyTavern. A Pinokio script that runs a FastAPI + LangGraph agent as middleware between SillyTavern and your LLMs.
LTX-Desktop Video Generation + Editor - Powered By WanGP
High-quality rapid TTS voice cloning model (150x+ realtime) — 48kHz speech, voice cloning
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Wan2GPFeatured
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
LivePortraitFeatured
Bring portraits to life! https://github.com/KwaiVGI/LivePortrait
High-Quality Text-to-Speech for Indian Languages
Fast Lipsync application for smaller GPU's.
ForgeFeatured
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.com/lllyasviel/stable-diffusion-webui-forge
Automatically remove watermarks from videos generated by Sora AI.
[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video, Flux and more. (On Windows supported by all dedicated AMD GPUs from RDNA 2 - RDNA 4)
Flexible Automapper for Beatsaber made for any difficulty
Hermes ModFeatured
A full Hermes skin manager for browsing, editing, saving, and activating CLI skins directly from Pinokio.
Standalone Text-to-Speech using Orpheus TTS with a Gradio UI
🗣️ PersonaPlex - NVIDIA's real-time speech-to-speech conversational AI model. Natural full-duplex conversations with customizable personas and voices. Requires an NVIDIA GPU on Windows or Linux (16-24GB VRAM recommended), 32GB RAM, and a Hugging Face account.
Lightweight CPU text-to-speech with preset voices and optional Hugging Face-authenticated voice cloning.
