Store
High-quality image generation and editing powered by SenseNova-U1-8B-MoT (NEO-Unify architecture). Supports text-to-image, image-to-image editing, and reasoning mode.
Real-time AI face swap with CoreML acceleration for macOS.
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Ultimate-TTS-StudioFeatured
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
Video to Openpose & DWPose (All OS supported) https://github.com/sdbds/vid2pose
The open-source ElevenLabs alternative. Local voice cloning, video dubbing, and real-time dictation — 646 languages, no API keys.
ACE-Step 1.5Featured
The most powerful local music generation model that outperforms most commercial alternatives.
A lightweight local launcher and control panel for llama.cpp.
Video translation & dubbing with voice cloning — 100% local, zero API. Supports 10 languages, automatic transcription, translation, and AI dubbing.
Based on BFS - Best Face Swap, VisoMaster, and SwapAnyHead.
Side-StepFeatured
Optimized Training script for Ace-Step with low VRAM support for local GPUs.
ApplioFeatured
A simple, high-quality voice conversion tool focused on ease of use and performance.

使用本地 LM Studio AI 免費校正 ASR 課程字幕,支援 PDF 參考資料,不需 API Key
Local GPU-accelerated music video generator: Gradio UI, analysis, SDXL backgrounds, NVENC output.
Flask video browser with previews and streaming.
Professional browser-based video editor. Open source CapCut alternative. 100% browser-based, no cloud uploads, no watermarks.
Local dashboard for Dynatrace problems and metrics with Azure OpenAI incident analysis.
a state-of-the-art open-source model for fast feedforward 3D reconstruction from a single image, developed in collaboration between Tripo AI and Stability AI. https://huggingface.co/spaces/stabilityai/TripoSR
A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects