Blizaine/Qwen3-TTS-MLX-WebUI-Enhancedv5.0updated 9d ago
High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.
Dub & translate any short video — locally, offline. Voice clone / per-speaker cast / voice packs, on-screen text localized in place, subtitle styling, blur-or-solid mask covers, funny re-dub. One process (FastAPI serves the React SPA), 6 UI languages.
Upload two audio files to compare their voices. The app will provide a similarity score and indicate if the voices match or not, along with performance metrics.
BazedFrog/SongGeneration-Studiov3.7updated 10d ago
AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model. [NVIDIA ONLY]