Welcome to Kokoro, a high-quality text-to-speech synthesis program powered by deep learning. This tool converts any text into high-fidelity speech in just a few seconds. Simply input text, select a voice, adjust the speed, and enjoy the generated audio.
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
kristopher-miles/threadspeak-audiobookv5.1updated 2mo ago
An AI audiobook generator built on Qwen3-TTS. Annotate your book with an LLM, assign voices, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, and export to MP3 or Audacity multi-track projects