Store
ACE-Step 1.5Featured
The most powerful local music generation model that outperforms most commercial alternatives.
ApplioFeatured
A simple, high-quality voice conversion tool focused on ease of use and performance.
AudioSepFeatured
Separate Anything You Describe (https://huggingface.co/spaces/Audio-AGI/AudioSep)
ComfyuiFeatured
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
HallucinatorFeatured
[NVIDIA ONLY] Autocomplete any voice(s), powered by Hertz AI (Standard Intelligence)
MMAudioFeatured
Generate synchronized audio from video and/or text inputs https://github.com/hkchengrex/MMAudio
OpenAudioFeatured
Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaudio/fish-speech
Qwen3-TTSFeatured
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team
StableDAWFeatured
Browser-based AI audio DAW for Stable Audio 3 with text-to-audio, inpainting, LoRA training, FFmpeg effects, waveform editing, sequencer, piano roll, and persistent library. https://github.com/gantasmo/stabledaw
Ultimate-TTS-StudioFeatured
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects
Slice any audio file into fixed-length (10-60s) clips and export them all at once. Powered by ffmpeg.
Pinokio wrapper: installs HeartMuLa heartlib + downloads checkpoints + launches a Gradio UI for music generation.
Local/cloud AI YouTube video generator: script, visuals, voiceover, thumbnail, MP4.
