pinokiofactory/Ultimate-TTS-Studiov3.7updated 3d ago
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
26 check-insNVIDIAAMDApple
francescofugazzi/ml-sharp-pinokiov0.3updated 1mo ago
One-click 3D Gaussian Splatting generation from a single image.
@franzipol5 check-insNVIDIAAMDApple
Qwen3-TTSFeatured
SUP3RMASS1VE/Qwen3-TTS-Pinokiov5.0updated 1mo ago
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team
@sup3rmass1ve21 check-insNVIDIAAMDApple
e2-f5-ttsFeatured
pinokiofactory/e2-f5-ttsv3.7updated 1mo ago
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
12 check-insNVIDIAAMDApple
pinokiofactory/Hunyuan3d-2-lowvramv3.7updated 1mo ago
Text/Image to 3D (Cross Platform: Mac + Windows + Linux): High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. https://github.com/deepbeepmeep/Hunyuan3D-2GP
6 check-insNVIDIAAMDApple
OpenAudioFeatured
pinokiofactory/openaudiov3.7updated 29d ago
Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaudio/fish-speech
10 check-insNVIDIAAMDApple
ForgeFeatured
pinokiofactory/stable-diffusion-webui-forgev2.0updated 15d ago
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.com/lllyasviel/stable-diffusion-webui-forge
20 check-insNVIDIAAMDApple
pinokiofactory/whisper-webuiv3.7updated 3mo ago
A Web UI for easy subtitle using whisper model.
2 check-insNVIDIAAMDApple
ComfyuiFeatured
pinokiofactory/comfyv3.7updated 17d ago
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
62 check-insNVIDIAAMDApple
MagicQuillFeatured
pinokiofactory/MagicQuillv3.7updated 9d ago
An intelligent, interactive Image Editing System. Easily erase and add objects on a user-friendly interface.
6 check-insNVIDIAAMDApple
Wan2GPFeatured
pinokiofactory/wanv3.7updated 15d ago
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
145 check-insNVIDIAAMDApple
BazedFrog/SongGeneration-Studiov3.7updated 4h ago
AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model. [NVIDIA ONLY]
12 check-insNVIDIAAMDApple