cocktailpeanutlabs/differential-diffusion-uiv1.2updated 3mo ago
Differential Diffusion modifies an image according to a text prompt, and according to a map that specifies the amount of change in each region https://differential-diffusion.github.io/
2 check-insNVIDIAAMDApple
parler-ttsFeatured
cocktailpeanutlabs/parler-ttsv1.5updated 1y ago
a lightweight text-to-speech (TTS) model that can generate high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation). https://huggingface.co/spaces/parler-tts/parler_tts_mini
1 check-inNVIDIAAMDApple
cocktailpeanutlabs/storydiffusion-comicsv3.0updated 3mo ago
create a story by generating consistent images https://github.com/HVision-NKU/StoryDiffusion
2 check-insNVIDIAAMDApple
SillyTavernFeatured
pinokiofactory/sillytavernv1.5updated 2mo ago
a local-install interface that allows you to interact with text generation AIs (LLMs) to chat and roleplay with custom characters. https://docs.sillytavern.app/
4 check-insNVIDIAAMDApple
FooocusFeatured
cocktailpeanutlabs/fooocusv3.7updated 19d ago
Minimal Stable Diffusion UI
11 check-insNVIDIAAMDApple
InvokeFeatured
pinokiofactory/invokev3.7updated 4mo ago
The Gen AI Platform for Pro Studios https://github.com/invoke-ai/InvokeAI
4 check-insNVIDIAAMDApple
pinokiofactory/StyleTTS2_Studiov3.7updated 4mo ago
Build your own voice for StyleTTS2
2 check-insNVIDIAAMDApple
Open WebUIFeatured
pinokiofactory/open-webuiv3.4.0updated 1mo ago
User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs https://github.com/open-webui/open-webui
14 check-insNVIDIAAMDApple
FramePackFeatured
pinokiofactory/Frame-Packv3.7updated 1mo ago
[NVIDIA ONLY] Generate Video Progressively. FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively. https://github.com/lllyasviel/FramePack
10 check-insNVIDIAAMDApple
facefusion/facefusion-pinokiov5.0updated 11d ago
Industry leading face manipulation platform
82 check-insNVIDIAAMDApple
pinokiofactory/Ultimate-TTS-Studiov3.7updated 3d ago
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
26 check-insNVIDIAAMDApple
francescofugazzi/ml-sharp-pinokiov0.3updated 1mo ago
One-click 3D Gaussian Splatting generation from a single image.
@franzipol5 check-insNVIDIAAMDApple
Qwen3-TTSFeatured
SUP3RMASS1VE/Qwen3-TTS-Pinokiov5.0updated 1mo ago
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team
@sup3rmass1ve21 check-insNVIDIAAMDApple
pinokiofactory/vibevoice-realtimev5.0updated 14d ago
Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B
5 check-insNVIDIAAMDApple
ForgeFeatured
pinokiofactory/stable-diffusion-webui-forgev2.0updated 15d ago
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.com/lllyasviel/stable-diffusion-webui-forge
20 check-insNVIDIAAMDApple
ComfyuiFeatured
pinokiofactory/comfyv3.7updated 17d ago
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
62 check-insNVIDIAAMDApple
Wan2GPFeatured
pinokiofactory/wanv3.7updated 15d ago
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
145 check-insNVIDIAAMDApple
BazedFrog/SongGeneration-Studiov3.7updated 3h ago
AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model. [NVIDIA ONLY]
12 check-insNVIDIAAMDApple