cocktailpeanut/ldm3d.pinokioupdated 1y ago
[NVIDIA GPU ONLY] One click installer for Intel's ldm3d
@cocktailpeanut0 check-insNVIDIAAMDApple
cocktailpeanut/densediffusion.pinokioupdated 1y ago
Dense Text-to-Image Generation with Attention Modulation
@cocktailpeanut0 check-insNVIDIAAMDApple
cocktailpeanut/VALL-E-X.pinokioupdated 1y ago
An open source implementation of Microsoft's VALL-E X zero-shot TTS model
@cocktailpeanut0 check-insNVIDIAAMDApple
cocktailpeanut/realtime-lcm.pinokioupdated 1y ago
Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server (https://github.com/radames/Real-Time-Latent-Consistency-Model)
@cocktailpeanut0 check-insNVIDIAAMDApple
cocktailpeanut/diffusers-sdxl-turboupdated 1y ago
Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server (https://github.com/radames/Real-Time-Latent-Consistency-Model)
@cocktailpeanut1 check-inNVIDIAAMDApple
shadowburn0/lavie.pinokioupdated 1y ago
Text-to-Video (T2V) generation framework from Vchitect https://github.com/Vchitect/LaVie
0 check-insNVIDIAAMDApple
cocktailpeanut/mirrorupdated 1y ago
An AI powered mirror
@cocktailpeanut0 check-insNVIDIAAMDApple
cocktailpeanutlabs/deusupdated 1y ago
A Realtime Creation Engine
0 check-insNVIDIAAMDApple
pinokiofactory/florence-samv2.0updated 1y ago
Integrates Florence2 and SAM2 models for detailed image captioning and object detection. Florence2 generates detailed captions that are then used to perform phrase grounding. The Segment Anything Model 2 (SAM2) converts these phrase-grounded boxes into masks. https://huggingface.co/spaces/SkalskiP/florence-sam
1 check-inNVIDIAAMDApple
pinokiofactory/accdiffusionv2.0updated 1y ago
0 check-insNVIDIAAMDApple
Feedjer/stable-diffusion-webui-ux.pinokiov1.5updated 1y ago
Stable Diffusion web UI UX: https://github.com/anapnoe/stable-diffusion-webui-ux
4 check-insNVIDIAAMDApple
Feedjer/AniPortrait.pinokiov1.5updated 1y ago
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation:https://github.com/Zejun-Yang/AniPortrait
0 check-insNVIDIAAMDApple
Feedjer/Langflow.pinokiov1.5updated 1y ago
Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity: https://github.com/langflow-ai/langflow
0 check-insNVIDIAAMDApple
Feedjer/HunyuanDiT.pinokiov1.5updated 1y ago
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding/ https://github.com/Tencent/HunyuanDiT
0 check-insNVIDIAAMDApple
Feedjer/Omost.pinokiov1.5updated 1y ago
Your image is almost there!:https://github.com/lllyasviel/Omost
0 check-insNVIDIAAMDApple
Feedjer/Flowise.pinokiov1.5updated 1y ago
Drag & drop UI to build your customized LLM flow: https://github.com/FlowiseAI/Flowise
0 check-insNVIDIAAMDApple
Feedjer/cambrian.pinokiov1.5updated 1y ago
[Need 24GB VRAM] Cambrian-1 is a family of multimodal LLMs with a vision-centric design: https://github.com/cambrian-mllm/cambrian
0 check-insNVIDIAAMDApple
pinokiofactory/doughv1updated 1y ago
Dough is a open source tool for steering AI animations with precision
1 check-inNVIDIAAMDApple
cocktailpeanutlabs/moondream1v1.1updated 1y ago
moondream1 is a tiny (1.6B parameter) vision language model trained by @vikhyatk that performs on par with models twice its size. It is trained on the LLaVa training dataset, and initialized with SigLIP as the vision tower and Phi-1.5 as the text encoder. https://huggingface.co/spaces/vikhyatk/moondream1
0 check-insNVIDIAAMDApple