Pinokio

Type:api

Platform:All

GPU:All

Recommended Latest Check-ins

LDM 3D

cocktailpeanut/ldm3d.pinokioupdated 1y ago

[NVIDIA GPU ONLY] One click installer for Intel's ldm3d

@cocktailpeanut0 check-insNVIDIAAMDApple

DenseDiffusion

cocktailpeanut/densediffusion.pinokioupdated 1y ago

Dense Text-to-Image Generation with Attention Modulation

@cocktailpeanut0 check-insNVIDIAAMDApple

VALL-E-X

cocktailpeanut/VALL-E-X.pinokioupdated 1y ago

An open source implementation of Microsoft's VALL-E X zero-shot TTS model

@cocktailpeanut0 check-insNVIDIAAMDApple

Realtime StableDiffusion

cocktailpeanut/realtime-lcm.pinokioupdated 1y ago

Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server (https://github.com/radames/Real-Time-Latent-Consistency-Model)

@cocktailpeanut0 check-insNVIDIAAMDApple

Diffusers SDXL Turbo

cocktailpeanut/diffusers-sdxl-turboupdated 1y ago

Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server (https://github.com/radames/Real-Time-Latent-Consistency-Model)

@cocktailpeanut

1 check-inNVIDIAAMDApple

lavie

shadowburn0/lavie.pinokioupdated 1y ago

Text-to-Video (T2V) generation framework from Vchitect https://github.com/Vchitect/LaVie

0 check-insNVIDIAAMDApple

Mirror

cocktailpeanut/mirrorupdated 1y ago

An AI powered mirror

@cocktailpeanut0 check-insNVIDIAAMDApple

DEUS

cocktailpeanutlabs/deusupdated 1y ago

A Realtime Creation Engine

0 check-insNVIDIAAMDApple

Vid2DensePose

cocktailpeanut/densepose.pinokioupdated 1y ago

Convert your videos to densepose and use it on MagicAnimate https://github.com/Flode-Labs/vid2densepose

#ai #utility

@cocktailpeanut0 check-insNVIDIAAMDApple

florence-sam

pinokiofactory/florence-samv2.0updated 1y ago

Integrates Florence2 and SAM2 models for detailed image captioning and object detection. Florence2 generates detailed captions that are then used to perform phrase grounding. The Segment Anything Model 2 (SAM2) converts these phrase-grounded boxes into masks. https://huggingface.co/spaces/SkalskiP/florence-sam

1 check-inNVIDIAAMDApple

accdiffusion

pinokiofactory/accdiffusionv2.0updated 1y ago

0 check-insNVIDIAAMDApple

stable-diffusion-webui-ux

Feedjer/stable-diffusion-webui-ux.pinokiov1.5updated 1y ago

Stable Diffusion web UI UX: https://github.com/anapnoe/stable-diffusion-webui-ux

4 check-insNVIDIAAMDApple

AniPortrait

Feedjer/AniPortrait.pinokiov1.5updated 1y ago

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation:https://github.com/Zejun-Yang/AniPortrait

0 check-insNVIDIAAMDApple

Langflow

Feedjer/Langflow.pinokiov1.5updated 1y ago

Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity: https://github.com/langflow-ai/langflow

0 check-insNVIDIAAMDApple

HunyuanDiT

Feedjer/HunyuanDiT.pinokiov1.5updated 1y ago

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding/ https://github.com/Tencent/HunyuanDiT

0 check-insNVIDIAAMDApple

Omost

Feedjer/Omost.pinokiov1.5updated 1y ago

Your image is almost there!:https://github.com/lllyasviel/Omost

0 check-insNVIDIAAMDApple

Flowise

Feedjer/Flowise.pinokiov1.5updated 1y ago

Drag & drop UI to build your customized LLM flow: https://github.com/FlowiseAI/Flowise

0 check-insNVIDIAAMDApple

cambrian

Feedjer/cambrian.pinokiov1.5updated 1y ago

[Need 24GB VRAM] Cambrian-1 is a family of multimodal LLMs with a vision-centric design: https://github.com/cambrian-mllm/cambrian

0 check-insNVIDIAAMDApple

Dough

pinokiofactory/doughv1updated 1y ago

Dough is a open source tool for steering AI animations with precision

1 check-inNVIDIAAMDApple

Moondream1

cocktailpeanutlabs/moondream1v1.1updated 1y ago

moondream1 is a tiny (1.6B parameter) vision language model trained by @vikhyatk that performs on par with models twice its size. It is trained on the LLaVa training dataset, and initialized with SigLIP as the vision tower and Phi-1.5 as the text encoder. https://huggingface.co/spaces/vikhyatk/moondream1

0 check-insNVIDIAAMDApple