Pinokio

Launcher updates

Audiochunker

@manatheturipa4d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi10d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro19d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming20d ago

new tts here!

Underfit

@cocktailpeanut20d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:All

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

Diffusers SDXL Turbo

cocktailpeanut/diffusers-sdxl-turboupdated 1y ago

Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server (https://github.com/radames/Real-Time-Latent-Consistency-Model)

@cocktailpeanut1 check-inNVIDIAAMDApple

lavie

shadowburn0/lavie.pinokioupdated 1y ago

Text-to-Video (T2V) generation framework from Vchitect https://github.com/Vchitect/LaVie

0 check-insNVIDIAAMDApple

Mirror

cocktailpeanut/mirrorupdated 1y ago

An AI powered mirror

@cocktailpeanut0 check-insNVIDIAAMDApple

DEUS

cocktailpeanutlabs/deusupdated 1y ago

A Realtime Creation Engine

0 check-insNVIDIAAMDApple

Vid2DensePose

cocktailpeanut/densepose.pinokioupdated 1y ago

Convert your videos to densepose and use it on MagicAnimate https://github.com/Flode-Labs/vid2densepose

#ai #utility

@cocktailpeanut0 check-insNVIDIAAMDApple

MLFocalLengths

nandometzger/MLFocalLengthsupdated 1y ago

Estimating the Focal Length of a Monocular Image

0 check-insNVIDIAAMDApple

florence-sam

pinokiofactory/florence-samv2.0updated 1y ago

Integrates Florence2 and SAM2 models for detailed image captioning and object detection. Florence2 generates detailed captions that are then used to perform phrase grounding. The Segment Anything Model 2 (SAM2) converts these phrase-grounded boxes into masks. https://huggingface.co/spaces/SkalskiP/florence-sam

1 check-inNVIDIAAMDApple

accdiffusion

pinokiofactory/accdiffusionv2.0updated 1y ago

0 check-insNVIDIAAMDApple

forge-legacy-extensions

lllyasviel/forge-legacy-extensionsupdated 1y ago

some archived legacy forge extensions

0 check-insNVIDIAAMDApple

FoleyCrafter

open-mmlab/FoleyCrafterupdated 1y ago

[IJCV] FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师，给你的无声视频添加生动而且同步的音效 😝

0 check-insNVIDIAAMDApple

Fooocus-inswapper

machineminded/fooocus-inswapperupdated 1y ago

Focus on prompting and generating with an inswapper integration

0 check-insNVIDIAAMDApple

TTS-Indonesia-Gratis

drat/TTS-Indonesia-Gratisupdated 1y ago

Aplikasi ini digunakan untuk menghasilkan suara berbasis teks dengan berbagai pilihan pembicara. Teknologi yang digunakan meliputi model text-to-speech (TTS) yang canggih dengan konversi teks ke fonem. Model yang dipakai dilatih khusus untuk bahasa Indonesia, Jawa dan Sunda.

0 check-insNVIDIAAMDApple

LivePortrait

cocktailpeanut/LivePortraitupdated 1y ago

Bring portraits to life!

@cocktailpeanut0 check-insNVIDIAAMDApple

stable-diffusion-webui-ux

Feedjer/stable-diffusion-webui-ux.pinokiov1.5updated 1y ago

Stable Diffusion web UI UX: https://github.com/anapnoe/stable-diffusion-webui-ux

4 check-insNVIDIAAMDApple

AniPortrait

Feedjer/AniPortrait.pinokiov1.5updated 1y ago

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation:https://github.com/Zejun-Yang/AniPortrait

0 check-insNVIDIAAMDApple

Langflow

Feedjer/Langflow.pinokiov1.5updated 1y ago

Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity: https://github.com/langflow-ai/langflow

0 check-insNVIDIAAMDApple

HunyuanDiT

Feedjer/HunyuanDiT.pinokiov1.5updated 1y ago

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding/ https://github.com/Tencent/HunyuanDiT

0 check-insNVIDIAAMDApple

Omost

Feedjer/Omost.pinokiov1.5updated 1y ago

Your image is almost there!:https://github.com/lllyasviel/Omost

0 check-insNVIDIAAMDApple

Flowise

Feedjer/Flowise.pinokiov1.5updated 1y ago

Drag & drop UI to build your customized LLM flow: https://github.com/FlowiseAI/Flowise

0 check-insNVIDIAAMDApple

cambrian

Feedjer/cambrian.pinokiov1.5updated 1y ago

[Need 24GB VRAM] Cambrian-1 is a family of multimodal LLMs with a vision-centric design: https://github.com/cambrian-mllm/cambrian

0 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#6diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#7openmed

open-source healthcare ai

#8GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#9meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

#10SkyReels-V2

SkyReels-V2: Infinite-length Film Generative model

Global feed

Latest posts from the community.

great

@apples · glitchframe

great

Feedback & Feature Request: Local Voice Training & Export for Qwen3-TTS

@hisuinoi · Qwen3-TTS

Hello, First of all, thank you for this excellent tool. I have been thoroughly testing Qwen3-TTS and ...

Convert Range of Pages to TXT File

@gvmoon · LightOnOCR-2-1B

This is great! Is it possible to have in the interface the ability to queue a range of pages to be ex...

pls help..thanks

@omar1984 · Wan2GP - AMD1

ACE UI Feedback: Functional bugs regarding Model Switching, Cover Art, and Generation limits

@hisuinoi · ACE-Step UI

What happened? The ACE UI exhibits several functional disconnects where UI elements do not trigger th...

Global radar

Projects people are discovering or following now.

Followed4 min

RVC

1 Click Installer for Retrieval-based-Voice-Conversion-WebUI (https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)

Followed5 min

Alexandria

A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects

Followed8 min

Wan2GP

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

Followed8 min

Stable Audio 3

Launcher for Stable Audio 3 Small Music, Small SFX, and NVIDIA Medium using public cocktailpeanut Hugging Face mirrors. https://github.com/Stability-AI/stable-audio-3

Followed8 min

Qwen3-TTS MLX WebUI Enhanced

High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.

Launcher updates

Store