Pinokio

Launcher updates

Audiochunker

@manatheturipa3d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi10d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro19d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming19d ago

new tts here!

Underfit

@cocktailpeanut20d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:api

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

Gymnast Photo Sorter

arnold2006/gymnast-photo-sorterv7.0updated 1mo ago

Sort competition photos into folders by gymnast using AI face recognition.

1 check-inNVIDIAAMDApple

Fooocus

cocktailpeanutlabs/fooocusv3.7updated 1mo ago

Minimal Stable Diffusion UI

#ai #image-generation

15 check-insNVIDIAAMDApple

diffusers-image-fill

pinokiofactory/diffusers-image-fillv3.7updated 1mo ago

Remove objects from an image https://huggingface.co/spaces/OzzyGT/diffusers-image-fill

#ai #image-edit

0 check-insNVIDIAAMDApple

pyramidflow

pinokiofactory/pyramidflowv3.7updated 1mo ago

Pyramd Flow Video Generation AI (text-to-video & image-to-video) https://github.com/jy0205/Pyramid-Flow

#video-generation #ai

3 check-insNVIDIAAMDApple

zonos

pinokiofactory/zonosv3.7updated 1mo ago

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos

#ai #tts

6 check-insNVIDIAAMDApple

Dia

pinokiofactory/diav3.7updated 1mo ago

Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia

#ai #tts

0 check-insNVIDIAAMDApple

Euraika Avatar Studio

Euraika-Labs/duix-avatar-pinokiov7.0updated 1mo ago

Local-first AI avatar video studio powered by duixcom/Duix-Avatar, Docker, and a consent-aware browser studio.

0 check-insNVIDIAAMDApple

ComfyComfyUI

drago87/ComfyComfyUIv7.0updated 1mo ago

A web control panel for ComfyUI

@drago870 check-insNVIDIAAMDApple

Whisper-WebUI

6Morpheus6/whisper-webuiv3.7updated 1mo ago

A Web UI for easy subtitle using whisper model.

@morpheus

1 check-inNVIDIAAMDApple

Applio

pinokiofactory/appliov3.7updated 1mo ago

A simple, high-quality voice conversion tool focused on ease of use and performance.

#ai #audio #voice-clone

7 check-insNVIDIAAMDApple

e2-f5-tts

pinokiofactory/e2-f5-ttsv3.7updated 1mo ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS

#tts #voice-clone #ai

14 check-insNVIDIAAMDApple

IndexTTS-2

6Morpheus6/IndexTTS2v3.7updated 1mo ago

Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech application

@morpheus

1 check-inNVIDIAAMDApple

Sam3D

6Morpheus6/Sam3D-bodyv3.7updated 1mo ago

Create 3D Meshes of Body Poses from Images.

#3d

@morpheus

1 check-inNVIDIAAMDApple

Lens

tehmod/lens-pinokiov7.0updated 1mo ago

Unofficial Pinokio launcher for Microsoft Lens text-to-image inference. Tested on Linux with an RTX 5090.

1 check-inNVIDIAAMDApple

T2I-L2P

tehmod/t2i-l2p-pinokiov7.0updated 1mo ago

L2P pixel-space text-to-image generation demo

1 check-inNVIDIAAMDApple

SRT 字幕校正（LM Studio）

vincentchiou/srt-correctionv2.0updated 1mo ago

使用本地 LM Studio AI 免費校正 ASR 課程字幕，支援 PDF 參考資料，不需 API Key

0 check-insNVIDIAAMDApple

X-Voice

6Morpheus6/X-Voicev5.0updated 1mo ago

X-Voice is a multilingual text-to-speech system that enables one speaker to speak 27 languages.

@morpheus

2 check-insNVIDIAAMDApple

Kokoro-FastAPI

6Morpheus6/Kokoro-FastAPIv3.7updated 1mo ago

A FastAPI wrapper for KokoroTTS. Integrates with Open-WebUI and other API-driven AI applications.

@morpheus

1 check-inNVIDIAAMDApple

OpenClaw (aka ClawdBot)

stoutimon/stoutimon-openclaw.pinokiov1.0.1updated 1mo ago

The AI that actually does things https://openclaw.ai

1 check-inNVIDIAAMDApple

3D Gen Studio

hoodtronik/3DGenStudio-pinokiov7.0updated 1mo ago

Local web UI for orchestrating 3D generation pipelines via ComfyUI / Tripo / Tencent. https://github.com/visualbruno/3DGenStudio

@hoodtronik0 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5theDAW

All-in-one AI music studio on Stable Audio 3 and a CUDA port of Magenta RealTime. It generates audio from text, separates stems with Demucs, transcribes to MIDI and notation, edits a multitrack timeline with a real-time Web Audio FX rack and automation, masters, DJs with stem decks, runs a live VJ engine, and plays from a Quest by hand over ADB.

#6Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#7diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#8openmed

open-source healthcare ai

#9GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Launcher updates

Store