Pinokio

Launcher updates

Audiochunker

@manatheturipa3d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi9d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro19d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming19d ago

new tts here!

Underfit

@cocktailpeanut20d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:api

Platform:Linux

GPU:All

Recommended Latest Check-ins

Sort:Latest

KittenTTS 😻

PierrunoYT/KittenTTS-Pinokiov7.0updated 7h ago

Ultra-lightweight text-to-speech (15M-80M params) — CPU optimized, 8 voices, ONNX-powered

@pierrunoyt 2 check-insNVIDIAAMDApple

MOSS-TTS

PierrunoYT/MossTTS-Pinokiov5.0updated 12h ago

All-in-one Gradio UI for the MOSS-TTS Family: voice cloning, dialogue generation, voice design from text, and sound effects.

@pierrunoyt

3 check-insNVIDIAAMDApple

Z-Fusion

ai-anchorite/Z-Fusionv3.7updated 15h ago

Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]

#image-generation

@anchorite

22 check-insNVIDIAAMDApple

Image to Prompt

cocktailpeanut/image-to-promptv7.0updated 1d ago

Generate editable Ideogram JSON prompts from uploaded images.

#ai #prompt-helper #prompting

@cocktailpeanut

6 check-insNVIDIAAMDApple

OpenVoiceUI

MCERQUA/OpenVoiceUIv3.7updated 1d ago

AI Voice Assistant — voice conversations, animated face, canvas, music generation, and more.

@metamike4 check-insNVIDIAAMDApple

FaceFusion 3.6.1

facefusion/facefusion-pinokiov5.0updated 2d ago

Industry leading face manipulation platform

#faceswap #facefusion #face #ff #1 ##faceswap-#ai-#video #ai #video

113 check-insNVIDIAAMDApple

SearXNG

cocktailpeanut/searxng.pinokiov7.0updated 2d ago

A privacy-respecting metasearch engine that runs locally.

#search #searchengine

@cocktailpeanut

4 check-insNVIDIAAMDApple

Alexandria

Finrandojin/alexandria-audiobookv5.0updated 3d ago

A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects

#audiobook #text-to-audiobook #audio-generation #tts

@finrandojin

6 check-insNVIDIAAMDApple

Ultimate-TTS-Studio

pinokiofactory/Ultimate-TTS-Studiov3.7updated 4d ago

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

#tts #ai #gradio #voice

38 check-insNVIDIAAMDApple

Wan2GP

6Morpheus6/wan2gpv3.7updated 5d ago

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

#video-generation #wan2gp

@morpheus

98 check-insNVIDIAAMDApple

Wan2GP - AMD

6Morpheus6/wan2gp-amdv3.7updated 5d ago

[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video, Flux and more. (On Windows supported by all dedicated AMD GPUs from RDNA 2 - RDNA 4)

#ai #wan #wan2gp

@morpheus

53 check-insNVIDIAAMDApple

StableDAW

cocktailpeanut/stabledaw.pinokiov7.0updated 6d ago

Browser-based AI audio DAW for Stable Audio 3 with text-to-audio, inpainting, LoRA training, FFmpeg effects, waveform editing, sequencer, piano roll, and persistent library. https://github.com/gantasmo/stabledaw

#audio #music #daw #ai #audio-generation #stableaudio #stableaudio3

@cocktailpeanut

18 check-insNVIDIAAMDApple

ZastTranslate — Beta 1.06

zast57/ZastTranslatev5.0updated 6d ago

Video translation & dubbing with voice cloning — 100% local, zero API. Supports 30 languages (VoxCPM 2) with per-language CPS calibration, automatic transcription, translation, and AI dubbing.

4 check-insNVIDIAAMDApple

Comfyui

pinokiofactory/comfyv3.7updated 7d ago

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI

#comfyui #ai #video #image #image-generation #audio #comfy #video-generation #node-interface

83 check-insNVIDIAAMDApple

Wan2GP

pinokiofactory/wanv3.7updated 7d ago

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

#video-generation #wan #wan2gp #video #image #ai #1 #image-generation #gradio

229 check-insNVIDIAAMDApple

Qwen3-TTS MLX WebUI Enhanced

Blizaine/Qwen3-TTS-MLX-WebUI-Enhancedv5.0updated 9d ago

High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.

#mlx #qwen #tts #ai #mac

@blizaine

63 check-insNVIDIAAMDApple

SongGeneration Studio

BazedFrog/SongGeneration-Studiov3.7updated 10d ago

AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model. [NVIDIA ONLY]

#music #song #ai

20 check-insNVIDIAAMDApple

AI Video Clipper & LoRA Captioner

manat0912/AI-Video-Clipper-LoRA-Pinokiov5.0updated 10d ago

Automatically clip videos and generate captions for LoRA training using advanced vision models like Gemma-3, Qwen3-VL, and Qwen2-VL.

@manatheturipa1 check-inNVIDIAAMDApple

MuseTalk

manat0912/TalkingMusev3.7updated 10d ago

MuseTalk is a cutting-edge video-to-video (V2V) lip-sync solution engineered to deliver highly accurate and natural mouth movements synchronized to audio input. Precision LipSync: Realistic and seamless synchronization of speech audio to facial movements. Efficiently designed to run on 8–12 GB VRAM,

@manatheturipa

2 check-insNVIDIAAMDApple

Hermes Mod

cocktailpeanut/hermes-modv6.0updated 12d ago

A full Hermes skin manager for browsing, editing, saving, and activating CLI skins directly from Pinokio.

#agent #utility #hermes #hermes-agent #mod #nous-research #nousresearch

@cocktailpeanut

10 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#6theDAW

All-in-one AI music studio on Stable Audio 3 and a CUDA port of Magenta RealTime. It generates audio from text, separates stems with Demucs, transcribes to MIDI and notation, edits a multitrack timeline with a real-time Web Audio FX rack and automation, masters, DJs with stem decks, runs a live VJ engine, and plays from a Quest by hand over ADB.

#7diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#8openmed

open-source healthcare ai

#9meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

#10GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

Global feed

Launcher updates

Store