Pinokio

Launcher updates

Audiochunker

@manatheturipa3d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi10d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro19d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming19d ago

new tts here!

Underfit

@cocktailpeanut20d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:api

Platform:All

GPU:AMD

Recommended Latest Check-ins

Sort:Latest

VoxCPM 2

PierrunoYT/VoxCPM-2-Pinokiov5.0updated 8h ago

Tokenizer-free TTS for context-aware speech, voice cloning, and voice design. 2B params, 48kHz, 30 languages (Gradio UI).

@pierrunoyt 2 check-insNVIDIAAMDApple

Z-Fusion

ai-anchorite/Z-Fusionv3.7updated 16h ago

Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]

#image-generation

@anchorite

22 check-insNVIDIAAMDApple

OmniVoice

PierrunoYT/OmniVoice-Pinokiov5.0updated 1d ago

Zero-shot multilingual TTS (600+ languages) with voice cloning and voice design — Gradio UI (app/app.py)

@pierrunoyt 5 check-insNVIDIAAMDApple

FaceFusion 3.6.1

facefusion/facefusion-pinokiov5.0updated 2d ago

Industry leading face manipulation platform

#faceswap #facefusion #face #ff #1 ##faceswap-#ai-#video #ai #video

113 check-insNVIDIAAMDApple

Alexandria

Finrandojin/alexandria-audiobookv5.0updated 3d ago

A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects

#audiobook #text-to-audiobook #audio-generation #tts

@finrandojin

6 check-insNVIDIAAMDApple

Ultimate-TTS-Studio

pinokiofactory/Ultimate-TTS-Studiov3.7updated 4d ago

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

#tts #ai #gradio #voice

38 check-insNVIDIAAMDApple

Clarity Refiners UI

pinokiofactory/clarity-refiners-uiv3.7updated 5d ago

An enhanced local port of finegrain-image-enhancer powered by Refiners (https://huggingface.co/spaces/finegrain/finegrain-image-enhancer), which was adapted from philz1337x's Clarity Upscaler (https://github.com/philz1337x/clarity-upscaler)

#image #ai #image-edit #upscaler

5 check-insNVIDIAAMDApple

Wan2GP

6Morpheus6/wan2gpv3.7updated 5d ago

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

#video-generation #wan2gp

@morpheus

98 check-insNVIDIAAMDApple

Wan2GP - AMD

6Morpheus6/wan2gp-amdv3.7updated 5d ago

[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video, Flux and more. (On Windows supported by all dedicated AMD GPUs from RDNA 2 - RDNA 4)

#ai #wan #wan2gp

@morpheus

53 check-insNVIDIAAMDApple

StableDAW

cocktailpeanut/stabledaw.pinokiov7.0updated 6d ago

Browser-based AI audio DAW for Stable Audio 3 with text-to-audio, inpainting, LoRA training, FFmpeg effects, waveform editing, sequencer, piano roll, and persistent library. https://github.com/gantasmo/stabledaw

#audio #music #daw #ai #audio-generation #stableaudio #stableaudio3

@cocktailpeanut

18 check-insNVIDIAAMDApple

Comfyui

pinokiofactory/comfyv3.7updated 7d ago

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI

#comfyui #ai #video #image #image-generation #audio #comfy #video-generation #node-interface

83 check-insNVIDIAAMDApple

Wan2GP

cocktailpeanutlabs/wanv3.7updated 7d ago

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

27 check-insNVIDIAAMDApple

Wan2GP

pinokiofactory/wanv3.7updated 7d ago

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

#video-generation #wan #wan2gp #video #image #ai #1 #image-generation #gradio

229 check-insNVIDIAAMDApple

LTX-Desktop-WanGP

hoodtronik/LTX-Desktop-WanGP-Pinokiov5.0updated 8d ago

Pinokio launcher for LTX-Desktop-WanGP (local video generation with WanGP backend)

@hoodtronik

3 check-insNVIDIAAMDApple

Fooocus2026

mikecastrodemaria/Fooocus2026-pinokiov3.6updated 8d ago

A personal fork of lllyasviel/Fooocus v2.5.5 with quality-of-life features: Save Preset, CivitAI Model Settings, LoRA trigger words, Embeddings panel, Wildcards editor, Vary-with-aspect-ratio, Custom Resolution, Asset Browser, Restart UI button.

@supersoniquestudio

4 check-insNVIDIAAMDApple

Qwen3-TTS MLX WebUI Enhanced

Blizaine/Qwen3-TTS-MLX-WebUI-Enhancedv5.0updated 9d ago

High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.

#mlx #qwen #tts #ai #mac

@blizaine

63 check-insNVIDIAAMDApple

SongGeneration Studio

BazedFrog/SongGeneration-Studiov3.7updated 10d ago

AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model. [NVIDIA ONLY]

#music #song #ai

20 check-insNVIDIAAMDApple

RMBG-2-Studio

pinokiofactory/RMBG-2-Studiov3.7updated 11d ago

Enhanced background remove and replace app built around BRIA-RMBG-2.0 https://huggingface.co/briaai/RMBG-2.0

#ai #image-edit #remove-background

4 check-insNVIDIAAMDApple

roop-unleashed-wip

Adutchguy/roop-unleashed-wipv3.7updated 15d ago

Swap faces in photos and videos in seconds — no training required. Powered by InsightFace and ONNX, with optional TensorRT acceleration, multi-face targeting, enhancement pipelines, and a clean one-click interface.

@adutchguy

7 check-insNVIDIAAMDApple

AudioGradio

cocktailpeanut/audiogradio.pinokioupdated 15d ago

One click installer for AudioCraft MusicGen and AudioGen Gradio UI (Requires at least Pinokio v0.0.56)

@cocktailpeanut 6 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5theDAW

All-in-one AI music studio on Stable Audio 3 and a CUDA port of Magenta RealTime. It generates audio from text, separates stems with Demucs, transcribes to MIDI and notation, edits a multitrack timeline with a real-time Web Audio FX rack and automation, masters, DJs with stem decks, runs a live VJ engine, and plays from a Quest by hand over ADB.

#6Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#7diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#8openmed

open-source healthcare ai

#9GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Launcher updates

Store