Pinokio

Launcher updates

Audiochunker

@manatheturipa3d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi9d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro19d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming19d ago

new tts here!

Underfit

@cocktailpeanut20d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:api

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

Qwen3-TTS MLX WebUI Enhanced

Blizaine/Qwen3-TTS-MLX-WebUI-Enhancedv5.0updated 9d ago

High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.

#mlx #qwen #tts #ai #mac

@blizaine

63 check-insNVIDIAAMDApple

IOPaint

Gan-Explore/IOPaint-pinokiov2.0updated 9d ago

AI-powered image inpainting. Remove objects, people, defects from images.

0 check-insNVIDIAAMDApple

Castwright

dudarenok-maker/Castwrightv1.0updated 9d ago

Any book, performed by a full cast — effortlessly.

0 check-insNVIDIAAMDApple

CCS — Pyme Ledger AI

vtomasv/pyme-ledger-ai.pinokiov1.5.0updated 9d ago

Clasificación inteligente de gastos con IA local para PyMEs — 100% offline | CCS

0 check-insNVIDIAAMDApple

Dub Studio

timoncool/dub-studio-pinokiov7.0updated 10d ago

Dub & translate any short video — locally, offline. Voice clone / per-speaker cast / voice packs, on-screen text localized in place, subtitle styling, blur-or-solid mask covers, funny re-dub. One process (FastAPI serves the React SPA), 6 UI languages.

@nerual_dreming

1 check-inNVIDIAAMDApple

SongGeneration Studio

BazedFrog/SongGeneration-Studiov3.7updated 10d ago

AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model. [NVIDIA ONLY]

#music #song #ai

20 check-insNVIDIAAMDApple

AI Video Clipper & LoRA Captioner

manat0912/AI-Video-Clipper-LoRA-Pinokiov5.0updated 10d ago

Automatically clip videos and generate captions for LoRA training using advanced vision models like Gemma-3, Qwen3-VL, and Qwen2-VL.

@manatheturipa1 check-inNVIDIAAMDApple

MuseTalk

manat0912/TalkingMusev3.7updated 10d ago

MuseTalk is a cutting-edge video-to-video (V2V) lip-sync solution engineered to deliver highly accurate and natural mouth movements synchronized to audio input. Precision LipSync: Realistic and seamless synchronization of speech audio to facial movements. Efficiently designed to run on 8–12 GB VRAM,

@manatheturipa

2 check-insNVIDIAAMDApple

RMBG-2-Studio

pinokiofactory/RMBG-2-Studiov3.7updated 11d ago

Enhanced background remove and replace app built around BRIA-RMBG-2.0 https://huggingface.co/briaai/RMBG-2.0

#ai #image-edit #remove-background

4 check-insNVIDIAAMDApple

Higgs Audio Studio

timoncool/HiggsAudio-Studio-pinokiov7.0updated 11d ago

Higgs Audio v3 TTS + AI text director, voice cloning, podcast & audiobook (multi-speaker). 100+ languages, offline, NVIDIA GPU.

@nerual_dreming

4 check-insNVIDIAAMDApple

Alexandria

on22s/alexandria-audiobook2v5.0updated 11d ago

A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects

0 check-insNVIDIAAMDApple

Orpheus-TTS-FastAPI

pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 11d ago

Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS

#ai #tts

0 check-insNVIDIAAMDApple

BASI WAN KENOBI

shitcoinsherpa/Basi-Wan-Kenobiv3.7updated 12d ago

[NVIDIA Only] Wan2.2 video Studio (text/image-to-video, restyle, keyframe, talking-character S2V) + LoRA Gym + MOVA joint audio+video. From 12GB VRAM.

0 check-insNVIDIAAMDApple

BenchClaw

Agnuxo1/benchclawv1.0updated 12d ago

P2PCLAW Agent Benchmark — connect any LLM agent (Claude, GPT, Gemini, Qwen, Kimi, DeepSeek…) and get scored on 10 dimensions + Tribunal IQ. Dashboard runs locally on :8787, leaderboard at p2pclaw.com/app/benchmark.

0 check-insNVIDIAAMDApple

Hermes Mod

cocktailpeanut/hermes-modv6.0updated 12d ago

A full Hermes skin manager for browsing, editing, saving, and activating CLI skins directly from Pinokio.

#agent #utility #hermes #hermes-agent #mod #nous-research #nousresearch

@cocktailpeanut

10 check-insNVIDIAAMDApple

Game Creator

alvescrafter/game-creator.pinokioupdated 13d ago

Prompt Orchestrator that turns module-based game design (genre, mechanics, visuals, menus, audio) into a complete, playable HTML5 game generated by your chosen AI provider. Supports OpenAI, Gemini, Claude, Ollama, and LM Studio. Every game ships as a single self-contained HTML file.

@alvescrafter0 check-insNVIDIAAMDApple

ScribeTube

PierrunoYT/ScribeTubev5.0updated 14d ago

Download and transcribe many YouTube videos or whole playlists at once with faster-whisper. Outputs txt, srt, vtt, or json.

@pierrunoyt0 check-insNVIDIAAMDApple

Odysseus

cocktailpeanut/odysseus.pinokiov7.0updated 14d ago

Self-hosted AI workspace for local-first chat, agents, tools, memory, research, documents, email, and model endpoint management.

#agent #odysseus #ai #deepresearch

@cocktailpeanut

6 check-insNVIDIAAMDApple

Whisper-WebUI

pinokiofactory/whisper-webuiv3.7updated 14d ago

A Web UI for easy subtitle using whisper model.

#whisper #ai #gradio #tts

2 check-insNVIDIAAMDApple

DEMON

b2renger/DEMON-pinokiov7.0updated 14d ago

Diffusion Engine for Musical Orchestrated Noise — a real-time streaming diffusion engine for music generation, built on ACE-Step v1.5. Requires an NVIDIA GPU.

0 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#6theDAW

All-in-one AI music studio on Stable Audio 3 and a CUDA port of Magenta RealTime. It generates audio from text, separates stems with Demucs, transcribes to MIDI and notation, edits a multitrack timeline with a real-time Web Audio FX rack and automation, masters, DJs with stem decks, runs a live VJ engine, and plays from a Quest by hand over ADB.

#7diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#8openmed

open-source healthcare ai

#9meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

#10GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

Global feed

Launcher updates

Store