Pinokio

Launcher updates

Audiochunker

@manatheturipa3d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi10d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro19d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming19d ago

new tts here!

Underfit

@cocktailpeanut20d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:All

Platform:Windows

GPU:All

Recommended Latest Check-ins

Sort:Latest

Forge Neo

6Morpheus6/forge-neov2.0updated 3h ago

[NVIDIA ONLY] Stable Diffusion WebUI Forge supporting Flux, Qwen, wan, nunchaku and more in a lightweight WebUI. https://github.com/Haoming02/sd-webui-forge-classic/tree/neo

@morpheus

12 check-insNVIDIAAMDApple

PocketTTS

PierrunoYT/pocket-tts-pinokiov5.0updated 8h ago

Lightweight CPU text-to-speech with preset voices and optional Hugging Face-authenticated voice cloning.

@pierrunoyt1 check-inNVIDIAAMDApple

LuxTTS 🎙️

PierrunoYT/LuxTTS-Pinokiov7.0updated 8h ago

High-quality rapid TTS voice cloning model (150x+ realtime) — 48kHz speech, voice cloning

@pierrunoyt1 check-inNVIDIAAMDApple

OrpheusTTS

PierrunoYT/OrpheusTTS-Pinokiov7.0updated 8h ago

Standalone Text-to-Speech using Orpheus TTS with a Gradio UI

@pierrunoyt1 check-inNVIDIAAMDApple

Z-Image-Turbo

PierrunoYT/Z-Image-Pinokiov5.0updated 8h ago

⚡️ Efficient 6B parameter image generation model with sub-second inference. Generate high-quality, photorealistic images with only 8 inference steps. Features bilingual text rendering (Chinese & English) and Single-Stream Diffusion Transformer architecture.

#image-generation

@pierrunoyt 2 check-insNVIDIAAMDApple

VoxCPM 2

PierrunoYT/VoxCPM-2-Pinokiov5.0updated 8h ago

Tokenizer-free TTS for context-aware speech, voice cloning, and voice design. 2B params, 48kHz, 30 languages (Gradio UI).

@pierrunoyt 2 check-insNVIDIAAMDApple

KittenTTS 😻

PierrunoYT/KittenTTS-Pinokiov7.0updated 9h ago

Ultra-lightweight text-to-speech (15M-80M params) — CPU optimized, 8 voices, ONNX-powered

@pierrunoyt 2 check-insNVIDIAAMDApple

Soprano TTS

PierrunoYT/soprano-tts-pinokiov5.0updated 9h ago

Instant, Ultra-Realistic Text-to-Speech

@pierrunoyt1 check-inNVIDIAAMDApple

PRX-1024 Text-to-Image

PierrunoYT/Photoroom-PRX-Pinokiov5.0updated 9h ago

Gradio web interface for Photoroom's PRX-1024-t2i-beta text-to-image model

@pierrunoyt 2 check-insNVIDIAAMDApple

TranslateGemma

PierrunoYT/TranslateGemma-Pinokiov5.0updated 9h ago

🌍 TranslateGemma - Google's open-source multilingual translation AI. Translate text across 55+ languages and extract/translate text from images. Powered by Gemma 3 architecture.

@pierrunoyt1 check-inNVIDIAAMDApple

Higgs Audio V2 Enhanced

PierrunoYT/Higgs-Audio-V2-Pinokiov5.0updated 9h ago

Advanced text-to-speech with voice cloning, multi-speaker support, and background music generation using Higgs Audio V2

@pierrunoyt1 check-inNVIDIAAMDApple

SmolLM3-3B Chatbot

PierrunoYT/SmolLM3-3B-Pinokiov5.0updated 13h ago

Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy

@pierrunoyt 2 check-insNVIDIAAMDApple

OmniVoice Studio

PierrunoYT/OmniVoice-Studio-Pinokiov7.0updated 13h ago

The open-source ElevenLabs alternative. Local voice cloning, video dubbing, and real-time dictation — 646 languages, no API keys.

@pierrunoyt

1 check-inNVIDIAAMDApple

PRX Pixel

PierrunoYT/PRX-Pixel-Pinokiov5.0updated 13h ago

Pixel-space PRX text-to-image pipeline (~7B params, Qwen3-VL text encoder, no VAE)

@pierrunoyt1 check-inNVIDIAAMDApple

MOSS-TTS

PierrunoYT/MossTTS-Pinokiov5.0updated 13h ago

All-in-one Gradio UI for the MOSS-TTS Family: voice cloning, dialogue generation, voice design from text, and sound effects.

@pierrunoyt

3 check-insNVIDIAAMDApple

Sana

PierrunoYT/Sana-Pinokiov5.0updated 13h ago

Fast Image Generation with Sana Diffusion Model

@pierrunoyt

2 check-insNVIDIAAMDApple

Z-Fusion

ai-anchorite/Z-Fusionv3.7updated 17h ago

Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]

#image-generation

@anchorite

22 check-insNVIDIAAMDApple

DramaBox

PierrunoYT/DramaBox-TTS-Pinokiov5.0updated 1d ago

Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AI

#ai #tts #voice-clone

@pierrunoyt

5 check-insNVIDIAAMDApple

Cohere Transcribe

PierrunoYT/cohere-transcribe-pinokiov5.0updated 1d ago

State-of-the-art open-source speech recognition model supporting 14 languages. 2B parameter ASR model from Cohere Labs.

@pierrunoyt1 check-inNVIDIAAMDApple

OmniVoice

PierrunoYT/OmniVoice-Pinokiov5.0updated 1d ago

Zero-shot multilingual TTS (600+ languages) with voice cloning and voice design — Gradio UI (app/app.py)

@pierrunoyt 5 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5theDAW

All-in-one AI music studio on Stable Audio 3 and a CUDA port of Magenta RealTime. It generates audio from text, separates stems with Demucs, transcribes to MIDI and notation, edits a multitrack timeline with a real-time Web Audio FX rack and automation, masters, DJs with stem decks, runs a live VJ engine, and plays from a Quest by hand over ADB.

#6Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#7diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#8openmed

open-source healthcare ai

#9GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Launcher updates

Store