Pinokio

Launcher updates

Audiochunker

@manatheturipa3d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi10d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro19d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming19d ago

new tts here!

Underfit

@cocktailpeanut20d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:All

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

Youtube2MP3

PierrunoYT/Youtube2MP3-Pinokiov5.0updated 11h ago

🎵 YouTube to MP3 downloader with a simple Gradio UI. Paste a YouTube link to download MP3. Requires ffmpeg installed on your system.

@pierrunoyt0 check-insNVIDIAAMDApple

Higgs Audio V2 Enhanced

PierrunoYT/Higgs-Audio-V2-Pinokiov5.0updated 11h ago

Advanced text-to-speech with voice cloning, multi-speaker support, and background music generation using Higgs Audio V2

@pierrunoyt1 check-inNVIDIAAMDApple

LFM2.5 Reader + Q&A

PierrunoYT/LFM2.5-Pinokiov5.0updated 11h ago

Paste long text, clean it into readable sections, summarize each section, and ask questions in-browser with WebGPU. Choose between LFM2.5 230M, 350M, 1.2B-Instruct, and 1.2B-Thinking.

@pierrunoyt0 check-insNVIDIAAMDApple

ComfyUI-UniRig

pozzettiandrea/comfyui-unirigupdated 13h ago

ComfyUI wrapper for UniRig

0 check-insNVIDIAAMDApple

FrameCrop

arnold2006/framecropv4.0updated 14h ago

Batch-crop images to a chosen aspect ratio using a draggable/resizable crop overlay on each image thumbnail.

0 check-insNVIDIAAMDApple

SmolLM3-3B Chatbot

PierrunoYT/SmolLM3-3B-Pinokiov5.0updated 14h ago

Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy

@pierrunoyt 2 check-insNVIDIAAMDApple

dots.tts-base

PierrunoYT/dots.tts-Pinokiov5.0updated 14h ago

2B-parameter fully continuous, end-to-end autoregressive text-to-speech with zero-shot voice cloning. https://huggingface.co/rednote-hilab/dots.tts-base

@pierrunoyt1 check-inNVIDIAAMDApple

OmniVoice Studio

PierrunoYT/OmniVoice-Studio-Pinokiov7.0updated 14h ago

The open-source ElevenLabs alternative. Local voice cloning, video dubbing, and real-time dictation — 646 languages, no API keys.

@pierrunoyt

1 check-inNVIDIAAMDApple

PRX Pixel

PierrunoYT/PRX-Pixel-Pinokiov5.0updated 14h ago

Pixel-space PRX text-to-image pipeline (~7B params, Qwen3-VL text encoder, no VAE)

@pierrunoyt1 check-inNVIDIAAMDApple

MOSS-TTS

PierrunoYT/MossTTS-Pinokiov5.0updated 14h ago

All-in-one Gradio UI for the MOSS-TTS Family: voice cloning, dialogue generation, voice design from text, and sound effects.

@pierrunoyt

3 check-insNVIDIAAMDApple

Sana

PierrunoYT/Sana-Pinokiov5.0updated 14h ago

Fast Image Generation with Sana Diffusion Model

@pierrunoyt

2 check-insNVIDIAAMDApple

Studio Hub KH

theng12/studiohub-macv3.6updated 15h ago

Control plane for the KH Studio family — live health grid, unified model catalog, and unified-memory monitoring for Image/Music/Voice/Chat/Video Studio. One canonical API for clients like Story Studio KH.

0 check-insNVIDIAAMDApple

자막 학습 플레이어

passivejobs01/subtitle-playerv7.0updated 16h ago

유튜브·로컬 영상에 원어+한글 이중 자막을 만들어 외국어를 공부하는 로컬 플레이어 — 잇츠매거진

0 check-insNVIDIAAMDApple

Z-Fusion

ai-anchorite/Z-Fusionv3.7updated 18h ago

Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]

#image-generation

@anchorite

22 check-insNVIDIAAMDApple

Adoption & Child Care

Brianmwanza-bit/adoption-and-child-care-mainv7.0updated 1d ago

Android app for adoption and child-care management with AI-assisted coding.

0 check-insNVIDIAAMDApple

OpenPrompt

manat0912/openpromptv5.0updated 1d ago

AI-powered prompt helper for image & video generation. Supports local LLMs (Ollama, LM Studio) and cloud APIs (Gemini, DeepSeek, OpenRouter).

@manatheturipa0 check-insNVIDIAAMDApple

DramaBox

PierrunoYT/DramaBox-TTS-Pinokiov5.0updated 1d ago

Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AI

#ai #tts #voice-clone

@pierrunoyt

5 check-insNVIDIAAMDApple

Cohere Transcribe

PierrunoYT/cohere-transcribe-pinokiov5.0updated 1d ago

State-of-the-art open-source speech recognition model supporting 14 languages. 2B parameter ASR model from Cohere Labs.

@pierrunoyt1 check-inNVIDIAAMDApple

ChatterBox

PierrunoYT/chatterbox-tts-pinokiov5.0updated 1d ago

AI-Powered Text-to-Speech with Voice Cloning using Chatterbox TTS and a Gradio interface. Includes Turbo, Multilingual (23+ languages), and Original models. Runs locally; CUDA GPU recommended, CPU supported. Windows, Mac, and Linux.

@pierrunoyt0 check-insNVIDIAAMDApple

Audio Flamingo 3

PierrunoYT/Audio-Flamingo-3-Pinokiov7.0updated 1d ago

NVIDIA's Audio Flamingo 3 - Large Audio-Language Model for speech, sound, and music understanding with Gradio web interface

@pierrunoyt0 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5theDAW

All-in-one AI music studio on Stable Audio 3 and a CUDA port of Magenta RealTime. It generates audio from text, separates stems with Demucs, transcribes to MIDI and notation, edits a multitrack timeline with a real-time Web Audio FX rack and automation, masters, DJs with stem decks, runs a live VJ engine, and plays from a Quest by hand over ADB.

#6Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#7diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#8openmed

open-source healthcare ai

#9GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Launcher updates

Store