Pinokio

Launcher updates

Audiochunker

@manatheturipa3d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi10d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro19d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming19d ago

new tts here!

Underfit

@cocktailpeanut20d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:api

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

SmolLM3-3B Chatbot

PierrunoYT/SmolLM3-3B-Pinokiov5.0updated 12h ago

Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy

@pierrunoyt 2 check-insNVIDIAAMDApple

dots.tts-base

PierrunoYT/dots.tts-Pinokiov5.0updated 13h ago

2B-parameter fully continuous, end-to-end autoregressive text-to-speech with zero-shot voice cloning. https://huggingface.co/rednote-hilab/dots.tts-base

@pierrunoyt1 check-inNVIDIAAMDApple

OmniVoice Studio

PierrunoYT/OmniVoice-Studio-Pinokiov7.0updated 13h ago

The open-source ElevenLabs alternative. Local voice cloning, video dubbing, and real-time dictation — 646 languages, no API keys.

@pierrunoyt

1 check-inNVIDIAAMDApple

PRX Pixel

PierrunoYT/PRX-Pixel-Pinokiov5.0updated 13h ago

Pixel-space PRX text-to-image pipeline (~7B params, Qwen3-VL text encoder, no VAE)

@pierrunoyt1 check-inNVIDIAAMDApple

MOSS-TTS

PierrunoYT/MossTTS-Pinokiov5.0updated 13h ago

All-in-one Gradio UI for the MOSS-TTS Family: voice cloning, dialogue generation, voice design from text, and sound effects.

@pierrunoyt

3 check-insNVIDIAAMDApple

Sana

PierrunoYT/Sana-Pinokiov5.0updated 13h ago

Fast Image Generation with Sana Diffusion Model

@pierrunoyt

2 check-insNVIDIAAMDApple

Studio Hub KH

theng12/studiohub-macv3.6updated 13h ago

Control plane for the KH Studio family — live health grid, unified model catalog, and unified-memory monitoring for Image/Music/Voice/Chat/Video Studio. One canonical API for clients like Story Studio KH.

0 check-insNVIDIAAMDApple

자막 학습 플레이어

passivejobs01/subtitle-playerv7.0updated 15h ago

유튜브·로컬 영상에 원어+한글 이중 자막을 만들어 외국어를 공부하는 로컬 플레이어 — 잇츠매거진

0 check-insNVIDIAAMDApple

Z-Fusion

ai-anchorite/Z-Fusionv3.7updated 16h ago

Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]

#image-generation

@anchorite

22 check-insNVIDIAAMDApple

Adoption & Child Care

Brianmwanza-bit/adoption-and-child-care-mainv7.0updated 1d ago

Android app for adoption and child-care management with AI-assisted coding.

0 check-insNVIDIAAMDApple

OpenPrompt

manat0912/openpromptv5.0updated 1d ago

AI-powered prompt helper for image & video generation. Supports local LLMs (Ollama, LM Studio) and cloud APIs (Gemini, DeepSeek, OpenRouter).

@manatheturipa0 check-insNVIDIAAMDApple

DramaBox

PierrunoYT/DramaBox-TTS-Pinokiov5.0updated 1d ago

Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AI

#ai #tts #voice-clone

@pierrunoyt

5 check-insNVIDIAAMDApple

Cohere Transcribe

PierrunoYT/cohere-transcribe-pinokiov5.0updated 1d ago

State-of-the-art open-source speech recognition model supporting 14 languages. 2B parameter ASR model from Cohere Labs.

@pierrunoyt1 check-inNVIDIAAMDApple

ChatterBox

PierrunoYT/chatterbox-tts-pinokiov5.0updated 1d ago

AI-Powered Text-to-Speech with Voice Cloning using Chatterbox TTS and a Gradio interface. Includes Turbo, Multilingual (23+ languages), and Original models. Runs locally; CUDA GPU recommended, CPU supported. Windows, Mac, and Linux.

@pierrunoyt0 check-insNVIDIAAMDApple

Audio Flamingo 3

PierrunoYT/Audio-Flamingo-3-Pinokiov7.0updated 1d ago

NVIDIA's Audio Flamingo 3 - Large Audio-Language Model for speech, sound, and music understanding with Gradio web interface

@pierrunoyt0 check-insNVIDIAAMDApple

MLX Media

CharafChnioune/AceJAM-Studiov7.0updated 1d ago

Create songs, albums and artwork locally on Apple MLX with ACE-Step v1.5, MFLUX, local agents and LoRA training.

0 check-insNVIDIAAMDApple

Higgs Audio v3 TTS

PierrunoYT/HiggsAudioV3-Pinokiov7.0updated 1d ago

Pinokio launcher for Higgs Audio v3 TTS with Gradio UI, SGLang-Omni backend, and automatic model download.

@pierrunoyt0 check-insNVIDIAAMDApple

OmniVoice

PierrunoYT/OmniVoice-Pinokiov5.0updated 1d ago

Zero-shot multilingual TTS (600+ languages) with voice cloning and voice design — Gradio UI (app/app.py)

@pierrunoyt 5 check-insNVIDIAAMDApple

Transcribr

PierrunoYT/Transcribr-Pinokiov5.0updated 1d ago

Bulk transcribe many YouTube videos, whole playlists, or your own uploaded audio/video files at once with faster-whisper. Outputs txt, srt, vtt, or json.

@pierrunoyt0 check-insNVIDIAAMDApple

PersonaPlex

PierrunoYT/PersonaPlex-Pinokiov5.0updated 1d ago

🗣️ PersonaPlex - NVIDIA's real-time speech-to-speech conversational AI model. Natural full-duplex conversations with customizable personas and voices.

@pierrunoyt 3 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5theDAW

All-in-one AI music studio on Stable Audio 3 and a CUDA port of Magenta RealTime. It generates audio from text, separates stems with Demucs, transcribes to MIDI and notation, edits a multitrack timeline with a real-time Web Audio FX rack and automation, masters, DJs with stem decks, runs a live VJ engine, and plays from a Quest by hand over ADB.

#6Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#7diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#8openmed

open-source healthcare ai

#9GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Launcher updates

Store