Pinokio

Launcher updates

Audiochunker

@manatheturipa3d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi10d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro19d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming19d ago

new tts here!

Underfit

@cocktailpeanut20d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:api

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

ComfyUI

cocktailpeanut/comfyui.pinokioupdated 1mo ago

Stable Diffusion & Stable Video Diffusion GUI

#comfyui

@cocktailpeanut

27 check-insNVIDIAAMDApple

ChatterBox

PierrunoYT/chatterbox-tts-appv3.7updated 1mo ago

AI-Powered Text-to-Speech with Voice Cloning using Chatterbox TTS and Gradio interface. Includes Turbo, Multilingual (23+ languages), and Original models.

@pierrunoyt 2 check-insNVIDIAAMDApple

React Portfolio Launcher

RAHUL-0568/portfolioupdated 1mo ago

One-click installer and launcher for the optimized React + Tailwind + Theatre.js portfolio

0 check-insNVIDIAAMDApple

VoxCPM2 Portable

timoncool/VoxCPM2_portable-pinokiov6.0.0updated 1mo ago

ElevenLabs at home. Multilingual TTS with Voice Design, Voice Cloning, and end-to-end LoRA fine-tuning straight from a video or podcast. Built on VoxCPM2 by OpenBMB. 30 languages incl. Russian.

@nerual_dreming

2 check-insNVIDIAAMDApple

ReClip

krynsky/reclip-pinokiov5.0updated 1mo ago

Self-hosted, open-source video and audio downloader with a clean web UI. Supports YouTube, TikTok, Instagram, X, and 1000+ other sites via yt-dlp.

#utility

@krynsky

7 check-insNVIDIAAMDApple

AceJAM

cocktailpeanut/acejam.pinokiov7.0updated 1mo ago

Describe any song in plain English, compose it locally with an embedded Qwen GGUF model, and generate it with ACE-Step v1.5.

#ai #song

@cocktailpeanut

13 check-insNVIDIAAMDApple

HiDream O1 Image FP8

cocktailpeanut/hidream-o1v7.0updated 1mo ago

One-click launcher for the original HiDream-O1-Image web UI using lazy-downloaded drbaph Dev or Full FP8 checkpoints through a root FP8 runner. Requires an NVIDIA CUDA GPU.

#ai #image-generation

@cocktailpeanut

3 check-insNVIDIAAMDApple

InstantIR

pinokiofactory/instantirv3.7updated 1mo ago

restore low-res images, restore broken images, recreate a new version of the image with a prompt https://huggingface.co/spaces/fffiloni/InstantIR

#ai #image-edit

0 check-insNVIDIAAMDApple

Kokoro-TTS-Multilingual.git

6Morpheus6/Kokoro-TTS-Multilingualv3.7updated 1mo ago

Super fast Multilingual TTS supporting 54 voices across 8 languages.

@morpheus

1 check-inNVIDIAAMDApple

InfiniteTalk (Pinokio)

gregor-chronos-games-com/infinitetalk.pinokioupdated 1mo ago

One-click install & launcher for MeiGen-AI/InfiniteTalk

5 check-insNVIDIAAMDApple

Mega-ASR

vdruts/mega-asr.pinokiov7.0updated 1mo ago

Robust automatic speech recognition for challenging real-world audio. Handles noise, far-field, echo, reverberation, and more using a foundation model trained on 2.6M samples across 54 acoustic scenarios.

0 check-insNVIDIAAMDApple

omnigen

pinokiofactory/omnigenv3.7updated 1mo ago

A unified image generation model that you can use to perform various tasks, including but not limited to text-to-image generation, subject-driven generation, Identity-Preserving Generation, and image-conditioned generation. https://huggingface.co/spaces/Shitao/OmniGen

#ai #image-generation

1 check-inNVIDIAAMDApple

facepoke

pinokiofactory/facepokev3.7updated 1mo ago

[NVIDIA Only] Select a portrait, click to move the head around https://github.com/jbilcke-hf/FacePoke

#ai #image-generation

0 check-insNVIDIAAMDApple

Allegro-txt2vid

pinokiofactory/Allegro-txt2vid-installv3.7updated 1mo ago

[NVIDIA ONLY] Generate videos with Allegro txt2vid model https://github.com/rhymes-ai/Allegro

#video-generation #ai

1 check-inNVIDIAAMDApple

hallo

pinokiofactory/hallov3.7updated 1mo ago

[NVIDIA Only] Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation https://github.com/fudan-generative-vision/hallo

#lipsync #video-generation #ai

4 check-insNVIDIAAMDApple

Hy-MT2

PierrunoYT/Tencent-HY-MT1.5-Pinokiov1.0.0updated 1mo ago

Hy-MT2 multilingual translation — Gradio UI for 33-language translation with Hy-MT2-1.8B and Hy-MT2-7B.

@pierrunoyt0 check-insNVIDIAAMDApple

Voice Studio KH

theng12/voicestudio-macv3.6updated 1mo ago

Apple Silicon TTS — Kokoro, VoxCPM, Bark, Qwen3-TTS, Orpheus, Chatterbox, Spark-TTS. PyTorch + MLX variants.

0 check-insNVIDIAAMDApple

Image Studio KH

theng12/imagestudio-macv3.6updated 1mo ago

Apple Silicon image studio — FLUX.2 (klein/dev), FLUX.1 (schnell/dev/Kontext/lite), HiDream, Shuttle, Qwen-Image. Powered by MLX/mflux.

1 check-inNVIDIAAMDApple

Gymnastics Photo Sorter

arnold2006/image-sorterupdated 1mo ago

AI-powered tool that automatically sorts thousands of gymnastics competition photos into folders by team and individual gymnast. Uses YOLOv8, CLIP, InsightFace, ReID and FAISS – fully offline, CUDA-accelerated.

0 check-insNVIDIAAMDApple

Music Studio KH

theng12/musicstudio-macv3.6updated 1mo ago

Apple Silicon music generation — MusicGen, Stable Audio Open, Bark.

1 check-inNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5theDAW

All-in-one AI music studio on Stable Audio 3 and a CUDA port of Magenta RealTime. It generates audio from text, separates stems with Demucs, transcribes to MIDI and notation, edits a multitrack timeline with a real-time Web Audio FX rack and automation, masters, DJs with stem decks, runs a live VJ engine, and plays from a Quest by hand over ADB.

#6Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#7diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#8openmed

open-source healthcare ai

#9GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Launcher updates

Store