Pinokio

Launcher updates

Audiochunker

@manatheturipa5d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi11d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro20d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming21d ago

new tts here!

Underfit

@cocktailpeanut21d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:All

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

Open-Sora-Plan

PKU-YuanGroup/Open-Sora-Planupdated 8mo ago

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

0 check-insNVIDIAAMDApple

livecc

showlab/liveccupdated 8mo ago

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)

0 check-insNVIDIAAMDApple

VisoMaster-Experimental

asdf31jsa/VisoMaster-Experimentalupdated 8mo ago

Powerful & Easy-to-Use Video Face Swapping and Editing Software

0 check-insNVIDIAAMDApple

Hunyuan3D-2

tencent/hunyuan3d-2updated 8mo ago

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

0 check-insNVIDIAAMDApple

RVC-realtime

Feedjer/RVC-realtimev2.0updated 8mo ago

[WINDOWS/LINUX ONLY] Easily train a good VC model with voice data <= 10 mins!: https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI

1 check-inNVIDIAAMDApple

PyLaia

jpuigcerver/pylaiaupdated 8mo ago

A deep learning toolkit specialized for handwritten document analysis

0 check-insNVIDIAAMDApple

civitai-companion

rbbrdckybk/civitai-companionupdated 8mo ago

Utility for extracting prompt metadata from Civitai AI images, auto-downloading the associated resources, and outputting/formatting the prompt information.

0 check-insNVIDIAAMDApple

index-tts-2

SUP3RMASS1VE/Index-TTS-2-Pinokiov3.7updated 8mo ago

@sup3rmass1ve0 check-insNVIDIAAMDApple

AI-Video-Transcriber

wendy7756/AI-Video-Transcriberupdated 8mo ago

Transcribe and summarize video content using AI. Open-source, multi-platform, and supports multiple languages.

0 check-insNVIDIAAMDApple

bolt.diy

stackblitz-labs/bolt.diyupdated 8mo ago

Prompt, run, edit, and deploy full-stack web applications using any LLM you want!

0 check-insNVIDIAAMDApple

Spanish-F5

jpgallegoar/Spanish-F5updated 8mo ago

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

0 check-insNVIDIAAMDApple

upscale-enhance

wengjiyao/upscale-enhanceupdated 8mo ago

High-quality video and image super-resolution powered by Real-ESRGAN. Upscale your media with advanced AI.

0 check-insNVIDIAAMDApple

argos-translate

argosopentech/argos-translateupdated 8mo ago

Open-source offline translation library written in Python

0 check-insNVIDIAAMDApple

dots.ocr-fix-demo

PRITHIVSAKTHIUR/dots.ocr-fix-demoupdated 8mo ago

This Gradio application demonstrates the capabilities of the "dots.ocr" model, a powerful multilingual document parser.

0 check-insNVIDIAAMDApple

UVR5-UI

eddycrack864/uvr5-uiupdated 8mo ago

Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models

0 check-insNVIDIAAMDApple

StoryCraft

TheAwaken1/StoryCraftv3.7updated 8mo ago

Generate engaging 1 to 5-minute short stories with LLMs and convert them to audio with Coqui TTS, supports voice cloning, built in speakers and multilingual.

@theawakenone0 check-insNVIDIAAMDApple

ollama-voice

maudoin/ollama-voiceupdated 8mo ago

plug whisper audio transcription to a local ollama server and ouput tts audio responses

0 check-insNVIDIAAMDApple

Hunyuan3D-Omni

Tencent-Hunyuan/Hunyuan3D-Omniupdated 8mo ago

Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

0 check-insNVIDIAAMDApple

Hunyuan3D-2.1

Tencent-Hunyuan/Hunyuan3D-2.1updated 8mo ago

From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

0 check-insNVIDIAAMDApple

VACE

ali-vilab/VACEupdated 8mo ago

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

0 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#6openmed

open-source healthcare ai

#7diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#8GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#9ChatTTS

A generative speech model for daily dialogue.

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Latest posts from the community.

'Error during processing: No faces detected in any frame of the video'

@xmilosobil · LatentSync

i was having this error on windows, it turned out to be video path construction problem within python...

FlashAttention error

@zela · theDAW

I encountered an error when creating a small-rf model. I tried uninstalling FlashAttention, but it di...

Modules installation

@987654321pinokio · Odysseus

Hello everyone. How can I add modules to Odysseus?

ENOENT: no such file or directory, stat 'D:\pinokio\api\wan2gp-amd.git\Error:'

@espei · Wan2GP - AMD2

need some help with these ENOENT: no such file or directory, stat 'D:\pinokio\api\wan2gp-amd.git\Erro...

Recommendations

@morpheus · theDAW3

Hi :) The app installs fine. But there are a few problems in the details... ffmpeg is installed by de...

Global radar

Projects people are discovering or following now.

Followedjust now

Z-Image-Turbo

⚡️ Efficient 6B parameter image generation model with sub-second inference. Generate high-quality, photorealistic images with only 8 inference steps. Features bilingual text rendering (Chinese & English) and Single-Stream Diffusion Transformer architecture.