Pinokio

Launcher updates

Audiochunker

@manatheturipa5d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi12d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro21d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming21d ago

new tts here!

Underfit

@cocktailpeanut22d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:All

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

CatVTON

Zheng-Chong/CatVTONupdated 6mo ago

[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).

0 check-insNVIDIAAMDApple

ClearerVoice-Studio

gotoolkits/ClearerVoice-Studiov2.0updated 6mo ago

0 check-insNVIDIAAMDApple

viterbox-tts

iamdinhthuan/viterbox-ttsupdated 6mo ago

Contribute to iamdinhthuan/viterbox-tts development by creating an account on GitHub.

0 check-insNVIDIAAMDApple

Real-Time-Voice-Cloning

CorentinJ/Real-Time-Voice-Cloningupdated 6mo ago

Clone a voice in 5 seconds to generate arbitrary speech in real-time

0 check-insNVIDIAAMDApple

Umo

linus74rn/UmoPinokiov1.0updated 6mo ago

Multi-Identity Consistency for Image Customization via Matching Reward https://github.com/bytedance/UMO

0 check-insNVIDIAAMDApple

Wan2GP-on-Colab

Square-Zero-Labs/Wan2GP-on-Colabupdated 6mo ago

Run Wan2GP on Google Colab

0 check-insNVIDIAAMDApple

Skywork-R1V

SkyworkAI/Skywork-R1Vupdated 6mo ago

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.

0 check-insNVIDIAAMDApple

presenton-python-sdk

presenton/presenton-python-sdkupdated 6mo ago

Contribute to presenton/presenton-python-sdk development by creating an account on GitHub.

0 check-insNVIDIAAMDApple

ultimate-rvc

JackismyShephard/ultimate-rvcupdated 6mo ago

An app for creating audio-based content such as song covers and speech using Retrieval-based Voice Conversion.

0 check-insNVIDIAAMDApple

jesse

jesse-ai/jesseupdated 6mo ago

An advanced crypto trading bot written in Python

0 check-insNVIDIAAMDApple

reshard-safetensors

NotTheStallion/reshard-safetensorsupdated 6mo ago

This repo helps you understand how safetensors are structured to store different layers of an LLM and re-shard/re-chunk safetensors files even if they don't fit in the GPU.. ( No Autoclass )

0 check-insNVIDIAAMDApple

Resemble Enhance

sealad886/pinokio-resemble-enhancev2.0updated 6mo ago

AI-powered speech denoising + enhancement (Gradio web demo + CLI).

0 check-insNVIDIAAMDApple

Track-Anything

gaomingqi/track-anythingupdated 6mo ago

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

0 check-insNVIDIAAMDApple

Z-Image Fusion

DenisJunio/Z-Image-Fusionv3.7updated 6mo ago

Fast, high-quality image generation using comfyui via a Gradio UI

0 check-insNVIDIAAMDApple

ReEzSynth

FuouM/ReEzSynthupdated 6mo ago

EbSynth in Python, version 2

0 check-insNVIDIAAMDApple

ultravox

fixie-ai/ultravoxupdated 6mo ago

A fast multimodal LLM for real-time voice

0 check-insNVIDIAAMDApple

sd-forge-ollama

Haoming02/sd-forge-ollamaupdated 6mo ago

Integrate LLM into Forge Webui via Ollama

0 check-insNVIDIAAMDApple

Gemini-Watermark-Remover

dinoBOLT/Gemini-Watermark-Removerupdated 6mo ago

An AI powered extension to get rid of the Gemini watermark

0 check-insNVIDIAAMDApple

codexffmpeg

GyanD/codexffmpegupdated 6mo ago

Support for https://www.gyan.dev/ffmpeg

0 check-insNVIDIAAMDApple

airunner

Capsize-Games/airunnerupdated 6mo ago

Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows

0 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#6openmed

open-source healthcare ai

#7diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#8GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#9ChatTTS

A generative speech model for daily dialogue.

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Latest posts from the community.

how I fixed this with Claude.ai but please fix the instal others less stubborn

@solonecpt · FaceFusion 3.6.1

What happened? Install continuously failed and would not correct Steps to reproduce 1. Your system (O...

Failed from Install

@solonecpt · AI4AnimationPy

What happened? After install it didn't work Steps to reproduce 1.Running on Pop!OS Your system (OS / ...

it keeps repeating the installation of the microsoft visualstudio how do i fix it?

@debihyahia30 · Wan2GP

What happened? Steps to reproduce 1. Your system (OS / GPU / RAM / VRAM / etc.) Logs / full error output

Bug after installation while generating video on Apple Macbook air M1 with 8GB

@dineshpathak · Phosphene

[21:48:22] caffeinate active — Mac won't idle-sleep while queue is running [21:48:22] Run via helper:...

'Error during processing: No faces detected in any frame of the video'

@xmilosobil · LatentSync2

i was having this error on windows, it turned out to be video path construction problem within python...

Global radar

Projects people are discovering or following now.

Followed9 min

Kokoro-TTS

Welcome to Kokoro, a high-quality text-to-speech synthesis program powered by deep learning. This tool converts any text into high-fidelity speech in just a few seconds. Simply input text, select a voice, adjust the speed, and enjoy the generated audio.

Followed14 min

remove-video-bg

Video background removal tool https://huggingface.co/spaces/amirgame197/Remove-Video-Background

Followed16 min

Wan2GP

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

Followed17 min

Comfyui

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI

Followed23 min

Kokoro-TTS-Multilingual.git

Super fast Multilingual TTS supporting 54 voices across 8 languages.

Launcher updates

Store