Pinokio

Launcher updates

Audiochunker

@manatheturipa5d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi12d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro21d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming21d ago

new tts here!

Underfit

@cocktailpeanut22d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:All

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

Kokoro-TTS-Local

PierrunoYT/Kokoro-TTS-Localupdated 6mo ago

A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web interface.

@pierrunoyt0 check-insNVIDIAAMDApple

sam-3d-body

facebookresearch/sam-3d-bodyupdated 6mo ago

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the model.

0 check-insNVIDIAAMDApple

AI-Scientist-v2

SakanaAI/AI-Scientist-v2updated 6mo ago

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

0 check-insNVIDIAAMDApple

AI-Scientist

SakanaAI/AI-Scientistupdated 6mo ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

0 check-insNVIDIAAMDApple

ml-sharp

apple/ml-sharpupdated 6mo ago

Sharp Monocular View Synthesis in Less Than a Second

0 check-insNVIDIAAMDApple

video2robot

AIM-Intelligence/video2robotupdated 6mo ago

End-to-end pipeline converting generative videos (Veo, Sora) to humanoid robot motions

0 check-insNVIDIAAMDApple

InfiniteTalk

MeiGen-AI/InfiniteTalkupdated 6mo ago

Unlimited-length talking video generation that supports image-to-video and video-to-video generation

0 check-insNVIDIAAMDApple

MultiTalk

MeiGen-AI/MultiTalkupdated 6mo ago

[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

0 check-insNVIDIAAMDApple

The-3rd-Eye

Ordinary0x/The-3rd-Eyeupdated 6mo ago

The 3rd Eye is a modular OSINT (Open Source Intelligence) framework built on an agent-based, graph-driven architecture. It automates public information discovery, identity correlation, and exposure analysis across multiple platforms, and generates structured intelligence reports. The system follows a LangGraph agent design.

0 check-insNVIDIAAMDApple

facefusion

hendrybui/facefusionupdated 6mo ago

Industry leading face manipulation platform

0 check-insNVIDIAAMDApple

HunyuanWorld-1.0

Tencent-Hunyuan/HunyuanWorld-1.0updated 6mo ago

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

0 check-insNVIDIAAMDApple

Dolphin

bytedance/Dolphinupdated 6mo ago

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

0 check-insNVIDIAAMDApple

manga-image-translator

zyddnys/manga-image-translatorupdated 6mo ago

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)

0 check-insNVIDIAAMDApple

ai-text-to-image-generator

mmehmetisik/ai-text-to-image-generatorupdated 6mo ago

AI-powered image generation tool using Hugging Face API and Stable Diffusion. Create images from text prompts with multiple style options.

0 check-insNVIDIAAMDApple

ComfyUI-Civitai-Discovery-Hub

Light-x02/ComfyUI-Civitai-Discovery-Hubupdated 6mo ago

This ComfyUI node lets you browse the Civitai gallery directly within the interface, featuring infinite scroll, advanced filters (including NSFW), and favorites management. It also allows you to retrieve prompts, metadata, and images/videos to seamlessly reuse them in your workflows.

0 check-insNVIDIAAMDApple

generative-models

Stability-AI/generative-modelsupdated 6mo ago

Generative Models by Stability AI

0 check-insNVIDIAAMDApple

flymyai-lora-trainer

FlyMyAI/flymyai-lora-trainerupdated 6mo ago

Qwen-Image text to image lora trainer

0 check-insNVIDIAAMDApple

stable-diffusion-videos

nateraw/stable-diffusion-videosupdated 6mo ago

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

0 check-insNVIDIAAMDApple

OpenKombai

gojodennis/OpenKombaiupdated 6mo ago

OpenKombai: A free, privacy-first alternative to Kombai. Instantly convert screenshots and designs into production-ready React + Tailwind code using local LLMs (Llama 3.2 Vision & Qwen 2.5). No API keys, zero cloud costs.

0 check-insNVIDIAAMDApple

SwitchLight-Studio

beeble-ai/SwitchLight-Studioupdated 6mo ago

Contribute to beeble-ai/SwitchLight-Studio development by creating an account on GitHub.

0 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#6openmed

open-source healthcare ai

#7diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#8GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#9ChatTTS

A generative speech model for daily dialogue.

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Latest posts from the community.

how I fixed this with Claude.ai but please fix the instal others less stubborn

@solonecpt · FaceFusion 3.6.1

What happened? Install continuously failed and would not correct Steps to reproduce 1. Your system (O...

Failed from Install

@solonecpt · AI4AnimationPy

What happened? After install it didn't work Steps to reproduce 1.Running on Pop!OS Your system (OS / ...

it keeps repeating the installation of the microsoft visualstudio how do i fix it?

@debihyahia30 · Wan2GP

What happened? Steps to reproduce 1. Your system (OS / GPU / RAM / VRAM / etc.) Logs / full error output

Bug after installation while generating video on Apple Macbook air M1 with 8GB

@dineshpathak · Phosphene

[21:48:22] caffeinate active — Mac won't idle-sleep while queue is running [21:48:22] Run via helper:...

'Error during processing: No faces detected in any frame of the video'

@xmilosobil · LatentSync2

i was having this error on windows, it turned out to be video path construction problem within python...

Global radar

Projects people are discovering or following now.

Followed1 min

Kimodo

Kimodo generates high-quality 3D human and robot motions and is controlled through text prompts

Followed1 min

Wan2GP

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

Followed7 min

HeartMuLa Studio

A professional, Suno-like music generation studio for HeartLib. https://github.com/fspecii/HeartMuLa-Studio

Followed9 min

XTTS

clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)

Followed10 min

LTX-2.3 (Windows)

Lightricks LTX-2.3 video generation (22B distilled-1.1) with a Gradio UI. Auto-detects GPU and configures offload.

Launcher updates

Store