Pinokio

Launcher updates

Audiochunker

@manatheturipa4d ago

AudioChunker

Audiochunker — slice any audio into perfectly timed clips, instantly Audiochunker is a lightweight Pinokio ap...

NEXUS OS

@ramshi10d ago

Free AI Agents, Forever

Included a feature called Router, which you can manually connect and plug in free models across many provider...

Phosphene

@bizarro20d ago

Phosphene 3.2.4 - Ideogram 4 now runs on M1 and M2 Macs

Phosphene 3.2.4 fixes Ideogram 4 for slower Apple GPUs. It now runs on M1 and M2 Macs, not just the fast ones...

Higgs Audio Studio

@nerual_dreming20d ago

new tts here!

Underfit

@cocktailpeanut21d ago

Underfit: Stable Audio 3 LoRA Training Studio

Underfit is for musicians, sound designers, sample makers, and audio experimenters who want Stable Audio 3 to...

Type:All

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

[NVIDIA GPU ONLY] LGM

cocktailpeanutlabs/lgmv3.0updated 1y ago

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation https://huggingface.co/spaces/ashawkey/LGM

#3dgen #ai #3d

0 check-insNVIDIAAMDApple

dust3r

cocktailpeanutlabs/dust3rv1.3updated 1y ago

Geometric 3D Vision Made Easy https://dust3r.europe.naverlabs.com/

#3dgen #ai

0 check-insNVIDIAAMDApple

ZETA

cocktailpeanutlabs/zetav1.2updated 1y ago

Zero-Shot Text-Based Audio Editing Using DDPM Inversion https://huggingface.co/spaces/hilamanor/audioEditing

#ai #audio-edit #audio-generation

1 check-inNVIDIAAMDApple

Arc2Face

cocktailpeanutlabs/arc2facev1.5updated 1y ago

A Foundation Model of Human Faces https://huggingface.co/spaces/FoivosPar/Arc2Face

#ai #face

0 check-insNVIDIAAMDApple

spright

cocktailpeanutlabs/sprightv1.5updated 1y ago

Generate images with spatial accuracy https://huggingface.co/spaces/SPRIGHT-T2I/SPRIGHT-T2I

#ai #image-generation

0 check-insNVIDIAAMDApple

CustomNet

cocktailpeanutlabs/customnetv1.5updated 1y ago

A unified encoder-based framework for object customization in text-to-image diffusion models https://huggingface.co/spaces/TencentARC/CustomNet

#ai

0 check-insNVIDIAAMDApple

Stable Cascade

cocktailpeanutlabs/stablecascadev3.0updated 1y ago

Stable Cascade from StabilityAI

0 check-insNVIDIAAMDApple

gligen

cocktailpeanutlabs/gligenv1.2updated 1y ago

An intuitive GUI for GLIGEN that uses ComfyUI in the backend https://github.com/mut-ex/gligen-gui

#ai

0 check-insNVIDIAAMDApple

CosXL

cocktailpeanutlabs/cosxlv1.5updated 1y ago

Edit images with just prompt, an unofficial demo for CosXL and CosXL Edit from Stability AI, https://huggingface.co/spaces/multimodalart/cosxl

0 check-insNVIDIAAMDApple

face-to-all

cocktailpeanutlabs/face-to-allv1.5updated 1y ago

diffusers InstantID + ControlNet inspired by face-to-many from fofr (https://x.com/fofrAI) - a localized Version of https://huggingface.co/spaces/multimodalart/face-to-all

#ai

0 check-insNVIDIAAMDApple

instantstyle

cocktailpeanutlabs/instantstylev1.5updated 1y ago

Upload the picture of an image, and generate images with that image style. Instant generation with no LoRA required https://huggingface.co/spaces/InstantX/InstantStyle

#ai #image-generation

0 check-insNVIDIAAMDApple

parler-tts

cocktailpeanutlabs/parler-ttsv1.5updated 1y ago

a lightweight text-to-speech (TTS) model that can generate high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation). https://huggingface.co/spaces/parler-tts/parler_tts_mini

#ai #tts

1 check-inNVIDIAAMDApple

ZeST

cocktailpeanutlabs/zestv1.5updated 1y ago

ZeST: Zero-Shot Material Transfer from a Single Image. Local port of https://huggingface.co/spaces/fffiloni/ZeST (Project: https://ttchengab.github.io/zest/)

#ai #image-edit

1 check-inNVIDIAAMDApple

LlamaFactory

pinokiofactory/llamafactoryv1.5updated 1y ago

Unify Efficient Fine-Tuning of 100+ LLMs https://github.com/hiyouga/LLaMA-Factory

#ai #training

1 check-inNVIDIAAMDApple

StableAudio

Ripkore/stableaudiov1.5updated 1y ago

An Open Source Model for Audio Samples and Sound Design https://github.com/Stability-AI/stable-audio-tools

@ripkore0 check-insNVIDIAAMDApple

moshi

pinokiofactory/moshiv2.0updated 1y ago

[Mac only] a speech-text foundation model for real time dialogue https://github.com/kyutai-labs/moshi

#ai

0 check-insNVIDIAAMDApple

Open-Interface

AmberSahdev/Open-Interfaceupdated 1y ago

Control Any Computer Using LLMs.

0 check-insNVIDIAAMDApple

audiocraft-webui

CoffeeVampir3/audiocraft-webuiupdated 1y ago

Quick webui for audiocraft

0 check-insNVIDIAAMDApple

ultimatevocalremovergui

Anjok07/ultimatevocalremoverguiupdated 1y ago

GUI for a Vocal Remover that uses Deep Neural Networks.

0 check-insNVIDIAAMDApple

audiocraft

facebookresearch/audiocraftupdated 1y ago

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

0 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#6openmed

open-source healthcare ai

#7diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#8GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#9meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

#10ChatTTS

A generative speech model for daily dialogue.

Global feed

Latest posts from the community.

Hanaging on step to verify ffmpeg install on my M1 MacBook

@bsanderson33 · Wan2GP

i tried to install wan2gp onto my M! macbook. First time installing install keeps hanging on install ...

crispz-studio : a 100% local, Fooocus-style studio for Z-Image (no ComfyUI / SwarmUI)

@mikecastrodemaria · crispz-studio

I've been building crispz-studio: a standalone creation + enhancement tool for Tongyi Z-Image, fully ...

I've been quietly maintaining a Fooocus2026 fork since it went LTS. It now has tag autocomplete, a job queue, an asset browser and a few other things

@mikecastrodemaria · Fooocus2026

When Fooocus went into maintenance mode I couldn't let it go, it's still the best "type prompt, get g...

how to set GPU 1 instead of GPU 0?

@pats007 · Wan2GP

What are you trying to do? Generate image or video Wan2GP is using GPU 0 Intel instead of Nvidia GPU ...

FP-Studio does not obey timestamped prompts

@andreaswb · FramePack-Studio

FP-Studio documents which part of a timestamped promt is used for the generation. But 1. as it genera...

Global radar

Projects people are discovering or following now.

Followed1 min

Comfyui

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI

Followed4 min

Openvoice2

Openvoice 2 Web UI - A local web UI for Openvoice2, a multilingual voice cloning TTS https://x.com/myshell_ai/status/1783161876052066793

Followed6 min

OpenVoice

Instantly clone any voice from any text to any speech, in any language https://huggingface.co/spaces/myshell-ai/OpenVoice

Followed8 min

PersonaPlex

🗣️ PersonaPlex - NVIDIA's real-time speech-to-speech conversational AI model. Natural full-duplex conversations with customizable personas and voices.

Followed8 min

ChatterBox

AI-Powered Text-to-Speech with Voice Cloning using Chatterbox TTS and a Gradio interface. Includes Turbo, Multilingual (23+ languages), and Original models. Runs locally; CUDA GPU recommended, CPU supported. Windows, Mac, and Linux.

Launcher updates

Store