Store
Fast Image Generation with Sana Diffusion Model
Hunyuan Translation Model Version 1.5 — Gradio UI for 33-language translation with HY-MT1.5-1.8B and HY-MT1.5-7B.
[NVIDIA ONLY] Stable Video Diffusion Streamlit App. Currently supports Nvidia GPU machines only.
ACE-Step UIFeatured
Open source UI for ACE-Step 1.5 music generation.
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Suno at home. Local AI music generation studio — full songs with vocals, lyrics, covers, and music videos. Built on ACE-Step 1.5 XL.
Text to audio, open sourced by Meta
Vibe KanbanFeatured
Local web UI for orchestrating AI coding agents and tasks (BloopAI/vibe-kanban).
ComfyuiFeatured
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
Distill and quantize models using TorchAO with intelligent GPU/CPU management
Cross-platform shell.run tests for [[ ]] multiline argument passing.
AI4AnimationPyFeatured
A Python framework for AI-driven character animation using neural networks.
P2PCLAW Agent Benchmark — connect any LLM agent (Claude, GPT, Gemini, Qwen, Kimi, DeepSeek…) and get scored on 10 dimensions + Tribunal IQ. Dashboard runs locally on :8787, leaderboard at p2pclaw.com/app/benchmark.
AceJAMFeatured
Describe any song in plain English, compose it locally with an embedded Qwen GGUF model, and generate it with ACE-Step v1.5.
TexturizerFeatured
Minimal NVIDIA-first web app for texturing existing meshes with Hunyuan3D-2GP/mmgp while preserving rigged GLB structure when the vertex layout stays compatible.
Generate music in different genres using text and audio prompts.
Separate Anything You Describe (https://huggingface.co/spaces/Audio-AGI/AudioSep)
Uncensored Deepfakes for images and videos without training and an easy-to-use GUI.
🎙️ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.
[v0.5.1] FramePack Video App offering multiple generation types: Original, F1, video extension, end frame. Features include: LoRA support, job queueing, advanced timestamped prompts, offline mode, a post-processing suite including upscaling, interpolation, filters and more!
