Store
MAGNeTFeatured
MAGNeT is a text-to-music and text-to-sound model capable of generating high-quality audio samples conditioned on text descriptions https://github.com/facebookresearch/audiocraft/blob/main/docs/MAGNET.md
[NVIDIA ONLY] Text-driven, intelligent restoration, blending AI technology with creativity to give every image a brand new life https://supir.xpixel.group
A simple, high-quality image generation tool to create stunning illusions.
Stable Diffusion Trainer: https://github.com/bmaltais/kohya_ss
Host a GPT action from your desktop (for ChatGPT Custom GPT)

One-click install & launcher for MeiGen-AI/InfiniteTalk
Higgs Audio Text-to-Speech Playground (Requires Python 3.10+)
One-click launcher for Stable Diffusion web UI (AUTOMATIC1111/stable-diffusion-webui)

Text+Image → Video with Allegro-TI2V (Rhymes AI), local one-click via Pinokio
A powerful tool for extending images to different aspect ratios using Stable Diffusion XL.
Gradio UI for YuE music generation model
Pinokio app to install and run sdbds/YuE-for-windows, tuned defaults for a single RTX 4060 Ti 16GB GPU. Uses Torch 2.5.1+cu124 and requirements-uv.txt.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface https://github.com/comfyanonymous/ComfyUI
DreamO: A Unified Framework for Image Customization
Transform lyric transcriptions into karaoke-style MP4 videos. Built on Python-Lyric-Transcriber, this Gradio UI uses Whisper for transcription, an LLM for lyric edits, and Demucs for vocal separation. A fun tool for karaoke fans, though outputs may vary.
Dough is a open source tool for steering AI animations with precision
[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)