Store
[NVIDIA ONLY] [RTX 50 Support] Image generation, image editing and free-form manipulation with a VLM (Minimum Requirements 12GB VRAM / 32GB RAM Recommended Requirements 24GB VRAM / 48GB RAM)
(WINDOWS)NVIDIA, Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
create a story by generating consistent images https://github.com/HVision-NKU/StoryDiffusion
An Open Source Model for Audio Samples and Sound Design https://github.com/Stability-AI/stable-audio-tools
Unify Efficient Fine-Tuning of 100+ LLMs https://github.com/hiyouga/LLaMA-Factory
Describe UI and see it rendered live. Ask for changes and convert HTML to React, Svelte, Web Components, etc. Like vercel v0, but open source https://github.com/wandb/openui
Simple script examples that highlight all the Pinokio APIs
Manage your ComfyUI environments with Docker
OneTrainer para Pinokio vato loco
Forget everything you thought you knew about AI art generation - RuinedFooocus is here to completely reinvent the game!
NeuTTS Air is the world’s first super-realistic, on-device, TTS speech language model with instant voice cloning. Built off a 0.5B
Fast and High-Quality Zero-Shot voice clone Text-to-Speech with Flow Matching
Fast and High-Quality Zero-Shot voice clone Text-to-Speech with Flow Matching Multilingual
[WINDOWS/LINUX ONLY] Easily train a good VC model with voice data <= 10 mins!: https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Generate engaging 1 to 5-minute short stories with LLMs and convert them to audio with Coqui TTS, supports voice cloning, built in speakers and multilingual.
MagicAnimate MiniFeatured
[NVIDIA GPU Only] An optimized version of MagicAnimate https://github.com/sdbds/magic-animate-for-windows
[NVIDIA ONLY] High-Quality and Efficient 3D Mesh Generation from a Single Image (Minimum requirements 12GB VRAM / 24GB RAM)
Image Dataset Tagger for Stable Diffusion / Lora / DreamBooth Training: https://github.com/mikeknapp/candy-machine
