Store
Super fast Multilingual TTS supporting 54 voices across 8 languages.
[ICCV'25 Best Paper Candidate] Official Implementations for Paper: Dynamic Typography: Bringing Text to Life via Video Diffusion Prior
SongBloom, a novel framework for full-length song generation
2D Video to 3D Generator, Creating Side-By-Side (SBS) Videos easily
✨ Open-source AI hackers for your apps 👨🏻💻
[NVIDIA ONLY] Requires 24GB VRAM (Use the lowvram option, it has the same quality). High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. https://github.com/Tencent/Hunyuan3D-2
Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching
Contribute to DecartAI/Lucy-Edit-ComfyUI development by creating an account on GitHub.
A simple web-based tool for Spriting and Pixel art. - piskelapp/piskel
Wan2.2-Lightning: Speed up wan2.2 model with distillation
Contribute to sammy030275-lang/WAN2.2-14B-Rapid-AllInOne development by creating an account on GitHub.
GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors
Florence2Featured
An advanced vision foundation model from MicroSoft https://huggingface.co/spaces/gokaygokay/Florence-2
(WINDOWS)NVIDIA, Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
[NVIDIA ONLY] Image generation, image editing and free-form manipulation with a VLM (Minimum Requirements 12GB VRAM / 32GB RAM Recommended Requirements 24GB VRAM / 48GB RAM)
[NVIDIA ONLY] [RTX 50 Support] Image generation, image editing and free-form manipulation with a VLM (Minimum Requirements 12GB VRAM / 32GB RAM Recommended Requirements 24GB VRAM / 48GB RAM)
(WINDOWS)NVIDIA, Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
in preparation...
