Store
The best vocal remover application on the internet, and it's totally free and open source!
Florence-2 Image Captioning
Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
No fortress, purely open ground. OpenManus is Coming.
DiffuEraser is a diffusion model for video inpainting, which performs great content completeness and temporal consistency while maintaining acceptable efficiency.
[Nvidia GPU only] High-Quality Image Restoration Following Human Instructions
diffusers-image-fillFeatured
Remove objects from an image https://huggingface.co/spaces/OzzyGT/diffusers-image-fill
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Contribute to cubiq/ComfyUI_InstantID development by creating an account on GitHub.
Contribute to cubiq/ComfyUI_essentials development by creating an account on GitHub.
Contribute to cubiq/ComfyUI_IPAdapter_plus development by creating an account on GitHub.
An enhanced version of Fooocus giving you access to all of the latest AI image generation models
Spark-TTS Inference Code
Upgraded repo includes more capabilities, converted the cmd .py scripts to function more intuitively, added 147 different depth output colour map methods, introduced batch image as well as video processing, everything is automatically saved to an outputs folder (w/ file-naming conventions) & I've converted the .pth models to .safetensors.
[NVIDIA ONLY] Temporally Consistent Human Image Animation using Diffusion Model https://showlab.github.io/magicanimate/
[NVIDIA ONLY] AllTalk-TTS is a unified UI for E5-TTS, XTTS, Vite TTS, Piper TTS, Parler TTS and RVC, based on CoquiTTS, including a finetune mode.
A Vietnamese Voice Cloning Text-to-Speech Model ✨
Slightly improved official version for finetune xtts
