SUP3RMASS1VE/UVR5-WebUIv2.0updated 1y ago
The best vocal remover application on the internet, and it's totally free and open source!
@sup3rmass1ve0 check-insNVIDIAAMDApple
SUP3RMASS1VE/Florence-2-Image-Captioningv3.6updated 1y ago
Florence-2 Image Captioning
@sup3rmass1ve0 check-insNVIDIAAMDApple
SUP3RMASS1VE/KD-Talkerv3.6updated 1y ago
Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
@sup3rmass1ve0 check-insNVIDIAAMDApple
apguan/openmanusupdated 1y ago
No fortress, purely open ground. OpenManus is Coming.
0 check-insNVIDIAAMDApple
lixiaowen-xw/DiffuEraserupdated 1y ago
DiffuEraser is a diffusion model for video inpainting, which performs great content completeness and temporal consistency while maintaining acceptable efficiency.
0 check-insNVIDIAAMDApple
tjoen/instructir.pinokioupdated 1y ago
[Nvidia GPU only] High-Quality Image Restoration Following Human Instructions
1 check-inNVIDIAAMDApple
sdbds/TRELLIS-for-windowsupdated 1y ago
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
0 check-insNVIDIAAMDApple
cubiq/ComfyUI_InstantIDupdated 1y ago
Contribute to cubiq/ComfyUI_InstantID development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
cubiq/ComfyUI_essentialsupdated 1y ago
Contribute to cubiq/ComfyUI_essentials development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
cubiq/ComfyUI_IPAdapter_plusupdated 1y ago
Contribute to cubiq/ComfyUI_IPAdapter_plus development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
Deepfake-Zatylkin/LatentSync-Pinokiov3.2updated 1y ago
0 check-insNVIDIAAMDApple
6Morpheus6/FooocusPlusv3.0updated 1y ago
An enhanced version of Fooocus giving you access to all of the latest AI image generation models
@morpheus0 check-insNVIDIAAMDApple
SparkAudio/Spark-TTSupdated 1y ago
Spark-TTS Inference Code
0 check-insNVIDIAAMDApple
MackinationsAi/Upgraded-Depth-Anything-V2updated 1y ago
Upgraded repo includes more capabilities, converted the cmd .py scripts to function more intuitively, added 147 different depth output colour map methods, introduced batch image as well as video processing, everything is automatically saved to an outputs folder (w/ file-naming conventions) & I've converted the .pth models to .safetensors.
0 check-insNVIDIAAMDApple
cocktailpeanut/MagicAnimate.pinokiov3.0updated 1y ago
[NVIDIA ONLY] Temporally Consistent Human Image Animation using Diffusion Model https://showlab.github.io/magicanimate/
@cocktailpeanut0 check-insNVIDIAAMDApple
pinokiofactory/AllTalk-TTSv3.3updated 1y ago
[NVIDIA ONLY] AllTalk-TTS is a unified UI for E5-TTS, XTTS, Vite TTS, Piper TTS, Parler TTS and RVC, based on CoquiTTS, including a finetune mode.
1 check-inNVIDIAAMDApple
thinhlpg/vixtts-demoupdated 1y ago
A Vietnamese Voice Cloning Text-to-Speech Model ✨
0 check-insNVIDIAAMDApple
daswer123/xtts-finetune-webuiupdated 1y ago
Slightly improved official version for finetune xtts
0 check-insNVIDIAAMDApple
masonjames/VASR-for-Pinokiov1.5updated 1y ago
0 check-insNVIDIAAMDApple