Launcher updates

More
Zheng-Chong/CatVTONupdated 6mo ago
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
0 check-insNVIDIAAMDApple
gotoolkits/ClearerVoice-Studiov2.0updated 6mo ago
0 check-insNVIDIAAMDApple
iamdinhthuan/viterbox-ttsupdated 6mo ago
Contribute to iamdinhthuan/viterbox-tts development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
CorentinJ/Real-Time-Voice-Cloningupdated 6mo ago
Clone a voice in 5 seconds to generate arbitrary speech in real-time
0 check-insNVIDIAAMDApple
linus74rn/UmoPinokiov1.0updated 6mo ago
Multi-Identity Consistency for Image Customization via Matching Reward https://github.com/bytedance/UMO
0 check-insNVIDIAAMDApple
Square-Zero-Labs/Wan2GP-on-Colabupdated 6mo ago
Run Wan2GP on Google Colab
0 check-insNVIDIAAMDApple
SkyworkAI/Skywork-R1Vupdated 6mo ago
Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.
0 check-insNVIDIAAMDApple
presenton/presenton-python-sdkupdated 6mo ago
Contribute to presenton/presenton-python-sdk development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
JackismyShephard/ultimate-rvcupdated 6mo ago
An app for creating audio-based content such as song covers and speech using Retrieval-based Voice Conversion.
0 check-insNVIDIAAMDApple
jesse-ai/jesseupdated 6mo ago
An advanced crypto trading bot written in Python
0 check-insNVIDIAAMDApple
NotTheStallion/reshard-safetensorsupdated 6mo ago
This repo helps you understand how safetensors are structured to store different layers of an LLM and re-shard/re-chunk safetensors files even if they don't fit in the GPU.. ( No Autoclass )
0 check-insNVIDIAAMDApple
sealad886/pinokio-resemble-enhancev2.0updated 6mo ago
AI-powered speech denoising + enhancement (Gradio web demo + CLI).
0 check-insNVIDIAAMDApple
gaomingqi/track-anythingupdated 6mo ago
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
0 check-insNVIDIAAMDApple
DenisJunio/Z-Image-Fusionv3.7updated 6mo ago
Fast, high-quality image generation using comfyui via a Gradio UI
0 check-insNVIDIAAMDApple
FuouM/ReEzSynthupdated 6mo ago
EbSynth in Python, version 2
0 check-insNVIDIAAMDApple
fixie-ai/ultravoxupdated 6mo ago
A fast multimodal LLM for real-time voice
0 check-insNVIDIAAMDApple
Haoming02/sd-forge-ollamaupdated 6mo ago
Integrate LLM into Forge Webui via Ollama
0 check-insNVIDIAAMDApple
dinoBOLT/Gemini-Watermark-Removerupdated 6mo ago
An AI powered extension to get rid of the Gemini watermark
0 check-insNVIDIAAMDApple
GyanD/codexffmpegupdated 6mo ago
Support for https://www.gyan.dev/ffmpeg
0 check-insNVIDIAAMDApple
Capsize-Games/airunnerupdated 6mo ago
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
0 check-insNVIDIAAMDApple