fox1245/Wan2GPupdated 8mo ago
A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Hunyuan Video, LTX Video and Flux.
0 check-insNVIDIAAMDApple
bytebot-ai/bytebotupdated 8mo ago
Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.
0 check-insNVIDIAAMDApple
Phantom-video/Phantomupdated 8mo ago
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
0 check-insNVIDIAAMDApple
SUP3RMASS1VE/Re-Size-Image-Outpaintv3.7updated 8mo ago
A powerful tool for extending images to different aspect ratios using Stable Diffusion XL.
@sup3rmass1ve0 check-insNVIDIAAMDApple
liinlin88888-bot/YuE-UIupdated 8mo ago
Gradio UI for YuE music generation model
1 check-inNVIDIAAMDApple
hzwer/ECCV2022-RIFEupdated 8mo ago
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
0 check-insNVIDIAAMDApple
megvii-research/ECCV2022-RIFEupdated 8mo ago
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
0 check-insNVIDIAAMDApple
liinlin88888-bot/YuE-for-windowsv0.2updated 8mo ago
Pinokio app to install and run sdbds/YuE-for-windows, tuned defaults for a single RTX 4060 Ti 16GB GPU. Uses Torch 2.5.1+cu124 and requirements-uv.txt.
0 check-insNVIDIAAMDApple
thewh1teagle/phonikud-chatterboxupdated 8mo ago
Contribute to thewh1teagle/phonikud-chatterbox development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
cocktailpeanutlabs/comfyuiv1.3updated 8mo ago
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface https://github.com/comfyanonymous/ComfyUI
2 check-insNVIDIAAMDApple
absadiki/subsaiupdated 8mo ago
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️ - absadiki/subsai
0 check-insNVIDIAAMDApple
SUP3RMASS1VE/DreamOv3.7updated 8mo ago
DreamO: A Unified Framework for Image Customization
@sup3rmass1ve0 check-insNVIDIAAMDApple
TheAwaken1/AIraoke-Pinokiov2.0updated 8mo ago
Transform lyric transcriptions into karaoke-style MP4 videos. Built on Python-Lyric-Transcriber, this Gradio UI uses Whisper for transcription, an LLM for lyric edits, and Demucs for vocal separation. A fun tool for karaoke fans, though outputs may vary.
@theawakenone1 check-inNVIDIAAMDApple
Deathdadev/DetailGen3Dv3.7updated 8mo ago
@death0 check-insNVIDIAAMDApple
harry2141985/VibeVoiceupdated 8mo ago
Frontier Open-Source Text-to-Speech
0 check-insNVIDIAAMDApple
joellliu/DiffProtectupdated 8mo ago
Contribute to joellliu/DiffProtect development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
ivanfioravanti/chatbot-ollamaupdated 8mo ago
Chatbot Ollama is an open source chat UI for Ollama.
0 check-insNVIDIAAMDApple
bytedance/UI-TARSupdated 8mo ago
Pioneering Automated GUI Interaction with Native Agents
0 check-insNVIDIAAMDApple
lyogavin/airllmupdated 8mo ago
AirLLM 70B inference with single 4GB GPU
0 check-insNVIDIAAMDApple
banodoco/Dough-pinokiov1updated 8mo ago
Dough is a open source tool for steering AI animations with precision
0 check-insNVIDIAAMDApple