Store
A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Hunyuan Video, LTX Video and Flux.
Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
A powerful tool for extending images to different aspect ratios using Stable Diffusion XL.
Gradio UI for YuE music generation model
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
Pinokio app to install and run sdbds/YuE-for-windows, tuned defaults for a single RTX 4060 Ti 16GB GPU. Uses Torch 2.5.1+cu124 and requirements-uv.txt.
Contribute to thewh1teagle/phonikud-chatterbox development by creating an account on GitHub.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface https://github.com/comfyanonymous/ComfyUI
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️ - absadiki/subsai
DreamO: A Unified Framework for Image Customization
Transform lyric transcriptions into karaoke-style MP4 videos. Built on Python-Lyric-Transcriber, this Gradio UI uses Whisper for transcription, an LLM for lyric edits, and Demucs for vocal separation. A fun tool for karaoke fans, though outputs may vary.
Frontier Open-Source Text-to-Speech
Contribute to joellliu/DiffProtect development by creating an account on GitHub.
Chatbot Ollama is an open source chat UI for Ollama.
Pioneering Automated GUI Interaction with Native Agents
AirLLM 70B inference with single 4GB GPU
Dough is a open source tool for steering AI animations with precision
