Store

Runs inference using HuggingFace models
interacting with the Ovis2-8B model. The script allows users to load the model, process image and video inputs, and generate text-based responses using a conversational chatbot.
MegaTTS app
自动优化seedance-1.0-pro提示词根据官方docs,一键生成无水印视频
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
One-click install and run Wav2Lip with preloaded models
Higgs Audio Text-to-Speech Playground
Running Quantized Higgs Audio (Web UI + OpenAI compatible API Server)
Mirror of git://git.ffmpeg.org/fateserver
Official inference repo for FLUX.1 models
Simple, scalable AI model deployment on GPU clusters
A collaboration friendly studio for NeRFs
Fully local AI vtuber that can see your screen and talk in real time
kernel mode mouse accel
This is the official implementation of our paper: "MiniMax-Remover: Taming Bad Noise Helps Video Object Removal"
Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)