Store
智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”
Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.
Synthetic identity documents dataset
A Gradio web UI for Large Language Models https://github.com/oobabooga/text-generation-webui
[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation
Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language models and a powerful autoregressive transformer decoder, text2midi allows users to create symbolic music that aligns with detailed textual prompts, including musical attributes like chords, tempo, and style.
Contribute to Sebix599/Ford-Radio-Code development by creating an account on GitHub.
An editing tool that uses AI to transcribe, understand content and search for anything in your footage, integrated with ChatGPT and other AI models
A new one shot head swapping approach
Image inpainting tool powered by SOTA AI models. Remove any unwanted object, defect, or even people from your pictures, and replace (powered by stable diffusion) anything in your pictures. https://www.iopaint.com/
LLM-Based Pseudo Music Captioning
Contribute to Erisvaldo2/Corte-autom-tico-v-deo-YouTube- development by creating an account on GitHub.
Next generation face swapper and enhancer

A one-click installer for setting up RFdiffusion using Pinokio.
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
A local and uncensored AI entity.
Official Code for MotionCtrl [SIGGRAPH 2024]
[WINDOWS/LINUX ONLY] Easily train a good VC model with voice data <= 10 mins!: https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI