Wanted
1,640 projectsNon-launcher projects without a Pinokio launcher yet.
Translate the video from one language to another and embed dubbing & subtitles.
Industry leading face manipulation platform
Contribute to veo-3/veo-3 development by creating an account on GitHub.
Focus even better on prompting and generating
🎙️ Qwen3-TTS-DubFlow: An open-source, human-in-the-loop AI dubbing workbench for novels, games, podcasts, and more. Features a "Design-then-Clone" workflow powered by Qwen3-TTS to achieve consistent identity and context-aware emotional performance.
Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially for my kids.
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
DeepFaceLab is the leading software for creating deepfakes.
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Stable Diffusion web UI
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automatic transcription.
Simple and easy to use DDNS. Support Aliyun, Tencent Cloud, Dnspod, Cloudflare, Callback, Huawei Cloud, Baidu Cloud, Porkbun, GoDaddy, Namecheap, NameSilo...
MOVA: Towards Scalable and Synchronized Video–Audio Generation
Send files from one device to many in real-time.
Image Background Removal Toolkit - Open Source and API Models
⚡️ Blazing-fast batch subtitle translation for SRT/ASS/VTT/LRC — 70+ languages, AI-powered 批量字幕翻译
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Lets make video diffusion practical!
