Store
High-Quality Text-to-Speech for Indian Languages
[ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
NanoBanana PPT Skills 基于 AI 自动生成高质量 PPT 图片和视频的强大工具,支持智能转场和交互式播放
Create 3D Meshes of Objects from Images.
State of the art OSINT tool. | A powerful open-source alternative to other face search engines.
ComfyUI HiTem3D Integration - Generate 3D models from images using HiTem3D API
A GUI for masking/rotoscoping video using AI models
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
A SillyTavern extension that creates character with LLMs.
Simple and easy to use DDNS. Support Aliyun, Tencent Cloud, Dnspod, Cloudflare, Callback, Huawei Cloud, Baidu Cloud, Porkbun, GoDaddy, Namecheap, NameSilo...
Contribute to POWERFULMOVES/PMOVES.AI development by creating an account on GitHub.
Forked from Wan2GP, a fast AI Video Generator, for the Apple Silicon.
Contribute to SUP3RMASS1VE/RoopUnleashed development by creating an account on GitHub.
Speech-to-text with NVIDIA Canary in Rust
Video inpainting (object removal / video completion) - sczhou/ProPainter
DiffRhythmFeatured
Generate songs with AI (up to 4 min 45 sec). Both with lyrics or instrumental https://github.com/ASLP-lab/DiffRhythm
Official inference repo for FLUX.2 models
Added support for russian language in train/inference scripts + example of train 60 hours
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
