Store
MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoning in real-world scenarios.
LTX-Desktop Video Generation + Editor - Powered By WanGP
A Python framework for AI-driven character animation using neural networks.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
High-quality rapid TTS voice cloning model (150x+ realtime) — 48kHz speech, voice cloning
[ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning.
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Wan2GPFeatured
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
LivePortraitFeatured
Bring portraits to life! https://github.com/KwaiVGI/LivePortrait
High-Quality Text-to-Speech for Indian Languages
Fast Lipsync application for smaller GPU's.
ForgeFeatured
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.com/lllyasviel/stable-diffusion-webui-forge
Automatically remove watermarks from videos generated by Sora AI.
Native and Compact Structured Latents for 3D Generation
[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video, Flux and more. (On Windows supported by all dedicated AMD GPUs from RDNA 2 - RDNA 4)
Flexible Automapper for Beatsaber made for any difficulty
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
[SIGGRAPH 2026] AniGen: Unified S^3 Fields for Animatable 3D Asset Generation
Upload a short recording of the voice you want to change and a reference clip of the target voice (or leave it blank to anonymize). Adjust simple sliders for speed, pitch, and style, then the app c...
