Store
End-to-end realtime stack for connecting humans and AI
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Open-source text-to-speech for European languages with voice cloning
MimikaStudio - Flutter Web+ Python: Voice Cloning, TTS & Audiobook Creator
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Industry leading face manipulation platform
Fast high quality video with audio generation with FA3
Generate 3D models from images, capture angles, and create new images with AI
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
halloFeatured
[NVIDIA Only] Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation https://github.com/fudan-generative-vision/hallo
GUI-focused roop
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice cloning.
Synthalingua - Real Time Translation
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Contribute to LukaDarsalia/colormnet_dinov3 development by creating an account on GitHub.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
StoryDiffusion ComicsFeatured
create a story by generating consistent images https://github.com/HVision-NKU/StoryDiffusion
