Store
将视频瞬间转化为手绘故事 Turn Video Moments into Hand-Drawn Stories
The official code of Yume
MoCha: End-to-End Video Character Replacement without Structural Guidance
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...
Contribute to kijai/ComfyUI-FramePackWrapper development by creating an account on GitHub.
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
aura-sr-upscalerFeatured
AuraSR-v2 - An open reproduction of the GigaGAN Upscaler from fal.ai https://huggingface.co/spaces/gokaygokay/AuraSR-v2
AuraSR-v2 - An open reproduction of the GigaGAN Upscaler from fal.ai https://huggingface.co/spaces/gokaygokay/AuraSR-v2
OCR model that handles complex tables, forms, handwriting with full layout.
Autoforge takes a picture and generates a 3D layer STL file that you can print with a 3d printer
Text To Speech Synthesis with Vosk
Contribute to presenton/presenton-js-sdk development by creating an account on GitHub.
AI资讯日报 是一个基于 Cloudflare Workers 驱动的内容聚合与生成平台。它每日为您精选 AI 领域的最新动态,包括行业新闻、热门开源项目、前沿学术论文、科技大V社交媒体言论,并通过 Google Gemini 模型进行智能处理与摘要生成,最终自动发布到 GitHub Pages 生成 AI 日报。
AMD 0.9B efficient text to video diffusion model
InvokeFeatured
The Gen AI Platform for Pro Studios https://github.com/invoke-ai/InvokeAI
A ComfyUI custom node for 3D camera angle control. Provides an interactive Three.js viewport to adjust camera angles and outputs formatted prompt strings for multi-angle image generation.
