SakanaAI/AI-Scientist-v2updated 4mo ago
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
0 check-insNVIDIAAMDApple
SakanaAI/AI-Scientistupdated 4mo ago
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬
0 check-insNVIDIAAMDApple
apple/ml-sharpupdated 4mo ago
Sharp Monocular View Synthesis in Less Than a Second
0 check-insNVIDIAAMDApple
MeiGen-AI/InfiniteTalkupdated 4mo ago
​​Unlimited-length talking video generation​​ that supports image-to-video and video-to-video generation
0 check-insNVIDIAAMDApple
MeiGen-AI/MultiTalkupdated 4mo ago
[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
0 check-insNVIDIAAMDApple
Ordinary0x/The-3rd-Eyeupdated 4mo ago
The 3rd Eye is a modular OSINT (Open Source Intelligence) framework built on an agent-based, graph-driven architecture. It automates public information discovery, identity correlation, and exposure analysis across multiple platforms, and generates structured intelligence reports. The system follows a LangGraph agent design.
0 check-insNVIDIAAMDApple
hendrybui/facefusionupdated 4mo ago
Industry leading face manipulation platform
0 check-insNVIDIAAMDApple
Tencent-Hunyuan/HunyuanWorld-1.0updated 4mo ago
Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
0 check-insNVIDIAAMDApple
bytedance/Dolphinupdated 4mo ago
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
0 check-insNVIDIAAMDApple
zyddnys/manga-image-translatorupdated 4mo ago
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
0 check-insNVIDIAAMDApple
mmehmetisik/ai-text-to-image-generatorupdated 4mo ago
AI-powered image generation tool using Hugging Face API and Stable Diffusion. Create images from text prompts with multiple style options.
0 check-insNVIDIAAMDApple
Light-x02/ComfyUI-Civitai-Discovery-Hubupdated 4mo ago
This ComfyUI node lets you browse the Civitai gallery directly within the interface, featuring infinite scroll, advanced filters (including NSFW), and favorites management. It also allows you to retrieve prompts, metadata, and images/videos to seamlessly reuse them in your workflows.
0 check-insNVIDIAAMDApple
Stability-AI/generative-modelsupdated 4mo ago
Generative Models by Stability AI
0 check-insNVIDIAAMDApple
FlyMyAI/flymyai-lora-trainerupdated 4mo ago
Qwen-Image text to image lora trainer
0 check-insNVIDIAAMDApple
nateraw/stable-diffusion-videosupdated 4mo ago
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
0 check-insNVIDIAAMDApple
gojodennis/OpenKombaiupdated 5mo ago
OpenKombai: A free, privacy-first alternative to Kombai. Instantly convert screenshots and designs into production-ready React + Tailwind code using local LLMs (Llama 3.2 Vision & Qwen 2.5). No API keys, zero cloud costs.
0 check-insNVIDIAAMDApple
beeble-ai/SwitchLight-Studioupdated 5mo ago
Contribute to beeble-ai/SwitchLight-Studio development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
Zheng-Chong/CatVTONupdated 5mo ago
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
0 check-insNVIDIAAMDApple
gotoolkits/ClearerVoice-Studiov2.0updated 5mo ago
0 check-insNVIDIAAMDApple
iamdinhthuan/viterbox-ttsupdated 5mo ago
Contribute to iamdinhthuan/viterbox-tts development by creating an account on GitHub.
0 check-insNVIDIAAMDApple