pinokiofactory/Hunyuan3d-2-lowvramv3.7updated 25d ago
Text/Image to 3D (Cross Platform: Mac + Windows + Linux): High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. https://github.com/deepbeepmeep/Hunyuan3D-2GP
Text-to-Speech for 16 Indian languages: Assamese, Bengali, Bodo, English (Indian accent), Hinglish, Gujarati, Hindi, Kannada, Malayalam, Manipuri, Marathi, Odia, Punjabi, Rajasthani, Tamil, and Telugu. SOTA models based on FastPitch and HiFi-GAN V1.
[NVIDIA ONLY] Super Optimized Gradio UI for Hunyuan Video Generator that works on GPU poor machines. Generate up to 10~14 sec videos https://github.com/deepbeepmeep/HunyuanVideoGP
[NVIDIA ONLY] Generate Video Progressively. FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively. https://github.com/lllyasviel/FramePack
hoodtronik/musubi-tuner.pinokiov2.0updated 28d ago
Train LoRA / LoHa / LoKr for Wan2.2, FLUX.2, Z-Image, HunyuanVideo, and more — one-click install of kohya-ss/musubi-tuner with its built-in Gradio GUI.
Pixel-Aligned 3D Generation from Images (SIGGRAPH 2026). Generate high-fidelity 3D GLB assets with PBR textures from a single image, powered by the TRELLIS.2 backbone. Native Windows install using prebuilt CUDA wheels (Python 3.12 / torch 2.8 / CUDA 12.8).