humanaigc/swapanyheadupdated 10mo ago
Project page for ICCV 2025 paper "Controllable and Expressive One-Shot Video Head Swapping"
0 check-insNVIDIAAMDApple
openai/whisperupdated 10mo ago
Robust Speech Recognition via Large-Scale Weak Supervision - openai/whisper
0 check-insNVIDIAAMDApple
6Morpheus6/omnigen2v3.7updated 10mo ago
Unified Image Understanding and Generation. Text-to-Image Generation, In-context Generation, Instruction-guided Image Editing, Visual Understanding (Minimum Requirements 12GBV RAM / 48GB RAM, Recommended Requirements 24GB VRAM / 32GB RAM)
@morpheus0 check-insNVIDIAAMDApple
cocktailpeanutlabs/protov4.0updated 10mo ago
0 check-insNVIDIAAMDApple
Omodaka9375/MIDIfrenupdated 10mo ago
MIDIfren is an Audio Stem & MIDI Processor in Python🎵. Convert audio to MIDI, extract stems, sonify MIDI files ...
0 check-insNVIDIAAMDApple
cocktailpeanutlabs/triposrv1.2updated 10mo ago
a state-of-the-art open-source model for fast feedforward 3D reconstruction from a single image, developed in collaboration between Tripo AI and Stability AI. https://huggingface.co/spaces/stabilityai/TripoSR
3 check-insNVIDIAAMDApple
bmaltais/kohya_ssupdated 10mo ago
Contribute to bmaltais/kohya_ss development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
presenton/presenton_dockerupdated 10mo ago
Contribute to presenton/presenton_docker development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
presenton/presenton_electronupdated 10mo ago
Contribute to presenton/presenton_electron development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
reo224/FLUX_Lora_Tarin_202506v2.1updated 10mo ago
[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)
0 check-insNVIDIAAMDApple
Endergr/ai-video-generatorupdated 10mo ago
Full-stack AI video generation app with image/text input and premium NSFW toggle
0 check-insNVIDIAAMDApple
Rudrabha/Wav2Lipupdated 10mo ago
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
0 check-insNVIDIAAMDApple
Myumiitsu/propainter.pinokioupdated 10mo ago
ICCV‑23 video in‑/out‑painting
0 check-insNVIDIAAMDApple
mannaandpoem/OpenManusupdated 10mo ago
Contribute to mannaandpoem/OpenManus development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
6Morpheus6/iopaint-pinokiov3.7updated 10mo ago
Image inpainting tool powered by SOTA AI models. Remove any unwanted object, defect, or even people from your pictures, and replace (powered by stable diffusion) anything in your pictures. https://www.iopaint.com/
@morpheus1 check-inNVIDIAAMDApple
elloza/slides2video-pinokioupdated 10mo ago
Public repository of the slides2video app for pinokio
0 check-insNVIDIAAMDApple
lokesh476/IndicF5-Pinokiov1.0updated 11mo ago
Text-to-Speech using IndicF5 for Indian languages
0 check-insNVIDIAAMDApple
CarlGao4/Demucs-Guiupdated 11mo ago
A GUI for music separation AI demucs
0 check-insNVIDIAAMDApple
Deathdadev/Direct3D-S2-Pinokiov3.7updated 11mo ago
[NVIDIA ONLY] Direct3D-S2 is a scalable 3D shape generation framework leveraging sparse volumetric representations for high-resolution outputs. It features Spatial Sparse Attention (SSA), a novel mechanism that accelerates Diffusion Transformer computations on sparse data, achieving up to 9.6× speedup in training. The unified Sparse VAE architecture maintains a consistent sparse volumetric format across input, latent, and output stages, significantly improving efficiency and stability.
@death0 check-insNVIDIAAMDApple
appotry/GLM4Voicev1.0updated 11mo ago
GLM-4-Voice | 端到端中英语音对话模型
0 check-insNVIDIAAMDApple