lokesh476/IndicF5-Pinokiov1.0updated 10mo ago
Text-to-Speech using IndicF5 for Indian languages
0 check-insNVIDIAAMDApple
Deathdadev/Direct3D-S2-Pinokiov3.7updated 11mo ago
[NVIDIA ONLY] Direct3D-S2 is a scalable 3D shape generation framework leveraging sparse volumetric representations for high-resolution outputs. It features Spatial Sparse Attention (SSA), a novel mechanism that accelerates Diffusion Transformer computations on sparse data, achieving up to 9.6× speedup in training. The unified Sparse VAE architecture maintains a consistent sparse volumetric format across input, latent, and output stages, significantly improving efficiency and stability.
@death0 check-insNVIDIAAMDApple
appotry/GLM4Voicev1.0updated 11mo ago
GLM-4-Voice | 端到端中英语音对话模型
0 check-insNVIDIAAMDApple
Feedjer/LocalAIVtuberv2.0updated 11mo ago
A tool for hosting AI vtubers that runs fully locally and offline: https://github.com/0Xiaohei0/LocalAIVtuber
0 check-insNVIDIAAMDApple
mgalore/fluxgym-enhancedv2.1updated 11mo ago
[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)
1 check-inNVIDIAAMDApple
cocktailpeanut/control-lora.comfyui.pinokioupdated 11mo ago
Install Control-Lora Models and Workflows to ComfyUI with 1 click
@cocktailpeanut0 check-insNVIDIAAMDApple
SUP3RMASS1VE/Fish-Speechv3.7updated 11mo ago
@sup3rmass1ve0 check-insNVIDIAAMDApple
SUP3RMASS1VE/HunyuanPortraitv3.7updated 11mo ago
@sup3rmass1ve0 check-insNVIDIAAMDApple
TheAwaken1/AutoGif-Pinokiov2.0updated 11mo ago
Transform YouTube videos into stunning animated GIFs with perfectly-timed, stylized subtitles and eye-catching effects.
@theawakenone1 check-inNVIDIAAMDApple
peanutcocktail/ghtestv3.7updated 11mo ago
github
0 check-insNVIDIAAMDApple
petermg/DreamO_Pinokiov3.7updated 11mo ago
0 check-insNVIDIAAMDApple
Deathdadev/Gepeto-improvedv3.7updated 11mo ago
@death1 check-inNVIDIAAMDApple
SUP3RMASS1VE/Kokoro-TTSv3.2updated 11mo ago
A local implementation of the Kokoro Text-to-Speech model
@sup3rmass1ve0 check-insNVIDIAAMDApple
supersonic13/sdxs-pinokiov1.2updated 11mo ago
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions. https://github.com/halr9000/sdxs
0 check-insNVIDIAAMDApple
SUP3RMASS1VE/SD-Nextv3.7updated 11mo ago
SD.Next: All-in-one WebUI for AI generative image and video creation
@sup3rmass1ve0 check-insNVIDIAAMDApple
SUP3RMASS1VE/IC-Light-Ultimate-Studiov3.7updated 11mo ago
This project is an enhanced version of the IC-Light repository, designed for advanced image relighting and enhancement using Stable Diffusion and deep learning techniques
@sup3rmass1ve0 check-insNVIDIAAMDApple
patbhakta/HuibleTTSv3.7updated 11mo ago
Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia
0 check-insNVIDIAAMDApple
drago87/TabbyAPI-Pinokiov2.0updated 1y ago
A local-install LLM backend
@drago870 check-insNVIDIAAMDApple
petermg/InfiniteYou_Flux_LoRA_Supportv3.7updated 1y ago
[NVIDIA ONLY - WINDOWS ONLY] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity [LoRA support fork] https://github.com/petermg/InfiniteYou
0 check-insNVIDIAAMDApple