xxlong0/Wonder3Dupdated 1y ago
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
0 check-insNVIDIAAMDApple
cocktailpeanut/zonosupdated 1y ago
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with鈥攐r even surpassing鈥攖op TTS providers.
@cocktailpeanut0 check-insNVIDIAAMDApple
sdbds/MangaNinjia-for-windowsupdated 1y ago
Official implementation of "MangaNinja: Line Art Colorization with Precise Reference Following"
0 check-insNVIDIAAMDApple
SociallyIneptWeeb/AICoverGenupdated 1y ago
A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.
0 check-insNVIDIAAMDApple
huggingface/Qwen2.5-Coderupdated 1y ago
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
0 check-insNVIDIAAMDApple
deepbeepmeep/YuEGPupdated 1y ago
YuE: Open Full-song Generation Foundation for the GPU Poor
0 check-insNVIDIAAMDApple
kfatehi/IOPaint.pinokioupdated 1y ago
A free and open-source inpainting tool powered by SOTA AI model https://github.com/Sanster/IOPaint
0 check-insNVIDIAAMDApple
Dawn-India/Z-Mirrorupdated 1y ago
Official Zee Repository: Telegram bot which can download direct links, torrents, nzb, google drive, telegram document, mega links, any file/folder from rclone supported clouds, all yt-dlp supported sites and jdownloader supported sites, then upload them to google drive, telegram cloud or to one of rclone supported clouds.
0 check-insNVIDIAAMDApple
gabotechs/MusicGPTupdated 1y ago
Generate music based on natural language prompts using LLMs running locally
0 check-insNVIDIAAMDApple
alisson-anjos/YuE-exllamav2-UIupdated 1y ago
Contribute to alisson-anjos/YuE-exllamav2-UI development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
MDShoons/RVC-V3-v2.0updated 1y ago
A simple, high-quality voice conversion tool focused on ease of use and performance. https://github.com/IAHispano/Applio
1 check-inNVIDIAAMDApple
RobViren/kokovoicelabupdated 1y ago
A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones
0 check-insNVIDIAAMDApple
advimman/lamaupdated 1y ago
馃 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
0 check-insNVIDIAAMDApple
saic-mdal/lamaupdated 1y ago
馃 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
0 check-insNVIDIAAMDApple
mesolitica/MeloTTS-MSupdated 1y ago
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese, Korean and Malay.
0 check-insNVIDIAAMDApple
yxzysy/YFJanusv3.2updated 1y ago
0 check-insNVIDIAAMDApple
matatonic/openedai-whisperupdated 1y ago
An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.
0 check-insNVIDIAAMDApple
visomaster/visomaster-assetsupdated 1y ago
Contribute to visomaster/visomaster-assets development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
Gourieff/sd-webui-reactor-sfwupdated 1y ago
(SFW Friendly) Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111, SD.Next, Cagliostro)
0 check-insNVIDIAAMDApple
stefantrajanov/fluxgym-trainerv2.1updated 1y ago
[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)
0 check-insNVIDIAAMDApple