Launcher updates

More
PKU-YuanGroup/Open-Sora-Planupdated 8mo ago
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
0 check-insNVIDIAAMDApple
showlab/liveccupdated 8mo ago
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)
0 check-insNVIDIAAMDApple
asdf31jsa/VisoMaster-Experimentalupdated 8mo ago
Powerful & Easy-to-Use Video Face Swapping and Editing Software
0 check-insNVIDIAAMDApple
tencent/hunyuan3d-2updated 8mo ago
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
0 check-insNVIDIAAMDApple
Feedjer/RVC-realtimev2.0updated 8mo ago
[WINDOWS/LINUX ONLY] Easily train a good VC model with voice data <= 10 mins!: https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI
1 check-inNVIDIAAMDApple
jpuigcerver/pylaiaupdated 8mo ago
A deep learning toolkit specialized for handwritten document analysis
0 check-insNVIDIAAMDApple
rbbrdckybk/civitai-companionupdated 8mo ago
Utility for extracting prompt metadata from Civitai AI images, auto-downloading the associated resources, and outputting/formatting the prompt information.
0 check-insNVIDIAAMDApple
SUP3RMASS1VE/Index-TTS-2-Pinokiov3.7updated 8mo ago
@sup3rmass1ve0 check-insNVIDIAAMDApple
wendy7756/AI-Video-Transcriberupdated 8mo ago
Transcribe and summarize video content using AI. Open-source, multi-platform, and supports multiple languages.
0 check-insNVIDIAAMDApple
stackblitz-labs/bolt.diyupdated 8mo ago
Prompt, run, edit, and deploy full-stack web applications using any LLM you want!
0 check-insNVIDIAAMDApple
jpgallegoar/Spanish-F5updated 8mo ago
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
0 check-insNVIDIAAMDApple
wengjiyao/upscale-enhanceupdated 8mo ago
High-quality video and image super-resolution powered by Real-ESRGAN. Upscale your media with advanced AI.
0 check-insNVIDIAAMDApple
argosopentech/argos-translateupdated 8mo ago
Open-source offline translation library written in Python
0 check-insNVIDIAAMDApple
PRITHIVSAKTHIUR/dots.ocr-fix-demoupdated 8mo ago
This Gradio application demonstrates the capabilities of the "dots.ocr" model, a powerful multilingual document parser.
0 check-insNVIDIAAMDApple
eddycrack864/uvr5-uiupdated 8mo ago
Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models
0 check-insNVIDIAAMDApple
TheAwaken1/StoryCraftv3.7updated 8mo ago
Generate engaging 1 to 5-minute short stories with LLMs and convert them to audio with Coqui TTS, supports voice cloning, built in speakers and multilingual.
@theawakenone0 check-insNVIDIAAMDApple
maudoin/ollama-voiceupdated 8mo ago
plug whisper audio transcription to a local ollama server and ouput tts audio responses
0 check-insNVIDIAAMDApple
Tencent-Hunyuan/Hunyuan3D-Omniupdated 8mo ago
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets
0 check-insNVIDIAAMDApple
Tencent-Hunyuan/Hunyuan3D-2.1updated 8mo ago
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
0 check-insNVIDIAAMDApple
ali-vilab/VACEupdated 8mo ago
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
0 check-insNVIDIAAMDApple