OpenBMB/MiniCPM-oupdated 3mo ago
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
0 check-insNVIDIAAMDApple
Shubhamsaboo/awesome-llm-appsupdated 3mo ago
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
0 check-insNVIDIAAMDApple
ravindergandhi/Roop-Floyd-Pinokiov1.0updated 3mo ago
Next-generation face-swapping and enhancement (Codeberg fork of Roop). Easy GUI for images & videos.
0 check-insNVIDIAAMDApple
funnn123/ff-pinokiov1.5updated 3mo ago
Industry leading face manipulation platform
6 check-insNVIDIAAMDApple
manat0912/DreamID-V-Pinokiov5.0updated 3mo ago
DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
@manatheturipa1 check-inNVIDIAAMDApple
taylorchu/2cent-ttsupdated 3mo ago
Contribute to taylorchu/2cent-tts development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
6Morpheus6/xtts.pinokiov3.7updated 3mo ago
clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)
@morpheus4 check-insNVIDIAAMDApple
pnnbao97/VieNeu-TTSupdated 3mo ago
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio quality
0 check-insNVIDIAAMDApple
huggingface.co/spaces/fancyfeast/joy-caption-alpha-two
This application lets you upload an image and generate a caption tailored to your choice of style and length. You can select from options like descriptive, informal, or specific formats like traini...
0 check-insNVIDIAAMDApple
ishandutta2007/open-antigravityupdated 3mo ago
🚀🪐🌕🌑☄️🛸 Opensource equivalent of Google's Antigravity
0 check-insNVIDIAAMDApple
OpenHands/OpenHandsupdated 3mo ago
🙌 OpenHands: AI-Driven Development
0 check-insNVIDIAAMDApple
huggingface.co/Wan-AI/Wan2.2-TI2V-5B
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
0 check-insNVIDIAAMDApple
overcrash66/video-translatorupdated 3mo ago
Transform any video into a professional multilingual production with natural voice cloning, lip-sync, and on-screen text translation. No cloud APIs, no subscriptions, no data leaving your machine.
0 check-insNVIDIAAMDApple
bytedance/LatentSyncupdated 3mo ago
Taming Stable Diffusion for Lip Sync!
0 check-insNVIDIAAMDApple
huggingface.co/TheBloke/Trurl-2-13B-GGUF
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
0 check-insNVIDIAAMDApple
zsyOAOA/InvSRupdated 3mo ago
Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)
0 check-insNVIDIAAMDApple
huggingface.co/papers/2312.08914
Join the discussion on this paper page
0 check-insNVIDIAAMDApple
huggingface.co/Qwen/Qwen3-TTS-12Hz-0.6B-Base
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
0 check-insNVIDIAAMDApple
huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
0 check-insNVIDIAAMDApple
huggingface.co/spaces/alexnasa/Wan2.2-Animate-ZEROGPU
Edit Videos with Wan 2.2
0 check-insNVIDIAAMDApple