Store
[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
from pumpCurry
Generates Minecraft skins with a text prompt using the HuggingFace "monadical-labs/minecraft-skin-generator" model.
This is an official PyTorch implementation of our NeurIPS 2023 paper "GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization"
Contribute to yyang181/colormnet development by creating an account on GitHub.
JARVIS - A ASSISTANT WHICH IS NEEDED FOR EVERYONE
A Fully Self-Hosted Solution for Full-Duplex Voice Interaction - FireRedTeam/FireRedChat
[NVIDIA ONLY] Text-driven, intelligent restoration, blending AI technology with creativity to give every image a brand new life https://supir.xpixel.group
tts for home assistant [vosk-tts]
A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.
A simple, high-quality image generation tool to create stunning illusions.
Stable Diffusion Trainer: https://github.com/bmaltais/kohya_ss
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
ScriptFlow: Free, open-source AI-powered transcription tool. Transcribe any video / playlist or audio from files, YouTube, or 1000+ websites with a fast, lightweight, and private GUI application, and Many Export Formats.
OpenRecall is a fully open-source, privacy-first alternative to proprietary solutions like Microsoft's Windows Recall. With OpenRecall, you can easily access your digital history, enhancing your memory and productivity without compromising your privacy.
Contribute to AI4Bharat/IndicF5 development by creating an account on GitHub.
