cocktailpeanut/bakllava.pinokioupdated 2y ago
llama.cpp with BakLLaVA model describes what does it see (https://github.com/Fuzzy-Search/realtime-bakllava)
@cocktailpeanut0 check-insNVIDIAAMDApple
cocktailpeanut/stable-diffusion-webui-forgeupdated 2y ago
Contribute to cocktailpeanut/stable-diffusion-webui-forge development by creating an account on GitHub.
@cocktailpeanut0 check-insNVIDIAAMDApple
continue-revolution/sd-webui-segment-anythingupdated 2y ago
Segment Anything for Stable Diffusion WebUI
0 check-insNVIDIAAMDApple
SUDO-AI-3D/zero123plusupdated 2y ago
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
0 check-insNVIDIAAMDApple
aolko/sd-webui-forge.pinokioupdated 2y ago
Stable Diffusion UI with patches by lllyasviel
0 check-insNVIDIAAMDApple
tjoen/infernosaber.pinokioupdated 2y ago
Flexible Automapper for Beatsaber made for any difficulty
0 check-insNVIDIAAMDApple
isurulkh/YouTube-Video-Summarizerupdated 2y ago
A Streamlit app that uses Google's AI to summarize YouTube video transcripts, providing concise, point-form notes. Perfect for quick content overviews.
0 check-insNVIDIAAMDApple
higgsfield-ai/higgsfieldupdated 2y ago
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
0 check-insNVIDIAAMDApple
coqui-ai/xtts-streaming-serverupdated 2y ago
Contribute to coqui-ai/xtts-streaming-server development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
Jaden-J/Coqui-TTS-XTTS-v2-updated 2y ago
๐Ÿธ๐Ÿ’ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
0 check-insNVIDIAAMDApple
cocktailpeanut/whisper-webuiupdated 2y ago
A Web UI for easy subtitle using whisper model.
@cocktailpeanut0 check-insNVIDIAAMDApple
Akegarasu/dataset-tag-editorupdated 2y ago
A fork of WebUI to edit dataset captions for txt2img models
0 check-insNVIDIAAMDApple
peanutcocktail/videocrafterupdated 2y ago
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
0 check-insNVIDIAAMDApple
cocktailpeanut/moondream1v1updated 2y ago
moondream1 is a tiny (1.6B parameter) vision language model trained by @vikhyatk that performs on par with models twice its size. It is trained on the LLaVa training dataset, and initialized with SigLIP as the vision tower and Phi-1.5 as the text encoder. https://huggingface.co/spaces/vikhyatk/moondream1
@cocktailpeanut0 check-insNVIDIAAMDApple
numz/sd-wav2lip-uhqupdated 2y ago
Wav2Lip UHQ extension for Automatic1111
0 check-insNVIDIAAMDApple
zsxkib/Moore-AnimateAnyoneupdated 2y ago
Unofficial Re-Trained AnimateAnyone (Image + DWPose Video โ†’ Animated Video of Image)
0 check-insNVIDIAAMDApple
Spaceish/facefusion-nsfwupdated 2y ago
nsfw protection bypass for the Next generation face swapper and enhancer
0 check-insNVIDIAAMDApple
peanutcocktail/photomakerupdated 2y ago
PhotoMaker
0 check-insNVIDIAAMDApple
candywrap/moore-animateanyone-for-windowsupdated 2y ago
Contribute to candywrap/Moore-AnimateAnyone-for-windows development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
HunxByts/GhostTrackupdated 2y ago
Useful tool to track location or mobile number
0 check-insNVIDIAAMDApple