Launcher updates

More
Feedjer/cambrian.pinokiov1.5updated 1y ago
[Need 24GB VRAM] Cambrian-1 is a family of multimodal LLMs with a vision-centric design: https://github.com/cambrian-mllm/cambrian
0 check-insNVIDIAAMDApple
pinokiofactory/doughv1updated 1y ago
Dough is a open source tool for steering AI animations with precision
1 check-inNVIDIAAMDApple
cocktailpeanutlabs/moondream1v1.1updated 1y ago
moondream1 is a tiny (1.6B parameter) vision language model trained by @vikhyatk that performs on par with models twice its size. It is trained on the LLaVa training dataset, and initialized with SigLIP as the vision tower and Phi-1.5 as the text encoder. https://huggingface.co/spaces/vikhyatk/moondream1
0 check-insNVIDIAAMDApple
rimsila/fooocus-API-pinokiov1.5updated 2y ago
1 check-inNVIDIAAMDApple
GivEN29/autogen-studio-pinokioupdated 2y ago
Declaratively define and modify agents and multi-agent workflows through a point and click, drag and drop interface (e.g., you can select the parameters of two agents that will communicate to solve your task).
0 check-insNVIDIAAMDApple
cocktailpeanut/bark.pinokioupdated 2y ago
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning
@cocktailpeanut0 check-insNVIDIAAMDApple
cocktailpeanut/ms-video2video.pinokioupdated 2y ago
enhance the resolution and spatiotemporal continuity of text-generated videos and image-generated videos
@cocktailpeanut0 check-insNVIDIAAMDApple
cocktailpeanut/xinference.pinokioupdated 2y ago
LLM Web UI and API
@cocktailpeanut0 check-insNVIDIAAMDApple
cocktailpeanut/sdxl-turboupdated 2y ago
A Real-Time Text-to-Image Generation Model
@cocktailpeanut2 check-insNVIDIAAMDApple
cocktailpeanutlabs/paligemmav1.5updated 2y ago
an open vision-language model by Google. PaliGemma is designed as a versatile model for transfer to a wide range of vision-language tasks such as image and short video caption, visual question answering, text reading, object detection and object segmentation https://huggingface.co/spaces/google/paligemma
0 check-insNVIDIAAMDApple
supersonic13/sillytavern-pinokiov1.5updated 2y ago
Brought to you by Cohee, RossAscends, and the SillyTavern community, SillyTavern is a local-install interface that allows you to interact with text generation AIs (LLMs) to chat and roleplay with custom characters.
1 check-inNVIDIAAMDApple
cocktailpeanutlabs/invokeaiv1.1updated 2y ago
Generative AI for Professional Creatives
0 check-insNVIDIAAMDApple
Feedjer/ConsistentID.pinokiov1.5updated 2y ago
Customized ID Consistent for human: https://github.com/JackAILab/ConsistentID
0 check-insNVIDIAAMDApple
bycloud-ai/rerender_a_video-windowsupdated 2y ago
Rerender_A_Video: Zero-Shot Text-Guided Video-to-Video Translation
0 check-insNVIDIAAMDApple
Shahnab/singing-songstarter.pinokiov1.5updated 2y ago
1 check-inNVIDIAAMDApple
Shyk92/XTTS-RVC-UI.pinokioupdated 2y ago
A Gradio UI for XTTSv2 and RVC, allowing for real-time voice conversion.
0 check-insNVIDIAAMDApple
supersonic13/superprompter-pinokiov1.0updated 2y ago
SuperPrompter is a Python-based application that utilises the SuperPrompt-v1 model to generate optimised text prompts for AI/LLM image generation (for use with Stable Diffusion etc...) from user prompts.
0 check-insNVIDIAAMDApple
supersonic13/onetrainer-pinokiov1.2updated 2y ago
The script utilizes various deep learning models to create detailed character cards, including names, summaries, personalities, greeting messages, and character avatars.
0 check-insNVIDIAAMDApple
ngoqquyen/facefusion-pinokiov1updated 2y ago
Next generation face swapper and enhancer
2 check-insNVIDIAAMDApple
cocktailpeanut/bakllava.pinokioupdated 2y ago
llama.cpp with BakLLaVA model describes what does it see (https://github.com/Fuzzy-Search/realtime-bakllava)
@cocktailpeanut0 check-insNVIDIAAMDApple