Store

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
AutoShorts can generate short videos with the help of AI. It can generate popular types of video seen on YouTube Shorts and TikTok.
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Customized fork of Rope Deepfake software featuring live streaming capabilities and support for Deepfacelive models
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
AI powered speech denoising and enhancement
Generative AI for Professional Creatives
a multiplayer street view bingo game
A minimal and universal controller for FLUX.1 https://github.com/Yuanshi9815/OminiControl
Control browsers
UI for Image database management: https://github.com/Eugeoter/waifuset
A Web UI for easy subtitle using fish-speech model.
Easily train a good VC model with voice data <= 10 mins!
بوت باحث لتمكين المستخدمين من الاستفادة من مميزات المنصة من خلال تطبيق Telegram
chat-with-mlxFeatured
[Mac Onlyl] An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework. https://github.com/qnguyen3/chat-with-mlx