Store
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Multi-Identity Consistency for Image Customization via Matching Reward https://github.com/bytedance/UMO
Run Wan2GP on Google Colab
Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.
# SillyTavern Character Generator
A pinokio script for https://github.com/Tremontaine/character-card-generator
When used with KoboldCPP use http://localhost:5001/v1
Where 5001 is the port reported by KoboldCPP when starting
Text API Key needs to be filled with anything. (If left empty will give a error so just add anything to it)
Contribute to presenton/presenton-python-sdk development by creating an account on GitHub.
An app for creating audio-based content such as song covers and speech using Retrieval-based Voice Conversion.
An advanced crypto trading bot written in Python
This repo helps you understand how safetensors are structured to store different layers of an LLM and re-shard/re-chunk safetensors files even if they don't fit in the GPU.. ( No Autoclass )
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
AI-powered speech denoising + enhancement (Gradio web demo + CLI).
Fast, high-quality image generation using comfyui via a Gradio UI
EbSynth in Python, version 2
A fast multimodal LLM for real-time voice
Integrate LLM into Forge Webui via Ollama
An AI powered extension to get rid of the Gemini watermark
Support for https://www.gyan.dev/ffmpeg
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
Voice Synthesis Platform with Smart Chunking, Batch Processing, and Voice Cloning capabilities.
即梦Dreamina free api,适配手机浏览器
