Store
调用Seedream 4.0的api服务实现本地生图。A custom node for ComfyUI to generate images using Volcano Engine's Seedream API.
Fast and memory-efficient exact attention
Contribute to SUP3RMASS1VE/VibeVoice-Realtime development by creating an account on GitHub.
Contribute to theroyallab/YALS development by creating an account on GitHub.
zuluCrypt is a front end to cryptsetup and tcplay and it allows easy management of encrypted block devices
自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili
Contribute to TensorStack-AI/AmuseAI development by creating an account on GitHub.
A fast and flexible random prompt generator for ComfyUI with 12 columns (Empty / Pre-filled SFW / Pre-filled NSFW)
Fast and Simple Face Swap Extension Node for ComfyUI (SFW)
DiaFeatured
Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia
A web interface for managing and interacting with Ollama models
A web interface for managing and interacting with Ollama models
zonosFeatured
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos
Automatically create music videos. Synchronize the cuts to the music's beat.
A Step Towards Music Generation Foundation Model
AI Prompt Optimization Platform is a professional prompt engineering tool designed to help users optimize AI model prompts, enhancing the effectiveness and accuracy of AI interactions. The platform integrates intelligent optimization algorithms, deep reasoning analysis, visualization debugging tools, and community sharing features, providing compre
echomimic2Featured
[NVIDIA ONLY] Make virtual avatars talk whatever you want with an image and an audio clip https://github.com/antgroup/echomimic_v2
Echo-TTS inference codebase
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
