Store
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice cloning.
Advanced CLI tool that scans your hardware and tells you exactly which LLM or sLLM models you can run locally, with full Ollama integration.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
A collection of utility nodes from the VNCCS project that are useful not only for the project's primary goals but also for everyday ComfyUI workflows.
Freeze (package) Python programs into stand-alone executables
Batch resize images to 512, 768, or 1024px on the longest side while preserving aspect ratio. Supports JPG, PNG, BMP, GIF, TIFF, and WebP.
Next generation face swapper and enhancer
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Enter a description for your character (using the trigger word “img”) and optionally upload face photos, then provide a list of prompts—one per line—for each scene. The app creates a series of imag...
Open-source alternative to Higgsfield AI — Free AI image generation & cinema studio with 20+ models (Flux, SDXL, Midjourney, Ideogram). Self-hosted, customizable, MIT licensed.
Open-source alternative to Higgsfield AI — Free AI image generation & cinema studio with 20+ models (Flux, SDXL, Midjourney, Ideogram). Self-hosted, customizable, MIT licensed.

Batch resize images to predefined sizes (512px, 768px, 1024px) while maintaining aspect ratio
Kortix – build, manage and train AI Agents.
generate a video from an image with a text prompt
BiRefNet for background removal
ComfyUI-Qwen3-TTS brings Alibaba's powerful Qwen3-TTS models to ComfyUI Multi-GPU Support: CUDA, Apple Silicon (MPS), Intel Arc (XPU), and CPU
MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, and multilingual support, while enabling zero-shot voice cloning from short audio references.