Store
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Wyoming protocol server for faster whisper speech to text system
Upload an image and provide a text prompt to create a short, animated video. The app uses AI to bring your image to life with synchronized audio.
Transform your written prompts into original images using advanced AI technology. Simply type what you want to see, adjust settings like size and style, and watch as the system creates a custom ima...
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface. Now ZLUDA enhanced for better AMD GPU performance.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
One-click installer for Microsoft TRELLIS.2: High-quality 3D asset generation from images with PBR textures.
Google's official AI agent for your terminal. Access Gemini 2.5 Pro with 1M token context window directly from the command line.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Taming Stable Diffusion for Lip Sync!
The world’s fastest framework for building websites.
Easy to use GUI for Qwen TTS 3 for voice creating and cloning
Advancing Open-source World Models
This application converts written text into spoken words. Users input text and can optionally provide reference audio to specify the speaker's voice. The result is a generated audio file that reads...
Enjoy the magic of Diffusion models!
SkyReels V3: Multimodal Video Generation Model
LangConfig is a open-source visual workflow builder designed to make AI agent development accessible to everyone. Build, test, and share LangChain and LangGraph agent workflows with an intuitive drag-and-drop interface.
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.
