Store
Ultra-lightweight text-to-speech (15M-80M params) โ CPU optimized, 8 voices, ONNX-powered
Instant, Ultra-Realistic Text-to-Speech
A web interface for the Moondream3 vision-language model featuring image captioning, visual question answering, object detection, and object pointing.
High-quality Text-to-Speech powered by VyvoTTS LFM2 model with easy-to-use web interface
Gradio web interface for Photoroom's PRX-1024-t2i-beta text-to-image model
๐ต YouTube to MP3 downloader with a simple Gradio UI. Paste a YouTube link to download MP3. Requires ffmpeg installed on your system.
๐ TranslateGemma - Google's open-source multilingual translation AI. Translate text across 55+ languages and extract/translate text from images. Powered by Gemma 3 architecture.
Advanced text-to-speech with voice cloning, multi-speaker support, and background music generation using Higgs Audio V2
Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy
State-of-the-art open-source speech recognition model supporting 14 languages. 2B parameter ASR model from Cohere Labs.
Paste long text, clean it into readable sections, summarize each section, and ask questions in-browser with WebGPU.
๐๏ธ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.
Weโre on a journey to advance and democratize artificial intelligence through open source and open science.
Chatterbox Saudi Arabic TTS Demo
Weโre on a journey to advance and democratize artificial intelligence through open source and open science.
Use Hugging Face with JavaScript
Open Claude Is Open-source coding-agent CLI for OpenAI, Gemini, DeepSeek, Ollama, Codex, GitHub Models, and 200+ models via OpenAI-compatible APIs.
Functionality optimization based on LTX desktop version
ClawX is a desktop app that provides a graphical interface for OpenClaw AI agents. It turns CLI-based AI orchestration into a desktop experience without using the terminal. China website is https://clawx.com.cn.
Contribute to ggroup-ai-lab/gwen-tts development by creating an account on GitHub.
