PierrunoYT/KittenTTS-Pinokiov5.0updated 1mo ago
Ultra-lightweight text-to-speech (15M-80M params) โ€” CPU optimized, 8 voices, ONNX-powered
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/soprano-tts-pinokiov5.0updated 1mo ago
Instant, Ultra-Realistic Text-to-Speech
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/moondream-3-pinokiov5.0updated 1mo ago
A web interface for the Moondream3 vision-language model featuring image captioning, visual question answering, object detection, and object pointing.
@pierrunoyt0 check-insNVIDIAAMDApple
PierrunoYT/VyvoTTS-LFM2-Pinokiov5.0updated 1mo ago
High-quality Text-to-Speech powered by VyvoTTS LFM2 model with easy-to-use web interface
@pierrunoyt0 check-insNVIDIAAMDApple
PierrunoYT/Photoroom-PRX-Pinokiov5.0updated 1mo ago
Gradio web interface for Photoroom's PRX-1024-t2i-beta text-to-image model
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/Youtube2MP3-Pinokiov5.0updated 1mo ago
๐ŸŽต YouTube to MP3 downloader with a simple Gradio UI. Paste a YouTube link to download MP3. Requires ffmpeg installed on your system.
@pierrunoyt0 check-insNVIDIAAMDApple
PierrunoYT/TranslateGemma-Pinokiov5.0updated 1mo ago
๐ŸŒ TranslateGemma - Google's open-source multilingual translation AI. Translate text across 55+ languages and extract/translate text from images. Powered by Gemma 3 architecture.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/Higgs-Audio-V2-Pinokiov1.0.0updated 1mo ago
Advanced text-to-speech with voice cloning, multi-speaker support, and background music generation using Higgs Audio V2
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/SmolLM3-3B-Pinokiov1.0.0updated 1mo ago
Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/cohere-transcribe-pinokiov5.0updated 1mo ago
State-of-the-art open-source speech recognition model supporting 14 languages. 2B parameter ASR model from Cohere Labs.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/LFM2.5-350M-Pinokiov5.0updated 1mo ago
Paste long text, clean it into readable sections, summarize each section, and ask questions in-browser with WebGPU.
@pierrunoyt1 check-inNVIDIAAMDApple
6morpheus6/glm-tts-pinokiov1.0.0updated 1mo ago
๐ŸŽ™๏ธ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.
@morpheus0 check-insNVIDIAAMDApple
huggingface.co/MYAIGF/ai-girlfriend
Weโ€™re on a journey to advance and democratize artificial intelligence through open source and open science.
0 check-insNVIDIAAMDApple
huggingface.co/spaces/omarelshehy/NAMAA-Egyptian-Voice
Chatterbox Saudi Arabic TTS Demo
0 check-insNVIDIAAMDApple
huggingface.co/g-group-ai-lab/gwen-tts-0.6B
Weโ€™re on a journey to advance and democratize artificial intelligence through open source and open science.
0 check-insNVIDIAAMDApple
huggingface/huggingface.jsupdated 1mo ago
Use Hugging Face with JavaScript
0 check-insNVIDIAAMDApple
Gitlawb/openclaudeupdated 1mo ago
Open Claude Is Open-source coding-agent CLI for OpenAI, Gemini, DeepSeek, Ollama, Codex, GitHub Models, and 200+ models via OpenAI-compatible APIs.
0 check-insNVIDIAAMDApple
hero8152/LTX2.3-Multifunctionalupdated 1mo ago
Functionality optimization based on LTX desktop version
0 check-insNVIDIAAMDApple
ValueCell-ai/ClawXupdated 1mo ago
ClawX is a desktop app that provides a graphical interface for OpenClaw AI agents. It turns CLI-based AI orchestration into a desktop experience without using the terminal. China website is https://clawx.com.cn.
0 check-insNVIDIAAMDApple
ggroup-ai-lab/gwen-ttsupdated 1mo ago
Contribute to ggroup-ai-lab/gwen-tts development by creating an account on GitHub.
0 check-insNVIDIAAMDApple