Launcher updates

More
6Morpheus6/forge-neov2.0updated 3h ago
[NVIDIA ONLY] Stable Diffusion WebUI Forge supporting Flux, Qwen, wan, nunchaku and more in a lightweight WebUI. https://github.com/Haoming02/sd-webui-forge-classic/tree/neo
@morpheus12 check-insNVIDIAAMDApple
PierrunoYT/pocket-tts-pinokiov5.0updated 8h ago
Lightweight CPU text-to-speech with preset voices and optional Hugging Face-authenticated voice cloning.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/LuxTTS-Pinokiov7.0updated 8h ago
High-quality rapid TTS voice cloning model (150x+ realtime) — 48kHz speech, voice cloning
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/OrpheusTTS-Pinokiov7.0updated 8h ago
Standalone Text-to-Speech using Orpheus TTS with a Gradio UI
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/Z-Image-Pinokiov5.0updated 8h ago
⚡️ Efficient 6B parameter image generation model with sub-second inference. Generate high-quality, photorealistic images with only 8 inference steps. Features bilingual text rendering (Chinese & English) and Single-Stream Diffusion Transformer architecture.
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/VoxCPM-2-Pinokiov5.0updated 8h ago
Tokenizer-free TTS for context-aware speech, voice cloning, and voice design. 2B params, 48kHz, 30 languages (Gradio UI).
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/KittenTTS-Pinokiov7.0updated 9h ago
Ultra-lightweight text-to-speech (15M-80M params) — CPU optimized, 8 voices, ONNX-powered
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/soprano-tts-pinokiov5.0updated 9h ago
Instant, Ultra-Realistic Text-to-Speech
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/Photoroom-PRX-Pinokiov5.0updated 9h ago
Gradio web interface for Photoroom's PRX-1024-t2i-beta text-to-image model
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/TranslateGemma-Pinokiov5.0updated 9h ago
🌍 TranslateGemma - Google's open-source multilingual translation AI. Translate text across 55+ languages and extract/translate text from images. Powered by Gemma 3 architecture.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/Higgs-Audio-V2-Pinokiov5.0updated 9h ago
Advanced text-to-speech with voice cloning, multi-speaker support, and background music generation using Higgs Audio V2
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/SmolLM3-3B-Pinokiov5.0updated 13h ago
Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/OmniVoice-Studio-Pinokiov7.0updated 13h ago
The open-source ElevenLabs alternative. Local voice cloning, video dubbing, and real-time dictation — 646 languages, no API keys.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/PRX-Pixel-Pinokiov5.0updated 13h ago
Pixel-space PRX text-to-image pipeline (~7B params, Qwen3-VL text encoder, no VAE)
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/MossTTS-Pinokiov5.0updated 13h ago
All-in-one Gradio UI for the MOSS-TTS Family: voice cloning, dialogue generation, voice design from text, and sound effects.
@pierrunoyt3 check-insNVIDIAAMDApple
PierrunoYT/Sana-Pinokiov5.0updated 13h ago
Fast Image Generation with Sana Diffusion Model
@pierrunoyt2 check-insNVIDIAAMDApple
ai-anchorite/Z-Fusionv3.7updated 17h ago
Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]
@anchorite22 check-insNVIDIAAMDApple
PierrunoYT/cohere-transcribe-pinokiov5.0updated 1d ago
State-of-the-art open-source speech recognition model supporting 14 languages. 2B parameter ASR model from Cohere Labs.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/OmniVoice-Pinokiov5.0updated 1d ago
Zero-shot multilingual TTS (600+ languages) with voice cloning and voice design — Gradio UI (app/app.py)
@pierrunoyt5 check-insNVIDIAAMDApple