Launcher updates

More
PierrunoYT/SmolLM3-3B-Pinokiov5.0updated 12h ago
Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/dots.tts-Pinokiov5.0updated 13h ago
2B-parameter fully continuous, end-to-end autoregressive text-to-speech with zero-shot voice cloning. https://huggingface.co/rednote-hilab/dots.tts-base
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/OmniVoice-Studio-Pinokiov7.0updated 13h ago
The open-source ElevenLabs alternative. Local voice cloning, video dubbing, and real-time dictation — 646 languages, no API keys.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/PRX-Pixel-Pinokiov5.0updated 13h ago
Pixel-space PRX text-to-image pipeline (~7B params, Qwen3-VL text encoder, no VAE)
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/MossTTS-Pinokiov5.0updated 13h ago
All-in-one Gradio UI for the MOSS-TTS Family: voice cloning, dialogue generation, voice design from text, and sound effects.
@pierrunoyt3 check-insNVIDIAAMDApple
PierrunoYT/Sana-Pinokiov5.0updated 13h ago
Fast Image Generation with Sana Diffusion Model
@pierrunoyt2 check-insNVIDIAAMDApple
theng12/studiohub-macv3.6updated 13h ago
Control plane for the KH Studio family — live health grid, unified model catalog, and unified-memory monitoring for Image/Music/Voice/Chat/Video Studio. One canonical API for clients like Story Studio KH.
0 check-insNVIDIAAMDApple
passivejobs01/subtitle-playerv7.0updated 15h ago
유튜브·로컬 영상에 원어+한글 이중 자막을 만들어 외국어를 공부하는 로컬 플레이어 — 잇츠매거진
0 check-insNVIDIAAMDApple
ai-anchorite/Z-Fusionv3.7updated 16h ago
Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]
@anchorite22 check-insNVIDIAAMDApple
Brianmwanza-bit/adoption-and-child-care-mainv7.0updated 1d ago
Android app for adoption and child-care management with AI-assisted coding.
0 check-insNVIDIAAMDApple
manat0912/openpromptv5.0updated 1d ago
AI-powered prompt helper for image & video generation. Supports local LLMs (Ollama, LM Studio) and cloud APIs (Gemini, DeepSeek, OpenRouter).
@manatheturipa0 check-insNVIDIAAMDApple
PierrunoYT/cohere-transcribe-pinokiov5.0updated 1d ago
State-of-the-art open-source speech recognition model supporting 14 languages. 2B parameter ASR model from Cohere Labs.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/chatterbox-tts-pinokiov5.0updated 1d ago
AI-Powered Text-to-Speech with Voice Cloning using Chatterbox TTS and a Gradio interface. Includes Turbo, Multilingual (23+ languages), and Original models. Runs locally; CUDA GPU recommended, CPU supported. Windows, Mac, and Linux.
@pierrunoyt0 check-insNVIDIAAMDApple
PierrunoYT/Audio-Flamingo-3-Pinokiov7.0updated 1d ago
NVIDIA's Audio Flamingo 3 - Large Audio-Language Model for speech, sound, and music understanding with Gradio web interface
@pierrunoyt0 check-insNVIDIAAMDApple
CharafChnioune/AceJAM-Studiov7.0updated 1d ago
Create songs, albums and artwork locally on Apple MLX with ACE-Step v1.5, MFLUX, local agents and LoRA training.
0 check-insNVIDIAAMDApple
PierrunoYT/HiggsAudioV3-Pinokiov7.0updated 1d ago
Pinokio launcher for Higgs Audio v3 TTS with Gradio UI, SGLang-Omni backend, and automatic model download.
@pierrunoyt0 check-insNVIDIAAMDApple
PierrunoYT/OmniVoice-Pinokiov5.0updated 1d ago
Zero-shot multilingual TTS (600+ languages) with voice cloning and voice design — Gradio UI (app/app.py)
@pierrunoyt5 check-insNVIDIAAMDApple
PierrunoYT/Transcribr-Pinokiov5.0updated 1d ago
Bulk transcribe many YouTube videos, whole playlists, or your own uploaded audio/video files at once with faster-whisper. Outputs txt, srt, vtt, or json.
@pierrunoyt0 check-insNVIDIAAMDApple
PierrunoYT/PersonaPlex-Pinokiov5.0updated 1d ago
🗣️ PersonaPlex - NVIDIA's real-time speech-to-speech conversational AI model. Natural full-duplex conversations with customizable personas and voices.
@pierrunoyt3 check-insNVIDIAAMDApple