Launcher updates

More
PierrunoYT/Youtube2MP3-Pinokiov5.0updated 9h ago
🎵 YouTube to MP3 downloader with a simple Gradio UI. Paste a YouTube link to download MP3. Requires ffmpeg installed on your system.
@pierrunoyt0 check-insNVIDIAAMDApple
PierrunoYT/Higgs-Audio-V2-Pinokiov5.0updated 9h ago
Advanced text-to-speech with voice cloning, multi-speaker support, and background music generation using Higgs Audio V2
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/LFM2.5-Pinokiov5.0updated 10h ago
Paste long text, clean it into readable sections, summarize each section, and ask questions in-browser with WebGPU. Choose between LFM2.5 230M, 350M, 1.2B-Instruct, and 1.2B-Thinking.
@pierrunoyt0 check-insNVIDIAAMDApple
pozzettiandrea/comfyui-unirigupdated 12h ago
ComfyUI wrapper for UniRig
0 check-insNVIDIAAMDApple
arnold2006/framecropv4.0updated 13h ago
Batch-crop images to a chosen aspect ratio using a draggable/resizable crop overlay on each image thumbnail.
0 check-insNVIDIAAMDApple
PierrunoYT/SmolLM3-3B-Pinokiov5.0updated 13h ago
Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/dots.tts-Pinokiov5.0updated 13h ago
2B-parameter fully continuous, end-to-end autoregressive text-to-speech with zero-shot voice cloning. https://huggingface.co/rednote-hilab/dots.tts-base
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/OmniVoice-Studio-Pinokiov7.0updated 13h ago
The open-source ElevenLabs alternative. Local voice cloning, video dubbing, and real-time dictation — 646 languages, no API keys.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/PRX-Pixel-Pinokiov5.0updated 13h ago
Pixel-space PRX text-to-image pipeline (~7B params, Qwen3-VL text encoder, no VAE)
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/MossTTS-Pinokiov5.0updated 13h ago
All-in-one Gradio UI for the MOSS-TTS Family: voice cloning, dialogue generation, voice design from text, and sound effects.
@pierrunoyt3 check-insNVIDIAAMDApple
PierrunoYT/Sana-Pinokiov5.0updated 13h ago
Fast Image Generation with Sana Diffusion Model
@pierrunoyt2 check-insNVIDIAAMDApple
theng12/studiohub-macv3.6updated 13h ago
Control plane for the KH Studio family — live health grid, unified model catalog, and unified-memory monitoring for Image/Music/Voice/Chat/Video Studio. One canonical API for clients like Story Studio KH.
0 check-insNVIDIAAMDApple
passivejobs01/subtitle-playerv7.0updated 15h ago
유튜브·로컬 영상에 원어+한글 이중 자막을 만들어 외국어를 공부하는 로컬 플레이어 — 잇츠매거진
0 check-insNVIDIAAMDApple
ai-anchorite/Z-Fusionv3.7updated 16h ago
Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]
@anchorite22 check-insNVIDIAAMDApple
Brianmwanza-bit/adoption-and-child-care-mainv7.0updated 1d ago
Android app for adoption and child-care management with AI-assisted coding.
0 check-insNVIDIAAMDApple
manat0912/openpromptv5.0updated 1d ago
AI-powered prompt helper for image & video generation. Supports local LLMs (Ollama, LM Studio) and cloud APIs (Gemini, DeepSeek, OpenRouter).
@manatheturipa0 check-insNVIDIAAMDApple
PierrunoYT/cohere-transcribe-pinokiov5.0updated 1d ago
State-of-the-art open-source speech recognition model supporting 14 languages. 2B parameter ASR model from Cohere Labs.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/chatterbox-tts-pinokiov5.0updated 1d ago
AI-Powered Text-to-Speech with Voice Cloning using Chatterbox TTS and a Gradio interface. Includes Turbo, Multilingual (23+ languages), and Original models. Runs locally; CUDA GPU recommended, CPU supported. Windows, Mac, and Linux.
@pierrunoyt0 check-insNVIDIAAMDApple
PierrunoYT/Audio-Flamingo-3-Pinokiov7.0updated 1d ago
NVIDIA's Audio Flamingo 3 - Large Audio-Language Model for speech, sound, and music understanding with Gradio web interface
@pierrunoyt0 check-insNVIDIAAMDApple