r4dius/LongCat-AudioDiT-pinokiov1.0.0updated 1mo ago
Pinokio wrapper for LongCat-AudioDiT with selectable 1B / 3.5B model downloads.
0 check-insNVIDIAAMDApple
neviah/BuffedGoosev5.0updated 1mo ago
Goose sidecar dashboard draft with Pinokio onboarding-focused launcher.
@ramshi0 check-insNVIDIAAMDApple
matthewhand/open-hivemindv1.0updated 1mo ago
Run the Open-Hivemind multi-agent orchestrator locally with Pinokio.
0 check-insNVIDIAAMDApple
PierrunoYT/VidLingo-Pinokiov6.0.0updated 1mo ago
YouTube to MP3, Cohere transcription, TranslateGemma translation, OmniVoice TTS. https://github.com/PierrunoYT/VidLingo-Pinokio
@pierrunoyt0 check-insNVIDIAAMDApple
senigami/audiobook-studio.pinokiov3.7updated 1mo ago
Local-first AI audiobook production with voice cloning and chapter repair tools. This is the easiest way to install locally, including an optional demo voice library so you can start exploring right away. Live demo: senigami.github.io/audiobook-studio
@senigami8 check-insNVIDIAAMDApple
PierrunoYT/Download-Transcribe-Translate-Pinokiov5.0updated 1mo ago
YouTube to MP3, Cohere transcription, TranslateGemma translation.
@pierrunoyt0 check-insNVIDIAAMDApple
6Morpheus6/stable-diffusion-webui-forgev2.0updated 1mo ago
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.com/lllyasviel/stable-diffusion-webui-forge
@morpheus4 check-insNVIDIAAMDApple
PierrunoYT/flux-2-klein-pinokiov5.0updated 1mo ago
🎨 FLUX.2 [klein] - Fast text-to-image generation with Black Forest Labs' FLUX.2 models. 6 variants available: 4B/9B (full precision) plus NVFP4/FP8 quantized versions. Consumer GPUs (~13GB) to high-end (~29GB) for sub-second image generation with outstanding quality.
@pierrunoyt4 check-insNVIDIAAMDApple
PierrunoYT/liquid-audio-pinokiov5.0updated 1mo ago
Liquid Audio - LFM2.5-Audio-1.5B: speech-to-speech, ASR, and TTS powered by Liquid AI.
@pierrunoyt0 check-insNVIDIAAMDApple
PierrunoYT/KittenTTS-Pinokiov5.0updated 1mo ago
Ultra-lightweight text-to-speech (15M-80M params) — CPU optimized, 8 voices, ONNX-powered
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/soprano-tts-pinokiov5.0updated 1mo ago
Instant, Ultra-Realistic Text-to-Speech
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/moondream-3-pinokiov5.0updated 1mo ago
A web interface for the Moondream3 vision-language model featuring image captioning, visual question answering, object detection, and object pointing.
@pierrunoyt0 check-insNVIDIAAMDApple
PierrunoYT/VyvoTTS-LFM2-Pinokiov5.0updated 1mo ago
High-quality Text-to-Speech powered by VyvoTTS LFM2 model with easy-to-use web interface
@pierrunoyt0 check-insNVIDIAAMDApple
PierrunoYT/Photoroom-PRX-Pinokiov5.0updated 1mo ago
Gradio web interface for Photoroom's PRX-1024-t2i-beta text-to-image model
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/Youtube2MP3-Pinokiov5.0updated 1mo ago
🎵 YouTube to MP3 downloader with a simple Gradio UI. Paste a YouTube link to download MP3. Requires ffmpeg installed on your system.
@pierrunoyt0 check-insNVIDIAAMDApple
PierrunoYT/TranslateGemma-Pinokiov5.0updated 1mo ago
🌍 TranslateGemma - Google's open-source multilingual translation AI. Translate text across 55+ languages and extract/translate text from images. Powered by Gemma 3 architecture.
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/Higgs-Audio-V2-Pinokiov1.0.0updated 1mo ago
Advanced text-to-speech with voice cloning, multi-speaker support, and background music generation using Higgs Audio V2
@pierrunoyt1 check-inNVIDIAAMDApple
PierrunoYT/SmolLM3-3B-Pinokiov1.0.0updated 1mo ago
Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy
@pierrunoyt2 check-insNVIDIAAMDApple
PierrunoYT/cohere-transcribe-pinokiov5.0updated 1mo ago
State-of-the-art open-source speech recognition model supporting 14 languages. 2B parameter ASR model from Cohere Labs.
@pierrunoyt1 check-inNVIDIAAMDApple