Pierre Bruno

@pierrunoyt
3 posts19 checkpointsJoined 1/27/2026, 9:46:34 AM
Creations by @pierrunoyt
40 total
DramaBoxUpdated 5 hours ago
https://github.com/PierrunoYT/DramaBox-TTS-Pinokio
Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AI
OmniVoice StudioUpdated 4 days ago
https://github.com/PierrunoYT/OmniVoice-Studio-Pinokio
The open-source ElevenLabs alternative. Local voice cloning, video dubbing, and real-time dictation — 646 languages, no API keys.
ChatterBoxUpdated last week
https://github.com/PierrunoYT/chatterbox-tts-app
AI-Powered Text-to-Speech with Voice Cloning using Chatterbox TTS and Gradio interface
Audio Flamingo 3Updated 2 weeks ago
https://github.com/PierrunoYT/Audio-Flamingo-3-Pinokio
NVIDIA's Audio Flamingo 3 - Large Audio-Language Model for speech, sound, and music understanding with Gradio web interface
LuxTTS 🎙️Updated 2 weeks ago
https://github.com/PierrunoYT/LuxTTS-Pinokio
High-quality rapid TTS voice cloning model (150x+ realtime) — 48kHz speech, voice cloning
OrpheusTTSUpdated 2 weeks ago
https://github.com/PierrunoYT/OrpheusTTS-Pinokio
Standalone Text-to-Speech using Orpheus TTS with a Gradio UI
PersonaPlexUpdated 2 weeks ago
https://github.com/PierrunoYT/PersonaPlex-Pinokio
🗣️ PersonaPlex - NVIDIA's real-time speech-to-speech conversational AI model. Natural full-duplex conversations with customizable personas and voices. Requires an NVIDIA GPU on Windows or Linux (16-24GB VRAM recommended), 32GB RAM, and a Hugging Face account.
PocketTTSUpdated 2 weeks ago
https://github.com/PierrunoYT/pocket-tts-pinokio
Lightweight CPU text-to-speech with preset voices and optional Hugging Face-authenticated voice cloning.
SanaUpdated 2 weeks ago
https://github.com/PierrunoYT/Sana-Pinokio
Fast Image Generation with Sana Diffusion Model
HY-MT1.5Updated 2 weeks ago
https://github.com/PierrunoYT/Tencent-HY-MT1.5-Pinokio
Hunyuan Translation Model Version 1.5 — Gradio UI for 33-language translation with HY-MT1.5-1.8B and HY-MT1.5-7B.
GLM-TTSUpdated 3 weeks ago
https://github.com/PierrunoYT/GLM-TTS-Pinokio
🎙️ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.
Z-Image-TurboUpdated 3 weeks ago
https://github.com/PierrunoYT/Z-Image-Pinokio
⚡️ Efficient 6B parameter image generation model with sub-second inference. Generate high-quality, photorealistic images with only 8 inference steps. Features bilingual text rendering (Chinese & English) and Single-Stream Diffusion Transformer architecture.
MOSS-TTSUpdated 4 weeks ago
https://github.com/PierrunoYT/MossTTS-Pinokio
All-in-one Gradio UI for the MOSS-TTS Family: voice cloning, dialogue generation, voice design from text, and sound effects.
OmniVoiceUpdated 4 weeks ago
https://github.com/PierrunoYT/OmniVoice-Pinokio
Zero-shot multilingual TTS (600+ languages) with voice cloning and voice design — Gradio UI (app/app.py)
LFM2.5-450M-VLUpdated last month
https://github.com/PierrunoYT/LFM2.5-450M-VL-Pinokio
LFM2.5-VL-450M (Liquid AI): compact vision–language model for image understanding. Gradio UI with upload/URL, prompt, and generation sliders.
VoxCPM 2Updated last month
https://github.com/PierrunoYT/VoxCPM-2-Pinokio
Tokenizer-free TTS for context-aware speech, voice cloning, and voice design. 2B params, 48kHz, 30 languages (Gradio UI).
VoxCPM 2Updated last month
https://github.com/PierrunoYT/VoxCPM-1.5-Pinokio
Tokenizer-free TTS for context-aware speech, voice cloning, and voice design. 2B params, 48kHz, 30 languages (Gradio UI).
VidLingoUpdated last month
https://github.com/PierrunoYT/VidLingo-Pinokio
YouTube to MP3, Cohere transcription, TranslateGemma translation, OmniVoice TTS. https://github.com/PierrunoYT/VidLingo-Pinokio
Transcribe StudioUpdated last month
https://github.com/PierrunoYT/Download-Transcribe-Translate-Pinokio
YouTube to MP3, Cohere transcription, TranslateGemma translation.
FLUX.2 [klein]Updated last month
https://github.com/PierrunoYT/flux-2-klein-pinokio
🎨 FLUX.2 [klein] - Fast text-to-image generation with Black Forest Labs' FLUX.2 models. 6 variants available: 4B/9B (full precision) plus NVFP4/FP8 quantized versions. Consumer GPUs (~13GB) to high-end (~29GB) for sub-second image generation with outstanding quality.