Pierre Bruno

@pierrunoyt

3 posts21 checkpointsJoined 1/27/2026, 9:46:34 AM

@pierrunoyt

Activity 24 Posts 3 Checkpoints 21 Apps 8 Creations 50 Following 0 Followers 4

Creations by @pierrunoyt

50 total

TranscribrUpdated 14 hours ago

https://github.com/PierrunoYT/Transcribr

Bulk transcribe many YouTube videos, whole playlists, or your own uploaded audio/video files at once with faster-whisper. Outputs txt, srt, vtt, or json.

PersonaPlexUpdated 4 days ago

https://github.com/PierrunoYT/PersonaPlex-Pinokio

🗣️ PersonaPlex - NVIDIA's real-time speech-to-speech conversational AI model. Natural full-duplex conversations with customizable personas and voices.

OmniVoiceUpdated 4 days ago

https://github.com/PierrunoYT/OmniVoice-Pinokio

Zero-shot multilingual TTS (600+ languages) with voice cloning and voice design — Gradio UI (app/app.py)

Higgs Audio v3 TTSUpdated 4 days ago

https://github.com/PierrunoYT/HiggsAudioV3-Pinokio

Pinokio launcher for Higgs Audio v3 TTS with Gradio UI, SGLang-Omni backend, and automatic model download.

DramaBoxUpdated 5 days ago

https://github.com/PierrunoYT/DramaBox-TTS-Pinokio

Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AI

SanaUpdated 5 days ago

https://github.com/PierrunoYT/Sana-Pinokio

Fast Image Generation with Sana Diffusion Model

ScribeTubeUpdated last week

https://github.com/PierrunoYT/ScribeTube

Download and transcribe many YouTube videos or whole playlists at once with faster-whisper. Outputs txt, srt, vtt, or json.

MOSS-TTSUpdated last week

https://github.com/PierrunoYT/MossTTS-Pinokio

All-in-one Gradio UI for the MOSS-TTS Family: voice cloning, dialogue generation, voice design from text, and sound effects.

Ideogram 4 StudioUpdated 2 weeks ago

https://github.com/PierrunoYT/Ideogram-4-Pinokio

Ideogram 4 (nf4) open-weights text-to-image model (9.3B params, Qwen3-VL-8B text encoder, structured JSON prompting, native 2k resolution)

PRX PixelUpdated 2 weeks ago

https://github.com/PierrunoYT/PRX-Pixel-Pinokio

Pixel-space PRX text-to-image pipeline (~7B params, Qwen3-VL text encoder, no VAE)

OmniVoice StudioUpdated 2 weeks ago

https://github.com/PierrunoYT/OmniVoice-Studio-Pinokio

The open-source ElevenLabs alternative. Local voice cloning, video dubbing, and real-time dictation — 646 languages, no API keys.

dots.tts-baseUpdated 3 weeks ago

https://github.com/PierrunoYT/dots.tts-Pinokio

2B-parameter fully continuous, end-to-end autoregressive text-to-speech with zero-shot voice cloning. https://huggingface.co/rednote-hilab/dots.tts-base

VidLingoUpdated 4 weeks ago

https://github.com/PierrunoYT/VidLingo-Pinokio

YouTube to MP3, Cohere transcription, TranslateGemma translation, OmniVoice TTS. https://github.com/PierrunoYT/VidLingo-Pinokio

RealRestorerUpdated 4 weeks ago

https://github.com/PierrunoYT/RealRestorer-Pinokio

Generalizable real-world image restoration (diffusers + Gradio). CUDA recommended; first run downloads HF weights.

Voxtral UIUpdated 4 weeks ago

https://github.com/PierrunoYT/Voxtral-UI-Pinokio

Run Mistral AI's Voxtral locally with a Gradio web interface (Transformers backend, no vLLM required).

SmolLM3-3B ChatbotUpdated 4 weeks ago

https://github.com/PierrunoYT/SmolLM3-3B-Pinokio

Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy

LFM2.5-350M Reader + Q&AUpdated 4 weeks ago

https://github.com/PierrunoYT/LFM2.5-350M-Pinokio

Paste long text, clean it into readable sections, summarize each section, and ask questions in-browser with WebGPU.

Higgs Audio V2 EnhancedUpdated 4 weeks ago

https://github.com/PierrunoYT/Higgs-Audio-V2-Pinokio

Advanced text-to-speech with voice cloning, multi-speaker support, and background music generation using Higgs Audio V2

TranslateGemmaUpdated 4 weeks ago

https://github.com/PierrunoYT/TranslateGemma-Pinokio

🌍 TranslateGemma - Google's open-source multilingual translation AI. Translate text across 55+ languages and extract/translate text from images. Powered by Gemma 3 architecture.

Youtube2MP3Updated 4 weeks ago

https://github.com/PierrunoYT/Youtube2MP3-Pinokio

🎵 YouTube to MP3 downloader with a simple Gradio UI. Paste a YouTube link to download MP3. Requires ffmpeg installed on your system.

Creations

More · 50

Transcribr

Bulk transcribe many YouTube videos, whole playlists, or your own uploaded audio/video files at once with faster-whisper. Outputs txt, srt, vtt, or json.Updated 14 hours ago

PersonaPlex

🗣️ PersonaPlex - NVIDIA's real-time speech-to-speech conversational AI model. Natural full-duplex conversations with customizable personas and voices.Updated 4 days ago

OmniVoice

Zero-shot multilingual TTS (600+ languages) with voice cloning and voice design — Gradio UI (app/app.py)Updated 4 days ago

Higgs Audio v3 TTS

Pinokio launcher for Higgs Audio v3 TTS with Gradio UI, SGLang-Omni backend, and automatic model download.Updated 4 days ago

DramaBox

Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AIUpdated 5 days ago

Sana

Fast Image Generation with Sana Diffusion ModelUpdated 5 days ago

ScribeTube

Download and transcribe many YouTube videos or whole playlists at once with faster-whisper. Outputs txt, srt, vtt, or json.Updated last week

MOSS-TTS

All-in-one Gradio UI for the MOSS-TTS Family: voice cloning, dialogue generation, voice design from text, and sound effects.Updated last week

Ideogram 4 Studio

Ideogram 4 (nf4) open-weights text-to-image model (9.3B params, Qwen3-VL-8B text encoder, structured JSON prompting, native 2k resolution)Updated 2 weeks ago

PRX Pixel

Pixel-space PRX text-to-image pipeline (~7B params, Qwen3-VL text encoder, no VAE)Updated 2 weeks ago