Pinokio

Type:api

Platform:All

GPU:All

Tag:#ttsx

Latest Check-ins Name

Sort:Check-ins

Wan2GP

pinokiofactory/wanv3.7updated 7d ago

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

#video-generation #wan #wan2gp #video #image #ai #1 #image-generation #gradio

229 check-insNVIDIAAMDApple

Comfyui

pinokiofactory/comfyv3.7updated 7d ago

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI

#comfyui #ai #video #image #image-generation #audio #comfy #video-generation #node-interface

83 check-insNVIDIAAMDApple

Qwen3-TTS MLX WebUI Enhanced

Blizaine/Qwen3-TTS-MLX-WebUI-Enhancedv5.0updated 9d ago

High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.

#mlx #qwen #tts #ai #mac

@blizaine

63 check-insNVIDIAAMDApple

Ultimate-TTS-Studio

pinokiofactory/Ultimate-TTS-Studiov3.7updated 4d ago

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

#tts #ai #gradio #voice

38 check-insNVIDIAAMDApple

Voicebox

cocktailpeanut/voicebox.pinokiov5.0updated 28d ago

Local-first voice synthesis studio powered by Qwen3-TTS.

#tts #voice-clone

@cocktailpeanut

33 check-insNVIDIAAMDApple

Qwen3-TTS

SUP3RMASS1VE/Qwen3-TTS-Pinokiov5.0updated 2mo ago

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team

#tts #voice #qwen3-tts #ai

@sup3rmass1ve

26 check-insNVIDIAAMDApple

e2-f5-tts

pinokiofactory/e2-f5-ttsv3.7updated 1mo ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS

#tts #voice-clone #ai

14 check-insNVIDIAAMDApple

OpenAudio

pinokiofactory/openaudiov3.7updated 2mo ago

Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaudio/fish-speech

#openaudio #ai #audio #gradio #tts

14 check-insNVIDIAAMDApple

zonos

pinokiofactory/zonosv3.7updated 1mo ago

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos

#ai #tts

6 check-insNVIDIAAMDApple

DramaBox

PierrunoYT/DramaBox-TTS-Pinokiov5.0updated 1d ago

Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AI

#ai #tts #voice-clone

@pierrunoyt

5 check-insNVIDIAAMDApple

VibeVoice Realtime

pinokiofactory/vibevoice-realtimev5.0updated 2mo ago

Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B

#ai #tts

5 check-insNVIDIAAMDApple

OpenVoice

cocktailpeanutlabs/openvoicev1updated 5mo ago

Instantly clone any voice from any text to any speech, in any language https://huggingface.co/spaces/myshell-ai/OpenVoice

#tts #ai

5 check-insNVIDIAAMDApple

Openvoice2

cocktailpeanutlabs/openvoice2v3.0updated 6mo ago

Openvoice 2 Web UI - A local web UI for Openvoice2, a multilingual voice cloning TTS https://x.com/myshell_ai/status/1783161876052066793

#ai #tts

3 check-insNVIDIAAMDApple

Whisper-WebUI

pinokiofactory/whisper-webuiv3.7updated 14d ago

A Web UI for easy subtitle using whisper model.

#whisper #ai #gradio #tts

2 check-insNVIDIAAMDApple

VoxCPM

IAnMove/voxcpm2-pinokio-launcherv7.0updated 2mo ago

Tokenizer-free multilingual TTS and voice cloning with low-VRAM and VoxCPM2 Web UI/API launch modes.

#ai #tts

@theinaog

2 check-insNVIDIAAMDApple

StyleTTS2 Studio

pinokiofactory/StyleTTS2_Studiov3.7updated 6mo ago

Build your own voice for StyleTTS2

#ai #tts

2 check-insNVIDIAAMDApple

MeloTTS

cocktailpeanutlabs/melottsv1.2updated 10mo ago

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean https://github.com/myshell-ai/MeloTTS

#ai #tts

2 check-insNVIDIAAMDApple

parler-tts

cocktailpeanutlabs/parler-ttsv1.5updated 1y ago

a lightweight text-to-speech (TTS) model that can generate high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation). https://huggingface.co/spaces/parler-tts/parler_tts_mini

#ai #tts

1 check-inNVIDIAAMDApple

XTTS

cocktailpeanut/xtts.pinokiov3.0updated 2mo ago

clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)

#ai #tts

@cocktailpeanut1 check-inNVIDIAAMDApple

Orpheus-TTS-FastAPI

pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 11d ago

Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS

#ai #tts

0 check-insNVIDIAAMDApple

Store