Pinokio

Type:All

Platform:All

GPU:All

Tag:#aix

Latest Check-ins Name

Sort:Latest

DramaBox

PierrunoYT/DramaBox-TTS-Pinokiov5.0updated 1d ago

Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AI

#ai #tts #voice-clone

@pierrunoyt

5 check-insNVIDIAAMDApple

Image to Prompt

cocktailpeanut/image-to-promptv7.0updated 1d ago

Generate editable Ideogram JSON prompts from uploaded images.

#ai #prompt-helper #prompting

@cocktailpeanut

6 check-insNVIDIAAMDApple

FaceFusion 3.6.1

facefusion/facefusion-pinokiov5.0updated 2d ago

Industry leading face manipulation platform

#faceswap #facefusion #face #ff #1 ##faceswap-#ai-#video #ai #video

113 check-insNVIDIAAMDApple

Ultimate-TTS-Studio

pinokiofactory/Ultimate-TTS-Studiov3.7updated 4d ago

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

#tts #ai #gradio #voice

38 check-insNVIDIAAMDApple

Clarity Refiners UI

pinokiofactory/clarity-refiners-uiv3.7updated 5d ago

An enhanced local port of finegrain-image-enhancer powered by Refiners (https://huggingface.co/spaces/finegrain/finegrain-image-enhancer), which was adapted from philz1337x's Clarity Upscaler (https://github.com/philz1337x/clarity-upscaler)

#image #ai #image-edit #upscaler

5 check-insNVIDIAAMDApple

Wan2GP - AMD

6Morpheus6/wan2gp-amdv3.7updated 5d ago

[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video, Flux and more. (On Windows supported by all dedicated AMD GPUs from RDNA 2 - RDNA 4)

#ai #wan #wan2gp

@morpheus

53 check-insNVIDIAAMDApple

Phosphene

mrbizarro/phosphenev7.0updated 5d ago

Local generative video, image, and character training on Apple Silicon. Train face + voice LoRAs in-app. Q8 HQ for character clips. MLX native — no cloud, no API key.

#video-generation #phosphene #ai #video

@bizarro

14 check-insNVIDIAAMDApple

fluxgym

cocktailpeanut/fluxgymv3.7updated 6d ago

[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)

#lora #training #ai #gradio

@cocktailpeanut1 check-inNVIDIAAMDApple

StableDAW

cocktailpeanut/stabledaw.pinokiov7.0updated 6d ago

Browser-based AI audio DAW for Stable Audio 3 with text-to-audio, inpainting, LoRA training, FFmpeg effects, waveform editing, sequencer, piano roll, and persistent library. https://github.com/gantasmo/stabledaw

#audio #music #daw #ai #audio-generation #stableaudio #stableaudio3

@cocktailpeanut

18 check-insNVIDIAAMDApple

Comfyui

pinokiofactory/comfyv3.7updated 7d ago

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI

#comfyui #ai #video #image #image-generation #audio #comfy #video-generation #node-interface

83 check-insNVIDIAAMDApple

Wan2GP

pinokiofactory/wanv3.7updated 7d ago

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

#video-generation #wan #wan2gp #video #image #ai #1 #image-generation #gradio

229 check-insNVIDIAAMDApple

Qwen3-TTS MLX WebUI Enhanced

Blizaine/Qwen3-TTS-MLX-WebUI-Enhancedv5.0updated 9d ago

High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.

#mlx #qwen #tts #ai #mac

@blizaine

63 check-insNVIDIAAMDApple

SongGeneration Studio

BazedFrog/SongGeneration-Studiov3.7updated 10d ago

AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model. [NVIDIA ONLY]

#music #song #ai

20 check-insNVIDIAAMDApple

RMBG-2-Studio

pinokiofactory/RMBG-2-Studiov3.7updated 11d ago

Enhanced background remove and replace app built around BRIA-RMBG-2.0 https://huggingface.co/briaai/RMBG-2.0

#ai #image-edit #remove-background

4 check-insNVIDIAAMDApple

Orpheus-TTS-FastAPI

pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 11d ago

Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS

#ai #tts

0 check-insNVIDIAAMDApple

Odysseus

cocktailpeanut/odysseus.pinokiov7.0updated 14d ago

Self-hosted AI workspace for local-first chat, agents, tools, memory, research, documents, email, and model endpoint management.

#agent #odysseus #ai #deepresearch

@cocktailpeanut

6 check-insNVIDIAAMDApple

Whisper-WebUI

pinokiofactory/whisper-webuiv3.7updated 14d ago

A Web UI for easy subtitle using whisper model.

#whisper #ai #gradio #tts

2 check-insNVIDIAAMDApple

Ideoprompt

cocktailpeanut/ideopromptv7.0updated 19d ago

Describe an image, get a 100% schema-valid Ideogram 4 JSON prompt — generated fully locally with an embedded llama.cpp (no Ollama or LM Studio required).

#ai #prompt-helper #prompting

@cocktailpeanut1 check-inNVIDIAAMDApple

IP-Adapter-FaceID

cocktailpeanutlabs/faceidv3.0updated 20d ago

Enter a face image and transform it to any other image. Demo for the h94/IP-Adapter-FaceID model https://huggingface.co/spaces/multimodalart/Ip-Adapter-FaceID

#ai #face #image-generation

2 check-insNVIDIAAMDApple

Underfit

cocktailpeanut/underfit.pinokiov7.0updated 20d ago

LoRA fine-tuning dashboard for Stable Audio 3

#ai #music-generation #stableaudio #stableaudio3

@cocktailpeanut

4 check-insNVIDIAAMDApple

Store