macOS-useFeatured
pinokiofactory/macOS-usev3.6updated 1y ago
[Mac Only] We make AI agents that control Mac apps: https://github.com/browser-use/macOS-use
0 check-insNVIDIAAMDApple
MatAnyoneFeatured
pinokiofactory/MatAnyonev3.3updated 1mo ago
MatAnyone AI is a tool for editing videos by separating objects from their backgrounds. It is an AI to remove the background from videos effectively. Stable Video Matting with Consistent Memory Propagation: https://github.com/pq-yang/MatAnyone.git
2 check-insNVIDIAAMDApple
DiffRhythmFeatured
pinokiofactory/diffrhythmv3.7updated 3mo ago
Generate songs with AI (up to 4 min 45 sec). Both with lyrics or instrumental https://github.com/ASLP-lab/DiffRhythm
2 check-insNVIDIAAMDApple
cubeFeatured
pinokiofactory/cubev3.7updated 5mo ago
Roblox Foundation Model for 3D Intelligence --- Cross Platform (Mac, Windows, Linux): Requires 16GB+ VRAM PC or 18GB+ Memory Macs https://github.com/Roblox/cube
1 check-inNVIDIAAMDApple
HunyuanVideoFeatured
pinokiofactory/hunyuanvideov3.7updated 26d ago
[NVIDIA ONLY] Super Optimized Gradio UI for Hunyuan Video Generator that works on GPU poor machines. Generate up to 10~14 sec videos https://github.com/deepbeepmeep/HunyuanVideoGP
11 check-insNVIDIAAMDApple
unoFeatured
pinokiofactory/unov3.7updated 5mo ago
[NVIDIA ONLY] Generate an image from multiple images https://github.com/bytedance/UNO
0 check-insNVIDIAAMDApple
DiaFeatured
pinokiofactory/diav3.7updated 5mo ago
Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia
0 check-insNVIDIAAMDApple
FramePackFeatured
pinokiofactory/Frame-Packv3.7updated 1mo ago
[NVIDIA ONLY] Generate Video Progressively. FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively. https://github.com/lllyasviel/FramePack
10 check-insNVIDIAAMDApple
facefusion/facefusion-pinokiov5.0updated 11d ago
Industry leading face manipulation platform
82 check-insNVIDIAAMDApple
pinokiofactory/Ultimate-TTS-Studiov3.7updated 3d ago
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
26 check-insNVIDIAAMDApple
francescofugazzi/ml-sharp-pinokiov0.3updated 1mo ago
One-click 3D Gaussian Splatting generation from a single image.
@franzipol5 check-insNVIDIAAMDApple
pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 2mo ago
Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS
0 check-insNVIDIAAMDApple
Qwen3-TTSFeatured
SUP3RMASS1VE/Qwen3-TTS-Pinokiov5.0updated 1mo ago
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team
@sup3rmass1ve21 check-insNVIDIAAMDApple
e2-f5-ttsFeatured
pinokiofactory/e2-f5-ttsv3.7updated 1mo ago
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
12 check-insNVIDIAAMDApple
pinokiofactory/vibevoice-realtimev5.0updated 14d ago
Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B
5 check-insNVIDIAAMDApple
pinokiofactory/Hunyuan3d-2-lowvramv3.7updated 1mo ago
Text/Image to 3D (Cross Platform: Mac + Windows + Linux): High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. https://github.com/deepbeepmeep/Hunyuan3D-2GP
6 check-insNVIDIAAMDApple
OpenAudioFeatured
pinokiofactory/openaudiov3.7updated 29d ago
Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaudio/fish-speech
10 check-insNVIDIAAMDApple
ForgeFeatured
pinokiofactory/stable-diffusion-webui-forgev2.0updated 15d ago
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.com/lllyasviel/stable-diffusion-webui-forge
20 check-insNVIDIAAMDApple
pinokiofactory/whisper-webuiv3.7updated 3mo ago
A Web UI for easy subtitle using whisper model.
2 check-insNVIDIAAMDApple
ComfyuiFeatured
pinokiofactory/comfyv3.7updated 17d ago
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
62 check-insNVIDIAAMDApple