Store
An enhanced local port of finegrain-image-enhancer powered by Refiners (https://huggingface.co/spaces/finegrain/finegrain-image-enhancer), which was adapted from philz1337x's Clarity Upscaler (https://github.com/philz1337x/clarity-upscaler)
The ultimate video editor powered by natural language and FFMPEG https://huggingface.co/spaces/huggingface-projects/ai-video-composer
MMAudioFeatured
Generate synchronized audio from video and/or text inputs https://github.com/hkchengrex/MMAudio
Build your own voice for StyleTTS2
cubeFeatured
Roblox Foundation Model for 3D Intelligence --- Cross Platform (Mac, Windows, Linux): Requires 16GB+ VRAM PC or 18GB+ Memory Macs https://github.com/Roblox/cube
unoFeatured
[NVIDIA ONLY] Generate an image from multiple images https://github.com/bytedance/UNO
Simple Gradio app for generating images with Tongyi-MAI/Z-Image-Turbo.
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning
Tag manager and captioner for image datasets: https://github.com/jhc13/taggui
[NVIDIA Only] Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation https://github.com/fudan-generative-vision/hallo
Fast and High-Quality Zero-Shot voice clone Text-to-Speech with Flow Matching
A minimalist todo list with a lightweight JSON API and local storage.
Audio Transcription App with Parakeet-TDT-0.6b-v2
A FastAPI wrapper for KokoroTTS. Integrates with Open-WebUI and other API-driven AI applications.
Janus Pro 7B is a powerful multimodal AI model designed for advanced image understanding and text-to-image generation.
