Store
Voice Synthesis Platform with Smart Chunking, Batch Processing, and Voice Cloning capabilities.
DiaFeatured
Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia

A web interface for managing and interacting with Ollama models

A web interface for managing and interacting with Ollama models
zonosFeatured
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos
Automatically create music videos. Synchronize the cuts to the music's beat.
A Step Towards Music Generation Foundation Model
echomimic2Featured
[NVIDIA ONLY] Make virtual avatars talk whatever you want with an image and an audio clip https://github.com/antgroup/echomimic_v2
pyramidflowFeatured
Pyramd Flow Video Generation AI (text-to-video & image-to-video) https://github.com/jy0205/Pyramid-Flow
One-click, permissioned recon: nmap + web enum, logs to /reports. For authorized testing only.
The Gen AI Platform for Pro Studios https://github.com/invoke-ai/InvokeAI
Allegro-txt2vidFeatured
[NVIDIA ONLY] Generate videos with Allegro txt2vid model https://github.com/rhymes-ai/Allegro
facepokeFeatured
[NVIDIA Only] Select a portrait, click to move the head around https://github.com/jbilcke-hf/FacePoke
Super fast Multilingual TTS supporting 54 voices across 8 languages.
High quality LipSync Application with a simple UI
AITownFeatured
Build and customize your own version of AI town - a virtual town where AI characters live, chat and socialize https://github.com/a16z-infra/ai-town
omnigenFeatured
A unified image generation model that you can use to perform various tasks, including but not limited to text-to-image generation, subject-driven generation, Identity-Preserving Generation, and image-conditioned generation. https://huggingface.co/spaces/Shitao/OmniGen
InstantIRFeatured
restore low-res images, restore broken images, recreate a new version of the image with a prompt https://huggingface.co/spaces/fffiloni/InstantIR