Store
LlamaFactoryFeatured
Unify Efficient Fine-Tuning of 100+ LLMs https://github.com/hiyouga/LLaMA-Factory
An Open Source Model for Audio Samples and Sound Design https://github.com/Stability-AI/stable-audio-tools
flashdiffusionFeatured
Accelerating any conditional diffusion model for few steps image generation https://gojasper.github.io/flash-diffusion-project/
audiocraft_plusFeatured
AudioCraft Plus is an all-in-one WebUI for the original AudioCraft, adding many quality features on top https://github.com/GrandaddyShmax/audiocraft_plus
moshiFeatured
[Mac only] a speech-text foundation model for real time dialogue https://github.com/kyutai-labs/moshi
Control Any Computer Using LLMs.
Quick webui for audiocraft
GUI for a Vocal Remover that uses Deep Neural Networks.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
gradio WebUI for AdvancedLivePortrait
CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)
Contribute to SUP3RMASS1VE/Deepseek-ai-Janus-7b development by creating an account on GitHub.
Local image and music generation for Apple Silicon - GitHub - voipnuggets/flux-generator: Local image and music generation for Apple Silicon
SkyReels V1: The first and most advanced open-source human-centric video foundation model
Generate stunning illusion artwork with StableDiffusion (A space by @angrypenguinPNGAP - created with Monster Labs QR ControlNet.
Select a portrait, click to move the head around (please use your own space / GPU!)
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers.
Inference app for a FP8-quantized flux1-dev model. This runs on graphic cards with 16 GB of VRAM.
Make Mac apps accessible for AI agents
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
