Ripkore/stableaudiov1.5updated 1y ago
An Open Source Model for Audio Samples and Sound Design https://github.com/Stability-AI/stable-audio-tools
@ripkore0 check-insNVIDIAAMDApple
AmberSahdev/Open-Interfaceupdated 1y ago
Control Any Computer Using LLMs.
0 check-insNVIDIAAMDApple
CoffeeVampir3/audiocraft-webuiupdated 1y ago
Quick webui for audiocraft
0 check-insNVIDIAAMDApple
Anjok07/ultimatevocalremoverguiupdated 1y ago
GUI for a Vocal Remover that uses Deep Neural Networks.
0 check-insNVIDIAAMDApple
facebookresearch/audiocraftupdated 1y ago
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
0 check-insNVIDIAAMDApple
jhj0517/AdvancedLivePortrait-WebUIupdated 1y ago
gradio WebUI for AdvancedLivePortrait
0 check-insNVIDIAAMDApple
journey-ad/CosyVoice2-Exupdated 1y ago
CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)
0 check-insNVIDIAAMDApple
sup3rmass1ve/deepseek-ai-janus-7bupdated 1y ago
Contribute to SUP3RMASS1VE/Deepseek-ai-Janus-7b development by creating an account on GitHub.
@sup3rmass1ve0 check-insNVIDIAAMDApple
voipnuggets/flux-generatorupdated 1y ago
Local image and music generation for Apple Silicon - GitHub - voipnuggets/flux-generator: Local image and music generation for Apple Silicon
0 check-insNVIDIAAMDApple
SkyworkAI/SkyReels-V1updated 1y ago
SkyReels V1: The first and most advanced open-source human-centric video foundation model
0 check-insNVIDIAAMDApple
cocktailpeanut/illusion.pinokioupdated 1y ago
Generate stunning illusion artwork with StableDiffusion (A space by @angrypenguinPNGAP - created with Monster Labs QR ControlNet.
@cocktailpeanut0 check-insNVIDIAAMDApple
jbilcke-hf/FacePokeupdated 1y ago
Select a portrait, click to move the head around (please use your own space / GPU!)
0 check-insNVIDIAAMDApple
Zyphra/Zonosupdated 1y ago
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers.
0 check-insNVIDIAAMDApple
Neurone/flux.1-dev-fp8updated 1y ago
Inference app for a FP8-quantized flux1-dev model. This runs on graphic cards with 16 GB of VRAM.
0 check-insNVIDIAAMDApple
browser-use/macos-useupdated 1y ago
Make Mac apps accessible for AI agents
0 check-insNVIDIAAMDApple
TMElyralab/MusePoseupdated 1y ago
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
0 check-insNVIDIAAMDApple