gpt-engineer-org/gpt-engineerupdated 1y ago
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
0 check-insNVIDIAAMDApple
brunostjohn/perplexideezupdated 1y ago
Search the web and your self-hosted apps using local AI agents.
0 check-insNVIDIAAMDApple
betapeanut/pyramid-flowupdated 1y ago
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
0 check-insNVIDIAAMDApple
instantX-research/InstantIRupdated 1y ago
InstantIR: Blind Image Restoration with Instant Generative Reference 馃敟
0 check-insNVIDIAAMDApple
facebookresearch/seamless_communicationupdated 1y ago
Foundational Models for State-of-the-Art Speech and Text Translation
0 check-insNVIDIAAMDApple
ai-anchorite/BRIA-RMBG-2.0v2.0updated 1y ago
@anchorite0 check-insNVIDIAAMDApple
EdAlXGoAm/e2-f5-tts-spanishv2.0updated 1y ago
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
2 check-insNVIDIAAMDApple
promptpirate/amphion.pinokiov2.0updated 1y ago
Amphion: An Open-Source Audio, Music, and Speech Generation Toolkit: https://github.com/open-mmlab/Amphion
0 check-insNVIDIAAMDApple
cocktailpeanut/f5-ttsupdated 1y ago
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
@cocktailpeanut0 check-insNVIDIAAMDApple
ai-anchorite/Diffusers-Image-Communityv2.0updated 1y ago
Inpaint or Remove objects from an image - or Outpaint - or Outpaint Video Zoom: 16GB+ GPU | 32GB+ RAM | 20GB+ Storage -- Read the README for more!
@anchorite0 check-insNVIDIAAMDApple
AI4Bharat/Indic-TTSupdated 1y ago
Text-to-Speech for languages of India
0 check-insNVIDIAAMDApple
BAAI-Agents/Cradleupdated 1y ago
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
0 check-insNVIDIAAMDApple
Manobhiramlol/Deep-Live-Cam-mainupdated 1y ago
real time face swap and one-click video deepfake with only a single image
0 check-insNVIDIAAMDApple
pinokiofactory/allegro-txt2vidupdated 1y ago
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
0 check-insNVIDIAAMDApple
gpt-omni/mini-omniupdated 1y ago
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
0 check-insNVIDIAAMDApple
zhan-xu/RigNetupdated 1y ago
Code for SIGGRAPH 2020 paper "RigNet: Neural Rigging for Articulated Characters"
0 check-insNVIDIAAMDApple
ai-dock/fooocusupdated 1y ago
Fooocus web UI for Stable Diffusion
0 check-insNVIDIAAMDApple
adityapatils/LLM-GEN-AIupdated 1y ago
Discover scalable Generative AI and LLM projects for innovative NLP applications, focusing on language understanding and transformation. - adityapatils/LLM-GEN-AI
0 check-insNVIDIAAMDApple
pinokiofactory/mochiv2.0updated 1y ago
0 check-insNVIDIAAMDApple