KenjieDec/RemBG-Pinokiov1.0updated 4mo ago
Pinokio WebUI for danielgatis' RemBG. RemBG is a tool to remove images background
0 check-insNVIDIAAMDApple
ChasonJiang/GPT-SoVITSupdated 4mo ago
1 min voice data can also be used to train a good TTS model! (few shot voice cloning) - ChasonJiang/GPT-SoVITS
0 check-insNVIDIAAMDApple
neviah/Fara-Pinokiov3.7updated 4mo ago
Microsoft's 7B parameter computer use agent with Gradio interface
@ramshi0 check-insNVIDIAAMDApple
SUP3RMASS1VE/MiraTTS-Pinokiov4.0updated 4mo ago
@sup3rmass1ve0 check-insNVIDIAAMDApple
Paxurux/chatterbox-old-supermasive-vrv3.7updated 4mo ago
SoTA open-source TTS
0 check-insNVIDIAAMDApple
bigai-nlco/IMTalkerupdated 4mo ago
Contribute to bigai-nlco/IMTalker development by creating an account on GitHub.
0 check-insNVIDIAAMDApple
OpenImagingLab/FlashVSRupdated 4mo ago
[CVPR 2026] Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional decoder.
0 check-insNVIDIAAMDApple
dangvansam/viet-ttsupdated 4mo ago
VietTTS: An Open-Source Vietnamese Text to Speech
0 check-insNVIDIAAMDApple
travisvn/chatterbox-tts-apiupdated 4mo ago
Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI API is used (e.g. Open WebUI, AnythingLLM, etc.)
0 check-insNVIDIAAMDApple
leafspark/AutoGGUFupdated 4mo ago
automatically quant GGUF models
0 check-insNVIDIAAMDApple
platomav/MEAnalyzerupdated 4mo ago
Intel Engine & Graphics Firmware Analysis Tool
0 check-insNVIDIAAMDApple
6Morpheus6/IndexTTS2v3.7updated 4mo ago
Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech application
@morpheus1 check-inNVIDIAAMDApple
V-Sekai-fire/pinokio-image-to-3dv1.0.0updated 4mo ago
ComfyUI with TRELLIS2, GeometryPack, and UniRig custom nodes for image-to-3D generation
1 check-inNVIDIAAMDApple
heiredjio-beep/e2-f5-ttsv3.7updated 4mo ago
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
0 check-insNVIDIAAMDApple
serpotapov/stable-diffusion-portableupdated 4mo ago
Stable Diffusion Portable
0 check-insNVIDIAAMDApple
devnen/Chatterbox-TTS-Serverupdated 4mo ago
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.
0 check-insNVIDIAAMDApple
6Morpheus6/photomaker2v3.7updated 4mo ago
Customizing Realistic Human Photos via Stacked ID Embedding https://huggingface.co/spaces/TencentARC/PhotoMaker-V2
@morpheus1 check-inNVIDIAAMDApple
tonykipkemboi/ollama_pdf_ragupdated 4mo ago
A full-stack demo showcasing a local RAG (Retrieval Augmented Generation) pipeline to chat with your PDFs.
0 check-insNVIDIAAMDApple
PierrunoYT/Kokoro-TTS-Localupdated 4mo ago
A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web interface.
@pierrunoyt0 check-insNVIDIAAMDApple
facebookresearch/sam-3d-bodyupdated 4mo ago
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the model.
0 check-insNVIDIAAMDApple