LongCat AudioDiT
InstallableDiffusion-based TTS with zero-shot voice cloning (1B / 3.5B). Upload a voice reference, auto-transcribe, and generate matching speech for video pickups and ADR.
Posts
Sort
Loading…
Diffusion-based TTS with zero-shot voice cloning (1B / 3.5B). Upload a voice reference, auto-transcribe, and generate matching speech for video pickups and ADR.