qwen3-TTS-studio: ElevenLabs-style voice cloning + NotebookLM-style podcast generation, but local
- Clone any voice with just a 3-second audio sample
- Fine-tune parameters (temperature, top-k, top-p) with quality presets
- Generate complete podcasts from just a topic – AI writes the script, assigns voices, and synthesizes everything
- 10 languages supported (Korean, English, Chinese, Japanese, etc.
Currently uses gpt5.2 for script generation, but the architecture is modular – you can swap in any local LLM (Qwen, Llama, etc.) if you want fully local.
GitHub - bc-dunia/qwen3-TTS-studio: A professional-grade interface for Qwen3-TTS, designed to unlock the model's full potential with fine-grained control and intuitive workflows.
A professional-grade interface for Qwen3-TTS, designed to unlock the model's full potential with fine-grained control and intuitive workflows. - bc-dunia/qwen3-TTS-studioGitHub
