qwen3-tts-ra

Files

pi-bot-01 fef6a1b74c feat: add PCM streaming + Kokoro voice name support

- POST /audio/speech with response_format=pcm now streams raw 16-bit
  PCM (24kHz mono) via Flask generator — compatible with customtts
  extension streaming mode
- resolve_voice() handles:
    * Standard OpenAI names (alloy, echo, ...)
    * Kokoro blend syntax: 'af_bella+bf_emma+af_nicole' (picks first)
    * Kokoro prefix heuristic: af_/bf_/am_/bm_ → Ryan, zf_/zm_ → Vivian
    * Explicit Kokoro aliases for common voices (bella, emma, sky, etc.)
    * Graceful fallback to alloy for unknown voices
- app.run(threaded=True) to support concurrent streaming connections

2026-03-25 21:39:56 -07:00

app.py

feat: add PCM streaming + Kokoro voice name support

2026-03-25 21:39:56 -07:00

requirements.txt

feat: Qwen3-TTS proxy with HIP graph + CPU decoder optimisations

2026-03-25 21:18:42 -07:00