diarization-ui/app.py at 4f651fac1274a2ab85ffa2adc926d69e5a8aa36f

Files

wb 4f651fac12 fix(diarization-ui): capture thinking tokens in debug stream (Qwen3)

Ollama streaming chunks for thinking models use a separate "thinking"
field. Previously only "response" was captured, leaving the debug
window empty while the model reasoned. Now both fields are tracked
independently: thinking is shown in blue above the final answer,
both are persisted to new llm_thinking / existing llm_response columns.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-06 15:50:40 +02:00

73 KiB

Raw Blame History

View Raw

73 KiB Raw Blame History

73 KiB

Raw Blame History