Files
diarization-ui/app.py
wb 91b8522916 feat(diarization-ui): dynamic num_ctx for Ollama based on prompt size
Estimates required context window from prompt length (chars/3 + 2048
response buffer) and rounds up to the nearest fixed tier (4096, 8192,
16384, 32768, 65536). Fixed tiers prevent Ollama from reloading the
model on every call. The chosen num_ctx is prepended to the stored
llm_prompt so it's visible in the debug window.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-06 15:56:59 +02:00

74 KiB