diarization-ui/app.py at 39250e6582cdf8812c26dec2797359950992b042

Files

wb 39250e6582 fix(diarization-ui): raise default num_predict to 16384

Thinking tokens count against num_predict. At 4096 the model was
running out mid-response after spending ~3000 tokens on thinking.
16384 gives enough headroom for thinking + full response.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-06 16:26:55 +02:00

74 KiB

Raw Blame History

View Raw

74 KiB Raw Blame History

74 KiB

Raw Blame History