Estimates required context window from prompt length (chars/3 + 2048 response buffer) and rounds up to the nearest fixed tier (4096, 8192, 16384, 32768, 65536). Fixed tiers prevent Ollama from reloading the model on every call. The chosen num_ctx is prepended to the stored llm_prompt so it's visible in the debug window. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
74 KiB
74 KiB