Feedback

Chat Icon

Local AI Engineering with Ollama

Run, understand, customize, fine-tune, and build agentic apps on your own hardware

Keep-Alive and Memory Control
51%

Picking a Keep-Alive That Makes Sense

There's no official guidance on this; the right value depends on how often you reuse the model and how much memory pressure you have. Here are starting points worth using and adjusting:

Use caseStarting valueReasoning
Interactive development30m to 1hSurvives breaks without paying reload cost; frees VRAM overnight
Production API server-1Never reload on live traffic
Shared workstation5m

Local AI Engineering with Ollama

Run, understand, customize, fine-tune, and build agentic apps on your own hardware

Enroll now to unlock all content and receive all future updates for free.