Understanding Deployment Status
| Status | Meaning |
|---|---|
| Pending | Kubernetes resources created, pod starting up |
| Running | Model is loaded and the endpoint is live |
| Failed | Pod failed to start — check the error message and logs |
First deploy cold-start times
- CaseDesk Sandbox: 10–18 min (GPU node provisioning + image pull + model download)
- Your own cluster (warm node): 5–10 min (image pull + model download)
Subsequent deploys on a warm node are significantly faster as the Ollama image is already cached.
Viewing logs
From the deployment detail page, click View Logs to see the pod output in real time. This is the fastest way to diagnose a failed deployment.