Skip to main content

How it Works

  1. Connect your cluster — authenticate with AWS or Azure and select a cluster
  2. Choose a model — pick from the catalogue (Llama, DeepSeek, Phi, and more)
  3. Deploy — CaseDesk creates a Kubernetes namespace and deploys Ollama into your cluster
  4. Use the endpoint — every deployment gets a unique OpenAI-compatible URL

Your data never leaves your infrastructure.