How OpenClaw Works with Multiple AI Models (GPT-4, Claude, Gemini & More)

Most AI assistants make a quiet decision for you: which AI model you get to use. ChatGPT gives you OpenAI's models. Copilot gives you Microsoft's. Gemini gives you Google's.

OpenClaw doesn't work that way.

Because it's self-hosted and open, you configure which model powers it. You can change that at any time. You can run different models for different tasks. And if a better model releases tomorrow, you're not waiting for someone else to flip a switch — you just update a config line.

This guide explains how OpenClaw's multi-model system works, which models are supported out of the box, and how to choose the right one for your use case.

Why Model Flexibility Matters

AI models differ in ways that actually matter to daily users:

Cost per token — Claude Haiku is far cheaper than GPT-4o for simple tasks
Context window — Gemini 1.5 Pro handles 1M tokens; great for long documents
Coding quality — Claude 3.5 Sonnet and DeepSeek consistently outperform on code
Privacy — Local Ollama models never send your data anywhere
Speed — Smaller models like Mistral 7B respond faster for quick lookups

When you're locked into one provider's model, you're always making a compromise. OpenClaw lets you stop compromising.

How OpenClaw Routes Model Requests

OpenClaw uses a model string in its config — typically following the provider/model-name format. When you send a message, OpenClaw calls the configured provider's API and returns the response.

The core architecture looks like this:

# In your OpenClaw config
model: anthropic/claude-sonnet-4-6
# or
model: openai/gpt-4o
# or
model: google/gemini-1.5-pro
# or (local via Ollama)
model: ollama/llama3.1

Switching models is a config change and a restart. That's it. Your memory, your skills, your personality files — they all stay the same. The model is just the inference engine underneath.

Supported Model Providers

OpenAI (GPT-4o, GPT-4 Turbo, GPT-4o Mini)

OpenAI's models are the most widely known and well-tested. GPT-4o is OpenClaw's best general-purpose choice for most users — strong reasoning, good coding, reliable instruction-following.

model: openai/gpt-4o

For cost-conscious setups, GPT-4o Mini handles most conversational tasks at a fraction of the price. Good for high-volume automations where you're watching your API spend.

Anthropic (Claude 3.5 Sonnet, Claude 3 Haiku)

Claude models tend to excel at long-context tasks, code review, and writing. Claude 3.5 Sonnet is arguably the best all-round model for agentic work — it follows complex multi-step instructions particularly well, which matters for OpenClaw's automation features.

model: anthropic/claude-sonnet-4-6

Claude Haiku is the budget pick: blazing fast, very cheap, solid for simple tasks and quick answers.

Google Gemini (1.5 Pro, 1.5 Flash)

Gemini's standout feature is context length — up to 1 million tokens in Gemini 1.5 Pro. If you're feeding OpenClaw large documents, entire codebases, or long conversation histories, Gemini handles it without truncating.

model: google/gemini-1.5-pro

Gemini 1.5 Flash is Google's speed-optimised variant — faster and cheaper, with a still-generous 1M context window.

Mistral (Mistral Large, Mistral 7B)

Mistral models are known for efficiency. Mistral Large punches above its weight for reasoning tasks; Mistral 7B is compact enough to run locally on decent hardware.

model: mistral/mistral-large-latest

Local Models via Ollama

This is where OpenClaw diverges most sharply from cloud-only assistants. If you've set up Ollama on your server, you can point OpenClaw at any locally-running model — Llama 3.1, Phi-3, Gemma 2, Qwen 2.5, or anything else in the Ollama library.

model: ollama/llama3.1
# Make sure Ollama is running at localhost:11434

Zero API costs. Zero data leaving your machine. Ideal for sensitive personal data or if you want complete air-gap privacy.

For a deeper dive into running local models, see our guide on running your own local AI with OpenClaw and Ollama.

OpenRouter (Access 100+ Models in One Place)

OpenRouter is a meta-provider that gives you a single API key with access to most major models — OpenAI, Anthropic, Google, Mistral, Meta, and many others. It's a convenient option if you want to experiment without managing multiple API keys.

model: openrouter/anthropic/claude-3.5-sonnet

OpenRouter also has a free tier with limited access to several models — useful for testing OpenClaw on a budget before committing to paid APIs.

Using Different Models for Different Tasks

OpenClaw's cron job system lets you specify a model per scheduled job. This is powerful: you might want your cheap daily-summary cron to use Gemini Flash, while your deep research tasks use Claude Sonnet.

# In your cron config
- name: morning-summary
  model: google/gemini-flash-1.5   # cheap and fast
  schedule: 0 8 * * *

- name: weekly-research-brief
  model: anthropic/claude-sonnet-4-6   # better reasoning
  schedule: 0 9 * * 1

This kind of per-task model routing is something closed AI products simply can't offer. For more on building these kinds of automated schedules, see the guide on OpenClaw cron jobs.

Model Performance: Quick Comparison

Model	Best For	Context	Cost	Privacy
`openai/gpt-4o`	General use, tools	128K	$$	Cloud
`anthropic/claude-sonnet`	Coding, long tasks	200K	$$	Cloud
`google/gemini-1.5-pro`	Big documents	1M	$$	Cloud
`anthropic/claude-haiku`	Quick answers	200K	$	Cloud
`openai/gpt-4o-mini`	High-volume tasks	128K	$	Cloud
`ollama/llama3.1`	Full privacy	8K–128K	Free	Local ✓

Switching Models: Step-by-Step

Here's the full process for changing your OpenClaw model:

Get your API key for the provider you want (Anthropic, OpenAI, Google AI Studio, etc.)
Add the key to your OpenClaw secrets or environment config:
```
# e.g. for Anthropic
ANTHROPIC_API_KEY=sk-ant-xxxx
```
Update your model string in the OpenClaw gateway config:
```
model: anthropic/claude-sonnet-4-6
```
Restart the gateway:
```
openclaw gateway restart
```
Test it — send a message and confirm the new model is responding

Your memory files, SOUL.md, skills, and all customisations survive a model switch intact. The model is stateless — your state lives in files, not the model.

Which Model Should You Start With?

If you're new to OpenClaw and want a single recommendation: start with Claude 3.5 Sonnet. It has the best balance of capability, context length, and instruction-following for OpenClaw's agentic patterns. Anthropic's API is straightforward to set up and the free tier covers initial testing.

Once you're comfortable, experiment. Try GPT-4o for comparison. Add a cheap model for your routine crons. If privacy is a concern, test an Ollama setup on the side.

The whole point of OpenClaw's architecture is that the model isn't a lock-in decision — it's a tunable parameter. Use that freedom.

What Doesn't Change Between Models

One concern people raise: "If I switch models, does my assistant lose its personality?"

No — because in OpenClaw, personality, memory, and behaviour are defined in files (SOUL.md, USER.md, MEMORY.md), not baked into the model. Every model reads those files at the start of each session.

A well-written SOUL.md will produce a consistent persona whether you're running Claude, GPT-4o, or a local Llama model. The model changes; the assistant's character and memory don't. For a deeper look at how this system works, see the post on OpenClaw's memory system.

Summary

OpenClaw is model-agnostic — change providers with a single config line
Supported: OpenAI, Anthropic, Google Gemini, Mistral, Ollama (local), OpenRouter
Different cron jobs can use different models for cost optimisation
Memory and personality survive model switches — they live in files, not the model
For privacy: Ollama lets you run entirely locally at zero API cost

// get started

Ready to Run OpenClaw with Your Preferred Model?

Install OpenClaw on your VPS in minutes and connect it to any AI model you choose. No lock-in, full control.

Install OpenClaw Free →