Supported AI models and providers in GridOS

GridOS ships with a static catalog of models across four providers. Once you configure an API key for a provider, every model from that provider appears in the model picker dropdown in the chat composer. You can switch models between requests without restarting. This page is the authoritative catalog — ordered by practical usefulness, not alphabetically.

Model catalog

Google Gemini (recommended for first-time users)

Model ID	Free-tier TPM	Notes
`gemini-3.1-flash-lite-preview`	~250K	Recommended default. Fast, cheap, huge TPM headroom. Handles multi-intent 3-statement and DCF builds comfortably.
`gemini-3.1-pro`	~250K	Higher-quality Gemini; slower and more expensive per token. Use when Flash Lite struggles on complex reasoning.

Anthropic Claude

Model ID	Free tier	Notes
`claude-haiku-4-5-20251001`	$5 credits	Fast and cheap; strong JSON reliability. Best free-credit option if you don’t have a Gemini key.
`claude-sonnet-4-6`	$5 credits	Balanced; solid for mid-complexity financial models.
`claude-opus-4-7`	$5 credits	Most capable Claude; best for long-reasoning builds where quality matters more than cost.

Groq

Fast inference (~500–1000 tokens/sec) but tight free-tier TPM. See the free-tier fit table before picking a Groq model on free tier.

Model ID	Free-tier TPM	Notes
`openai/gpt-oss-120b`	~8K	Strongest Groq model for structured JSON. Tight on free tier for multi-intent builds — pair with chain mode or upgrade Groq plan.
`openai/gpt-oss-20b`	~20K	Fastest Groq model (~1000 tps). Also the preferred router model when you have Groq configured.
`qwen/qwen3-32b`	~12K	Preview; strong at structured output.
`llama-3.3-70b-versatile`	~12K	Capable but occasionally prefaces JSON with prose — GridOS’s balanced-brace parser strips that, but reliability varies.

OpenRouter

Model ID	Free tier	Notes
`nousresearch/hermes-3-llama-3.1-405b:free`	Free	Large 405B model with reasoning chops. Rate-limited.
`meta-llama/llama-3.3-70b-instruct:free`	Free	Strong general-purpose Llama; reliable for GridOS prompts.
`meta-llama/llama-3.2-3b-instruct:free`	Free	Tiny and fast; fine for simple single-cell edits.
`openrouter/free`	Free	Meta-router — auto-picks a working free model at request time. Use when you don’t want to think about model choice.

llama-3.1-8b-instant (Groq) is in the catalog but hidden from the chat composer’s model picker — its 6K TPM free-tier ceiling can’t fit GridOS’s agent prompt. It’s used internally as the router classifier, where prompts are small enough to fit.

Picking a model

Open the chat composer at the bottom of the workbook view. The model picker dropdown is embedded in the composer — click it to see every model available for your configured providers. Select any model and your next chat message uses it. The selection persists across page reloads (stored in localStorage).

If a model returns garbled or non-JSON output, switch to a stronger one (Claude Sonnet, Gemini Pro) or rephrase to be more specific. If you hit 413 — Request too large, switch providers or enable chain mode — see Troubleshooting.

The router model

GridOS uses a separate, smaller model for intent classification — deciding whether your prompt needs the finance agent or the general agent. This routing call is pinned to whichever fast small model has a configured key, in this preference order:

llama-3.1-8b-instant (Groq, ~560 tps)
gemini-3.1-flash-lite-preview (Google)
claude-haiku-4-5-20251001 (Anthropic)
meta-llama/llama-3.2-3b-instruct:free (OpenRouter)

The router call is transparent: it reserves only 32 completion tokens (enough for one lowercase agent id), so it fits even the tightest free-tier TPM buckets. Your chosen model in the composer drives only the agent call — the part that actually writes cells.

Adding your own models

Self-hosted users can extend the catalog two ways:

Via a plugin (recommended)

Create a plugin that calls kernel.model({...}) — see Plugins for the full walkthrough. Plugins don’t require editing core files and can ship as a self-contained directory.

By editing `core/providers/catalog.py`

Append an entry to MODEL_CATALOG:

{
    "id": "model-id-sent-to-provider-sdk",
    "provider": "gemini",  # one of: gemini | anthropic | groq | openrouter
    "display_name": "Human-readable name",
    "description": "Short description shown in the UI.",
}

The UI picks up new entries on the next page load, as long as the owning provider has a configured key.

Groq example
OpenRouter example

{
    "id": "llama-3.1-70b-versatile",
    "provider": "groq",
    "display_name": "Llama 3.1 70B (Groq)",
    "description": "Llama 3.1 70B via Groq.",
}

{
    "id": "mistralai/mistral-7b-instruct:free",
    "provider": "openrouter",
    "display_name": "Mistral 7B (free)",
    "description": "Free Mistral 7B via OpenRouter.",
}

Restricting a model to router use only? Add "router_only": True — the picker will hide it but route_prompt() will still reach it.

​Model catalog

​Google Gemini (recommended for first-time users)

​Anthropic Claude

​Groq

​OpenRouter

​Picking a model

​The router model

​Adding your own models

​Via a plugin (recommended)

​By editing core/providers/catalog.py

Model catalog

Google Gemini (recommended for first-time users)

Anthropic Claude

Groq

OpenRouter

Picking a model

The router model

Adding your own models

Via a plugin (recommended)

By editing `core/providers/catalog.py`