tota supports 12 AI providers and tries them in order when the primary fails — rate limits, network errors, API outages. You configure providers once during setup; the fallback chain is automatic.

#Supported providers

ProviderTop modelKey env var
OpenAIgpt-5.2OPENAI_API_KEY
Anthropicclaude-sonnet-4-20250514ANTHROPIC_API_KEY
DeepSeekdeepseek-chatDEEPSEEK_API_KEY
Grok (xAI)grok-4XAI_API_KEY
Groqllama-3.3-70b-versatileGROQ_API_KEY
MiMomimo-v2.5-proMIMO_API_KEY
MiMo Token Planmimo-v2.5-proMIMO_TOKEN_PLAN_API_KEY
Ollama (local)qwen3.5:2b(none)
Ollama (cloud)gpt-oss:120bOLLAMA_API_KEY
OpenAI-compatibleyour chosen modelOPENAI_COMPAT_API_KEY
NVIDIA NIMnvidia/nemotron-3-super-120b-a12bNVIDIA_API_KEY
OpenRouteropenrouter/autoOPENROUTER_API_KEY
ℹ️

Env vars are optional — tota stores keys in ~/.tota/config.json. The env vars are only for CI/CD environments or if you prefer not to use the config file.

ℹ️

DeepSeek Reasoner (deepseek-reasoner) and MiMo providers use a specialized AI SDK code path for extended reasoning. Select them during tota setup llm to enable this behavior.

#How fallback works

tota tries the primary provider first. On failure (rate limit, timeout, API error), it retries once, then falls through to the next configured provider. If all fail, it surfaces a clear error message.

The fallback order follows your provider setup order. To see your current chain:

tota status