HelixML

Supported models and providers

The LLM providers Helix supports and the model identifiers to use in project and assistant configuration.

Helix routes inference to external providers — it doesn't bundle models. You configure providers under Account → AI Providers (UI) or controlplane.providers (Helm).

Providers

ProviderTypeNotes
AnthropicCloud APIClaude models. Recommended for coding agents.
OpenAICloud APIGPT-4 and o-series models
GoogleCloud APIGemini models via the Gemini API
Together AICloud APIHosted open-source models (Llama, Mistral, Qwen, etc.)
HelixBuilt-inAvailable on Helix Cloud. Useful for getting started without your own keys.
OpenAI-compatibleAnyAny endpoint serving /v1/chat/completions — vLLM, Ollama, LM Studio, Anyscale, etc.
Anthropic via VertexCloud APIAnthropic models routed through GCP Vertex AI

Model identifiers

Use these in model: fields in project and assistant YAML.

Anthropic

Model IDDescription
claude-opus-4-8Most capable Claude model
claude-sonnet-4-6Balanced capability and speed. Default for most coding tasks.
claude-haiku-4-5-20251001Fastest, lowest cost

OpenAI

Model IDDescription
gpt-4oGPT-4 Omni — multimodal, strong reasoning
gpt-4o-miniSmaller, faster, lower cost
o3High-reasoning model for complex tasks
o4-miniFast reasoning

Google

Model IDDescription
gemini-2.5-proMost capable Gemini model
gemini-2.5-flashFast and cost-efficient

Qwen (via Together AI or Helix runner)

Model IDDescription
qwen3-coder-480bLarge coding-tuned Qwen model
qwen3:8bSmall Qwen model, runs locally
qwen3:30b-a3bMid-size Qwen model

Local / self-hosted (OpenAI-compatible)

Any model served by Ollama, vLLM, LM Studio, or another OpenAI-compatible server. The model ID is whatever that server uses:

  agent:
    model: llama3.3          # or deepseek-r1, gemma3, mistral, etc.
    provider: ollama          # whatever you named this provider in Helix

Model selection for code agents

Code agentModel selection
Claude CodeManaged internally — you don't set model or provider
GooseSet model and provider in the project YAML
Qwen CodeSet model and provider. Works with any OpenAI-compatible provider.
Zed AgentSet model and provider. Routes through Helix's /v1 proxy.

Recommendations

For coding tasks (spec tasks, implementation):

  • Claude Code with Anthropic → best overall code quality
  • Sonnet 4.6 or Opus 4.8 for Goose/Zed/Qwen → strong alternatives
  • Qwen3-Coder-480b via Together AI → best open-source option

For chat assistants (knowledge bases, support bots):

  • Qwen3:8b → fast, low cost, runs locally
  • GPT-4o-mini → good for cost-sensitive cloud deployments
  • Gemini Flash → fast and multimodal

For planning and analysis (complex reasoning):

  • o3 or Claude Opus → highest reasoning quality