Overview
Aster Agents supports bring-your-own-API-key across multiple providers. You connect your own API keys in Control Hub > Providers, then select a model when building each agent. This gives you full control over cost, performance, and provider choice.
Don’t use outdated models. Models like GPT-4o, Claude 3.5 Sonnet, or GPT-4.1 are previous-generation and significantly underperform current models on agentic tasks. Always use the latest models listed below.
Supported Providers
| Provider | API Key Location | Models |
|---|
| Anthropic | console.anthropic.com | Claude Opus 4.6, Sonnet 4.6, Haiku 4.5 |
| OpenAI | platform.openai.com | GPT-5.4, GPT-5.4 Mini, GPT-5.4 Nano |
| Google | aistudio.google.dev | Gemini 3.1 Pro, Gemini 3 Flash |
| xAI | console.x.ai | Grok 4-1 |
| Azure OpenAI | Azure Portal | Any deployed model |
Add your API key in Control Hub > Providers to unlock that provider’s models across all your agents.
Recommended Models
These are the three models you should be choosing between for most agents:
Claude Sonnet 4.6 — Best All-Around
The default choice for most agents. Near-flagship intelligence at mid-tier pricing with the fastest output speed of any frontier model.
| Spec | Value |
|---|
| Input | $3.00 / 1M tokens |
| Output | $15.00 / 1M tokens |
| Context | 1M tokens |
| Max Output | 64K tokens |
| Strengths | Instruction following, tool use, structured output, long documents |
Best for: General-purpose agents, CRM workflows, document analysis, data extraction, customer-facing agents, scheduled tasks, multi-tool orchestration.
Why Sonnet over Opus? Sonnet 4.6 scores within 1-2% of Opus 4.6 on coding and agentic benchmarks (79.6% vs 80.8% on SWE-bench) while being significantly faster and 40% cheaper. For most agent tasks — querying databases, filling out forms, searching knowledge bases, writing emails — the difference is imperceptible.
Claude Opus 4.6 — Maximum Capability
The most capable model for tasks requiring deep reasoning, complex multi-step planning, or handling ambiguous instructions.
| Spec | Value |
|---|
| Input | $5.00 / 1M tokens |
| Output | $25.00 / 1M tokens |
| Context | 1M tokens |
| Max Output | 128K tokens |
| Strengths | Complex reasoning, nuanced judgment, long-form generation, ambiguous tasks |
Best for: Legal document analysis, financial modeling, research agents, complex multi-agent workflows, agents that need to handle edge cases gracefully, tasks where accuracy matters more than cost.
When to choose Opus: When your agent regularly encounters situations that require judgment calls — interpreting vague requirements, handling conflicting information, or producing output where subtle errors have real consequences.
GPT-5.4 — Strong Alternative
OpenAI’s flagship. Competitive with Claude Opus on overall benchmarks, with slightly different strengths.
| Spec | Value |
|---|
| Input | $2.50 / 1M tokens |
| Output | $15.00 / 1M tokens |
| Context | 1M tokens |
| Max Output | 128K tokens |
| Strengths | Broad knowledge, web search tool, image generation, code interpretation |
Best for: Agents that need OpenAI’s built-in tools (web search, code interpreter, image generation), teams already invested in the OpenAI ecosystem, general-purpose agents where input cost sensitivity matters.
OpenAI-exclusive tools: GPT-5.4 unlocks web_search, code_interpreter, and image_generation as provider tools — these aren’t available with other providers.
Budget-Friendly Options
For high-volume or cost-sensitive workloads:
| Model | Input | Output | Best For |
|---|
| GPT-5.4 Mini | $0.75 / 1M | $4.50 / 1M | Capable subagents, classification, extraction |
| GPT-5.4 Nano | $0.20 / 1M | $1.25 / 1M | High-volume routing, simple classification, triage |
| Claude Haiku 4.5 | $1.00 / 1M | $5.00 / 1M | Fast responses, simple Q&A, scheduled monitoring tasks |
These models are excellent for multi-agent workflows where a cheaper model handles routine subtasks (classification, data formatting, simple lookups) while a flagship model handles the complex reasoning.
Choosing by Use Case
| Agent Type | Recommended Model | Why |
|---|
| CRM data entry / updates | Sonnet 4.6 | Reliable tool use, fast, cost-effective |
| Document analysis & extraction | Sonnet 4.6 | Strong structured output, large context |
| Legal / compliance review | Opus 4.6 | Nuanced judgment, handles ambiguity |
| Financial analysis | Opus 4.6 | Complex reasoning, accuracy matters |
| Customer-facing chatbot | Sonnet 4.6 | Fast responses, good instruction following |
| Scheduled monitoring tasks | Haiku 4.5 | Runs frequently, cost matters, simple logic |
| Research agents | Opus 4.6 or GPT-5.4 | Deep reasoning + web search (GPT-5.4) |
| Multi-agent orchestrator | Sonnet 4.6 | Fast coordination, reliable tool dispatch |
| Subagent / worker | GPT-5.4 Mini or Haiku 4.5 | Cost-efficient for delegated subtasks |
| Data classification / triage | GPT-5.4 Nano | Ultra-cheap, high-volume |
Cost Optimization Tips
- Start with Sonnet 4.6 — it handles 90%+ of agent tasks well. Only upgrade to Opus if you see quality issues on specific tasks.
- Use cheaper models for subagents — in multi-agent workflows, the orchestrator can run on Sonnet while worker agents run on Haiku or GPT-5.4 Mini.
- Prompt caching saves money — both Anthropic (90% off cached input) and OpenAI (50% off cached input) automatically cache repeated context. Agents with large system prompts or knowledge base content benefit significantly.
- Keep system prompts focused — longer prompts cost more on every message. A concise, well-structured prompt outperforms a verbose one and costs less.
Some providers include built-in tools that extend your agent’s capabilities beyond Aster’s standard tool library:
| Provider | Tool | Description |
|---|
| OpenAI | Web Search | Search the web for current information |
| OpenAI | Code Interpreter | Write and execute Python code in a sandbox |
| OpenAI | Image Generation | Generate images from text prompts |
| Google | Google Search | Ground responses with real-time search results |
| Google | Code Execution | Generate and run Python code |
| xAI | X Search | Search X (Twitter) for posts and discussions |
| xAI | Web Search | Search the web for current information |
These tools appear automatically when you select a model from the corresponding provider. They can be enabled alongside Aster’s standard tools.