Choosing a model
If you’re unsure which model to use, we recommend starting with Claude Sonnet 4.5. It offers the best balance of intelligence, speed, and cost for most use cases, with exceptional performance in coding and agentic tasks. All current Claude models support text and image input, text output, multilingual capabilities, and vision. Models are available via the Anthropic API, AWS Bedrock, and Google Vertex AI. Once you’ve picked a model, learn how to make your first API call.Latest models comparison
| Feature | Claude Sonnet 4.5 | Claude Haiku 4.5 | Claude Opus 4.1 |
|---|---|---|---|
| Description | Our smartest model for complex agents and coding | Our fastest model with near-frontier intelligence | Exceptional model for specialized reasoning tasks |
| Claude API ID | claude-sonnet-4-5-20250929Copied! | claude-haiku-4-5-20251001Copied! | claude-opus-4-1-20250805Copied! |
| Claude API alias1 | claude-sonnet-4-5Copied! | claude-haiku-4-5Copied! | claude-opus-4-1Copied! |
| AWS Bedrock ID | anthropic.claude-sonnet-4-5-20250929-v1:0Copied! | anthropic.claude-haiku-4-5-20251001-v1:0Copied! | anthropic.claude-opus-4-1-20250805-v1:0Copied! |
| GCP Vertex AI ID | claude-sonnet-4-5@20250929Copied! | claude-haiku-4-5@20251001Copied! | claude-opus-4-1@20250805Copied! |
| Pricing2 | $3 / input MTok $15 / output MTok | $1 / input MTok $5 / output MTok | $15 / input MTok $75 / output MTok |
| Extended thinking | Yes | Yes | Yes |
| Priority Tier | Yes | Yes | Yes |
| Comparative latency | Fast | Fastest | Moderate |
| Context window | / (beta)3 | ||
| Max output | 64K tokens | 64K tokens | 32K tokens |
| Reliable knowledge cutoff | Jan 20254 | Feb 2025 | Jan 20254 |
| Training data cutoff | Jul 2025 | Jul 2025 | Mar 2025 |
claude-sonnet-4-5-20250929) in production applications to ensure consistent behavior.
2 - See our pricing page for complete pricing information including batch API discounts, prompt caching rates, extended thinking costs, and vision processing fees.
3 - Claude Sonnet 4.5 supports a 1M token context window when using the context-1m-2025-08-07 beta header. Long context pricing applies to requests exceeding 200K tokens.
4 - Reliable knowledge cutoff indicates the date through which a model’s knowledge is most extensive and reliable. Training data cutoff is the broader date range of training data used. For example, Claude Sonnet 4.5 was trained on publicly available information through July 2025, but its knowledge is most extensive and reliable through January 2025. For more information, see Anthropic’s Transparency Hub.
Legacy models
Legacy models
| Feature | Claude Sonnet 4 | Claude Sonnet 3.7 | Claude Opus 4 | Claude Haiku 3.5 | Claude Haiku 3 |
|---|---|---|---|---|---|
| Claude API ID | claude-sonnet-4-20250514Copied! | claude-3-7-sonnet-20250219Copied! | claude-opus-4-20250514Copied! | claude-3-5-haiku-20241022Copied! | claude-3-haiku-20240307Copied! |
| Claude API alias | claude-sonnet-4-0Copied! | claude-3-7-sonnet-latestCopied! | claude-opus-4-0Copied! | claude-3-5-haiku-latestCopied! | — |
| AWS Bedrock ID | anthropic.claude-sonnet-4-20250514-v1:0Copied! | anthropic.claude-3-7-sonnet-20250219-v1:0Copied! | anthropic.claude-opus-4-20250514-v1:0Copied! | anthropic.claude-3-5-haiku-20241022-v1:0Copied! | anthropic.claude-3-haiku-20240307-v1:0Copied! |
| GCP Vertex AI ID | claude-sonnet-4@20250514Copied! | claude-3-7-sonnet@20250219Copied! | claude-opus-4@20250514Copied! | claude-3-5-haiku@20241022Copied! | claude-3-haiku@20240307Copied! |
| Pricing | $3 / input MTok $15 / output MTok | $3 / input MTok $15 / output MTok | $15 / input MTok $75 / output MTok | $0.80 / input MTok $4 / output MTok | $0.25 / input MTok $1.25 / output MTok |
| Extended thinking | Yes | Yes | Yes | No | No |
| Priority Tier | Yes | Yes | Yes | Yes | No |
| Comparative latency | Fast | Fast | Moderate | Fastest | Fast |
| Context window | / (beta)1 | ||||
| Max output | 64K tokens | 64K tokens / 128K tokens (beta)4 | 32K tokens | 8K tokens | 4K tokens |
| Reliable knowledge cutoff | Jan 20252 | Oct 20242 | Jan 20252 | 3 | 3 |
| Training data cutoff | Mar 2025 | Nov 2024 | Mar 2025 | Jul 2024 | Aug 2023 |
context-1m-2025-08-07 beta header. Long context pricing applies to requests exceeding 200K tokens.2 - Reliable knowledge cutoff indicates the date through which a model’s knowledge is most extensive and reliable. Training data cutoff is the broader date range of training data used.3 - Some Haiku models have a single training data cutoff date.4 - Include the beta header output-128k-2025-02-19 in your API request to increase the maximum output token length to 128K tokens for Claude Sonnet 3.7. We strongly suggest using our streaming Messages API to avoid timeouts when generating longer outputs. See our guidance on long requests for more details.Prompt and output performance
Claude 4 models excel in:- Performance: Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing. See the Claude 4 blog post for more information.
-
Engaging responses: Claude models are ideal for applications that require rich, human-like interactions.
- If you prefer more concise responses, you can adjust your prompts to guide the model toward the desired output length. Refer to our prompt engineering guides for details.
- For specific Claude 4 prompting best practices, see our Claude 4 best practices guide.
- Output quality: When migrating from previous model generations to Claude 4, you may notice larger improvements in overall performance.

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.
