COLLECTED BY

Organization: Archive Team

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.

History is littered with hundreds of conflicts over the future of a community, group, location or business that were "resolved" when one of the parties stepped ahead and destroyed what was there. With the original point of contention destroyed, the debates would fall to the wayside. Archive Team believes that by duplicated condemned data, the conversation and debate can continue, as well as the richness and insight gained by keeping the materials. Our projects have ranged in size from a single volunteer downloading the data to a small-but-critical site, to over 100 volunteers stepping forward to acquire terabytes of user-created data to save for future generations.

The main site for Archive Team is at archiveteam.org and contains up to the date information on various projects, manifestos, plans and walkthroughs.

This collection contains the output of many Archive Team projects, both ongoing and completed. Thanks to the generous providing of disk space by the Internet Archive, multi-terabyte datasets can be made available, as well as in use by the Wayback Machine, providing a path back to lost websites and work.

Our collection has grown to the point of having sub-collections for the type of data we acquire. If you are seeking to browse the contents of these collections, the Wayback Machine is the best first stop. Otherwise, you are free to dig into the stacks to see what you may find.

The Archive Team Panic Downloads are full pulldowns of currently extant websites, meant to serve as emergency backups for needed sites that are in danger of closing, or which will be missed dearly if suddenly lost due to hard drive crashes or server failures.

Collection: ArchiveBot: The Archive Team Crowdsourced Crawler

ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, records it in a WARC, and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites).

To use ArchiveBot, drop by #archivebot on EFNet. To interact with ArchiveBot, you issue commands by typing it into the channel. Note you will need channel operator permissions in order to issue archiving jobs. The dashboard shows the sites being downloaded currently.

There is a dashboard running for the archivebot process at http://www.archivebot.com.

ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot.

TIMESTAMPS

The Wayback Machine - https://web.archive.org/web/20260609003635/https://platform.claude.com/docs/en/build-with-claude/effort

Cookie settings

We use cookies to deliver and improve our services, analyze site usage, and if you agree, to customize or personalize your experience and market our services to you. You can read our Cookie Policy here.

Effort

MessagesModel capabilities

Effort

Control how many tokens Claude uses when responding with the effort parameter, trading off between response thoroughness and token efficiency.

This feature is eligible for Zero Data Retention (ZDR). When your organization has a ZDR arrangement, data sent through this feature is not stored after the API response is returned.

The effort parameter allows you to control how eager Claude is about spending tokens when responding to requests. This gives you the ability to trade off between response thoroughness and token efficiency, all with a single model. The effort parameter is available on all supported models with no beta header required.

The effort parameter is supported by Claude Opus 4.8, Claude Mythos Preview, Claude Opus 4.7, Claude Opus 4.6, Claude Sonnet 4.6, and Claude Opus 4.5.

For Claude Opus 4.6 and Sonnet 4.6, effort replaces budget_tokens as the recommended way to control thinking depth. Combine effort with adaptive thinking (thinking: {type: "adaptive"}) for the best experience. While budget_tokens is still accepted on Opus 4.6 and Sonnet 4.6, it is deprecated and will be removed in a future model release. At high (default) and max effort, Claude will almost always think. At lower effort levels, it may skip thinking for simpler problems.

How effort works

By default, Claude uses high effort, spending as many tokens as needed for excellent results. You can raise the effort level to max for the absolute highest capability, or lower it to be more conservative with token usage, optimizing for speed and cost while accepting some reduction in capability.

Setting effort to "high" produces exactly the same behavior as omitting the effort parameter entirely.

The effort parameter affects all tokens in the response, including:

Text responses and explanations
Tool calls and function arguments
Extended thinking (when enabled)

This approach has two major advantages:

It doesn't require thinking to be enabled in order to use it.
It can affect all token spend including tool calls. For example, lower effort would mean Claude makes fewer tool calls. This gives a much greater degree of control over efficiency.

Effort levels

Level	Description	Typical use case
`max`	Absolute maximum capability with no constraints on token spending. Available on Claude Opus 4.8, Claude Mythos Preview, Claude Opus 4.7, Claude Opus 4.6, and Claude Sonnet 4.6.	Tasks requiring the deepest possible reasoning and most thorough analysis
`xhigh`	Extended capability for long-horizon work. Available on Claude Opus 4.8 and Claude Opus 4.7.	Long-running agentic and coding tasks (over 30 minutes) with token budgets in the millions
`high`	High capability. Equivalent to not setting the parameter.	Complex reasoning, difficult coding problems, agentic tasks
`medium`	Balanced approach with moderate token savings.	Agentic tasks that require a balance of speed, cost, and performance
`low`	Most efficient. Significant token savings with some capability reduction.	Simpler tasks that need the best speed and lowest costs, such as subagents

Effort is a behavioral signal, not a strict token budget. At lower effort levels, Claude will still think on sufficiently difficult problems, but it will think less than it would at higher effort levels for the same problem.

Recommended effort levels for Sonnet 4.6

Sonnet 4.6 defaults to high effort. Explicitly set effort when using Sonnet 4.6 to avoid unexpected latency:

Medium effort (recommended default): Best balance of speed, cost, and performance for most applications. Suitable for agentic coding, tool-heavy workflows, and code generation.
Low effort: For high-volume or latency-sensitive workloads. Suitable for chat and non-coding use cases where faster turnaround is prioritized.
High effort: For complex reasoning and tasks where quality matters more than speed or cost.
Max effort: For tasks requiring the absolute highest capability with no constraints on token spending.

Recommended effort levels for Claude Opus 4.7

Start with xhigh for coding and agentic use cases, and use high as the minimum for most intelligence-sensitive workloads. Step down to medium for cost-sensitive workloads, or up to max only when your evals show measurable headroom at xhigh.

The API default is high. To use xhigh, set effort explicitly; the value you pass overrides the default.

Effort	Guidance for Claude Opus 4.7
`low`	Efficient, but best for short, scoped tasks. Pair `low` with explicit checklists if your task has multiple sections.
`medium`	The drop-in for the average workflow where you want good results while reducing costs.
`high`	Advanced use cases that still need a balance of intelligence and token consumption. This is often the sweet spot balancing quality and token efficiency.
`xhigh`	The recommended starting point for coding and agentic work, and for exploratory tasks such as repeated tool calling, detailed web search, and knowledge-base search. Expect meaningfully higher token usage than `high`.
`max`	Reserve for genuinely frontier problems. On most workloads `max` adds significant cost for relatively small quality gains, and on some structured-output or less intelligence-sensitive tasks it can lead to overthinking.

Claude Opus 4.7 also respects effort levels more strictly than Claude Opus 4.6, especially at low and medium. At lower effort levels, the model scopes its work to what was asked rather than going above and beyond. If you observe shallow reasoning on complex problems with Claude Opus 4.7, raise effort rather than prompting around it. If you must keep effort low for latency, add targeted guidance like "This task involves multi-step reasoning. Think carefully before responding."

When running Claude Opus 4.7 at xhigh or max effort, set a large max_tokens so the model has room to think and act across subagents and tool calls. Starting at 64k tokens and tuning from there is a reasonable default.

Recommended effort levels for Claude Opus 4.8

The guidance for Claude Opus 4.7 above also applies to Claude Opus 4.8. Start with xhigh for coding and agentic use cases, use high for most other intelligence-sensitive workloads, and step down to medium or low only when you've measured that the lower level holds quality on your evals.

The default is high on all surfaces, including the Claude API and Claude Code. Set effort explicitly to use a different level; the value you pass overrides the default.

When running Claude Opus 4.8 at xhigh or max effort, set a large max_tokens so the model has room to think and act across subagents and tool calls. Starting at 64k tokens and tuning from there is a reasonable default.

Basic usage

client = anthropic.Anthropic()

response = client.messages.create(
    model="claude-opus-4-8",
    max_tokens=4096,
    messages=[
        {
            "role": "user",
            "content": "Analyze the trade-offs between microservices and monolithic architectures",
        }
    ],
    output_config={"effort": "medium"},
)

print(response.content[0].text)

When to adjust the effort parameter

Use max effort when you need the absolute highest capability with no constraints: the most thorough reasoning and deepest analysis. Available on Claude Opus 4.8, Claude Mythos Preview, Claude Opus 4.7, Claude Opus 4.6, and Claude Sonnet 4.6.
Use xhigh effort for advanced coding and complex agentic work requiring extended exploration, such as repeated tool calling and detailed search. Available on Claude Opus 4.8 and Claude Opus 4.7.
Use high effort (the default) for complex reasoning, nuanced analysis, difficult coding problems, or any task where quality matters more than speed or cost.
Use medium effort as a balanced option when you want solid performance without the full token expenditure of high effort.
Use low effort when you're optimizing for speed (because Claude answers with fewer tokens) or cost. For example, simple classification tasks, quick lookups, or high-volume use cases where marginal quality improvements don't justify additional latency or spend.

Claude Code's ultracode mode: ultracode appears in Claude Code's effort menu, but it is not an additional API effort level. The values documented on this page are the complete set the API accepts. Ultracode pairs the xhigh effort level with standing permission for Claude Code to launch multi-agent workflows, granted through Mid-conversation system messages. To build similar behavior with the API, see Build an orchestration mode.

Effort with tool use

When using tools, the effort parameter affects both the explanations around tool calls and the tool calls themselves. Lower effort levels tend to:

Combine multiple operations into fewer tool calls
Make fewer tool calls
Proceed directly to action without preamble
Use terse confirmation messages after completion

Higher effort levels may:

Make more tool calls
Explain the plan before taking action
Provide detailed summaries of changes
Include more comprehensive code comments

Effort with extended thinking

The effort parameter works alongside extended thinking. Its behavior depends on the model:

Claude Opus 4.8 uses adaptive thinking (thinking: {type: "adaptive"}), where effort is the recommended control for thinking depth. Manual extended thinking (thinking: {type: "enabled", budget_tokens: N}) is not supported and returns a 400 error. The model decides when and how much to think based on each request, so it triggers thinking only as needed. At high, xhigh, and max effort, Claude almost always thinks deeply. At lower levels, it may skip thinking for simpler problems. Set thinking: {type: "adaptive"} to enable thinking; without it, requests run without thinking.
Claude Mythos Preview uses adaptive thinking by default (no thinking configuration required). thinking: {type: "disabled"} is rejected. Effort controls thinking depth the same way as on Opus 4.7 and Opus 4.6.
Claude Opus 4.7 uses adaptive thinking (thinking: {type: "adaptive"}), where effort is the recommended control for thinking depth. Manual extended thinking (thinking: {type: "enabled", budget_tokens: N}) is no longer supported on Opus 4.7; use adaptive thinking with effort instead. At high, xhigh, and max effort, Claude almost always thinks deeply. At lower levels, it may skip thinking for simpler problems.
Claude Opus 4.6 uses adaptive thinking (thinking: {type: "adaptive"}), where effort is the recommended control for thinking depth. While budget_tokens is still accepted on Opus 4.6, it is deprecated and will be removed in a future release. At high and max effort, Claude almost always thinks deeply. At lower levels, it may skip thinking for simpler problems.
Claude Sonnet 4.6 uses adaptive thinking (where effort controls thinking depth). Manual thinking with interleaved mode (thinking: {type: "enabled", budget_tokens: N}) is still functional but deprecated.
Claude Opus 4.5 uses manual thinking (thinking: {type: "enabled", budget_tokens: N}), where effort works alongside the thinking token budget. Set the effort level for your task, then set the thinking token budget based on task complexity.

The effort parameter can be used with or without extended thinking enabled. When used without thinking, it still controls overall token spend for text responses and tool calls.

Best practices

Set effort explicitly: The API defaults to high, but the right starting point depends on your model and workload.
Use low for speed-sensitive or simple tasks: When latency matters or tasks are straightforward, low effort can significantly reduce response times and costs.
Test your use case: The impact of effort levels varies by task type. Evaluate performance on your specific use cases before deploying.
Consider dynamic effort: Adjust effort based on task complexity. Simple queries may warrant low effort while agentic coding and complex reasoning benefit from high effort.

Was this page helpful?

MessagesModel capabilities

Effort

Control how many tokens Claude uses when responding with the effort parameter, trading off between response thoroughness and token efficiency.

This feature is eligible for Zero Data Retention (ZDR). When your organization has a ZDR arrangement, data sent through this feature is not stored after the API response is returned.

The effort parameter is supported by Claude Opus 4.8, Claude Mythos Preview, Claude Opus 4.7, Claude Opus 4.6, Claude Sonnet 4.6, and Claude Opus 4.5.

How effort works

Setting effort to "high" produces exactly the same behavior as omitting the effort parameter entirely.

The effort parameter affects all tokens in the response, including:

Text responses and explanations
Tool calls and function arguments
Extended thinking (when enabled)

This approach has two major advantages:

It doesn't require thinking to be enabled in order to use it.
It can affect all token spend including tool calls. For example, lower effort would mean Claude makes fewer tool calls. This gives a much greater degree of control over efficiency.

Effort levels

Level	Description	Typical use case
`max`	Absolute maximum capability with no constraints on token spending. Available on Claude Opus 4.8, Claude Mythos Preview, Claude Opus 4.7, Claude Opus 4.6, and Claude Sonnet 4.6.	Tasks requiring the deepest possible reasoning and most thorough analysis
`xhigh`	Extended capability for long-horizon work. Available on Claude Opus 4.8 and Claude Opus 4.7.	Long-running agentic and coding tasks (over 30 minutes) with token budgets in the millions
`high`	High capability. Equivalent to not setting the parameter.	Complex reasoning, difficult coding problems, agentic tasks
`medium`	Balanced approach with moderate token savings.	Agentic tasks that require a balance of speed, cost, and performance
`low`	Most efficient. Significant token savings with some capability reduction.	Simpler tasks that need the best speed and lowest costs, such as subagents

Recommended effort levels for Sonnet 4.6

Sonnet 4.6 defaults to high effort. Explicitly set effort when using Sonnet 4.6 to avoid unexpected latency:

Medium effort (recommended default): Best balance of speed, cost, and performance for most applications. Suitable for agentic coding, tool-heavy workflows, and code generation.
Low effort: For high-volume or latency-sensitive workloads. Suitable for chat and non-coding use cases where faster turnaround is prioritized.
High effort: For complex reasoning and tasks where quality matters more than speed or cost.
Max effort: For tasks requiring the absolute highest capability with no constraints on token spending.

Recommended effort levels for Claude Opus 4.7

The API default is high. To use xhigh, set effort explicitly; the value you pass overrides the default.

Effort	Guidance for Claude Opus 4.7
`low`	Efficient, but best for short, scoped tasks. Pair `low` with explicit checklists if your task has multiple sections.
`medium`	The drop-in for the average workflow where you want good results while reducing costs.
`high`	Advanced use cases that still need a balance of intelligence and token consumption. This is often the sweet spot balancing quality and token efficiency.
`xhigh`	The recommended starting point for coding and agentic work, and for exploratory tasks such as repeated tool calling, detailed web search, and knowledge-base search. Expect meaningfully higher token usage than `high`.
`max`	Reserve for genuinely frontier problems. On most workloads `max` adds significant cost for relatively small quality gains, and on some structured-output or less intelligence-sensitive tasks it can lead to overthinking.

Recommended effort levels for Claude Opus 4.8

The default is high on all surfaces, including the Claude API and Claude Code. Set effort explicitly to use a different level; the value you pass overrides the default.

Basic usage

client = anthropic.Anthropic()

response = client.messages.create(
    model="claude-opus-4-8",
    max_tokens=4096,
    messages=[
        {
            "role": "user",
            "content": "Analyze the trade-offs between microservices and monolithic architectures",
        }
    ],
    output_config={"effort": "medium"},
)

print(response.content[0].text)

When to adjust the effort parameter

Use max effort when you need the absolute highest capability with no constraints: the most thorough reasoning and deepest analysis. Available on Claude Opus 4.8, Claude Mythos Preview, Claude Opus 4.7, Claude Opus 4.6, and Claude Sonnet 4.6.
Use xhigh effort for advanced coding and complex agentic work requiring extended exploration, such as repeated tool calling and detailed search. Available on Claude Opus 4.8 and Claude Opus 4.7.
Use high effort (the default) for complex reasoning, nuanced analysis, difficult coding problems, or any task where quality matters more than speed or cost.
Use medium effort as a balanced option when you want solid performance without the full token expenditure of high effort.
Use low effort when you're optimizing for speed (because Claude answers with fewer tokens) or cost. For example, simple classification tasks, quick lookups, or high-volume use cases where marginal quality improvements don't justify additional latency or spend.

Effort with tool use

When using tools, the effort parameter affects both the explanations around tool calls and the tool calls themselves. Lower effort levels tend to:

Combine multiple operations into fewer tool calls
Make fewer tool calls
Proceed directly to action without preamble
Use terse confirmation messages after completion

Higher effort levels may:

Make more tool calls
Explain the plan before taking action
Provide detailed summaries of changes
Include more comprehensive code comments

Effort with extended thinking

The effort parameter works alongside extended thinking. Its behavior depends on the model:

Claude Opus 4.8 uses adaptive thinking (thinking: {type: "adaptive"}), where effort is the recommended control for thinking depth. Manual extended thinking (thinking: {type: "enabled", budget_tokens: N}) is not supported and returns a 400 error. The model decides when and how much to think based on each request, so it triggers thinking only as needed. At high, xhigh, and max effort, Claude almost always thinks deeply. At lower levels, it may skip thinking for simpler problems. Set thinking: {type: "adaptive"} to enable thinking; without it, requests run without thinking.
Claude Mythos Preview uses adaptive thinking by default (no thinking configuration required). thinking: {type: "disabled"} is rejected. Effort controls thinking depth the same way as on Opus 4.7 and Opus 4.6.
Claude Opus 4.7 uses adaptive thinking (thinking: {type: "adaptive"}), where effort is the recommended control for thinking depth. Manual extended thinking (thinking: {type: "enabled", budget_tokens: N}) is no longer supported on Opus 4.7; use adaptive thinking with effort instead. At high, xhigh, and max effort, Claude almost always thinks deeply. At lower levels, it may skip thinking for simpler problems.
Claude Opus 4.6 uses adaptive thinking (thinking: {type: "adaptive"}), where effort is the recommended control for thinking depth. While budget_tokens is still accepted on Opus 4.6, it is deprecated and will be removed in a future release. At high and max effort, Claude almost always thinks deeply. At lower levels, it may skip thinking for simpler problems.
Claude Sonnet 4.6 uses adaptive thinking (where effort controls thinking depth). Manual thinking with interleaved mode (thinking: {type: "enabled", budget_tokens: N}) is still functional but deprecated.
Claude Opus 4.5 uses manual thinking (thinking: {type: "enabled", budget_tokens: N}), where effort works alongside the thinking token budget. Set the effort level for your task, then set the thinking token budget based on task complexity.

The effort parameter can be used with or without extended thinking enabled. When used without thinking, it still controls overall token spend for text responses and tool calls.

Best practices

Set effort explicitly: The API defaults to high, but the right starting point depends on your model and workload.
Use low for speed-sensitive or simple tasks: When latency matters or tasks are straightforward, low effort can significantly reduce response times and costs.
Test your use case: The impact of effort levels varies by task type. Evaluate performance on your specific use cases before deploying.
Consider dynamic effort: Adjust effort based on task complexity. Simple queries may warrant low effort while agentic coding and complex reasoning benefit from high effort.

Was this page helpful?

May	JUN	Jul
	09
2025	2026	2027