[Update] Claude Opus 4.8 is now available on Amazon Bedrock
This page has been translated by machine translation. View original
This is Ishikawa from the Cloud Business Division. On May 28, 2026, Anthropic's latest model Claude Opus 4.8 became available on AWS. It can be accessed from both Amazon Bedrock and Claude Platform on AWS.
This is an update approximately one and a half months after the previous version Claude Opus 4.7 (available since April 16, 2026), with enhancements focused on agent use cases, coding, and long-running autonomous tasks. In addition to the What's New content, this article summarizes the improvements from Opus 4.7.
What is Claude Opus 4.8
Claude Opus 4.8 is the latest generation large language model in the Opus series provided by Anthropic. Anthropic positions it as "the highest-performing generally available (GA) model at this time," supporting complex reasoning, long-running autonomous agent processing (agentic coding), and tasks requiring high autonomy.
Key use cases include coding, agent tasks, knowledge work (document creation and analysis), financial analysis, cybersecurity, and computer use.
Update Details
With this update, Claude Opus 4.8 is now available via Amazon Bedrock.
The main features are as follows.
- Supports long-running autonomous execution, handling multi-stage projects and complex dependencies with minimal supervision
- Reads codebases like an expert and plans before editing. Maintains context across long sessions
- Improved ability to avoid obstacles and self-recover from errors
- Supports cross-document synthesis of long documents, self-verification of outputs, and generation of structured deliverables
Model Basic Specifications
The information available in the Amazon Bedrock model catalog (Tokyo region) is as follows.
| Item | Details |
|---|---|
| Model ID | anthropic.claude-opus-4-8 |
| Provider | Anthropic |
| Deployment Type | Serverless |
| Inference Type | Cross-region inference |
| Maximum Input | 1M (1 million) tokens |
| Maximum Output | 128K tokens |
| Input Modalities | Text, images |
| Output Modalities | Text |
| Supported Languages | English, Japanese, Chinese, Spanish, German, and many more |
| Release Date | May 28, 2026 |
Supported Regions
At the time of writing, Claude Opus 4.8 was confirmed in the Amazon Bedrock model catalog in the Tokyo region (ap-northeast-1). Cross-region inference is supported.
For the exact list of supported regions and cross-region inference profiles, please refer to the official Amazon Bedrock documentation. At the time of writing, individual model cards and region lists on the AWS side may still be in the process of being updated, so please check the latest information before production use.

Pricing
In Amazon Bedrock, charges are based on the amount of input and output tokens for model inference and customization, as well as whether Provisioned Throughput is purchased.
At this time, the AWS Pricing page has not been updated, but according to the standard pricing on the official Anthropic site (Claude API), Claude Opus 4.8 is priced at $5 USD per 1 million tokens (MTok) for input and $25 USD per MTok for output. This is the same price as Claude Opus 4.7. Please check the AWS pricing page for exact pricing via Amazon Bedrock. Note that Claude Platform on AWS charges in CCU (Claude Consumption Unit) units.
Evolution of Claude Opus 4.8
This is the main focus of this article. Here is a summary of what has changed from Claude Opus 4.7 to Claude Opus 4.8.
Benchmark Comparison
The benchmark comparison published by Anthropic is as follows. It compares against Claude Opus 4.7, GPT-5.5, and Gemini 3.1 Pro.
| Benchmark | Opus 4.8 | Opus 4.7 | GPT-5.5 | Gemini 3.1 Pro |
|---|---|---|---|---|
| Agentic coding (SWE-Bench Pro) | 69.2% | 64.3% | 58.6% | 54.2% |
| Agentic terminal coding (Terminal-Bench 2.1) | 74.6% | 66.1% | 78.2% | 70.3% |
| Multidisciplinary reasoning (Humanity's Last Exam, no tools) | 49.8% | 46.9% | 41.4% | 44.4% |
| Multidisciplinary reasoning (Humanity's Last Exam, with tools) | 57.9% | 54.7% | 52.2% | 51.4% |
| Agentic computer use (OSWorld-Verified) | 83.4% | 82.8% | 78.7% | 76.2% |
| Knowledge work (GDPval-AA) | 1890 | 1753 | 1769 | 1314 |
| Agentic financial analysis (Finance Agent v2) | 53.9% | 51.5% | 51.8% | 43.0% |
※Source: Benchmarks from official Anthropic announcements. Values are based on Anthropic internal evaluations. GDPval-AA is a score value; all others are accuracy rates (%).
On SWE-Bench Pro, a coding benchmark close to real-world tasks, performance improved from 64.3% with Claude Opus 4.7 to 69.2% with Claude Opus 4.8. While Claude Opus 4.8 outperforms Claude Opus 4.7 in most categories, GPT-5.5 achieved the highest result on Terminal-Bench 2.1 at 78.2%, so it is worth noting that strengths and weaknesses vary depending on the use case.
Key New Features and Improvements
According to the official Anthropic documentation, the following features have been added or changed from Claude Opus 4.7 to Claude Opus 4.8.
- Mid-conversation system messages: It is now possible to insert
role: "system"messages immediately after a user turn. This allows instructions to be added during long conversations without resending the entire system prompt, while maintaining prompt cache hits from previous turns. This contributes to reducing input costs in agent loops. - Default for the effort parameter changed to
high: The default for the effort parameter is nowhighacross all surfaces including the Claude API and Claude Code (no change if explicitly set). - Fast mode (research preview): Specifying
speed: "fast"allows you to get up to 2.5x more output tokens per second from the same model (premium pricing, Claude API only). - Lower minimum token count for prompt caching: The minimum cacheable prompt length has been reduced to 1,024 tokens. Short prompts that could not be cached with Claude Opus 4.7 can now create cache entries without any code changes.
- Official documentation of Refusal stop details: The
stop_detailsobject when a request is refused is now officially documented, making it easier to handle based on the type of refusal.
In terms of capabilities, the following improvements have been made compared to Claude Opus 4.7.
- Long-horizon agentic coding: Handling of long contexts has improved, with fewer occurrences of compaction (context compression) and better recovery quality after compression.
- Reasoning effort calibration: Behavior at each effort level is more stable.
- Improved tool triggering: Fewer cases of skipping necessary tool calls for a task (this was a behavior partially reported in Claude Opus 4.7).
- More efficient adaptive thinking: Determines whether thinking is needed on a per-turn basis, reducing unnecessary thinking tokens even at the same effort level.
Notes on Migration
According to Anthropic's migration guide, there are no breaking API changes when migrating from Claude Opus 4.7 to Claude Opus 4.8, and it is stated that basically just replacing the model ID will work. However, the following constraints carried over from Claude Opus 4.7 apply.
- Setting
temperature/top_p/top_kto non-default values will result in a 400 error (control behavior via prompts). - Only adaptive thinking is supported;
thinking: {"type": "enabled", "budget_tokens": N}will result in a 400 error.
How to Use
There are two ways to access Claude Opus 4.8.
When using Amazon Bedrock, enable model access from Amazon Bedrock in the AWS Management Console. You can use it in combination with AWS managed features such as Guardrails and Knowledge Bases while keeping data within the AWS infrastructure.
Claude Platform on AWS is a platform operated by Anthropic, accessible from the AWS Management Console, and supports consolidated billing via AWS Marketplace (billed in CCU units).
Usage Notes
- The benchmark figures in this article are based on official Anthropic announcements (internal evaluations). We recommend verifying performance with your own workloads in your own environment.
- The pricing listed is the standard pricing for the Anthropic Claude API (first-party). Please check the AWS pricing page for pricing via Amazon Bedrock, and the CCU-based billing structure for Claude Platform on AWS.
- At the time of writing, individual model cards and region lists on the AWS side may still be in the process of being updated. Please check the official Amazon Bedrock documentation for the latest information before production use.
Conclusion
Claude Opus 4.8 is a practically oriented update built on Claude Opus 4.7, with improvements to long-horizon agentic coding, tool triggering, and compaction handling. For those using Amazon Bedrock or Claude Platform on AWS, you can try it out simply by replacing the model ID with anthropic.claude-opus-4-8. Why not start by experiencing the differences from Claude Opus 4.7 in a test environment?

