[Update] Claude Opus 4.8 is now available on Amazon Bedrock

Anthropic's latest model Claude Opus 4.8 is now available on AWS. We introduce the features of this latest version, which has been enhanced for agent use cases, coding, and long-running autonomous tasks, as well as the improvements from the previous version.

石川覚

2026.05.29

This page has been translated by machine translation. View original

This is Ishikawa from the Cloud Business Division. On May 28, 2026, Anthropic's latest model Claude Opus 4.8 became available on AWS. It can be accessed from both Amazon Bedrock and Claude Platform on AWS.
https://aws.amazon.com/jp/about-aws/whats-new/2026/05/claude-opus-4.8-aws/
https://aws.amazon.com/blogs/machine-learning/claude-opus-4-8-is-now-available-on-aws/
This update comes approximately one and a half months after the previous version Claude Opus 4.7 (available since April 16, 2026), with enhancements focused on agentic use cases, coding, and long-duration autonomous tasks. In this article, in addition to the What's New content, we summarize the improvements from Opus 4.7.
 What is Claude Opus 4.8Claude Opus 4.8 is the latest generation large language model in the Opus series provided by Anthropic. Anthropic positions it as "the highest-performing generally available (GA) model at this time," handling complex reasoning, long-duration autonomous agentic processing (agentic coding), and tasks requiring high autonomy.
Key application areas include coding, agentic tasks, knowledge work (document creation and analysis), financial analysis, cybersecurity, and computer use.
 Update DetailsWith this update, Claude Opus 4.8 is now available via Amazon Bedrock.
The main features are as follows.
Supports long-duration autonomous execution, handling multi-stage projects and complex dependencies with minimal supervision
Reads codebases like an expert and formulates a plan before editing. Retains context even in long sessions
Improved ability to avoid failures and self-recover from errors
Supports cross-document synthesis of long documents, self-verification of output, and generation of structured deliverables
 Model Basic SpecificationsThe information available in the Amazon Bedrock model catalog (Tokyo region) is as follows.


Item
Details


Model ID
anthropic.claude-opus-4-8

Provider
Anthropic

Deployment Type
Serverless

Inference Type
Cross-region inference

Maximum Input
1M (1 million) tokens

Maximum Output
128K tokens

Input Modalities
Text, images

Output Modalities
Text

Supported Languages
English, Japanese, Chinese, Spanish, German, and many others

Release Date
May 28, 2026

 Supported RegionsAt the time of writing this article, Claude Opus 4.8 was confirmed in the Amazon Bedrock model catalog in the Tokyo region (ap-northeast-1). In addition to cross-region inference, Japan domestic inference profiles are also supported.
For an accurate list of supported regions and inference profiles, please refer to the official Amazon Bedrock documentation. At the time of writing, individual model cards and region lists on the AWS side may still be in the process of being updated, so please check the latest information before production use.
 PricingIn Amazon Bedrock, charges are based on the amount of input and output tokens for model inference and customization, as well as whether Provisioned Throughput is purchased.
https://aws.amazon.com/bedrock/pricing/
At this time, the AWS Pricing page has not been updated, but according to Anthropic's official (Claude API) standard pricing, Claude Opus 4.8 is priced at 5 USD per 1 million tokens (MTok) for input and 25 USD per MTok for output. This is the same price as Claude Opus 4.7. Please check the AWS pricing page for the exact pricing via Amazon Bedrock. Note that Claude Platform on AWS charges in CCU (Claude Consumption Unit) units.
 Evolution of Claude Opus 4.8This is the main subject of this article. We summarize what has changed from Claude Opus 4.7 to Claude Opus 4.8.
 Benchmark ComparisonThe benchmark comparison published by Anthropic officially is as follows. It compares against Claude Opus 4.7, GPT-5.5, and Gemini 3.1 Pro.


Benchmark
Opus 4.8
Opus 4.7
GPT-5.5
Gemini 3.1 Pro


Agentic coding (SWE-Bench Pro)
69.2%
64.3%
58.6%
54.2%

Agentic terminal coding (Terminal-Bench 2.1)
74.6%
66.1%
78.2%
70.3%

Multidisciplinary reasoning (Humanity's Last Exam, no tools)
49.8%
46.9%
41.4%
44.4%

Multidisciplinary reasoning (Humanity's Last Exam, with tools)
57.9%
54.7%
52.2%
51.4%

Agentic computer use (OSWorld-Verified)
83.4%
82.8%
78.7%
76.2%

Knowledge work (GDPval-AA)
1890
1753
1769
1314

Agentic financial analysis (Finance Agent v2)
53.9%
51.5%
51.8%
43.0%

※Source: Benchmarks from official Anthropic announcements. Values are based on Anthropic internal evaluations. GDPval-AA is a score value; all others are accuracy rates (%).
On SWE-Bench Pro, a coding benchmark close to real-world scenarios, the score improved from Claude Opus 4.7's 64.3% to Claude Opus 4.8's 69.2%. While Claude Opus 4.8 outperforms Claude Opus 4.7 in many categories, GPT-5.5 achieved the highest result on Terminal-Bench 2.1 at 78.2%, so it is worth noting that strengths and weaknesses vary depending on the use case.
https://x.com/claudeai/status/2060042702150930686
 Main New Features and Improvementshttps://platform.claude.com/docs/en/about-claude/models/whats-new-claude-4-8
According to the official Anthropic documentation, the following features have been added or changed from Claude Opus 4.7 to Claude Opus 4.8.
Mid-conversation system messages: It is now possible to insert a role: "system" message immediately after a user turn. This allows adding instructions during long conversations without resending the entire system prompt, while maintaining prompt cache hits for earlier turns. This contributes to reducing input costs in agent loops
The default for the effort parameter is now high: Across all surfaces including the Claude API and Claude Code, the default for the effort parameter is now high (no change if explicitly set)
Fast mode (research preview): Specifying speed: "fast" allows you to get up to 2.5x more output tokens per second from the same model (premium pricing, Claude API only)
Lower minimum token count for prompt caching: The minimum cacheable prompt length has been reduced to 1,024 tokens. Short prompts that could not be cached in Claude Opus 4.7 can now create cache entries without any code changes
Official documentation of refusal stop details: The stop_details object when a request is refused has been officially documented, making it easier to handle according to the type of refusal
In terms of capabilities, the following improvements have been made compared to Claude Opus 4.7.
Long-horizon agentic coding: Handling of long-context has improved, compaction (context compression) occurs less frequently, and the quality of recovery after compression has improved
Calibration of reasoning effort: Behavior at each effort level is more stable
Improved tool triggering: Cases where necessary tool calls for a task are skipped have decreased (this was a behavior partially reported in Claude Opus 4.7)
Efficiency of adaptive thinking: The need for thinking is determined per turn, reducing unnecessary thinking tokens even at the same effort level
 Notes on MigrationAccording to Anthropic's migration guide, there are no breaking API changes when migrating from Claude Opus 4.7 to Claude Opus 4.8, and in principle it should work simply by replacing the model ID. However, the following constraints carried over from Claude Opus 4.7 apply.
Setting temperature / top_p / top_k to non-default values will result in a 400 error (control behavior via prompts)
Only adaptive thinking is supported; thinking: {"type": "enabled", "budget_tokens": N} will result in a 400 error
 How to UseThere are two ways to access Claude Opus 4.8.
When using Amazon Bedrock, enable model access from Amazon Bedrock in the AWS Management Console. You can use it in combination with AWS managed features such as Guardrails and Knowledge Bases while keeping data within the AWS infrastructure.
Claude Platform on AWS is a platform operated by Anthropic, accessible from the AWS Management Console, and supports consolidated billing via AWS Marketplace (billed in CCU units).
 Important NotesThe benchmark figures in this article are based on official Anthropic announcements (internal evaluations). We recommend verifying performance on actual workloads in your own environment
The pricing stated is the standard pricing for the Anthropic Claude API (first-party). Please check the AWS pricing page for pricing via Amazon Bedrock, and the CCU-based billing structure for Claude Platform on AWS
At the time of writing, individual model cards and region lists on the AWS side may still be in the process of being updated. Please check the official Amazon Bedrock documentation for the latest information before production use
 In ClosingClaude Opus 4.8 is a practically oriented update built on top of Claude Opus 4.7, with improvements to long-horizon agentic coding, tool triggering, and handling of compaction. If you are using Amazon Bedrock or Claude Platform on AWS, you can try it simply by replacing the model ID with anthropic.claude-opus-4-8. Why not start by experiencing the differences from Claude Opus 4.7 in a test environment?

[Update] Claude Opus 4.8 is now available on Amazon Bedrock

What is Claude Opus 4.8

Update Details

Model Basic Specifications

Supported Regions

Pricing

Evolution of Claude Opus 4.8

Benchmark Comparison

Main New Features and Improvements

Notes on Migration

How to Use

Important Notes

In Closing

Claudeならクラスメソッドにお任せください

AWS Topics

Trending Topics

Products & Services

Features and Series

Item	Details
Model ID	`anthropic.claude-opus-4-8`
Provider	Anthropic
Deployment Type	Serverless
Inference Type	Cross-region inference
Maximum Input	1M (1 million) tokens
Maximum Output	128K tokens
Input Modalities	Text, images
Output Modalities	Text
Supported Languages	English, Japanese, Chinese, Spanish, German, and many others
Release Date	May 28, 2026

Benchmark	Opus 4.8	Opus 4.7	GPT-5.5	Gemini 3.1 Pro
Agentic coding (SWE-Bench Pro)	69.2%	64.3%	58.6%	54.2%
Agentic terminal coding (Terminal-Bench 2.1)	74.6%	66.1%	78.2%	70.3%
Multidisciplinary reasoning (Humanity's Last Exam, no tools)	49.8%	46.9%	41.4%	44.4%
Multidisciplinary reasoning (Humanity's Last Exam, with tools)	57.9%	54.7%	52.2%	51.4%
Agentic computer use (OSWorld-Verified)	83.4%	82.8%	78.7%	76.2%
Knowledge work (GDPval-AA)	1890	1753	1769	1314
Agentic financial analysis (Finance Agent v2)	53.9%	51.5%	51.8%	43.0%