[Update] Claude Opus 4.8 is now available on Amazon Bedrock

[Update] Claude Opus 4.8 is now available on Amazon Bedrock

Anthropic's latest model Claude Opus 4.8 is now available on AWS. Here we introduce the features of this latest version, enhanced for agentic use cases, coding, and long-running autonomous tasks, as well as the improvements from the previous version.
2026.05.29

This page has been translated by machine translation. View original

This is Ishikawa from the Cloud Business Division. On May 28, 2026, Anthropic's latest model Claude Opus 4.8 became available on AWS. It can be accessed from both Amazon Bedrock and Claude Platform on AWS.

https://aws.amazon.com/jp/about-aws/whats-new/2026/05/claude-opus-4.8-aws/

https://aws.amazon.com/blogs/machine-learning/claude-opus-4-8-is-now-available-on-aws/

This is an update approximately one and a half months after the previous version Claude Opus 4.7 (available since April 16, 2026), with enhancements focused on agent use cases, coding, and long-running autonomous tasks. In addition to the What's New content, this article summarizes the improvements from Opus 4.7.

What is Claude Opus 4.8

Claude Opus 4.8 is the latest generation large language model in the Opus series provided by Anthropic. Anthropic positions it as "the highest-performing generally available (GA) model at this time," supporting complex reasoning, long-running autonomous agent processing (agentic coding), and tasks requiring high autonomy.

Key use cases include coding, agent tasks, knowledge work (document creation and analysis), financial analysis, cybersecurity, and computer use.

Update Details

With this update, Claude Opus 4.8 is now available via Amazon Bedrock.

The main features are as follows.

  • Supports long-running autonomous execution, handling multi-stage projects and complex dependencies with minimal supervision
  • Reads codebases like an expert and plans before editing. Maintains context across long sessions
  • Improved ability to avoid obstacles and self-recover from errors
  • Supports cross-document synthesis of long documents, self-verification of outputs, and generation of structured deliverables

Model Basic Specifications

The information available in the Amazon Bedrock model catalog (Tokyo region) is as follows.

Item Details
Model ID anthropic.claude-opus-4-8
Provider Anthropic
Deployment Type Serverless
Inference Type Cross-region inference
Maximum Input 1M (1 million) tokens
Maximum Output 128K tokens
Input Modalities Text, images
Output Modalities Text
Supported Languages English, Japanese, Chinese, Spanish, German, and many more
Release Date May 28, 2026

Supported Regions

At the time of writing, Claude Opus 4.8 was confirmed in the Amazon Bedrock model catalog in the Tokyo region (ap-northeast-1). Cross-region inference is supported.

For the exact list of supported regions and cross-region inference profiles, please refer to the official Amazon Bedrock documentation. At the time of writing, individual model cards and region lists on the AWS side may still be in the process of being updated, so please check the latest information before production use.

スクリーンショット 2026-05-29 2.50.51

Pricing

In Amazon Bedrock, charges are based on the amount of input and output tokens for model inference and customization, as well as whether Provisioned Throughput is purchased.

https://aws.amazon.com/bedrock/pricing/

At this time, the AWS Pricing page has not been updated, but according to the standard pricing on the official Anthropic site (Claude API), Claude Opus 4.8 is priced at $5 USD per 1 million tokens (MTok) for input and $25 USD per MTok for output. This is the same price as Claude Opus 4.7. Please check the AWS pricing page for exact pricing via Amazon Bedrock. Note that Claude Platform on AWS charges in CCU (Claude Consumption Unit) units.

Evolution of Claude Opus 4.8

This is the main focus of this article. Here is a summary of what has changed from Claude Opus 4.7 to Claude Opus 4.8.

Benchmark Comparison

The benchmark comparison published by Anthropic is as follows. It compares against Claude Opus 4.7, GPT-5.5, and Gemini 3.1 Pro.

Benchmark Opus 4.8 Opus 4.7 GPT-5.5 Gemini 3.1 Pro
Agentic coding (SWE-Bench Pro) 69.2% 64.3% 58.6% 54.2%
Agentic terminal coding (Terminal-Bench 2.1) 74.6% 66.1% 78.2% 70.3%
Multidisciplinary reasoning (Humanity's Last Exam, no tools) 49.8% 46.9% 41.4% 44.4%
Multidisciplinary reasoning (Humanity's Last Exam, with tools) 57.9% 54.7% 52.2% 51.4%
Agentic computer use (OSWorld-Verified) 83.4% 82.8% 78.7% 76.2%
Knowledge work (GDPval-AA) 1890 1753 1769 1314
Agentic financial analysis (Finance Agent v2) 53.9% 51.5% 51.8% 43.0%

※Source: Benchmarks from official Anthropic announcements. Values are based on Anthropic internal evaluations. GDPval-AA is a score value; all others are accuracy rates (%).

On SWE-Bench Pro, a coding benchmark close to real-world tasks, performance improved from 64.3% with Claude Opus 4.7 to 69.2% with Claude Opus 4.8. While Claude Opus 4.8 outperforms Claude Opus 4.7 in most categories, GPT-5.5 achieved the highest result on Terminal-Bench 2.1 at 78.2%, so it is worth noting that strengths and weaknesses vary depending on the use case.

https://x.com/claudeai/status/2060042702150930686

Key New Features and Improvements

https://platform.claude.com/docs/en/about-claude/models/whats-new-claude-4-8

According to the official Anthropic documentation, the following features have been added or changed from Claude Opus 4.7 to Claude Opus 4.8.

  • Mid-conversation system messages: It is now possible to insert role: "system" messages immediately after a user turn. This allows instructions to be added during long conversations without resending the entire system prompt, while maintaining prompt cache hits from previous turns. This contributes to reducing input costs in agent loops.
  • Default for the effort parameter changed to high: The default for the effort parameter is now high across all surfaces including the Claude API and Claude Code (no change if explicitly set).
  • Fast mode (research preview): Specifying speed: "fast" allows you to get up to 2.5x more output tokens per second from the same model (premium pricing, Claude API only).
  • Lower minimum token count for prompt caching: The minimum cacheable prompt length has been reduced to 1,024 tokens. Short prompts that could not be cached with Claude Opus 4.7 can now create cache entries without any code changes.
  • Official documentation of Refusal stop details: The stop_details object when a request is refused is now officially documented, making it easier to handle based on the type of refusal.

In terms of capabilities, the following improvements have been made compared to Claude Opus 4.7.

  • Long-horizon agentic coding: Handling of long contexts has improved, with fewer occurrences of compaction (context compression) and better recovery quality after compression.
  • Reasoning effort calibration: Behavior at each effort level is more stable.
  • Improved tool triggering: Fewer cases of skipping necessary tool calls for a task (this was a behavior partially reported in Claude Opus 4.7).
  • More efficient adaptive thinking: Determines whether thinking is needed on a per-turn basis, reducing unnecessary thinking tokens even at the same effort level.

Notes on Migration

According to Anthropic's migration guide, there are no breaking API changes when migrating from Claude Opus 4.7 to Claude Opus 4.8, and it is stated that basically just replacing the model ID will work. However, the following constraints carried over from Claude Opus 4.7 apply.

  • Setting temperature / top_p / top_k to non-default values will result in a 400 error (control behavior via prompts).
  • Only adaptive thinking is supported; thinking: {"type": "enabled", "budget_tokens": N} will result in a 400 error.

How to Use

There are two ways to access Claude Opus 4.8.

When using Amazon Bedrock, enable model access from Amazon Bedrock in the AWS Management Console. You can use it in combination with AWS managed features such as Guardrails and Knowledge Bases while keeping data within the AWS infrastructure.

Claude Platform on AWS is a platform operated by Anthropic, accessible from the AWS Management Console, and supports consolidated billing via AWS Marketplace (billed in CCU units).

Usage Notes

  • The benchmark figures in this article are based on official Anthropic announcements (internal evaluations). We recommend verifying performance with your own workloads in your own environment.
  • The pricing listed is the standard pricing for the Anthropic Claude API (first-party). Please check the AWS pricing page for pricing via Amazon Bedrock, and the CCU-based billing structure for Claude Platform on AWS.
  • At the time of writing, individual model cards and region lists on the AWS side may still be in the process of being updated. Please check the official Amazon Bedrock documentation for the latest information before production use.

Conclusion

Claude Opus 4.8 is a practically oriented update built on Claude Opus 4.7, with improvements to long-horizon agentic coding, tool triggering, and compaction handling. For those using Amazon Bedrock or Claude Platform on AWS, you can try it out simply by replacing the model ID with anthropic.claude-opus-4-8. Why not start by experiencing the differences from Claude Opus 4.7 in a test environment?


生成AI活用はクラスメソッドにお任せ

過去に支援してきた生成AIの支援実績100+を元にホワイトペーパーを作成しました。御社が抱えている課題のうち、どれが解決できて、どのようなサービスが受けられるのか?4つのフェーズに分けてまとめています。どうぞお気軽にご覧ください。

生成AI資料イメージ

無料でダウンロードする

Share this article

AWSのお困り事はクラスメソッドへ