I've summarized the concept of Claude API rate limits and spend limits

I'm explaining about Claude API rate limits and spend limits. The Claude API is separate from Claude subscriptions and needs to be managed in the Claude Console. It works on a deposit system, with spend limits and rate limits determined by your Usage Tier. I also introduce service tiers and usage through Amazon Bedrock.

千葉幸宏（チバユキ）

2025.12.09

This page has been translated by machine translation. View original

Good evening, this is Chiba (Koh).
When considering using Claude API, I was concerned about rate limits.
Claude API has the following limits:
Spend limits: Limits on maximum monthly cost
Rate limits: Limits on API requests within defined periods
Since this was difficult to understand at first glance, I'll summarize various aspects of Claude API in a roundabout way.
!The content of this blog is as of December 9, 2025. Please refer to the official pages for the latest information.
 Summary FirstWhen using Claude API, you need to create an account in Claude Console (separate from Claude subscription)
Claude Console has concepts of organizations, workspaces, and members
Claude API has Service tiers: Priority, Standard, and Batch
Claude API usage follows a deposit system (purchase credits in advance)
Usage Tier is closely related to the following Claude API limits:
Spend limits: Limits on maximum monthly cost
Rate limits: Limits on API requests within defined periods

 Claude and Claude API have different pricing structures and ratesIt's easier to understand that Claude and Claude API are separate entities.
Claude has paid subscription plans like Pro, Max, Team, and Enterprise. Selecting a higher-priced plan allows access to more features and relaxed limits.
For Claude API, pricing and limits are different.
When using Claude API, you'll use Claude Console. This blog covers pricing and limits on the Claude Console side, not Claude subscriptions.
Reference: I have a paid Claude subscription (Pro, Max, Team, or Enterprise plan). Why do I need to pay separately to use Claude API and Console? | Anthropic Help Center
For Claude subscription pricing and limits, please refer to:
Pricing | Claude
Usage Limit Best Practices | Anthropic Help Center
About Usage and Length Limits | Anthropic Help Center
 Understanding Claude Console for API UsageTo use Claude API, you create a Claude Console account. Claude Console is a platform for creating API keys, managing users and billing, and testing Claude in Workbench.
Reference: How can I access Claude API? | Anthropic Help Center
You create a Claude Console organization as the outermost container, then create one or more workspaces within it. Within workspaces, you add members and issue/manage API keys.


Claude Console organization image
Before using Claude API, you need to purchase credits (Usage Credit) in your Claude Console organization. It's a deposit system where you consume purchased credits as you use the API.
Reference: How do I pay for Claude API usage? | Anthropic Help Center
Credits and limits are shared across the Claude Console organization. You can also set limits at the narrower workspace level (except for the default workspace).
 Service tiers in Claude APIClaude API has Service tiers that define service levels. Be careful not to confuse this with Usage Tier, which is a different concept that we'll cover later when discussing limits.


Service tiers image
Service tiers consist of:
Priority Tier
Standard Tier
Batch
Only Priority Tier requires pre-commitment. Committing to usage period, model, and input/output tokens per minute means your requests are prioritized. This minimizes server overload errors and improves availability.
Which tier is used can be controlled in API requests to the model. Priority Tier has its own pricing, while Standard Tier/Batch pricing can be seen in the price table at:
Pricing | Claude
Pricing is defined for each model based on input/output token counts. The screen shows Standard Tier pricing, and toggling the switch at the bottom shows Batch pricing, which is set at 50% of Standard Tier.
Credits purchased in advance for your Claude Console organization are consumed according to charges incurred through Priority Tier, Standard Tier, and Batch usage.
References:
Service tiers - Claude Docs
Batch processing - Claude Docs

 Claude API Rate LimitsFinally, the main topic of this post. There are the following restrictions:
Spend limits: Limits on maximum monthly cost
Rate limits: Limits on API requests within defined periods
The concept of Usage Tier is important for these limits. Usage Tier is defined by the amount of credits deposited.


Usage Tier image
Each Tier has a maximum amount that can be deposited. For example, Tier 4 allows up to $5,000 monthly (as of December 9, 2025). This deposit limit becomes the Spend limit for the Claude Console organization, and smaller limits can be set for non-default workspaces.
Depositing a certain amount upgrades you to the next Tier. Previously, a waiting period of several days was required, but as of December 2025, upgrades appear to be immediate. For Tier 2, a maximum monthly deposit of $500 is allowed, which would skip Tier 3 and upgrade directly to Tier 4.
The "Spend limits" and "Tier upgrade threshold amounts" can be found in the documentation as "Credit Purchase" and "Max Credit Purchase."


Previously, this document apparently had a column showing waiting periods for Tier upgrades.
Each Tier defines maximum values for the following metrics for each model:
RPM: Requests per minute
ITPM: Input tokens per minute
OTPM: Output tokens per minute
This can also be confirmed in the documentation:


Limits for each model in Standard Tier as of December 9, 2025. The image shows Tier 4 selected
All limits can be customized. The documentation notes that it's possible to set limits higher than Tier 4, in which case you should contact sales. While Priority Tier is for improving service level, it may have different limits than Standard Tier. If the limits in the documentation don't meet your requirements, consider individual customization.

The limits outlined below are our standard tier limits. If you're seeking higher, custom limits or Priority Tier for enhanced service levels, contact sales through the Claude Console.

Reference: Rate limits - Claude Docs
 Pricing and Quotas when using Claude through Amazon BedrockAs a bonus, let's look at pricing and quotas when using Claude foundation models through Amazon Bedrock.
 Amazon Bedrock Cross-Region InferenceAs background knowledge, Amazon Bedrock has a mechanism called cross-region inference that allows use of foundation models across multiple regions. Newer models often can't be used within a single region. Cross-region inference is divided into specific region group combinations and global ones. Keep these in mind when checking pricing and quotas.


From https://pages.awscloud.com/rs/112-TZM-766/images/AWS-Black-Belt_2024_Amazon-Bedrock-Model-Inference-b_0909_v1.pdf
 Pricing for Claude through Amazon BedrockThe Amazon Bedrock pricing page is:
https://aws.amazon.com/jp/bedrock/pricing/
Tabs are separated by model provider. Selecting Anthropic shows pricing details for various Claude foundation models. Pricing based on input/output tokens is displayed alongside pricing for batch inference and when cache is utilized.
👆Pricing and available models may differ by region. Some foundation models may only support global cross-region inference.
 Quotas for Claude through Amazon BedrockLimits for using Claude foundation models through Amazon Bedrock can be checked in AWS documentation or the AWS Management Console.
Let's check the documentation:
Amazon Bedrock endpoints and quotas - AWS General Reference
Quotas are set for each foundation model, so filter to find what you're looking for.
This shows default values and whether they can be increased. Some AWS accounts may have values lower than the default. Quotas generally include:
Requests per minute
Tokens per minute (no distinction between input/output)
Tokens per day (no distinction between input/output)
Quotas are further divided based on factors like "whether it's global cross-region inference" and "whether it has 1M context length."
To request increases, use the Service Quotas screen in the AWS Management Console. Approval depends on AWS Support's evaluation of your use case and projected future usage.
Reference: Amazon Bedrock quotas - Amazon Bedrock
 Amazon Bedrock Reserved TierAt the end of November 2025, Amazon Bedrock Reserved Tier was announced.
https://aws.amazon.com/jp/about-aws/whats-new/2025/11/amazon-bedrock-reserved-service-tier/
From the documentation:
 Reserved TierThe Reserved tier provides the ability to reserve prioritized compute capacity for your mission-critical applications that cannot tolerate any downtime. You have the flexibility to allocate different input and output tokens-per-minute capacities to match the exact requirements of your workload and control cost. When your application needs more tokens-per-minute capacity than what you reserved, the service automatically overflows to the Standard tier, ensuring uninterrupted operations. The Reserved tier targets 99.5% uptime for model response. Customers can reserve capacity for 1 month or 3 month duration. Customers pay a fixed price per 1K tokens-per-minute and are billed monthly.
To get access to the Reserved tier, please contact your AWS account team.
Reference: Service tiers for optimizing performance and cost - Amazon Bedrock
This seems similar to Claude API's Priority Tier concept.
As of December 9, 2025, some Claude Sonnet 4.5 models support Reserved Tier.
 Summary (Repeated)When using Claude API, you need to create an account in Claude Console (separate from Claude subscription)
Claude Console has concepts of organizations, workspaces, and members
Claude API has Service tiers: Priority, Standard, and Batch
Claude API usage follows a deposit system (purchase credits in advance)
Usage Tier is closely related to the following Claude API limits:
Spend limits: Limits on maximum monthly cost
Rate limits: Limits on API requests within defined periods

 ConclusionI've examined Claude API's Rate limits and Spend limits.
Since there were many new concepts, I summarized them starting with the basics. I hope this is helpful for those considering using Claude API.
This has been Chiba Yuki (@batchicchi).
 ReferencesAbout Claude's Rate Limits - 429 Error #AI - Qiita
Approach to Claude API Rate Limits | Anthropic Help Center
Using Claude Code with Pro or Max Plan | Anthropic Help Center
Using Claude Code with Team or Enterprise Plan | Anthropic Help Center
About Usage and Length Limits | Anthropic Help Center
Anthropic Help Center
Claude Code on AWS Pattern Explanation – Amazon Bedrock / AWS Marketplace | Amazon Web Services Blog