AWS raises AgentCore runtime quotas by up to 5x to help enterprises scale AI agents
AWS has increased key Amazon Bedrock AgentCore runtime quotas by up to fivefold, enabling enterprises to support more concurrent AI agents and user interactions without going through the quota-increase process that often slows production deployments. While quota increase service requests are free themselves, the added capacity is more likely to translate into higher underlying compute and runtime consumption as enterprises expand AI deployments. “The new default limits support up to 5,000 active concurrent sessions in US East (N. Virginia) and US West (Oregon), and 2,500 in all other supported Regions (previously 1,000 and 500 respectively),” AWS wrote in its release notes. The hyperscaler has also increased the number of interactions each AI agent can handle from 25 tokens per second to 200 tokens per second across all supported regions, which it says will enable enterprises to support more simultaneous user requests. Further, to help enterprises scale AI applications faster during pe