aws resource_error ai_generated true

Execution aborted because the total number of executions for this state machine exceeded the concurrent execution quota

ID: aws/stepfunctions-execution-aborted-quota

Also available as: JSON · Markdown · 中文
78%Fix Rate
83%Confidence
1Evidence
2023-11-20First Seen

Version Compatibility

VersionStatusIntroducedDeprecatedNotes
Step Functions 2023-10-01 active
AWS CLI 2.16.0 active

Root Cause

The Step Functions state machine has exceeded its concurrent execution limit (default 1000 per account per region for Standard workflows).

generic

中文

Step Functions状态机已超过其并发执行限制(标准工作流默认每个账户每个区域1000个)。

Official Documentation

https://docs.aws.amazon.com/step-functions/latest/dg/limits.html

Workarounds

  1. 85% success Stop running executions that are no longer needed: # List running executions: aws stepfunctions list-executions --state-machine-arn arn:aws:states:us-east-1:123456789012:stateMachine:MyStateMachine --status-filter RUNNING # Stop each execution: aws stepfunctions stop-execution --execution-arn arn:aws:states:us-east-1:123456789012:execution:MyStateMachine:execution-id # Wait for the quota to free up.
    Stop running executions that are no longer needed:
    # List running executions:
    aws stepfunctions list-executions --state-machine-arn arn:aws:states:us-east-1:123456789012:stateMachine:MyStateMachine --status-filter RUNNING
    # Stop each execution:
    aws stepfunctions stop-execution --execution-arn arn:aws:states:us-east-1:123456789012:execution:MyStateMachine:execution-id
    # Wait for the quota to free up.
  2. 70% success Request a service quota increase for concurrent executions via AWS Support: https://console.aws.amazon.com/support/home#/case/create?issueType=service-limit-increase&limitType=service-code-step-functions
    Request a service quota increase for concurrent executions via AWS Support:
    https://console.aws.amazon.com/support/home#/case/create?issueType=service-limit-increase&limitType=service-code-step-functions
  3. 75% success Use Express workflows instead of Standard if the workload is high-throughput and short-lived: # Express workflows have a higher concurrent execution limit (100,000 per account per region). aws stepfunctions create-state-machine --name MyExpressStateMachine --definition '{}' --role-arn arn:aws:iam::123456789012:role/MyRole --type EXPRESS
    Use Express workflows instead of Standard if the workload is high-throughput and short-lived:
    # Express workflows have a higher concurrent execution limit (100,000 per account per region).
    aws stepfunctions create-state-machine --name MyExpressStateMachine --definition '{}' --role-arn arn:aws:iam::123456789012:role/MyRole --type EXPRESS

中文步骤

  1. 停止不再需要的运行中执行:
    # 列出运行中的执行:
    aws stepfunctions list-executions --state-machine-arn arn:aws:states:us-east-1:123456789012:stateMachine:MyStateMachine --status-filter RUNNING
    # 停止每个执行:
    aws stepfunctions stop-execution --execution-arn arn:aws:states:us-east-1:123456789012:execution:MyStateMachine:execution-id
    # 等待配额释放。
  2. 通过AWS支持请求增加并发执行的服务配额:
    https://console.aws.amazon.com/support/home#/case/create?issueType=service-limit-increase&limitType=service-code-step-functions
  3. 如果工作负载是高吞吐量和短生命周期的,使用Express工作流代替Standard:
    # Express工作流具有更高的并发执行限制(每个账户每个区域100,000个)。
    aws stepfunctions create-state-machine --name MyExpressStateMachine --definition '{}' --role-arn arn:aws:iam::123456789012:role/MyRole --type EXPRESS

Dead Ends

Common approaches that don't work:

  1. Delete completed executions manually 90% fail

    Completed executions do not count toward the concurrent quota; only running executions matter.

  2. Increase the state machine's timeout 95% fail

    Increasing timeout keeps executions running longer, potentially making the quota issue worse.

  3. Restart the state machine from the beginning 85% fail

    Restarting creates a new execution, which will also be blocked if the quota is still exceeded.