aws resource_error ai_generated true

执行被中止,因为此状态机的总执行次数超过了并发执行配额

Execution aborted because the total number of executions for this state machine exceeded the concurrent execution quota

ID: aws/stepfunctions-execution-aborted-quota

其他格式: JSON · Markdown 中文 · English
78%修复率
83%置信度
1证据数
2023-11-20首次发现

版本兼容性

版本状态引入弃用备注
Step Functions 2023-10-01 active
AWS CLI 2.16.0 active

根因分析

Step Functions状态机已超过其并发执行限制(标准工作流默认每个账户每个区域1000个)。

English

The Step Functions state machine has exceeded its concurrent execution limit (default 1000 per account per region for Standard workflows).

generic

官方文档

https://docs.aws.amazon.com/step-functions/latest/dg/limits.html

解决方案

  1. 停止不再需要的运行中执行:
    # 列出运行中的执行:
    aws stepfunctions list-executions --state-machine-arn arn:aws:states:us-east-1:123456789012:stateMachine:MyStateMachine --status-filter RUNNING
    # 停止每个执行:
    aws stepfunctions stop-execution --execution-arn arn:aws:states:us-east-1:123456789012:execution:MyStateMachine:execution-id
    # 等待配额释放。
  2. 通过AWS支持请求增加并发执行的服务配额:
    https://console.aws.amazon.com/support/home#/case/create?issueType=service-limit-increase&limitType=service-code-step-functions
  3. 如果工作负载是高吞吐量和短生命周期的,使用Express工作流代替Standard:
    # Express工作流具有更高的并发执行限制(每个账户每个区域100,000个)。
    aws stepfunctions create-state-machine --name MyExpressStateMachine --definition '{}' --role-arn arn:aws:iam::123456789012:role/MyRole --type EXPRESS

无效尝试

常见但无效的做法:

  1. Delete completed executions manually 90% 失败

    Completed executions do not count toward the concurrent quota; only running executions matter.

  2. Increase the state machine's timeout 95% 失败

    Increasing timeout keeps executions running longer, potentially making the quota issue worse.

  3. Restart the state machine from the beginning 85% 失败

    Restarting creates a new execution, which will also be blocked if the quota is still exceeded.