llm protocol_error ai_generated true

json.decoder.JSONDecodeError: Expecting property name enclosed in double quotes: line 1 column 1024 (char 1023) when parsing streaming function call arguments

ID: llm/function-call-arguments-truncated-in-streaming

Also available as: JSON · Markdown · 中文
85%Fix Rate
89%Confidence
1Evidence
2024-04-01First Seen

Version Compatibility

VersionStatusIntroducedDeprecatedNotes
openai==1.30.0 active
anthropic==0.25.0 active
mistralai==0.1.0 active

Root Cause

When streaming function calls, the API sends the arguments in chunks. If the argument string is long, a chunk boundary can split a JSON token (e.g., a string value or key), causing the accumulated arguments to be invalid JSON when parsed incrementally.

generic

中文

当流式传输函数调用时,API 会分块发送参数。如果参数字符串很长,块边界可能会分割 JSON 令牌(例如,字符串值或键),导致在增量解析时累积的参数成为无效 JSON。

Official Documentation

https://platform.openai.com/docs/guides/function-calling#streaming-function-calls

Workarounds

  1. 95% success Accumulate all chunks for a function call before attempting to parse. Wait for the 'finish_reason' to be 'stop' or 'function_call' before parsing the complete arguments string. Example: full_args = "" for chunk in stream: delta = chunk.choices[0].delta if delta.function_call and delta.function_call.arguments: full_args += delta.function_call.arguments if chunk.choices[0].finish_reason: break import json parsed_args = json.loads(full_args)
    Accumulate all chunks for a function call before attempting to parse. Wait for the 'finish_reason' to be 'stop' or 'function_call' before parsing the complete arguments string. Example:
    
    full_args = ""
    for chunk in stream:
        delta = chunk.choices[0].delta
        if delta.function_call and delta.function_call.arguments:
            full_args += delta.function_call.arguments
        if chunk.choices[0].finish_reason:
            break
    import json
    parsed_args = json.loads(full_args)
  2. 82% success Use the 'incremental parsing' approach with a JSON repair library like 'json-repair' or 'json5' that can handle partial JSON. Example: import json5 partial_args = "" for chunk in stream: delta = chunk.choices[0].delta if delta.function_call and delta.function_call.arguments: partial_args += delta.function_call.arguments try: parsed = json5.loads(partial_args) # use parsed incrementally except Exception: pass
    Use the 'incremental parsing' approach with a JSON repair library like 'json-repair' or 'json5' that can handle partial JSON. Example:
    
    import json5
    partial_args = ""
    for chunk in stream:
        delta = chunk.choices[0].delta
        if delta.function_call and delta.function_call.arguments:
            partial_args += delta.function_call.arguments
            try:
                parsed = json5.loads(partial_args)
                # use parsed incrementally
            except Exception:
                pass
  3. 75% success Set the API parameter 'stream_options' to {'include_usage': True} and use the 'usage' field to detect the end of function call arguments, then parse the complete string.
    Set the API parameter 'stream_options' to {'include_usage': True} and use the 'usage' field to detect the end of function call arguments, then parse the complete string.

中文步骤

  1. 在尝试解析之前,累积函数调用的所有块。等待 'finish_reason' 为 'stop' 或 'function_call' 后再解析完整的参数字符串。示例:
    
    full_args = ""
    for chunk in stream:
        delta = chunk.choices[0].delta
        if delta.function_call and delta.function_call.arguments:
            full_args += delta.function_call.arguments
        if chunk.choices[0].finish_reason:
            break
    import json
    parsed_args = json.loads(full_args)
  2. 使用 '增量解析' 方法,配合 JSON 修复库(如 'json-repair' 或 'json5')处理部分 JSON。示例:
    
    import json5
    partial_args = ""
    for chunk in stream:
        delta = chunk.choices[0].delta
        if delta.function_call and delta.function_call.arguments:
            partial_args += delta.function_call.arguments
            try:
                parsed = json5.loads(partial_args)
                # 增量使用解析结果
            except Exception:
                pass
  3. 设置 API 参数 'stream_options' 为 {'include_usage': True},并使用 'usage' 字段检测函数调用参数的结束,然后解析完整字符串。

Dead Ends

Common approaches that don't work:

  1. Using json.loads() on each individual chunk instead of accumulating 95% fail

    Individual chunks are not valid JSON; they are partial fragments. json.loads() will always fail on incomplete chunks.

  2. Increasing max_tokens to reduce chunking 80% fail

    max_tokens controls the output length, not the chunk size. The API still sends chunks of arbitrary size regardless of max_tokens.

  3. Setting stream_options={'include_usage': True} 90% fail

    This option controls whether usage information is included in the stream, it has no effect on how function call arguments are chunked or parsed.