Release date 2025/10/01
Feature
-
Added
time_to_first_token
andrequest_mode
for observability.
Bugfix
-
Fixed an issue where the Anthropic provider mapped the wrong
stop_reason
to Chat and Completions responses. -
Fixed an issue where the Anthropic provider failed to stream function call responses.
-
Fixed an issue where the Anthropic provider could truncate tokens in streaming responses.
-
Fixed an issue where OpenAI chat completion’s
tool_choice
was not converted to Anthropic’s. -
Fixed an issue where the
model
field was incorrect in Gemini responses when a variable was used as the model name. -
Fixed an issue where the
model
field was missing in OpenAI-format Gemini responses in some situations. -
Fixed an issue where the Gemini provider did not properly handle multiple tool calls across different turns.
-
Fixed an issue where an empty array was encoded as an empty object in structured output.
-
Fixed an issue where AWS stream parser didn’t parse correctly when the frame was incomplete.
-
Fixed an issue where JSON array data wasn’t correctly parsed.
-
Fixed an issue with Azure Managed Identity credentials causing improper request balancing.
-
Fixed an issue where AWS Bedrock did not properly handle image editing requests through the
image/v1/images/edits
route. -
Fixed an issue where AWS Bedrock streaming requests would panic when the ai-semantic-cache plugin was enabled.
-
Fixed an issue where the resource was not being passed correctly when using Azure as the provider.