.Net: Fix non-streaming function calling text and usage aggregation by Cozmopolit · Pull Request #13429 · microsoft/semantic-kernel

Cozmopolit · 2025-12-22T17:51:32Z

Motivation and Context

When using auto function invocation in non-streaming mode (GetChatMessageContentAsync), intermediate text content generated by the LLM before tool calls is silently discarded. Additionally, token usage is not aggregated across multiple API calls in the auto-invoke loop.

Problem: If the LLM responds with "Let me check that for you..." before requesting a tool call, and then provides a final answer after the tool result, only the final answer is returned. The intermediate text is lost.

Scenario: Users relying on non-streaming mode with auto function invocation expect to receive all text the LLM generated, not just the final response.

Description

This PR modifies the non-streaming auto function calling loop in all three affected connectors to:

Aggregate text content across all loop iterations using a StringBuilder
Aggregate token usage (InputTokens + OutputTokens) across all API calls
Apply aggregated state to the final response, including when a filter terminates early

Affected Connectors:

Connector	File
OpenAI / Azure OpenAI	`Connectors.OpenAI/Core/ClientCore.ChatCompletion.cs`
Google / Gemini	`Connectors.Google/Core/Gemini/Clients/GeminiChatCompletionClient.cs`
MistralAI	`Connectors.MistralAI/Client/MistralClient.cs`

Implementation approach:

Before the loop: Initialize aggregatedContent (StringBuilder) and token counters
Each iteration: Accumulate text (with \n\n separator) and token counts
On exit: Prepend aggregated text to final message and add AggregatedUsage metadata (only when multiple iterations occurred)

Out of Scope:

MEAI-based connectors (Azure AI Inference, Ollama) - they already handle this correctly via FunctionInvokingChatClient
Python SDK - separate issue/PR needed

New Tests

Added 5 unit tests for the OpenAI connector in FunctionCallingContentAggregationTests.cs:

Test	Description
`NonStreaming_IntermediateTextBeforeToolCall_IsAggregatedInFinalResponseAsync`	Verifies text before tool calls is preserved
`NonStreaming_TokenUsage_IsAggregatedAcrossAllIterationsAsync`	Verifies `AggregatedUsage` metadata contains sum of all tokens
`NonStreaming_SingleIteration_NoAggregationMetadataAddedAsync`	Verifies no regression for single-iteration calls
`NonStreaming_ToolCallWithoutIntermediateText_OnlyFinalTextReturnedAsync`	Verifies empty intermediate text is handled correctly
`NonStreaming_FilterTerminatesEarly_AggregatedContentStillAppliedAsync`	Verifies aggregation works when filter terminates

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the SK Contribution Guidelines and the pre-submission formatting script raises no violations
All unit tests pass, and I have added new tests where possible
I didn't break anyone 😄

Preserve intermediate LLM text content (e.g., 'Let me check that for you...') and aggregate token usage across all iterations in the auto function calling loop. - Add StringBuilder for text aggregation across loop iterations - Accumulate InputTokens/OutputTokens and store as 'AggregatedUsage' metadata - Apply aggregated state to final response (or filter-terminated response) - Add 5 unit tests covering text aggregation, usage aggregation, single iteration, empty content, and filter termination scenarios Fixes microsoft#13420

Cozmopolit requested a review from a team as a code owner December 22, 2025 17:51

moonbox3 added .NET Issue or Pull requests regarding .NET code kernel Issues or pull requests impacting the core kernel labels Dec 22, 2025

github-actions bot changed the title ~~Fix non-streaming function calling text and usage aggregation~~ .Net: Fix non-streaming function calling text and usage aggregation Dec 22, 2025

markwallace-microsoft self-assigned this Jan 14, 2026

Cozmopolit had a problem deploying to integration January 19, 2026 15:12 — with GitHub Actions Failure

Merge branch 'main' into fix/non-streaming-function-calling-aggregation

d1cb75c

markwallace-microsoft had a problem deploying to integration January 19, 2026 15:21 — with GitHub Actions Failure

Merge branch 'main' into fix/non-streaming-function-calling-aggregation

b205a87

markwallace-microsoft had a problem deploying to integration January 27, 2026 12:43 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.Net: Fix non-streaming function calling text and usage aggregation#13429

.Net: Fix non-streaming function calling text and usage aggregation#13429
Cozmopolit wants to merge 3 commits intomicrosoft:mainfrom
Cozmopolit:fix/non-streaming-function-calling-aggregation

Cozmopolit commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Cozmopolit commented Dec 22, 2025

Motivation and Context

Description

New Tests

Contribution Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants