test(patrol): client-side tool calling E2E test by runyaga · Pull Request #291 · soliplex/flutter

runyaga · 2026-02-11T04:35:16Z

Summary

Add Patrol E2E test validating full client-side tool call lifecycle against real Gemini room
Add extraOverrides parameter to pumpTestApp / pumpAuthenticatedTestApp for provider injection
Document tool-calling log patterns and extraOverrides usage in patrol skill

Changes

client_tool_calling_test.dart: New test — registers get_secret_code tool (returns "42"), sends prompt, asserts tool lifecycle via log-driven waits, verifies "42" in LLM response, checks performance < 90s for two-run flow
patrol_helpers.dart: Both pump helpers accept optional List<Override> extraOverrides appended after base overrides
SKILL.md: Added extraOverrides usage docs, Riverpod 3.x import note, tool-calling log patterns table

Test plan

patrol test --device macos --target integration_test/client_tool_calling_test.dart passes (2x)
patrol test --device macos --target integration_test/smoke_test.dart passes (no regression)
flutter test — 1221 tests pass
dart test (soliplex_client) — 1101 tests pass
flutter analyze --fatal-infos — 0 issues
Pre-commit hooks pass

Add Patrol integration test validating the full tool call lifecycle: register get_secret_code tool, trigger model to call it, execute client-side, and verify continuation run returns "42". Key changes: - patrol_helpers: add extraOverrides param for provider injection - active_run_notifier: use createRun for continuation, new CancelToken - agui_message_mapper: normalize empty args to '{}', add id to ToolMessage, handle failed status - Update mapper test for empty args normalization

With the stream drain fix, Run 1's RUN_FINISHED arrives before the continuation run. The test now counts RUN_FINISHED events and waits for the second one. Always dumps logs in finally block for diagnosis.

The E2E test waits for a "Continuation run" log message to confirm the tool result handoff succeeded. Add the log after createRun in _continueWithToolResults.

runyaga added 2 commits February 11, 2026 01:32

fix(test): wait for second RUN_FINISHED in tool calling E2E test

43e253e

With the stream drain fix, Run 1's RUN_FINISHED arrives before the continuation run. The test now counts RUN_FINISHED events and waits for the second one. Always dumps logs in finally block for diagnosis.

runyaga force-pushed the test/patrol-tool-calling branch from 8b9cadd to 43e253e Compare February 11, 2026 07:33

runyaga changed the base branch from feat/client-tool-calling-v2 to refactor/notifier-stream-setup February 11, 2026 07:34

fix(client): add continuation run log for E2E test observability

4bf7932

The E2E test waits for a "Continuation run" log message to confirm the tool result handoff succeeded. Add the log after createRun in _continueWithToolResults.

runyaga self-assigned this Feb 11, 2026

runyaga marked this pull request as draft February 11, 2026 16:21

svarlet self-requested a review February 23, 2026 15:42

runyaga mentioned this pull request Feb 23, 2026

Client side toolcalling #94

Closed

8 tasks

This was referenced Mar 7, 2026

docs(audit): docs/planning/client-tool-calling-v3-reference.md — 3/6 runyaga/flutter#115

Open

docs(audit): docs/planning/client-tool-calls-plan.md — 5/6 runyaga/flutter#119

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(patrol): client-side tool calling E2E test#291

test(patrol): client-side tool calling E2E test#291
runyaga wants to merge 3 commits intorefactor/notifier-stream-setupfrom
test/patrol-tool-calling

runyaga commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

runyaga commented Feb 11, 2026

Summary

Changes

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant