Streaming Tool Execution Parallelism

Description

Design where tool invocations are executed as their definitions stream in from the model — not after the full model response completes — producing roughly a 40% wall-time reduction compared with traditional sequential agents.

Key claims

Relations

Sources

src-20260419-f46b0a2fccd9