Cache terminal risk assessment by command only by chrmarti · Pull Request #315663 · microsoft/vscode

chrmarti · 2026-05-11T13:01:57Z

Summary

Fixes the terminal confirmation badge showing Assessing risk… with a spinner every time, instead of using the cached assessment, when re-running the same command outside the sandbox.

The risk assessment cache key was tool.id + stableStringify(parameters). For run_in_terminal the parameter set includes model-generated explanation, goal, and requestUnsandboxedExecutionReason fields, which differ on every invocation even when the underlying command is identical — so the cache never hit on the second run.

This narrows the cache key for run_in_terminal to just `{ command }`. Other tools are unaffected and still key on their full parameter set.

Repro

Set `chat.tools.riskAssessment.enabled: true` and `chat.agent.sandbox.enabled: true`
In agent mode, run a command that requires unsandboxed access (e.g. `curl https://google.com\`)
Trigger the same command again — badge now renders instantly from cache.

Session Context

Key decisions from the development session:

Cache key scope: Only narrowed for `run_in_terminal` (via the `TerminalToolId.RunInTerminal` id). The previous full-parameter key is preserved for all other tools, since their parameters are typically the deterministic inputs of the call.
Normalized fields: Only `command` is used. Fields like `explanation`, `goal`, `requestUnsandboxedExecutionReason`, `mode`, `timeout`, and `requestUnsandboxedExecution` are excluded because they're either model-generated descriptive text or execution-mode controls that don't change the risk semantics of the command itself.
Shared helper: `getCached` and `assess` both go through a single `_cacheKey` helper so the lookup and insertion keys can't drift.

The cache key for run_in_terminal previously included the model-generated explanation, goal, and unsandboxed-execution reason fields, which differ on every invocation even for the same command. As a result, repeating the same command showed the 'Assessing risk...' spinner instead of using the cached assessment. Narrow the cache key for run_in_terminal to just the command string.

Copilot

Pull request overview

This PR fixes risk assessment cache misses for terminal tool invocations by narrowing the cache key used for run_in_terminal from “tool id + full parameter set” to “tool id + normalized parameters (command only)”. This prevents repeated terminal confirmations from showing “Assessing risk…” on every re-run when only model-generated descriptive fields changed.

Changes:

Introduced a shared _cacheKey(...) helper so getCached(...) and assess(...) use identical cache key logic.
Added parameter normalization for TerminalToolId.RunInTerminal so only { command } contributes to the cache key.
Left all other tools’ cache key behavior unchanged (still based on full parameter set via stableStringify).

Show a summary per file

File	Description
src/vs/workbench/contrib/chat/browser/tools/chatToolRiskAssessmentService.ts	Normalizes `run_in_terminal` cache keys to be command-based and centralizes cache key computation to restore cache hits on repeated runs.

Copilot's findings

Files reviewed: 1/1 changed files
Comments generated: 0

Copilot AI review requested due to automatic review settings May 11, 2026 13:01

chrmarti marked this pull request as ready for review May 11, 2026 13:02

chrmarti enabled auto-merge (rebase) May 11, 2026 13:02

Copilot started reviewing on behalf of chrmarti May 11, 2026 13:02 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

lszomoru approved these changes May 11, 2026

View reviewed changes

chrmarti merged commit 3f0be37 into main May 11, 2026
29 checks passed

chrmarti deleted the chrmarti/risk-badge-cache-key branch May 11, 2026 13:50

vs-code-engineering Bot added this to the 1.121.0 milestone May 11, 2026

chrmarti added the ~release-cherry-pick Trigger: cherry-pick this PR to the latest release branch label May 11, 2026

vs-code-engineering Bot mentioned this pull request May 11, 2026

[cherry-pick] Cache terminal risk assessment by command only #315685

Merged

vs-code-engineering Bot added release-cherry-pick Automated cherry-pick between release and main branches and removed ~release-cherry-pick Trigger: cherry-pick this PR to the latest release branch labels May 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache terminal risk assessment by command only#315663

Cache terminal risk assessment by command only#315663
chrmarti merged 1 commit into
mainfrom
chrmarti/risk-badge-cache-key

chrmarti commented May 11, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chrmarti commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Repro

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chrmarti commented May 11, 2026 •

edited

Loading