UI + UX Trust, Safety, And Privacy emerging

Dangerous-action review

Before execution, present a risk review that binds the exact action, payload, target, actor, evidence, permissions, freshness, side effects, alternatives, and audit record, then let the reviewer run, edit, cancel, defer, or escalate only the displayed action after revalidation.

Decision first

Choose this pattern when the problem matches

Use when

  • A user, agent, automation, or admin tool is about to execute a high-impact action that can affect money, access, production systems, legal/compliance state, customers, external recipients, sensitive data, or safety.
  • The system can show the exact payload, risk reason, evidence, current state, side effects, alternatives, and execution boundary before the action runs.
  • The action can be canceled, edited, reduced, deferred, escalated, approved, or revalidated before execution.

Avoid when

  • The risk is narrowly permanent deletion or loss of a named object; use destructive action confirmation.
  • The safeguard needed is exact target text reproduction; use typed confirmation inside the appropriate review.
  • Automation is paused for a separate eligible reviewer before resuming; use human approval gate.
  • The user is only checking ordinary form answers before submit; use review before submit.
  • A detected unsafe destination, download, form, certificate, or file preview is the issue; use security warning.
  • The product cannot show the action payload or enforce a pre-execution boundary; do not present a decorative review.

Problem it prevents

High-impact actions fail safely only when users can inspect what is about to happen; generic confirmations, warnings, and post-action logs often hide scope, payload, evidence, external effects, stale state, and safer alternatives until after the dangerous operation has already run.

Pattern anatomy

What a strong implementation has to make clear

User need

The action may send money, issue refunds, change access, publish or delete customer-facing content, run production commands, deploy code, contact external recipients, submit legal or compliance material, launch a physical-world operation, or let an AI agent call a privileged tool.

Pattern promise

Before execution, present a risk review that binds the exact action, payload, target, actor, evidence, permissions, freshness, side effects, alternatives, and audit record, then let the reviewer run, edit, cancel, defer, or escalate only the displayed action after revalidation.

Required state

Armed action state with verb, target, payload, actor, source, and exact execution boundary.

Recovery path

A reviewer runs a command without seeing that it targets production instead of staging.

Access contract

Use headings and labels that name the action and target before risk details.

Quality bar

The difference between expert and weak execution

Strong implementation

Specific, visible, recoverable

  • A production console shows Restart payment workers, affected region, open incidents, customer impact, rollback owner, evidence links, change window, dry-run result, and Run restart only after the reviewer checks the risk inventory.
  • An AI support tool proposes Send refund email and issue $480 credit, shows customer, amount, policy match, source gaps, external email preview, account balance, and safer Edit amount or Escalate options before execution.
  • A release manager sees that a deploy action affects production EU, has a stale smoke test, cancels execution, refreshes checks, and then runs the action with an audit record.
  • A support lead reviews an agent-proposed refund, notices that the evidence is partial, edits the amount, and runs only the revised external action.
Weak implementation

Vague, hidden, hard to recover from

  • A privileged tool button says Continue and immediately sends a customer email, changes access, and updates billing without showing the payload or external recipients.
  • A warning says This may be risky but hides the target, policy trigger, evidence, and whether the action can be undone.
  • A user approves a notification from email after the underlying payload changed, and the system executes against a different customer.
  • A reviewer sees only a confidence score and a red warning, so they cannot tell whether the action sends a message, changes permissions, or spends money.
UI guidance
  • Render dangerous-action review as a pre-execution checkpoint that names the armed action, actor, target, scope, affected systems, external effects, risk reason, evidence, freshness, permissions, alternatives, and exact outcome of Run, Cancel, Edit, or Escalate.
  • Show a structured risk inventory before execution: who or what is affected, what leaves the system boundary, what cannot be recalled, what policy or threshold triggered review, what evidence supports the action, and what still needs human attention.
UX guidance
  • Use dangerous-action review when an action is not necessarily destructive but can create high-impact consequences such as sending money, changing access, executing production commands, contacting customers, publishing content, filing a legal response, or letting an agent use a privileged tool.
  • Help users decide whether to run, revise, cancel, defer, or escalate the action by making the action payload inspectable, comparing it with safer alternatives, revalidating stale state, and recording what the reviewer saw before execution.
Implementation contract

What the implementation must handle

States

  • Armed action state with verb, target, payload, actor, source, and exact execution boundary.
  • Risk inventory state listing money, access, customer, production, legal, security, external-recipient, data, safety, or physical-world effects.
  • Evidence and confidence state with source links, gaps, stale checks, and uncertainty stated in task language.
  • Freshness revalidation state before execution when payload, target, permission, policy, or external data may have changed.

Interaction

  • The review is bound to a specific action ID, payload version, target, actor, permission scope, source context, evidence set, and policy trigger.
  • The primary action label names the real operation, such as Send refund, Restart workers, Publish policy, Grant admin access, or Run production command.
  • The review exposes what will happen outside the product boundary, including emails, webhooks, payments, permission changes, deployments, records, customer notifications, or irreversible third-party effects.
  • Run is unavailable until required risk inventory, evidence freshness, permissions, and acknowledgements pass.

Accessibility

  • Use headings and labels that name the action and target before risk details.
  • Expose risk inventory, evidence gaps, stale state, safe alternatives, and execution status as text, not only color, icons, or severity badges.
  • Keep Run, Edit, Dry run, Cancel, Defer, Escalate, and evidence links keyboard reachable and labelled by outcome.
  • Announce stale, blocked, edited, dry-run complete, executing, executed, failed, canceled, and escalated states through status text.

Review

  • What exact action, target, payload, and side effects will execute if the reviewer continues?
  • What makes this action dangerous: money, access, production, customer, legal, security, external recipient, data sensitivity, safety, or physical-world impact?
  • What evidence, policy trigger, confidence, or freshness check supports the action, and what is missing?
  • Can the reviewer edit, reduce scope, dry run, cancel, defer, request approval, or escalate instead of running?
Interactive lab

Inspect the states before you copy the pattern

Review high-impact action before execution

Inspect armed action, risk inventory, evidence and confidence, freshness revalidation, edit before run, dry run, cancel, defer, escalate, run action, blocked state, executed audit receipt, mobile compact review, and compare generic continue, hidden payload, confidence-only, stale notification, bundled actions, routine overuse, and missing audit failures.

Dangerous-action review
Interactive demo is ready

Launch the live UI/UX lab when you want to inspect states, keyboard behavior, and common failure modes.

State To Inspect

Armed action state with verb, target, payload, actor, source, and exact execution boundary.

Keyboard / Access

Tab order reaches the action summary, risk inventory, evidence, alternatives, acknowledgement or blockers, and execution controls in order.

Avoid Generating

Using vague Are you sure, Continue, or Proceed copy without naming the dangerous operation.

Evidence trail

Source-backed claims behind this guidance

Guidelines for Human-AI Interaction

Microsoft Research - checked

Supports human control, correction, uncertainty handling, and capability-boundary communication for AI-assisted high-impact actions.

AI Risk Management Framework

NIST - checked

Supports risk identification, measurement, management, monitoring, accountability, transparency, reliability, and risk treatment for AI-supported decisions.

GitHub Actions deployment protection rules

GitHub Docs - checked

Supports protected high-impact execution with required review, self-review prevention, wait timers, branch controls, bypass, and deployment protection.

Full agent/debug reference

Problem Context

  • The action may send money, issue refunds, change access, publish or delete customer-facing content, run production commands, deploy code, contact external recipients, submit legal or compliance material, launch a physical-world operation, or let an AI agent call a privileged tool.
  • The action may not be destructive in the narrow sense; risk comes from external visibility, cost, permissions, security, customer impact, legal effect, production blast radius, or inability to recall the side effect.
  • Users may encounter the review inside consoles, agent tool-call previews, workflow builders, admin tools, deployment screens, customer-support workspaces, finance tools, moderation systems, and mobile approval notifications.
  • The system can compute or disclose action payload, target, risk reason, evidence, reviewer authority, freshness, alternatives, policy triggers, expected side effects, and execution result.

Selection Rules

  • Choose dangerous-action review when the user is about to execute a high-impact action and needs to inspect the exact payload, risk, evidence, and side effects before it leaves the safe preview state.
  • Use destructive action confirmation when the central risk is permanent deletion, removal, reset, cancellation, or unrecoverable loss of a named object.
  • Use typed confirmation as an escalation when wrong-target risk is severe enough that the user must reproduce a visible target phrase.
  • Use human approval gate when automation is already paused and cannot resume until an eligible human approves the next step; dangerous-action review may be initiated by the same user before manual or agent-assisted execution.
  • Use review before submit for ordinary user-entered form answers; dangerous-action review is for high-impact side effects, privileged tools, production changes, external recipients, money, access, legal status, or safety-sensitive action.
  • Use security warning when the system detected an unsafe destination, malware, phishing, certificate, transport, or file-risk signal.
  • Show the action verb, target, payload, side effects, affected people or systems, evidence, uncertainty, policy trigger, freshness, permissions, alternatives, and execution boundary before Run is available.
  • Block or re-open review when the action payload, target, permissions, evidence, policy, external state, model output, or run context changes after the review opened.
  • Prefer safer alternatives such as dry run, preview, edit, schedule, sandbox, request approval, reduce scope, or cancel when the review exposes unresolved risk.

Required States

  • Armed action state with verb, target, payload, actor, source, and exact execution boundary.
  • Risk inventory state listing money, access, customer, production, legal, security, external-recipient, data, safety, or physical-world effects.
  • Evidence and confidence state with source links, gaps, stale checks, and uncertainty stated in task language.
  • Freshness revalidation state before execution when payload, target, permission, policy, or external data may have changed.
  • Edit before run state that changes payload or scope and reopens the review summary.
  • Dry run or preview state when the system can simulate the operation safely.
  • Cancel, defer, escalate, or request approval state when risk remains unresolved.
  • Run action state with duplicate-submit protection and one named action boundary.
  • Blocked state for missing permission, stale evidence, policy restriction, maintenance freeze, or unresolved dependency.
  • Executed state with audit receipt, side-effect result, failure handling, and rollback or follow-up status.
  • Mobile compact review state that still exposes action, target, risk, evidence, side effect, and safe alternatives.

Interaction Contract

  • The review is bound to a specific action ID, payload version, target, actor, permission scope, source context, evidence set, and policy trigger.
  • The primary action label names the real operation, such as Send refund, Restart workers, Publish policy, Grant admin access, or Run production command.
  • The review exposes what will happen outside the product boundary, including emails, webhooks, payments, permission changes, deployments, records, customer notifications, or irreversible third-party effects.
  • Run is unavailable until required risk inventory, evidence freshness, permissions, and acknowledgements pass.
  • Editing scope, payload, target, recipients, amount, command, environment, model output, or evidence returns the review to an updated pre-execution state.
  • Opening a review from email, mobile, notification, or reopened tab revalidates current state and refuses stale actions.
  • Cancel leaves the target unchanged and records no execution; defer, escalate, and request approval preserve the review context.
  • Execution produces an audit record of the shown payload, reviewer, decision, timestamp, risk reason, revalidation result, side effects, and downstream outcome.

Implementation Checklist

  • Define which operations count as dangerous by consequence: money movement, access changes, production commands, public publication, customer messaging, legal submission, sensitive data mutation, external integrations, safety-sensitive workflows, or privileged agent tool use.
  • Capture action ID, actor, target, payload, recipients, environment, permissions, risk reason, policy trigger, evidence, confidence, source freshness, dry-run result, side effects, rollback options, and audit metadata.
  • Design the review so the first screen names the action and target, then exposes risk inventory, evidence, alternatives, and execution controls without hiding the payload behind generic warnings.
  • Add revalidation for stale payloads, changed recipients, changed permissions, expired evidence, maintenance windows, policy changes, model output changes, and external-state drift.
  • Support safe alternatives such as edit, reduce scope, dry run, schedule, sandbox, request approval, escalate, cancel, or open supporting evidence.
  • Prevent duplicate execution and make the in-progress boundary clear after the reviewer chooses the dangerous action.
  • Record what was visible at review time and what actually executed, without logging secrets or unnecessary sensitive payload details.
  • Test stale notification decisions, changed payloads, permission loss, duplicate clicks, mobile wrapping, keyboard flow, screen-reader status, audit receipt, failed execution, and post-run follow-up.

Common Generated-UI Mistakes

  • Using vague Are you sure, Continue, or Proceed copy without naming the dangerous operation.
  • Showing a risk warning but hiding the action payload, target, recipients, amount, command, environment, or external effect.
  • Treating confidence, risk color, or severity score as a substitute for reviewable evidence.
  • Allowing stale email or mobile approvals to execute a changed action.
  • Combining many dangerous operations under one approval when the UI only described one of them.
  • Using dangerous-action review for routine low-impact actions until reviewers stop reading.
  • Recording only that a user clicked Run without preserving the shown payload and risk basis.

Critique Questions

  • What exact action, target, payload, and side effects will execute if the reviewer continues?
  • What makes this action dangerous: money, access, production, customer, legal, security, external recipient, data sensitivity, safety, or physical-world impact?
  • What evidence, policy trigger, confidence, or freshness check supports the action, and what is missing?
  • Can the reviewer edit, reduce scope, dry run, cancel, defer, request approval, or escalate instead of running?
  • What revalidation prevents stale notifications, changed payloads, or lost permissions from executing?
  • What audit record proves what the reviewer saw and what actually ran?
  • Would destructive action confirmation, typed confirmation, human approval gate, security warning, or review before submit be more precise?
Accessibility
  • Use headings and labels that name the action and target before risk details.
  • Expose risk inventory, evidence gaps, stale state, safe alternatives, and execution status as text, not only color, icons, or severity badges.
  • Keep Run, Edit, Dry run, Cancel, Defer, Escalate, and evidence links keyboard reachable and labelled by outcome.
  • Announce stale, blocked, edited, dry-run complete, executing, executed, failed, canceled, and escalated states through status text.
  • Wrap long commands, recipients, policy names, evidence links, and payload excerpts without horizontal scrolling.
  • Avoid reading raw secrets aloud or placing sensitive payload values in labels, URLs, logs, or analytics.
Keyboard Behavior
  • Tab order reaches the action summary, risk inventory, evidence, alternatives, acknowledgement or blockers, and execution controls in order.
  • Enter or Space activates Run, Edit, Dry run, Cancel, Defer, Escalate, and Request approval controls according to native behavior.
  • Run stays disabled while risk inventory, freshness, permission, or dependency checks fail.
  • Escape or Cancel leaves the action unexecuted while the review is still in preview state.
  • After edit, focus returns to the updated review summary rather than preserving a stale Run button.
  • After execution, focus moves to the audit receipt or result status.
Variants
  • Privileged agent tool review
  • Production command review
  • Customer email send review
  • Refund or payment review
  • Access grant review
  • Public publish review
  • Legal submission review
  • Moderation action review
  • Physical-world operation review
  • Dry-run before execute
  • Stale dangerous action review
  • Mobile dangerous action review

Verification

Last verified: