Tool-use visibility vs Agent progress trace vs Source grounding display vs Citation display vs Activity log vs Human approval gate

Choose tool-use visibility when users need inspectable AI tool calls with tool name, tool input, tool output, permission scope, target object, side-effect risk, redaction reason, raw payload eligibility, and tool status.

Tool-use visibility live example Agent progress trace live example Source grounding display live example Citation display live example Activity log live example Human approval gate live example

Decision dimensions

Dimension	Tool-use visibility	Agent progress trace	Source grounding display	Citation display	Activity log	Human approval gate
UI or UX	UI + UX - Inspectable disclosure of AI agent tool calls, inputs, outputs, permissions, and side effects	UI + UX - Live execution trace for an AI agent or automation run after work has started	UI + UX - Whole-answer source coverage and grounding evidence display	UI + UX - Inline claim-to-source evidence display for generated or summarized content	UI + UX - Searchable and exportable record of system, user, or administrative events	UI + UX - Runtime checkpoint that pauses AI or automation until an eligible human authorizes the next step
UI guidance	Render tool-use visibility as an inspectable tool-call surface with tool name, purpose, input summary, output summary, permission scope, data source, side-effect risk, status, timestamps, and redaction state.	Render an agent progress trace as a live, ordered run timeline with run ID, plan version, current step, queued steps, active tool or task, elapsed time, last event time, step status, blocked gates, retry state, and final outcome.	Render source grounding as an answer-wide evidence panel that separates source scope, searched sources, retrieved sources, used sources, supported claims, partially supported claims, unsupported claims, and unresolved source states.	Render citation markers beside the claims they support, and connect each marker to a selected source preview with title, source type, excerpt, date or version, permission state, and open source action.	Render activity logs as evidence-oriented records with event time, actor, action, object, source system, scope, result, and technical context such as IP address or location when available.	Render a human approval gate as a paused automation checkpoint with the proposed action, tool or workflow step, triggering rule, risk level, payload snapshot, requester or agent, approver eligibility, timeout, and explicit approve, reject, edit, cancel, or bypass controls.
UX guidance	Use tool-use visibility when users need to understand, verify, approve, debug, or audit how an AI agent used tools, data, or integrations during a run.	Use agent progress trace when an AI agent or automation has started multi-step work and users need to monitor progress, intervene on stalls or gates, understand partial completion, and know whether the reviewed plan is still being followed.	Use source grounding display when users need to judge whether an AI answer is backed by the right body of evidence, not merely open one citation.	Use citation display when users must verify where a generated claim, summary, or recommendation came from without leaving the answer context.	Use activity log when users need to investigate, audit, verify, or troubleshoot actions across accounts, objects, systems, settings, or security boundaries.	Use human approval gate when automation is ready to act but policy, risk, confidence, cost, access, publication, deployment, customer impact, or legal consequence requires a human decision before execution continues.
Good UI	A research agent shows Knowledge search, Web search, and CRM lookup tool cards with purpose, input summary, source scope, output summary, status, and redacted raw request details.	An account research agent trace shows Run A-204, reviewed plan P-18, completed CRM lookup, active policy search, queued draft email, approval gate pending before send, elapsed time, and a View tool details control.	A policy answer includes a Grounding panel showing 4 sources searched, 3 retrieved, 2 used, 5 supported claims, 1 partially supported claim, and 1 unsupported claim with a Review action.	A policy assistant places numbered citation chips after each sourced claim; selecting a chip opens a source preview with the document title, section, quoted excerpt, updated date, and Open source action.	An organization audit log table shows timestamp, actor, action, target object, app, IP address, result, and a Details drawer with before and after fields.	An AI support agent pauses before issuing a refund, shows the proposed amount, customer, policy match, confidence, source grounding, approver role, timeout, Approve refund, Edit amount, Reject, and Stop run controls.
Bad UI	A trace says Using tools but never names the tool, input, source, output, permission, or side effect.	A spinner says Working on it while an agent calls several tools with no step identity, elapsed time, blocked state, or recovery path.	The answer shows a green Grounded badge even though only one citation supports one paragraph.	An answer ends with five links under Sources but no marker shows which link supports which claim.	A page titled Activity shows vague entries such as Changed settings with no actor, target, timestamp, or source.	A banner says Human approval needed but does not show the tool call, payload, approver, timeout, or resume consequence.
Good UX	A user opens the active CRM lookup, sees it is read-only, verifies the account ID, and continues watching the run.	A user watches the active step move from searching policies to drafting the email, opens the blocked permission item, grants access, and sees the run continue from the same step.	A reviewer opens the grounding panel, sees that the answer used the current policy but not the outdated FAQ, and flags one unsupported claim before publishing.	A user checks a claim, opens its source preview, compares the quoted excerpt with the answer text, and copies the citation with the source title included.	An admin filters to failed SSO events, expands one entry, copies the event ID, exports the filtered range, and sees that records older than 180 days require a different archive.	A billing lead opens the paused refund gate, sees that the amount is under policy but source grounding is partial, edits the refund to the verified amount, approves, and the agent resumes only that step.
Bad UX	Users cannot tell whether the agent searched the web, read private files, or changed customer data.	Users cannot tell whether the agent is stuck, waiting for approval, or finished because all states use the same animated progress label.	A user trusts a generated answer because the product says Grounded, but the source scope was only web search and did not include internal policy.	A user trusts a generated compliance claim because it has a number beside it, but the number points to an unrelated source.	A user marks a notification read and the corresponding activity evidence disappears from the only log.	A human approves a stale agent action from email and the agent applies it to a different customer state.
Best fit	An AI agent or automation calls tools, functions, APIs, retrieval systems, commands, or integrations.	An agent or automation run has started and spans multiple steps, tools, gates, or side effects.	Users need answer-wide evidence coverage before trusting generated content.	Users need to verify generated claims, summaries, recommendations, or extracted facts against source material.	Users need to inspect recorded user, admin, system, security, or integration events.	An AI agent, workflow, deployment, or automation is ready to perform a high-impact step and must pause for human authorization.
Avoid when	The system cannot reliably identify tool calls, inputs, outputs, status, permissions, or side effects.	Execution has not started and users need to inspect or edit a proposed plan.	The system cannot determine source scope, retrieval status, or claim support reliably.	The product cannot reliably map claims to sources or label unresolved citations honestly.	The goal is only to show a readable milestone history for one case or process.	The action has already happened and users only need an audit log.
Required state	Pending tool call state with tool name, purpose, requested permission, and side-effect risk.	Run started state tied to run ID, plan version, objective, and user who started the run.	Default grounded state with source scope, searched sources, retrieved sources, used sources, and supported-claim count.	Default answer with cited claims and inline citation markers.	Default log state with event records, result count, visible timezone, retention window, and permission scope.	Paused gate state with proposed action, payload snapshot, reason for gate, and run context.
Accessibility burden	Expose tool name, status, permission, risk, input summary, output summary, and redaction reason as text.	Expose trace status, run ID, current step, elapsed time, blocked state, final outcome, and details availability as text.	Expose grounding summary, source scope, status counts, unsupported claims, and source groups as text.	Give citation markers accessible names that include their selected state and source status, such as Citation 2, verified source, or Citation pending.	Use table or structured list semantics so actor, action, object, timestamp, result, and scope are perceivable together.	Expose gate status, proposed action, target, payload summary, risk, approver rule, timeout, and current run state as text.
Common misuse	Showing a vague Using tools label without names, inputs, outputs, or permissions.	Using one spinner or vague Thinking label for a multi-step agent run.	Showing a global Grounded badge when only some claims have evidence.	Displaying a link dump below the answer instead of mapping sources to specific claims.	Calling a social feed or notification drawer an activity log without event evidence.	Showing Approve without the exact action, payload, target, risk, or resume consequence.

Tool-use visibility

UI or UX: UI + UX - Inspectable disclosure of AI agent tool calls, inputs, outputs, permissions, and side effects
UI guidance: Render tool-use visibility as an inspectable tool-call surface with tool name, purpose, input summary, output summary, permission scope, data source, side-effect risk, status, timestamps, and redaction state.
UX guidance: Use tool-use visibility when users need to understand, verify, approve, debug, or audit how an AI agent used tools, data, or integrations during a run.
Good UI: A research agent shows Knowledge search, Web search, and CRM lookup tool cards with purpose, input summary, source scope, output summary, status, and redacted raw request details.
Bad UI: A trace says Using tools but never names the tool, input, source, output, permission, or side effect.
Good UX: A user opens the active CRM lookup, sees it is read-only, verifies the account ID, and continues watching the run.
Bad UX: Users cannot tell whether the agent searched the web, read private files, or changed customer data.
Best fit: An AI agent or automation calls tools, functions, APIs, retrieval systems, commands, or integrations.
Avoid when: The system cannot reliably identify tool calls, inputs, outputs, status, permissions, or side effects.
Required state: Pending tool call state with tool name, purpose, requested permission, and side-effect risk.
Accessibility burden: Expose tool name, status, permission, risk, input summary, output summary, and redaction reason as text.
Common misuse: Showing a vague Using tools label without names, inputs, outputs, or permissions.

Agent progress trace

UI or UX: UI + UX - Live execution trace for an AI agent or automation run after work has started
UI guidance: Render an agent progress trace as a live, ordered run timeline with run ID, plan version, current step, queued steps, active tool or task, elapsed time, last event time, step status, blocked gates, retry state, and final outcome.
UX guidance: Use agent progress trace when an AI agent or automation has started multi-step work and users need to monitor progress, intervene on stalls or gates, understand partial completion, and know whether the reviewed plan is still being followed.
Good UI: An account research agent trace shows Run A-204, reviewed plan P-18, completed CRM lookup, active policy search, queued draft email, approval gate pending before send, elapsed time, and a View tool details control.
Bad UI: A spinner says Working on it while an agent calls several tools with no step identity, elapsed time, blocked state, or recovery path.
Good UX: A user watches the active step move from searching policies to drafting the email, opens the blocked permission item, grants access, and sees the run continue from the same step.
Bad UX: Users cannot tell whether the agent is stuck, waiting for approval, or finished because all states use the same animated progress label.
Best fit: An agent or automation run has started and spans multiple steps, tools, gates, or side effects.
Avoid when: Execution has not started and users need to inspect or edit a proposed plan.
Required state: Run started state tied to run ID, plan version, objective, and user who started the run.
Accessibility burden: Expose trace status, run ID, current step, elapsed time, blocked state, final outcome, and details availability as text.
Common misuse: Using one spinner or vague Thinking label for a multi-step agent run.

Source grounding display

UI or UX: UI + UX - Whole-answer source coverage and grounding evidence display
UI guidance: Render source grounding as an answer-wide evidence panel that separates source scope, searched sources, retrieved sources, used sources, supported claims, partially supported claims, unsupported claims, and unresolved source states.
UX guidance: Use source grounding display when users need to judge whether an AI answer is backed by the right body of evidence, not merely open one citation.
Good UI: A policy answer includes a Grounding panel showing 4 sources searched, 3 retrieved, 2 used, 5 supported claims, 1 partially supported claim, and 1 unsupported claim with a Review action.
Bad UI: The answer shows a green Grounded badge even though only one citation supports one paragraph.
Good UX: A reviewer opens the grounding panel, sees that the answer used the current policy but not the outdated FAQ, and flags one unsupported claim before publishing.
Bad UX: A user trusts a generated answer because the product says Grounded, but the source scope was only web search and did not include internal policy.
Best fit: Users need answer-wide evidence coverage before trusting generated content.
Avoid when: The system cannot determine source scope, retrieval status, or claim support reliably.
Required state: Default grounded state with source scope, searched sources, retrieved sources, used sources, and supported-claim count.
Accessibility burden: Expose grounding summary, source scope, status counts, unsupported claims, and source groups as text.
Common misuse: Showing a global Grounded badge when only some claims have evidence.

Citation display

UI or UX: UI + UX - Inline claim-to-source evidence display for generated or summarized content
UI guidance: Render citation markers beside the claims they support, and connect each marker to a selected source preview with title, source type, excerpt, date or version, permission state, and open source action.
UX guidance: Use citation display when users must verify where a generated claim, summary, or recommendation came from without leaving the answer context.
Good UI: A policy assistant places numbered citation chips after each sourced claim; selecting a chip opens a source preview with the document title, section, quoted excerpt, updated date, and Open source action.
Bad UI: An answer ends with five links under Sources but no marker shows which link supports which claim.
Good UX: A user checks a claim, opens its source preview, compares the quoted excerpt with the answer text, and copies the citation with the source title included.
Bad UX: A user trusts a generated compliance claim because it has a number beside it, but the number points to an unrelated source.
Best fit: Users need to verify generated claims, summaries, recommendations, or extracted facts against source material.
Avoid when: The product cannot reliably map claims to sources or label unresolved citations honestly.
Required state: Default answer with cited claims and inline citation markers.
Accessibility burden: Give citation markers accessible names that include their selected state and source status, such as Citation 2, verified source, or Citation pending.
Common misuse: Displaying a link dump below the answer instead of mapping sources to specific claims.

Activity log

UI or UX: UI + UX - Searchable and exportable record of system, user, or administrative events
UI guidance: Render activity logs as evidence-oriented records with event time, actor, action, object, source system, scope, result, and technical context such as IP address or location when available.
UX guidance: Use activity log when users need to investigate, audit, verify, or troubleshoot actions across accounts, objects, systems, settings, or security boundaries.
Good UI: An organization audit log table shows timestamp, actor, action, target object, app, IP address, result, and a Details drawer with before and after fields.
Bad UI: A page titled Activity shows vague entries such as Changed settings with no actor, target, timestamp, or source.
Good UX: An admin filters to failed SSO events, expands one entry, copies the event ID, exports the filtered range, and sees that records older than 180 days require a different archive.
Bad UX: A user marks a notification read and the corresponding activity evidence disappears from the only log.
Best fit: Users need to inspect recorded user, admin, system, security, or integration events.
Avoid when: The goal is only to show a readable milestone history for one case or process.
Required state: Default log state with event records, result count, visible timezone, retention window, and permission scope.
Accessibility burden: Use table or structured list semantics so actor, action, object, timestamp, result, and scope are perceivable together.
Common misuse: Calling a social feed or notification drawer an activity log without event evidence.

Human approval gate

UI or UX: UI + UX - Runtime checkpoint that pauses AI or automation until an eligible human authorizes the next step
UI guidance: Render a human approval gate as a paused automation checkpoint with the proposed action, tool or workflow step, triggering rule, risk level, payload snapshot, requester or agent, approver eligibility, timeout, and explicit approve, reject, edit, cancel, or bypass controls.
UX guidance: Use human approval gate when automation is ready to act but policy, risk, confidence, cost, access, publication, deployment, customer impact, or legal consequence requires a human decision before execution continues.
Good UI: An AI support agent pauses before issuing a refund, shows the proposed amount, customer, policy match, confidence, source grounding, approver role, timeout, Approve refund, Edit amount, Reject, and Stop run controls.
Bad UI: A banner says Human approval needed but does not show the tool call, payload, approver, timeout, or resume consequence.
Good UX: A billing lead opens the paused refund gate, sees that the amount is under policy but source grounding is partial, edits the refund to the verified amount, approves, and the agent resumes only that step.
Bad UX: A human approves a stale agent action from email and the agent applies it to a different customer state.
Best fit: An AI agent, workflow, deployment, or automation is ready to perform a high-impact step and must pause for human authorization.
Avoid when: The action has already happened and users only need an audit log.
Required state: Paused gate state with proposed action, payload snapshot, reason for gate, and run context.
Accessibility burden: Expose gate status, proposed action, target, payload summary, risk, approver rule, timeout, and current run state as text.
Common misuse: Showing Approve without the exact action, payload, target, risk, or resume consequence.

Decision rules

Choose tool-use visibility when users need inspectable AI tool calls with tool name, tool input, tool output, permission scope, target object, side-effect risk, redaction reason, raw payload eligibility, and tool status.
Choose agent progress trace when users need the run-level ordered sequence of steps, current step, queued step, blocker, retry, cancellation, and final outcome.
Choose source grounding display when users need whole-answer evidence coverage, source scope, retrieved sources, used evidence, and unsupported claims.
Choose citation display when users need claim-level source preview and claim-to-source mapping.
Choose activity log when users need retained event evidence, export, retention, actor, timestamp, and audit investigation after the run.
Choose human approval gate when one tool call or automation step is a paused authorization decision and an eligible human must approve, reject, edit, or bypass before execution continues.
Tool-use visibility should show pending, running, succeeded, failed, denied, redacted, permission-limited, retrying, approval-required, side-effect preview, side-effect executed, and stale result states.
The tool surface should summarize safely before exposing raw request, raw response, stack trace, command output, file path, source excerpt, private record, credential, or hidden prompt detail.
Use redaction labels, role-gated details, copy tool ID, open source, open target, view audit, retry, cancel, grant permission, and open approval gate actions according to tool state.
Do not replace tool-use visibility with a vague Using tools label, raw JSON dump, progress spinner, citation list, grounding badge, or post-hoc audit row.
Do not treat tool output as verified evidence unless citation display or source grounding display shows how the output supports the generated claim.

Inspect live examples

Tool-use visibility Agent progress trace Source grounding display Citation display Activity log Human approval gate

Failure modes

Tool calls are hidden behind a spinner or Thinking label.
Raw payloads leak secrets, credentials, private records, hidden prompts, or irrelevant internal events.
A side-effecting tool call executes without preview, permission scope, or human approval gate.
A failed, denied, partial, stale, or redacted tool output is shown as simple success.
Retrying a side-effecting tool creates duplicate changes because idempotency and target object are hidden.
The final audit record cannot be tied back to tool call ID, run ID, step ID, and reviewed plan version.